Software Engineer, AI Inference
$100k - $300kSkild
At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machine learning is the key to unlocking these capabilities for the widespread deployment of robots within society. Our team consists of individuals with varying levels of experience and backgrounds, from new graduates to domain experts. Relevant industry experience is important, but ultimately less so than your demonstrated abilities and attitude. We are looking for passionate individuals who are eager to explore uncharted waters and contribute to our innovative projects.
Position Overview
We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI models, enhancing the performance and capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with unmatched efficiency and intelligence in real-world environments. You will work at the intersection of systems and machine learning, directly contributing to making our AI models more powerful and adaptive by ensuring consistent performance in light of variable and perhaps unforeseen compute and hardware constraints.
Responsibilities
- Develop and optimize runtime AI inference pipelines for real-world robotic deployment.
- Build infrastructure, frameworks, and tooling to enable reliable integration of models into robotic systems and informative analysis of production models to drive the direction of architecture choice and deployment system design.
- Formulate specialized optimization solutions for various inference paradigms and scenarios (autoregressive models, denoising models, hierarchical models, state machines, multi-agent systems, cloud-based inference).
- Adapt optimization solutions to various compute, hardware, and networking constraints.
Preferred Qualifications
- BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
- Minimum of 3 years of industry experience.
- Proficiency developing in low-level systems languages (C, C++, Rust, Go), Python and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
- Deep understanding and practical experience with low-level systems concepts (multithreading, networking, embedded systems, memory management).
- Experience with CUDA.
- Deep understanding of state-of-the-art machine learning techniques and models.
- Experience optimizing various machine learning architectures.
- Experience with machine learning compilers.
- Experience optimizing model inference for robotic systems deployment.
Base Salary Range
$100,000 - $300,000 USD
Apply for this job
*
indicates a required field
First Name *
Last Name *
Email *
Phone
Resume/CV *
Enter manually
Accepted file types: pdf, doc, docx, txt, rtf
Education
Degree * Select...
LinkedIn Profile
Personal Website or Portfolio
Which office(s) are you willing to work? *
Pittsburgh, PA
For undergraduate applicants, transcripts must be included in your application. Please attach below.
Accepted file types: pdf, doc, docx, txt, rtf
Why do you want to work at Skild AI? *
Tell us about two to three projects or accomplishments you’re most proud of! Whether it’s building something impactful, contributing to open-source work, launching a company, winning an award, or anything else that showcases your skills — we’re excited to hear about it! *
#J-18808-Ljbffr$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly...SuggestedWork at officeVisa sponsorshipFlexible hours$187.5k - $395k
...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step...Suggested$142.2k - $204.6k
...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers... ...USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide - including...SuggestedLocal areaWorldwide- ...We are seeking a highly technical Inference Engine Engineer to optimize the performance and... ...next-generation generative and agentic AI workloads. Your work will directly power... ...Analyze performance bottlenecks across the software and hardware stack, and implement targeted...SuggestedWorldwideFlexible hours
$325k
...About the Team Our Inference team brings OpenAI's most capable research and technology to... ...alike to use and access our state-of-the-art AI models, allowing them to do things that... .... About the Role We're hiring engineers to scale and optimize OpenAI's inference...Suggested- ...About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate... ...engineering - including GPU orchestration, large-scale inference systems, performance optimization, and developer...InternshipImmediate start
$295k
...About the Team Our Inference team brings OpenAI's most capable... ...access our start-of-the-art AI models, allowing them to do things... ...We are looking for an engineer who wants to take the world's... ...least 5 years of professional software engineering experience....$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most advanced generative... ...orchestration is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC...Full timeLocal area$160k - $250k
...A pioneering AI company in San Francisco is seeking a Senior Backend Engineer for their Inference Platform. The role involves optimizing latency, developing auto-scaling systems, and collaborating with ML researchers to scale architectures. Ideal candidates will have...- ...for a Solution Architect in San Francisco to enable enterprises to implement AI workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with customers' DevOps teams. Ideal candidates...Flexible hours
- Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token streaming... ...plus) No degree required Company Luminal (YC S25) builds an AI compiler and serving stack that makes models 10x faster and...
- ...About Us Most AI is frozen in place - it doesn't adapt to the world. We think that... ...compute into useful intelligence - the inference services that serve LLMs at scale and the... ...you honest about both. Researchers and ML engineers will hand you workloads that barely run;...Flexible hours
- ...BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion,... ...us and help build the platform engineers turn to to ship AI products. THE... ...who enjoy working across product, software development, performance engineering...Work experience placementFlexible hours
- .... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development... ...low-level performance-critical software such as CUDA kernels, compilers,... ...About OpenAI OpenAI is an AI research and deployment company...Work at officeRelocation package
$380k
...team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and... ...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for...Work at officeRelocation package- ...Tech Lead, Data & Inference Engineer San Francisco, California, United States About the Job Tech Lead, Data & Inference Engineer Our... ...leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency...Full time
- ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative... ...powering the next generation of AI products. We build the infrastructure, tools... ...You'll shape the future of fal's inference engine and ensure our generative models achieve...
$175k - $225k
...security. Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and... ...firms. The Role We're looking for an AI Inference Engineer who lives at the boundary of high-performance software and physical hardware. In this role, you won'...Local areaRemote work- A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates...
$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...Remote job- ...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile...Worldwide
$350k
...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor...Visa sponsorship- Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service....Remote job
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels... ...candidate has 3+ years of experience in software engineering with a focus on ML inference...- ...Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is... ...developers and enterprises who are building AI systems to power magical experiences like... .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...Full timeWork experience placementWork at officeRemote workFlexible hours
$56.25 - $137 per hour
...Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for... ...: Required: • 8 or more years of experience as a software reliability engineer or software engineer working on large...Full timeSummer workInternshipH1bShift work- ...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal...
$160k - $230k
...Senior Software Engineer - Together Cloud Infrastructure Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure. As...Full timeWork at officeRemote work$180k - $220k
...for agentic team chat—a workspace where AI agents and humans collaborate as peers. Our... ...AI era. We’re looking for a backend engineer to join our small team and help lead the... ...APIs, with emphasis on integrating AI/ML inference endpoints and ensuring predictable SLAs...Work at officeLocal areaImmediate startFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, AI Inference. Be the first to apply!
- software sales engineer San Francisco, CA
- software engineer internship remote San Francisco, CA
- IT software developer San Francisco, CA
- new grad software engineer San Francisco, CA
- software engineer staff San Francisco, CA
- integration software engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- software engineer part time San Francisco, CA
- facebook software engineer San Francisco, CA
- senior robotics software engineer San Francisco, CA


