Software Engineer, AI Inference

$100k - $300k

Skild

At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machine learning is the key to unlocking these capabilities for the widespread deployment of robots within society. Our team consists of individuals with varying levels of experience and backgrounds, from new graduates to domain experts. Relevant industry experience is important, but ultimately less so than your demonstrated abilities and attitude. We are looking for passionate individuals who are eager to explore uncharted waters and contribute to our innovative projects.

Position Overview

We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI models, enhancing the performance and capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with unmatched efficiency and intelligence in real-world environments. You will work at the intersection of systems and machine learning, directly contributing to making our AI models more powerful and adaptive by ensuring consistent performance in light of variable and perhaps unforeseen compute and hardware constraints.

Responsibilities

Develop and optimize runtime AI inference pipelines for real-world robotic deployment.
Build infrastructure, frameworks, and tooling to enable reliable integration of models into robotic systems and informative analysis of production models to drive the direction of architecture choice and deployment system design.
Formulate specialized optimization solutions for various inference paradigms and scenarios (autoregressive models, denoising models, hierarchical models, state machines, multi-agent systems, cloud-based inference).
Adapt optimization solutions to various compute, hardware, and networking constraints.

Preferred Qualifications

BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
Minimum of 3 years of industry experience.
Proficiency developing in low-level systems languages (C, C++, Rust, Go), Python and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
Deep understanding and practical experience with low-level systems concepts (multithreading, networking, embedded systems, memory management).
Experience with CUDA.
Deep understanding of state-of-the-art machine learning techniques and models.
Experience optimizing various machine learning architectures.
Experience with machine learning compilers.
Experience optimizing model inference for robotic systems deployment.

Base Salary Range

$100,000 - $300,000 USD

Apply for this job

indicates a required field

First Name *

Last Name *

Email *

Phone

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Education

Degree * Select...

LinkedIn Profile

Personal Website or Portfolio

Which office(s) are you willing to work? *

Pittsburgh, PA

For undergraduate applicants, transcripts must be included in your application. Please attach below.

Accepted file types: pdf, doc, docx, txt, rtf

Why do you want to work at Skild AI? *

Tell us about two to three projects or accomplishments you’re most proud of! Whether it’s building something impactful, contributing to open-source work, launching a company, winning an award, or anything else that showcases your skills — we’re excited to hear about it! *

#J-18808-Ljbffr

Apply

Vacancy posted 17 hours ago

Similar jobs that could be interesting for youBased on the Software Engineer, AI Inference in San Francisco, CA vacancy

Staff + Sr. Software Engineer, Cloud Inference
$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly...
Suggested
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
17 hours ago
Software Engineer, Inference
$187.5k - $395k
...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step...
Suggested
Luma AI
San Francisco, CA
1 day ago
Software Engineer - GenAI inference
$142.2k - $204.6k
...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers... ...USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide - including...
Suggested
Local area
Worldwide
Databricks
San Francisco, CA
3 days ago
Software Engineer - AI Inference Engine
...We are seeking a highly technical Inference Engine Engineer to optimize the performance and... ...next-generation generative and agentic AI workloads. Your work will directly power... ...Analyze performance bottlenecks across the software and hardware stack, and implement targeted...
Suggested
Worldwide
Flexible hours
FriendliAI Corp
San Francisco, CA
2 days ago
Software Engineer, Inference - AMD GPU Enablement
$325k
...About the Team Our Inference team brings OpenAI's most capable research and technology to... ...alike to use and access our state-of-the-art AI models, allowing them to do things that... .... About the Role We're hiring engineers to scale and optimize OpenAI's inference...
Suggested
OpenAI
San Francisco, CA
3 days ago
Software Engineer Intern (AI Infrastructure / Training / Inference)
...About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate... ...engineering - including GPU orchestration, large-scale inference systems, performance optimization, and developer...
Internship
Immediate start
SpreeAI
San Francisco, CA
3 days ago
Software Engineer, Model Inference
$295k
...About the Team Our Inference team brings OpenAI's most capable... ...access our start-of-the-art AI models, allowing them to do things... ...We are looking for an engineer who wants to take the world's... ...least 5 years of professional software engineering experience....
OpenAI
San Francisco, CA
3 days ago
Senior Backend Engineer, Inference Platform
$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most advanced generative... ...orchestration is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC...
Full time
Local area
Together AI
San Francisco, CA
1 day ago
Senior Backend Engineer, Inference Platform Low Latency
$160k - $250k
...A pioneering AI company in San Francisco is seeking a Senior Backend Engineer for their Inference Platform. The role involves optimizing latency, developing auto-scaling systems, and collaborating with ML researchers to scale architectures. Ideal candidates will have...
Together
San Francisco, CA
3 days ago
Cloud-Scale AI Inference Architect
...for a Solution Architect in San Francisco to enable enterprises to implement AI workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with customers' DevOps teams. Ideal candidates...
Flexible hours
FriendliAI
San Francisco, CA
4 days ago
Cloud Inference Engineer
Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token streaming... ...plus) No degree required Company Luminal (YC S25) builds an AI compiler and serving stack that makes models 10x faster and...
SupportFinity™
San Francisco, CA
4 days ago
Distributed Systems Engineer, Data & Inference Platform
...About Us Most AI is frozen in place - it doesn't adapt to the world. We think that... ...compute into useful intelligence - the inference services that serve LLMs at scale and the... ...you honest about both. Researchers and ML engineers will hand you workloads that barely run;...
Flexible hours
Adaption
San Francisco, CA
14 days ago
Applied AI Inference Engineer
...BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion,... ...us and help build the platform engineers turn to to ship AI products. THE... ...who enjoy working across product, software development, performance engineering...
Work experience placement
Flexible hours
Baseten
San Francisco, CA
4 days ago
Inference Technical Lead, On-Device Transformers
.... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development... ...low-level performance-critical software such as CUDA kernels, compilers,... ...About OpenAI OpenAI is an AI research and deployment company...
Work at office
Relocation package
OpenAI
San Francisco, CA
4 days ago
Inference Technical Lead, Sora
$380k
...team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and... ...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for...
Work at office
Relocation package
OpenAI
San Francisco, CA
1 day ago
Tech Lead, Data & Inference Engineer
...Tech Lead, Data & Inference Engineer San Francisco, California, United States About the Job Tech Lead, Data & Inference Engineer Our... ...leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency...
Full time
Catalyst Labs, LLC
San Francisco, CA
17 hours ago
Staff Technical Lead for Inference & ML Performance
...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative... ...powering the next generation of AI products. We build the infrastructure, tools... ...You'll shape the future of fal's inference engine and ensure our generative models achieve...
Fal
San Francisco, CA
17 hours ago
AI Inference Engineer
$175k - $225k
...security. Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and... ...firms. The Role We're looking for an AI Inference Engineer who lives at the boundary of high-performance software and physical hardware. In this role, you won'...
Local area
Remote work
Sauron
San Francisco, CA
17 hours ago
Technical Staff Lead, AI Inference & GPU Infra
A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates...
Wafer
San Francisco, CA
2 days ago
Senior AI Inference Data Plane Engineer - Remote
$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...
Remote job
DigitalOcean
San Francisco, CA
3 days ago
AI Infrastructure Engineer: Scalable GPU Inference, On-Site
...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile...
Worldwide
Spellbrush
San Francisco, CA
3 days ago
AI Inference Infrastructure Engineer
$350k
...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor...
Visa sponsorship
Thinking Machines Lab Inc.
San Francisco, CA
3 days ago
AI Inference Performance Engineer
Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
Fathom
San Francisco, CA
1 day ago
Senior AI Inference Performance Engineer (Remote)
A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service....
Remote job
DigitalOcean
San Francisco, CA
4 days ago
Senior AI Inference Engineer - GPU, Rust & CUDA
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels... ...candidate has 3+ years of experience in software engineering with a focus on ML inference...
Perplexity
San Francisco, CA
3 days ago
Staff Software Engineer, Inference Infrastructure
...Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is... ...developers and enterprises who are building AI systems to power magical experiences like... .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...
Full time
Work experience placement
Work at office
Remote work
Flexible hours
Jaide Health
San Francisco, CA
2 days ago
Site Reliability Engineer - Inference
$56.25 - $137 per hour
...Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for... ...: Required: • 8 or more years of experience as a software reliability engineer or software engineer working on large...
Full time
Summer work
Internship
H1b
Shift work
jobright.com
San Francisco, CA
3 days ago
Robotics AI Inference Engineer - Optimize & Deploy
...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal...
Skild
San Francisco, CA
3 days ago
Senior Software Engineer - Together Cloud Infrastructure
$160k - $230k
...Senior Software Engineer - Together Cloud Infrastructure Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure. As...
Full time
Work at office
Remote work
Together AI
San Francisco, CA
3 days ago
Software Engineer, Backend
$180k - $220k
...for agentic team chat—a workspace where AI agents and humans collaborate as peers. Our... ...AI era. We’re looking for a backend engineer to join our small team and help lead the... ...APIs, with emphasis on integrating AI/ML inference endpoints and ensuring predictable SLAs...
Work at office
Local area
Immediate start
Flexible hours
Glue
San Francisco, CA
29 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, AI Inference. Be the first to apply!