Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, AI Inference

$100k - $300k

Skild

At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machine learning is the key to unlocking these capabilities for the widespread deployment of robots within society. Our team consists of individuals with varying levels of experience and backgrounds, from new graduates to domain experts. Relevant industry experience is important, but ultimately less so than your demonstrated abilities and attitude. We are looking for passionate individuals who are eager to explore uncharted waters and contribute to our innovative projects.

Position Overview

We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI models, enhancing the performance and capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with unmatched efficiency and intelligence in real-world environments. You will work at the intersection of systems and machine learning, directly contributing to making our AI models more powerful and adaptive by ensuring consistent performance in light of variable and perhaps unforeseen compute and hardware constraints.

Responsibilities
  • Develop and optimize runtime AI inference pipelines for real-world robotic deployment.
  • Build infrastructure, frameworks, and tooling to enable reliable integration of models into robotic systems and informative analysis of production models to drive the direction of architecture choice and deployment system design.
  • Formulate specialized optimization solutions for various inference paradigms and scenarios (autoregressive models, denoising models, hierarchical models, state machines, multi-agent systems, cloud-based inference).
  • Adapt optimization solutions to various compute, hardware, and networking constraints.
Preferred Qualifications
  • BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
  • Minimum of 3 years of industry experience.
  • Proficiency developing in low-level systems languages (C, C++, Rust, Go), Python and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
  • Deep understanding and practical experience with low-level systems concepts (multithreading, networking, embedded systems, memory management).
  • Experience with CUDA.
  • Deep understanding of state-of-the-art machine learning techniques and models.
  • Experience optimizing various machine learning architectures.
  • Experience with machine learning compilers.
  • Experience optimizing model inference for robotic systems deployment.

Base Salary Range

$100,000 - $300,000 USD

Apply for this job

*

indicates a required field

First Name *

Last Name *

Email *

Phone

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Education

Degree * Select...

LinkedIn Profile

Personal Website or Portfolio

Which office(s) are you willing to work? *

Pittsburgh, PA

For undergraduate applicants, transcripts must be included in your application. Please attach below.

Accepted file types: pdf, doc, docx, txt, rtf

Why do you want to work at Skild AI? *

Tell us about two to three projects or accomplishments you’re most proud of! Whether it’s building something impactful, contributing to open-source work, launching a company, winning an award, or anything else that showcases your skills — we’re excited to hear about it! *

#J-18808-Ljbffr
Vacancy posted 17 hours ago
Similar jobs that could be interesting for youBased on the Software Engineer, AI Inference in San Francisco, CA vacancy
  • $320k

     ...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    17 hours ago
  • $187.5k - $395k

     ...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step... 
    Suggested

    Luma AI

    San Francisco, CA
    1 day ago
  • $142.2k - $204.6k

     ...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers...  ...USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide - including... 
    Suggested
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago
  •  ...We are seeking a highly technical Inference Engine Engineer to optimize the performance and...  ...next-generation generative and agentic AI workloads. Your work will directly power...  ...Analyze performance bottlenecks across the software and hardware stack, and implement targeted... 
    Suggested
    Worldwide
    Flexible hours

    FriendliAI Corp

    San Francisco, CA
    2 days ago
  • $325k

     ...About the Team Our Inference team brings OpenAI's most capable research and technology to...  ...alike to use and access our state-of-the-art AI models, allowing them to do things that...  .... About the Role We're hiring engineers to scale and optimize OpenAI's inference... 
    Suggested

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate...  ...engineering - including GPU orchestration, large-scale inference systems, performance optimization, and developer... 
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    3 days ago
  • $295k

     ...About the Team Our Inference team brings OpenAI's most capable...  ...access our start-of-the-art AI models, allowing them to do things...  ...We are looking for an engineer who wants to take the world's...  ...least 5 years of professional software engineering experience.... 

    OpenAI

    San Francisco, CA
    3 days ago
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most advanced generative...  ...orchestration is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC... 
    Full time
    Local area

    Together AI

    San Francisco, CA
    1 day ago
  • $160k - $250k

     ...A pioneering AI company in San Francisco is seeking a Senior Backend Engineer for their Inference Platform. The role involves optimizing latency, developing auto-scaling systems, and collaborating with ML researchers to scale architectures. Ideal candidates will have... 

    Together

    San Francisco, CA
    3 days ago
  •  ...for a Solution Architect in San Francisco to enable enterprises to implement AI workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with customers' DevOps teams. Ideal candidates... 
    Flexible hours

    FriendliAI

    San Francisco, CA
    4 days ago
  • Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token streaming...  ...plus) No degree required Company Luminal (YC S25) builds an AI compiler and serving stack that makes models 10x faster and... 

    SupportFinity™

    San Francisco, CA
    4 days ago
  •  ...About Us Most AI is frozen in place - it doesn't adapt to the world. We think that...  ...compute into useful intelligence - the inference services that serve LLMs at scale and the...  ...you honest about both. Researchers and ML engineers will hand you workloads that barely run;... 
    Flexible hours

    Adaption

    San Francisco, CA
    14 days ago
  •  ...BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion,...  ...us and help build the platform engineers turn to to ship AI products. THE...  ...who enjoy working across product, software development, performance engineering... 
    Work experience placement
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  •  .... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development...  ...low-level performance-critical software such as CUDA kernels, compilers,...  ...About OpenAI OpenAI is an AI research and deployment company... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • $380k

     ...team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and...  ...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...Tech Lead, Data & Inference Engineer San Francisco, California, United States About the Job Tech Lead, Data & Inference Engineer Our...  ...leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency... 
    Full time

    Catalyst Labs, LLC

    San Francisco, CA
    17 hours ago
  •  ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative...  ...powering the next generation of AI products. We build the infrastructure, tools...  ...You'll shape the future of fal's inference engine and ensure our generative models achieve... 

    Fal

    San Francisco, CA
    17 hours ago
  • $175k - $225k

     ...security. Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and...  ...firms. The Role We're looking for an AI Inference Engineer who lives at the boundary of high-performance software and physical hardware. In this role, you won'... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    17 hours ago
  • A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates... 

    Wafer

    San Francisco, CA
    2 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Remote job

    DigitalOcean

    San Francisco, CA
    3 days ago
  •  ...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile... 
    Worldwide

    Spellbrush

    San Francisco, CA
    3 days ago
  • $350k

     ...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor... 
    Visa sponsorship

    Thinking Machines Lab Inc.

    San Francisco, CA
    3 days ago
  • Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques... 

    Fathom

    San Francisco, CA
    1 day ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service.... 
    Remote job

    DigitalOcean

    San Francisco, CA
    4 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels...  ...candidate has 3+ years of experience in software engineering with a focus on ML inference... 

    Perplexity

    San Francisco, CA
    3 days ago
  •  ...Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is...  ...developers and enterprises who are building AI systems to power magical experiences like...  .... Cohere is a team of researchers, engineers, designers, and more, who are passionate... 
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    2 days ago
  • $56.25 - $137 per hour

     ...Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for...  ...: Required: • 8 or more years of experience as a software reliability engineer or software engineer working on large... 
    Full time
    Summer work
    Internship
    H1b
    Shift work

    jobright.com

    San Francisco, CA
    3 days ago
  •  ...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal... 

    Skild

    San Francisco, CA
    3 days ago
  • $160k - $230k

     ...Senior Software Engineer - Together Cloud Infrastructure Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure. As... 
    Full time
    Work at office
    Remote work

    Together AI

    San Francisco, CA
    3 days ago
  • $180k - $220k

     ...for agentic team chat—a workspace where AI agents and humans collaborate as peers. Our...  ...AI era. We’re looking for a backend engineer to join our small team and help lead the...  ...APIs, with emphasis on integrating AI/ML inference endpoints and ensuring predictable SLAs... 
    Work at office
    Local area
    Immediate start
    Flexible hours

    Glue

    San Francisco, CA
    29 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, AI Inference. Be the first to apply!