Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Backend Engineer (ML Infra) — Scale AI Training & Inference

Rockstar

A dynamic digital product studio is seeking a Backend Software Engineer (ML Infrastructure) to design and build core systems for training and deploying ML models. This early-career role involves collaborating with ML engineers and focuses on distributed training pipelines and cloud-native infrastructure. The ideal candidate has backend engineering experience, strong foundations in distributed systems, and is comfortable working in Python or Go. The position offers an exciting opportunity to work on real ML infrastructure in a fast-paced environment in San Francisco. #J-18808-Ljbffr Rockstar

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Backend Engineer (ML Infra) — Scale AI Training & Inference in San Francisco, CA vacancy
  • A leading AI technology firm in San Francisco is seeking an AI Infra Engineer to enhance their infrastructure. The successful candidate...  ...manage Slurm for distributed training. Important skills include...  ...aiming at advancements in AI and ML infrastructure. #J-18808-Ljbffr... 
    Training

    Perplexity

    San Francisco, CA
    5 days ago
  •  ...that is building the AI backbone for the...  ...but a full-stack backend for fine-tuning, reinforcement...  ...learning, inference, and long-term...  ...Backend Software Engineer (ML Infrastructure) to...  ...design, build, and scale the core systems that...  ...large-scale model training and deployment.... 
    Training

    Rockstar

    San Francisco, CA
    1 day ago
  • $192k - $260k

     ...world's best data and AI infrastructure...  ...deploy and manage AI/ML models - from traditional...  ...-time, low-latency inference, governance,...  ...operationalize models at scale with strong SLAs...  .... As a Staff Engineer, you'll play a critical...  ...certifications and training, and specific work... 
    Training
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    5 days ago
  • Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,... 
    Training

    Reducto

    San Francisco, CA
    3 days ago
  • $192k - $260k

     ...s best data and AI infrastructure platform...  ...AI model inference for open source...  ...role, no prior ML or AI experience...  ...’re looking for engineers who have owned high scale operational sensitive...  ...sensitive backend systems. A track...  ...certifications and training, and specific... 
    Training
    Local area
    Worldwide

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...hiring Software Engineers focused on AI Infrastructure to...  ...reliably at production scale. This role exists...  ...traditional backend engineering - including...  ..., large-scale inference systems,...  ...infrastructure supporting training and inference...  ...Familiarity with GPU-based ML workloads or... 
    Training
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    4 days ago
  • $125k - $225k

    A leading AI infrastructure company is seeking a Senior Backend Engineer to build core backend services for a high-scale observability platform. The ideal candidate will have 5+ years of experience...  ...designing APIs specifically for ML workflows and handling real-time data... 
    Remote work

    Space Executive

    Berkeley, CA
    4 days ago
  • $150k - $300k

     ...models to the infra that enables...  ...anyone to create, train, and deploy...  ...at frontier scale, adapting...  ...serving, LLM inference optimization...  ...Experience Building ML Systems at...  .... Inference Backends: Hands‑on...  ...LLM Inference engine development and...  ...AI and RL at Prime... 
    Training
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    4 days ago
  • $160k - $200k

     ...agentic models to the infra that enables anyone to create, train, and deploy them...  ...at frontier scale, adapting models...  ...sophisticated AI teams in the world...  ...jobs, scale inference workloads against...  ...with a customer's ML infrastructure...  ...customer, from ML engineers to engineering... 
    Training
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Day shift

    Prime Intellect

    San Francisco, CA
    5 days ago
  • $250k

     ...International Consulting Ltd is looking for a talented ML/AI Research Engineer to join their San Francisco team. You will be...  ...and managing the infrastructure that powers training, deployment, and governance of large-scale AI systems. The ideal candidate has a strong background... 
    Training

    Alldus International Consulting Ltd

    San Francisco, CA
    2 days ago
  • $130k - $400k

     ...Backend Engineer, Marketplace Location: San Francisco...  ...-Stage / Series C AI Infrastructure...  ...representing a rapidly scaling AI infrastructure...  ...support model training, evaluation, and human...  ...with product, ML, and operations teams...  ...production model inference systems... 
    Training
    Work at office
    Relocation package

    Recruiting from Scratch

    San Francisco, CA
    5 days ago
  • $166k - $225k

     ...enabling data and AI teams to solve the...  ...business. Founded by engineers — and customer...  ...interfacing with data to scaling our services and...  ...AI agents, model training, model serving,...  ...Collaborate with platform, infra, and ML teams to deliver...  ...of experience in backend or infrastructure... 
    Training
    Remote job
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    more than 2 months ago
  • $167.2k - $209k

     ...DigitalOcean is expanding its AI Infrastructure layer...  ...are seeking a Senior Engineer 2 to join our AI Inference Data Plane team. In...  ...and delivering high-scale, resilient data plane...  ...as code.AI/ML Domain Knowledge: Hands...  ...relevant conferences, training, and education. All employees... 
    Training
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    2 days ago
  • $200k - $280k

     ...of efficient inference (algorithms, architectures, engines) and post-training / RL systems....  ...at production scale. Our mandate...  ...computing for ML. Are comfortable...  ...with infra, research, and...  ...including kernel backends, speculative...  ...About Together AI Together... 
    Training
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • $146.5k

     ...search, recommendations, AI/ML systems, and the...  ...can be inconsistent, and scale amplifies every edge case...  ...partnership with Content and Infra-Security ~...  ...Content Security, ML Data Engineering, Search & Discovery,...  ...relevant education or training; and other business and... 
    Training
    Local area
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    3 days ago
  •  ...the web by building AI agents that can...  ...agent-first, from training our own models to...  ...Responsibilities: Scale infra for post-training...  ...infra for agentic inference (throughput and...  ...closely with product engineers to translate...  ...Experience with ML infrastructure (GPU... 
    Training
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    21 days ago
  • $180k - $240k

     ...Backend Engineer - Infrastructure Los Angeles, San Francisco...  ...and challenging to scale. Our ambition is to...  ...management and scheduling ML Infrastructure: Construct the ML training infrastructure to enhance our AI researchers' productivity, and inference systems to optimize... 
    Training
    Work experience placement

    HeyGen

    San Francisco, CA
    1 day ago
  • AI Chopping Block, Inc. is looking for a core backend engineer to design and operate platform backend services for AI products in San Francisco. The ideal candidate...  ...handling system design and collaborating with product and ML research teams to enhance capabilities. This... 

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the...  ...video, and speech models at scale. If you get a thrill from...  ...responses. Collaborate with ML researchers to bring new model... 
    Full time
    Local area

    Together AI

    San Francisco, CA
    1 day ago
  •  ...intelligence, AI-driven optimization...  ...complexity at scale. We are backed...  ...- from ML model training through production...  ...Platform Engineering & Payments Integration...  ...and maintain inference services in Go...  ...Skills Backend / Platform...  ...systems Cloud & Infra - AWS... 
    Training
    Local area
    Shift work

    DEUNA

    San Francisco, CA
    2 days ago
  • About the Role ML Ops Engineer — Agentic AI Lab (Founding Team...  ...the model training, deployment, versioning...  ..., and inference rollout Manage hybrid...  ...custom inference backends (e.g. vLLM, TGI,...  ...engineering, or infra-focused ML roles...  ...(spot instance scaling, batch prioritization... 
    Training
    Full time

    Fabrion

    San Francisco, CA
    1 day ago
  •  ...SEE *ALL* OF OUR JOB OPENINGS! Senior Backend Engineer Seeking a Senior Backend Engineer...  ...complex distributed systems at enterprise scale. What You'll Do: Design and...  ...interface design Nice to Have AI/ML infrastructure experience or familiarity... 
    Casual work

    Three Pillars Recruiting

    San Francisco, CA
    4 days ago
  • David Joseph & Company is seeking a backend engineer in San Francisco, California. This high-ownership...  ...on building core infrastructure for an AI-native platform, emphasizing autonomy...  ...and cloud technologies, capable of scaling systems and operating independently. #J... 

    David Joseph & Company

    San Francisco, CA
    5 days ago
  •  ...Backend Engineering Role At Sesame Sesame believes in...  ...machine learning inference, scalable agentic...  ...cutting-edge applied AI. At the centre of...  ...of systems where ML models are a critical...  ...it off to the infra team to productionize...  ...tradeoffs, and scaling strategies independently... 
    Full time
    Contract work
    Flexible hours

    SESAME

    San Francisco, CA
    1 day ago
  • Spherecast is seeking a Senior Backend Engineer to design and implement backend systems that power Agnes, our AI Supply Chain Manager. This role requires hands-on experience...  ...efficiently. If you excel in managing large-scale data and systems engineering, this is your chance... 

    Spherecast

    San Francisco, CA
    3 days ago
  •  ...We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a...  ...opportunity to build and scale the infrastructure that powers...  ...Partner with ML research teams on model optimization...  ...Background in training infrastructure and RL workloads... 
    Training

    Perplexity

    San Francisco, CA
    1 day ago
  •  ...first company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role involves building and scaling training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have a passion... 
    Training

    Mirendil

    San Francisco, CA
    4 days ago
  • A cutting-edge AI research firm in San Francisco is seeking talent to build and optimize GPU infrastructure for large-scale model inference and training workloads. The ideal candidate will have hands-on experience with GPU systems and optimization techniques, actively contributing... 
    Training

    Reflection

    San Francisco, CA
    2 days ago
  • Dormont Manufacturing Co is seeking a Senior/Staff Backend Engineer to design and build large-scale systems for their AI platform. In this role, you will ensure reliability and performance, integrating complex workflows for real-time data processing. The ideal candidate... 

    Dormont Manufacturing Co

    San Francisco, CA
    2 days ago
  • A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate... 
    Training

    Reflection AI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Backend Engineer (ML Infra) — Scale AI Training & Inference. Be the first to apply!