Backend Engineer (ML Infra) — Scale AI Training & Inference

Rockstar

A dynamic digital product studio is seeking a Backend Software Engineer (ML Infrastructure) to design and build core systems for training and deploying ML models. This early-career role involves collaborating with ML engineers and focuses on distributed training pipelines and cloud-native infrastructure. The ideal candidate has backend engineering experience, strong foundations in distributed systems, and is comfortable working in Python or Go. The position offers an exciting opportunity to work on real ML infrastructure in a fast-paced environment in San Francisco. #J-18808-Ljbffr Rockstar

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Backend Engineer (ML Infra) — Scale AI Training & Inference in San Francisco, CA vacancy

AI Infra Engineer: Scale ML Training & Inference
A leading AI technology firm in San Francisco is seeking an AI Infra Engineer to enhance their infrastructure. The successful candidate... ...manage Slurm for distributed training. Important skills include... ...aiming at advancements in AI and ML infrastructure. #J-18808-Ljbffr...
Training
Perplexity
San Francisco, CA
5 days ago
Backend Software Engineer (ML Infra)
...that is building the AI backbone for the... ...but a full-stack backend for fine-tuning, reinforcement... ...learning, inference, and long-term... ...Backend Software Engineer (ML Infrastructure) to... ...design, build, and scale the core systems that... ...large-scale model training and deployment....
Training
Rockstar
San Francisco, CA
1 day ago
Staff Backend Software Engineer- (AI Platform)
$192k - $260k
...world's best data and AI infrastructure... ...deploy and manage AI/ML models - from traditional... ...-time, low-latency inference, governance,... ...operationalize models at scale with strong SLAs... .... As a Staff Engineer, you'll play a critical... ...certifications and training, and specific work...
Training
Local area
Worldwide
Databricks
San Francisco, CA
5 days ago
ML Infra Engineer: Scale GPU Training & Inference
Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,...
Training
Reducto
San Francisco, CA
3 days ago
Staff Backend Software Engineer- (AI Platform)
$192k - $260k
...s best data and AI infrastructure platform... ...AI model inference for open source... ...role, no prior ML or AI experience... ...’re looking for engineers who have owned high scale operational sensitive... ...sensitive backend systems. A track... ...certifications and training, and specific...
Training
Local area
Worldwide
Menlo Ventures
San Francisco, CA
1 day ago
Software Engineer Intern (AI Infrastructure / Training / Inference)
...hiring Software Engineers focused on AI Infrastructure to... ...reliably at production scale. This role exists... ...traditional backend engineering - including... ..., large-scale inference systems,... ...infrastructure supporting training and inference... ...Familiarity with GPU-based ML workloads or...
Training
Internship
Immediate start
SpreeAI
San Francisco, CA
4 days ago
Senior Backend Engineer - AI/ML Infra (Remote)
$125k - $225k
A leading AI infrastructure company is seeking a Senior Backend Engineer to build core backend services for a high-scale observability platform. The ideal candidate will have 5+ years of experience... ...designing APIs specifically for ML workflows and handling real-time data...
Remote work
Space Executive
Berkeley, CA
4 days ago
Member of Technical Staff - Inference
$150k - $300k
...models to the infra that enables... ...anyone to create, train, and deploy... ...at frontier scale, adapting... ...serving, LLM inference optimization... ...Experience Building ML Systems at... .... Inference Backends: Hands‑on... ...LLM Inference engine development and... ...AI and RL at Prime...
Training
Work at office
Remote work
Visa sponsorship
Relocation package
Flexible hours
Shift work
Prime Intellect
San Francisco, CA
4 days ago
Technical Account Manager - AI Infrastructure
$160k - $200k
...agentic models to the infra that enables anyone to create, train, and deploy them... ...at frontier scale, adapting models... ...sophisticated AI teams in the world... ...jobs, scale inference workloads against... ...with a customer's ML infrastructure... ...customer, from ML engineers to engineering...
Training
Remote work
Visa sponsorship
Relocation package
Flexible hours
Day shift
Prime Intellect
San Francisco, CA
5 days ago
Founding MLOps Engineer — Scale LLMs & Secure AI Infra
$250k
...International Consulting Ltd is looking for a talented ML/AI Research Engineer to join their San Francisco team. You will be... ...and managing the infrastructure that powers training, deployment, and governance of large-scale AI systems. The ideal candidate has a strong background...
Training
Alldus International Consulting Ltd
San Francisco, CA
2 days ago
Backend Engineer, Marketplace
$130k - $400k
...Backend Engineer, Marketplace Location: San Francisco... ...-Stage / Series C AI Infrastructure... ...representing a rapidly scaling AI infrastructure... ...support model training, evaluation, and human... ...with product, ML, and operations teams... ...production model inference systems...
Training
Work at office
Relocation package
Recruiting from Scratch
San Francisco, CA
5 days ago
Senior Backend Software Engineer- (AI Platform)
$166k - $225k
...enabling data and AI teams to solve the... ...business. Founded by engineers — and customer... ...interfacing with data to scaling our services and... ...AI agents, model training, model serving,... ...Collaborate with platform, infra, and ML teams to deliver... ...of experience in backend or infrastructure...
Training
Remote job
Local area
Worldwide
Databricks
San Francisco, CA
more than 2 months ago
Senior Engineer 2: AI Inference Engine Systems
$167.2k - $209k
...DigitalOcean is expanding its AI Infrastructure layer... ...are seeking a Senior Engineer 2 to join our AI Inference Data Plane team. In... ...and delivering high-scale, resilient data plane... ...as code.AI/ML Domain Knowledge: Hands... ...relevant conferences, training, and education. All employees...
Training
Local area
Remote work
Worldwide
Flexible hours
DigitalOcean
San Francisco, CA
2 days ago
AI Researcher, Core ML (Turbo)
$200k - $280k
...of efficient inference (algorithms, architectures, engines) and post-training / RL systems.... ...at production scale. Our mandate... ...computing for ML. Are comfortable... ...with infra, research, and... ...including kernel backends, speculative... ...About Together AI Together...
Training
Full time
Together AI
San Francisco, CA
2 days ago
Senior Backend Engineer, Content Foundations
$146.5k
...search, recommendations, AI/ML systems, and the... ...can be inconsistent, and scale amplifies every edge case... ...partnership with Content and Infra-Security ~... ...Content Security, ML Data Engineering, Search & Discovery,... ...relevant education or training; and other business and...
Training
Local area
Home office
Flexible hours
Scribd
San Francisco, CA
3 days ago
AI Engineer LLM Infra
...the web by building AI agents that can... ...agent-first, from training our own models to... ...Responsibilities: Scale infra for post-training... ...infra for agentic inference (throughput and... ...closely with product engineers to translate... ...Experience with ML infrastructure (GPU...
Training
Work at office
Relocation
Visa sponsorship
Yutori
San Francisco, CA
21 days ago
Backend Engineer - Infrastructure
$180k - $240k
...Backend Engineer - Infrastructure Los Angeles, San Francisco... ...and challenging to scale. Our ambition is to... ...management and scheduling ML Infrastructure: Construct the ML training infrastructure to enhance our AI researchers' productivity, and inference systems to optimize...
Training
Work experience placement
HeyGen
San Francisco, CA
1 day ago
AI Platform Backend Engineer — Scale AI for real‑world impact
AI Chopping Block, Inc. is looking for a core backend engineer to design and operate platform backend services for AI products in San Francisco. The ideal candidate... ...handling system design and collaborating with product and ML research teams to enhance capabilities. This...
AI Chopping Block, Inc.
San Francisco, CA
4 days ago
Senior Backend Engineer, Inference Platform
$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the... ...video, and speech models at scale. If you get a thrill from... ...responses. Collaborate with ML researchers to bring new model...
Full time
Local area
Together AI
San Francisco, CA
1 day ago
AI Platform Tech Lead
...intelligence, AI-driven optimization... ...complexity at scale. We are backed... ...- from ML model training through production... ...Platform Engineering & Payments Integration... ...and maintain inference services in Go... ...Skills Backend / Platform... ...systems Cloud & Infra - AWS...
Training
Local area
Shift work
DEUNA
San Francisco, CA
2 days ago
ML Ops Engineer — Agentic AI Lab (Founding Team)
About the Role ML Ops Engineer — Agentic AI Lab (Founding Team... ...the model training, deployment, versioning... ..., and inference rollout Manage hybrid... ...custom inference backends (e.g. vLLM, TGI,... ...engineering, or infra-focused ML roles... ...(spot instance scaling, batch prioritization...
Training
Full time
Fabrion
San Francisco, CA
1 day ago
Senior Backend Engineer - Casual AI -MarTech/AdTech
...SEE *ALL* OF OUR JOB OPENINGS! Senior Backend Engineer Seeking a Senior Backend Engineer... ...complex distributed systems at enterprise scale. What You'll Do: Design and... ...interface design Nice to Have AI/ML infrastructure experience or familiarity...
Casual work
Three Pillars Recruiting
San Francisco, CA
4 days ago
Backend Engineer - AI-Native Infra, Full Ownership
David Joseph & Company is seeking a backend engineer in San Francisco, California. This high-ownership... ...on building core infrastructure for an AI-native platform, emphasizing autonomy... ...and cloud technologies, capable of scaling systems and operating independently. #J...
David Joseph & Company
San Francisco, CA
5 days ago
Backend Engineering
...Backend Engineering Role At Sesame Sesame believes in... ...machine learning inference, scalable agentic... ...cutting-edge applied AI. At the centre of... ...of systems where ML models are a critical... ...it off to the infra team to productionize... ...tradeoffs, and scaling strategies independently...
Full time
Contract work
Flexible hours
SESAME
San Francisco, CA
1 day ago
Senior Backend Engineer — Scale Global AI Supply Chain
Spherecast is seeking a Senior Backend Engineer to design and implement backend systems that power Agnes, our AI Supply Chain Manager. This role requires hands-on experience... ...efficiently. If you excel in managing large-scale data and systems engineering, this is your chance...
Spherecast
San Francisco, CA
3 days ago
Engineering Manager (AI Inference)
...We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a... ...opportunity to build and scale the infrastructure that powers... ...Partner with ML research teams on model optimization... ...Background in training infrastructure and RL workloads...
Training
Perplexity
San Francisco, CA
1 day ago
Staff ML Systems Engineer — Frontier AI Infra
...first company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role involves building and scaling training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have a passion...
Training
Mirendil
San Francisco, CA
4 days ago
Staff Engineer, Mid-Training Infra for Large-Scale AI
A cutting-edge AI research firm in San Francisco is seeking talent to build and optimize GPU infrastructure for large-scale model inference and training workloads. The ideal candidate will have hands-on experience with GPU systems and optimization techniques, actively contributing...
Training
Reflection
San Francisco, CA
2 days ago
Senior Backend Engineer, AI Agents for Enterprise Scale
Dormont Manufacturing Co is seeking a Senior/Staff Backend Engineer to design and build large-scale systems for their AI platform. In this role, you will ensure reliability and performance, integrating complex workflows for real-time data processing. The ideal candidate...
Dormont Manufacturing Co
San Francisco, CA
2 days ago
Senior GPU ML Infra Engineer — Mid-Training & Inference
A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate...
Training
Reflection AI
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Backend Engineer (ML Infra) — Scale AI Training & Inference. Be the first to apply!