Staff Backend Engineer, ML Inference Systems

$192.6k - $305.6k

Unity Technologies

Mountain View, CA

Mountain View, CA, USA

Staff Backend Engineer, ML Inference Systems

Location

Mountain View, CA, USA

Department

AI & Machine Learning

Requisition ID

JOBREQ-2615964

Role description

The opportunity

Every day, we connect billions of players with the games and experiences they love.

Our Vector ( Gamer AI team sits at the heart of that mission, governing ad ranking and bidding decisions across billions of daily impressions, where large-scale machine learning and real-world impact converge at scale.

We're hiring a Staff Backend Engineer to build and operate the infrastructure those models depend on. You'll design and operate the distributed systems that power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems.

Join us and help influence how billions of gaming experiences are discovered, monetized, and how creators are rewarded.

What you'll be doing

Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests
Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure
Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput
Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana
Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment
Promote and implement best practices for backend service development, testing, deployment, and monitoring (DevOps, SRE)

What we're looking for

5+ years designing, deploying, and maintaining distributed systems at scale
Expertise in Golang for building high-performance, low-latency backend infrastructure
Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes
Strong grounding in monitoring and observability tooling, including Prometheus and Grafana
Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains
Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices
Familiarity with machine learning platforms, workflows, and serving infrastructure

You might also have

Experience with ML inference servers like NVIDIA Triton Inference Server
Familiarity with auction mechanics or bidding systems in an ad tech context
Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security

Additional information

Relocation support is not available for this position

Benefits

At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.

Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.

While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program

Life at Unity

Unity [NYSE: U] is the world’s leading game engine, powering play for more than 3 billion consumers each month. The top mobile games in the world, the most played PC indie titles, the most innovative console games, and virtually all of the top XR and Web Games are developed, deployed, and grown in Unity. Unity also enables teams across industries like automotive, manufacturing, and healthcare to design, simulate, and collaborate in 3D — closing the gap between ideas and reality. For more information, please visit

Unity is a proud equal opportunity employer. We are committed to fostering an inclusive, innovative environment and celebrate our employees across age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, or any other protected status in accordance with applicable law. Our differences are strengths that enable us to support the growing and evolving needs of our customers, partners, and collaborators. If you have a disability that means there are preparations or accommodations we can make to help ensure you have a comfortable and positive interview experience, please fill out this form ( to let us know.

This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Unity does not accept unsolicited headhunter and agency resumes. Unity will not pay fees to any third-party agency or company that does not have a signed agreement with Unity.

Your privacy is important to us. Please take a moment to review our Prospect Privacy Policy ( and Applicant Privacy Policy ( . Should you have any concerns about your privacy, please contact us at View email address on click.appcast.io.

#SEN #LI-AR1

*Note: Certain locations require a good faith disclosure of the base salary range for the role. The actual salary for the successful candidate may differ based on location, experience, and other job-related factors.

Gross pay salary

$192,600—$305,600 USD

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff Backend Engineer, ML Inference Systems in Mountain View, CA vacancy

Senior, Staff Backend Engineer - Distributed System
...for the AI-first world. Why this role exists We need a Backend Engineer to build the systems that orchestrate GPU clusters for AI workloads. You'll create... ...or HPC cluster management experience Understanding of ML,AI workload patterns and requirements Experience with...
Suggested
Hourly pay
Full time
Work at office
Work from home
Visa sponsorship
SproutsAI
Palo Alto, CA
3 days ago
Member of Technical Staff - Backend Engineer - Data Systems and APIs
Member of Technical Staff - Backend Engineer - Data Systems and APIs About Vinci We’re building a copilot for hardware. Software engineers have powerful... ...of the platform You’ll work across the stack with ML engineers, physics researchers, and product engineers....
Suggested
Vinci4d
Palo Alto, CA
15 hours ago
Backend Engineer - Distributed Systems
Job type: Full Time · Department: Backend Engineer · Work type: On-Site About A... ...scalable, and resilient distributed systems. You’ll work closely with researchers, ML engineers, and product teams to... ..., low-latency AI model inference and data services. Partner with...
Suggested
Full time
Neara
Palo Alto, CA
3 days ago
Staff Inference ML Runtime Engineer
...Cerebras Systems builds the world's largest AI chip... ...-leading training and inference speeds and empowers machine... ...run large-scale ML applications, without... ...The Inference ML Engineering team at Cerebras Systems... ...our scalable serving backend for handling many concurrent...
Suggested
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
15 hours ago
Staff ML Engineer, Inference Platform
$185.5k - $270k
...About the Team: The ML Inference Platform is part of the AI... ...Role: We are seeking a Staff ML Infrastructure engineer to help build and scale robust... ...in designing distributed systems for ML, strong problem-solving... ...implement core platform backend software components....
Suggested
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Staff Inference Systems Engineer
$180k
A cutting-edge AI firm in California is seeking engineers focused on optimizing AI model inference. Candidates should have experience with Python, Rust, and system optimizations. The role involves building reliable serving systems and contributing to innovative AI technologies...
Pantera Capital
Palo Alto, CA
15 hours ago
Staff ML Engineer — Ultra-Low-Latency Inference
A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive...
Inworld
Mountain View, CA
3 days ago
Staff ML Engineer, Inference Platform
$195k - $298k
...assistance. About the Team The ML Inference Platform is part of the AI... ...the Role We are seeking a Staff ML Infrastructure engineer to help build and scale... ...in designing distributed systems for ML, strong problem-... ...and implement core platform backend software components....
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
15 hours ago
Senior Staff Machine Learning Engineer, Agentic Systems - Moveworks
...converse with all of their business systems through natural language to... ...with Moveworks' Reasoning Engine and natural language capabilities... ...to help build cutting edge ML infrastructure for building and... ...including distributed training and inference pipeline for large language...
Work at office
Remote work
Flexible hours
ServiceNow
Mountain View, CA
1 day ago
Backend Engineer - Data Systems & APIs for AI Platform
...technology company focused on AI and simulation is seeking a Backend Engineer to build and maintain data systems and APIs. The ideal candidate will have experience in... ...role offers the opportunity to work closely with ML engineers and researchers, with a focus on building...
Getvinci
Palo Alto, CA
15 hours ago
Backend API Engineer — Scalable AI Inference (Rust/C++)
$180k
Pantera Capital is seeking a candidate to develop backend infrastructure primarily in Rust. Responsibilities include building the xAI API for global developers and ensuring high-throughput inference systems. Ideal candidates will have expert knowledge in Rust or C++, experience...
Pantera Capital
Palo Alto, CA
15 hours ago
Backend Engineer - Infrastructure
$180k - $240k
...Backend Engineer - Infrastructure Los Angeles, San Francisco, Palo Alto... ...spearhead the development of core systems and infrastructure crucial to... ...management and scheduling ML Infrastructure: Construct the... ...' productivity, and inference systems to optimize the scalability...
Work experience placement
HeyGen
Palo Alto, CA
2 days ago
Backend Engineer for ML Evaluation & Systems
$147.4k - $272.1k
A leading technology company in Cupertino, California is looking for a Backend Engineer to develop backend systems and APIs for evaluation platforms. The role involves software architecture design, coding in Python or C++, and collaboration with cross-functional teams....
Apple Inc.
Cupertino, CA
3 days ago
Staff ML Infra Engineer: Scalable Inference Platform (Hybrid)
A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale... ...coding skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive...
General Motors
Sunnyvale, CA
4 days ago
Staff Machine Learning Engineer, Prediction & Planning, System Architecture
$251k - $310k
...Staff Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology company with the mission to be the world's... ...Tackle challenging real-world problems with ML and engineering solutions. Use state of the art...
Full time
Contract work
Internship
Remote work
Waymo
Mountain View, CA
3 days ago
Staff Machine Learning Performance Engineer, Siri Runtime Systems and Interaction
$212k - $318.4k
...Staff Machine Learning Performance Engineer, Siri Runtime Systems And Interaction Apple is where individual imaginations gather... ...and optimizing our model inference stack. In this highly collaborative... ...accelerate and optimize LLMs and other ML models used by Siri. This...
Relocation
Apple
Cupertino, CA
1 day ago
Backend Engineer - Data Systems & APIs for AI Platform
A technology firm in Palo Alto is seeking a Backend Engineer to develop and maintain data generation systems and product backend APIs. This role entails working closely with machine learning engineers and researchers, along with deploying services in cloud environments...
Vinci4d
Palo Alto, CA
1 day ago
Staff Data Engineer - Vehicle Telemetry and Data Infrastructure
$190k - $220k
...Staff Data Engineer We're ALSO, an electric mobility company originally... ...intersection of data engineering and backend systems, designing pipelines that... ...to downstream analytics, ML, and visualization systems.... ...supporting real-time inference pipelines Prior Staff or...
Local area
Flexible hours
ALSO
Palo Alto, CA
2 days ago
Senior/Staff ML Engineer - Speech & NLP Systems
$155k - $207k
...Mountain View, CA, is seeking a Senior/Staff Machine Learning Engineer to lead the development of cutting-... ...enhance large-scale machine learning systems for summarization and conversation intelligence... ...extensive experience in deploying ML systems, holds a degree in a relevant...
Cacheflow
Mountain View, CA
15 hours ago
Staff ML Systems Engineer — Distributed Training at Scale
...infrastructure company in California seeks a Member of Technical Staff — Training to design and optimize large-scale distributed training systems for frontier AI models. Candidates should have 5+ years of experience in ML systems and be proficient in Python along with another...
RadixArk
Palo Alto, CA
4 days ago
Senior Inference Platform Engineer — Low-Latency, Multi-Tenant
...company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic... ...over five years of experience in backend systems and proficiency in languages like... ...or Python. You'll work alongside ML researchers to enhance infrastructure...
MongoDB
Palo Alto, CA
2 days ago
Principal Cloud Backend Engineer
...Principal Cloud Backend Engineer San Jose, California, United States... ...that powers our large-scale AI inference services, with a critical focus... ...design and implementation of the systems that not only ensure... ...strong plus. Experience in AI/ML Infrastructure: Direct experience...
Full time
Temporary work
Local area
Flexible hours
SambaNova Systems
Palo Alto, CA
2 days ago
Staff AI/ML Fullstack Engineer - AV ML Infra
$218.8k - $335.3k
...the team: The AV ML Infra team at GM... ...productivity of ML engineers, and drive the... ...AI Validation & Inference: Ensures robust model... ..., these tools and systems empower GM to tackle... ...: As a Staff AI/ML Full-Stack Engineer... ...interfaces to backend services and cloud...
Local area
Work from home
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
Staff Backend Engineer - AI-Driven Cloud Systems
$197k - $266.5k
Intuit is seeking an experienced software engineer in Mountain View, California, to lead technology initiatives and drive AI integrations. The successful candidate will have over 7 years of experience in delivering enterprise-class applications and a strong proficiency...
ATX Venture Partners
Mountain View, CA
4 days ago
Staff / Principal Machine Learning Engineer, Serving - USA
...research lab of top AI researchers and engineers, developing best-in-class... ...year ago, reliably working agentic systems and sub-second multimodal inference at scale barely existed. Nobody has... ...practical experience building backend or ML systems. Who Thrives Here...
Full time
Work at office
Relocation package
Inworld AI
Mountain View, CA
2 days ago
Staff Backend Engineer
...Staff Backend Engineer At Commure, our engineering team is at the forefront of revolutionizing healthcare technology by building and scaling the systems powering our suite of healthcare products. We are looking for a talented Staff Backend Engineer to help us craft...
Work at office
Commure
Mountain View, CA
2 days ago
Staff Backend Engineer
...Commure, we're building the AI Operating System for healthcare, the foundation that... ...About the Role At Commure, our engineering team is at the forefront of revolutionizing... ...products. We are looking for a talented Staff Backend Engineer to help us craft user-...
Work at office
Immediate start
Commure Athelas
Mountain View, CA
2 days ago
Staff, Backend Engineer - Catalog
$225k - $300k
...Staff, Backend Engineer - Catalog Palo Alto, California, United States DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises... ...time-to-value from their data investments, ensure AI system reliability, and implement unified governance, enabling AI...
Work at office
Remote work
Worldwide
Home office
Flexible hours
Acryl Data, Inc.
Palo Alto, CA
2 days ago
Senior/Staff Backend Engineer, CloudKitchens - Mountain View, CA
$224k - $284k
...Senior/Staff Backend Engineer Mountain View, CA About Us CloudKitchens helps restaurateurs around the world succeed in online food... ...a Backend Engineer, you'll design, implement, and optimize systems that power mission-critical applications. Your role will adapt...
Full time
Temporary work
Work at office
Flexible hours
CloudKitchens
Mountain View, CA
2 days ago
Member of Technical Staff - Inference
...seeking a Member of Technical Staff — Inference to push the limits of large... ...You will work on the core systems that serve frontier models... ...intersection of systems engineering, ML infrastructure, and performance... ..., or performance‑critical backend systems Strong expertise...
Worldwide
Flexible hours
RadixArk
Palo Alto, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Backend Engineer, ML Inference Systems. Be the first to apply!