Staff Backend Engineer, ML Inference Systems
$192.6k - $305.6kUnity Technologies
Mountain View, CA, USA
Staff Backend Engineer, ML Inference Systems
Location
Mountain View, CA, USA
Department
AI & Machine Learning
Requisition ID
JOBREQ-2615964
Role description
The opportunity
Every day, we connect billions of players with the games and experiences they love.
Our Vector ( Gamer AI team sits at the heart of that mission, governing ad ranking and bidding decisions across billions of daily impressions, where large-scale machine learning and real-world impact converge at scale.
We're hiring a Staff Backend Engineer to build and operate the infrastructure those models depend on. You'll design and operate the distributed systems that power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems.
Join us and help influence how billions of gaming experiences are discovered, monetized, and how creators are rewarded.
What you'll be doing
Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests
Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure
Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput
Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana
Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment
Promote and implement best practices for backend service development, testing, deployment, and monitoring (DevOps, SRE)
What we're looking for
5+ years designing, deploying, and maintaining distributed systems at scale
Expertise in Golang for building high-performance, low-latency backend infrastructure
Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes
Strong grounding in monitoring and observability tooling, including Prometheus and Grafana
Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains
Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices
Familiarity with machine learning platforms, workflows, and serving infrastructure
You might also have
Experience with ML inference servers like NVIDIA Triton Inference Server
Familiarity with auction mechanics or bidding systems in an ad tech context
Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security
Additional information
- Relocation support is not available for this position
Benefits
At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.
Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.
While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program
Life at Unity
Unity [NYSE: U] is the world’s leading game engine, powering play for more than 3 billion consumers each month. The top mobile games in the world, the most played PC indie titles, the most innovative console games, and virtually all of the top XR and Web Games are developed, deployed, and grown in Unity. Unity also enables teams across industries like automotive, manufacturing, and healthcare to design, simulate, and collaborate in 3D — closing the gap between ideas and reality. For more information, please visit
Unity is a proud equal opportunity employer. We are committed to fostering an inclusive, innovative environment and celebrate our employees across age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, or any other protected status in accordance with applicable law. Our differences are strengths that enable us to support the growing and evolving needs of our customers, partners, and collaborators. If you have a disability that means there are preparations or accommodations we can make to help ensure you have a comfortable and positive interview experience, please fill out this form ( to let us know.
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Unity does not accept unsolicited headhunter and agency resumes. Unity will not pay fees to any third-party agency or company that does not have a signed agreement with Unity.
Your privacy is important to us. Please take a moment to review our Prospect Privacy Policy ( and Applicant Privacy Policy ( . Should you have any concerns about your privacy, please contact us at View email address on click.appcast.io.
#SEN #LI-AR1
*Note: Certain locations require a good faith disclosure of the base salary range for the role. The actual salary for the successful candidate may differ based on location, experience, and other job-related factors.
Gross pay salary
$192,600—$305,600 USD
- ...for the AI-first world. Why this role exists We need a Backend Engineer to build the systems that orchestrate GPU clusters for AI workloads. You'll create... ...or HPC cluster management experience Understanding of ML,AI workload patterns and requirements Experience with...SuggestedHourly payFull timeWork at officeWork from homeVisa sponsorship
- Member of Technical Staff - Backend Engineer - Data Systems and APIs About Vinci We’re building a copilot for hardware. Software engineers have powerful... ...of the platform You’ll work across the stack with ML engineers, physics researchers, and product engineers....Suggested
- Job type: Full Time · Department: Backend Engineer · Work type: On-Site About A... ...scalable, and resilient distributed systems. You’ll work closely with researchers, ML engineers, and product teams to... ..., low-latency AI model inference and data services. Partner with...SuggestedFull time
- ...Cerebras Systems builds the world's largest AI chip... ...-leading training and inference speeds and empowers machine... ...run large-scale ML applications, without... ...The Inference ML Engineering team at Cerebras Systems... ...our scalable serving backend for handling many concurrent...Suggested
$185.5k - $270k
...About the Team: The ML Inference Platform is part of the AI... ...Role: We are seeking a Staff ML Infrastructure engineer to help build and scale robust... ...in designing distributed systems for ML, strong problem-solving... ...implement core platform backend software components....SuggestedLocal areaWork from homeRelocation packageFlexible hours$180k
A cutting-edge AI firm in California is seeking engineers focused on optimizing AI model inference. Candidates should have experience with Python, Rust, and system optimizations. The role involves building reliable serving systems and contributing to innovative AI technologies...- A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive...
$195k - $298k
...assistance. About the Team The ML Inference Platform is part of the AI... ...the Role We are seeking a Staff ML Infrastructure engineer to help build and scale... ...in designing distributed systems for ML, strong problem-... ...and implement core platform backend software components....Relocation packageFlexible hours- ...converse with all of their business systems through natural language to... ...with Moveworks' Reasoning Engine and natural language capabilities... ...to help build cutting edge ML infrastructure for building and... ...including distributed training and inference pipeline for large language...Work at officeRemote workFlexible hours
- ...technology company focused on AI and simulation is seeking a Backend Engineer to build and maintain data systems and APIs. The ideal candidate will have experience in... ...role offers the opportunity to work closely with ML engineers and researchers, with a focus on building...
$180k
Pantera Capital is seeking a candidate to develop backend infrastructure primarily in Rust. Responsibilities include building the xAI API for global developers and ensuring high-throughput inference systems. Ideal candidates will have expert knowledge in Rust or C++, experience...$180k - $240k
...Backend Engineer - Infrastructure Los Angeles, San Francisco, Palo Alto... ...spearhead the development of core systems and infrastructure crucial to... ...management and scheduling ML Infrastructure: Construct the... ...' productivity, and inference systems to optimize the scalability...Work experience placement$147.4k - $272.1k
A leading technology company in Cupertino, California is looking for a Backend Engineer to develop backend systems and APIs for evaluation platforms. The role involves software architecture design, coding in Python or C++, and collaboration with cross-functional teams....- A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale... ...coding skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive...
$251k - $310k
...Staff Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology company with the mission to be the world's... ...Tackle challenging real-world problems with ML and engineering solutions. Use state of the art...Full timeContract workInternshipRemote work$212k - $318.4k
...Staff Machine Learning Performance Engineer, Siri Runtime Systems And Interaction Apple is where individual imaginations gather... ...and optimizing our model inference stack. In this highly collaborative... ...accelerate and optimize LLMs and other ML models used by Siri. This...Relocation- A technology firm in Palo Alto is seeking a Backend Engineer to develop and maintain data generation systems and product backend APIs. This role entails working closely with machine learning engineers and researchers, along with deploying services in cloud environments...
$190k - $220k
...Staff Data Engineer We're ALSO, an electric mobility company originally... ...intersection of data engineering and backend systems, designing pipelines that... ...to downstream analytics, ML, and visualization systems.... ...supporting real-time inference pipelines Prior Staff or...Local areaFlexible hours$155k - $207k
...Mountain View, CA, is seeking a Senior/Staff Machine Learning Engineer to lead the development of cutting-... ...enhance large-scale machine learning systems for summarization and conversation intelligence... ...extensive experience in deploying ML systems, holds a degree in a relevant...- ...infrastructure company in California seeks a Member of Technical Staff — Training to design and optimize large-scale distributed training systems for frontier AI models. Candidates should have 5+ years of experience in ML systems and be proficient in Python along with another...
- ...company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic... ...over five years of experience in backend systems and proficiency in languages like... ...or Python. You'll work alongside ML researchers to enhance infrastructure...
- ...Principal Cloud Backend Engineer San Jose, California, United States... ...that powers our large-scale AI inference services, with a critical focus... ...design and implementation of the systems that not only ensure... ...strong plus. Experience in AI/ML Infrastructure: Direct experience...Full timeTemporary workLocal areaFlexible hours
$218.8k - $335.3k
...the team: The AV ML Infra team at GM... ...productivity of ML engineers, and drive the... ...AI Validation & Inference: Ensures robust model... ..., these tools and systems empower GM to tackle... ...: As a Staff AI/ML Full-Stack Engineer... ...interfaces to backend services and cloud...Local areaWork from homeFlexible hours$197k - $266.5k
Intuit is seeking an experienced software engineer in Mountain View, California, to lead technology initiatives and drive AI integrations. The successful candidate will have over 7 years of experience in delivering enterprise-class applications and a strong proficiency...- ...research lab of top AI researchers and engineers, developing best-in-class... ...year ago, reliably working agentic systems and sub-second multimodal inference at scale barely existed. Nobody has... ...practical experience building backend or ML systems. Who Thrives Here...Full timeWork at officeRelocation package
- ...Staff Backend Engineer At Commure, our engineering team is at the forefront of revolutionizing healthcare technology by building and scaling the systems powering our suite of healthcare products. We are looking for a talented Staff Backend Engineer to help us craft...Work at office
- ...Commure, we're building the AI Operating System for healthcare, the foundation that... ...About the Role At Commure, our engineering team is at the forefront of revolutionizing... ...products. We are looking for a talented Staff Backend Engineer to help us craft user-...Work at officeImmediate start
$225k - $300k
...Staff, Backend Engineer - Catalog Palo Alto, California, United States DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises... ...time-to-value from their data investments, ensure AI system reliability, and implement unified governance, enabling AI...Work at officeRemote workWorldwideHome officeFlexible hours$224k - $284k
...Senior/Staff Backend Engineer Mountain View, CA About Us CloudKitchens helps restaurateurs around the world succeed in online food... ...a Backend Engineer, you'll design, implement, and optimize systems that power mission-critical applications. Your role will adapt...Full timeTemporary workWork at officeFlexible hours- ...seeking a Member of Technical Staff — Inference to push the limits of large... ...You will work on the core systems that serve frontier models... ...intersection of systems engineering, ML infrastructure, and performance... ..., or performance‑critical backend systems Strong expertise...WorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Backend Engineer, ML Inference Systems. Be the first to apply!
- assistant engineer Mountain View, CA
- engineering aide Mountain View, CA
- staff engineer Mountain View, CA
- technology administrator Mountain View, CA
- senior staff systems engineer Mountain View, CA
- staff data engineer Mountain View, CA
- software engineer staff Mountain View, CA
- senior staff engineer Mountain View, CA
- back-end developer Mountain View, CA
- senior backend developer Mountain View, CA

