Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Backend Engineer, ML Inference Systems

$192.6k - $305.6k

Unity Technologies

Mountain View, CA, USA

Staff Backend Engineer, ML Inference Systems

Location

Mountain View, CA, USA

Department

AI & Machine Learning

Requisition ID

JOBREQ-2615964

Role description

The opportunity

Every day, we connect billions of players with the games and experiences they love.

Our Vector ( Gamer AI team sits at the heart of that mission, governing ad ranking and bidding decisions across billions of daily impressions, where large-scale machine learning and real-world impact converge at scale.

We're hiring a Staff Backend Engineer to build and operate the infrastructure those models depend on. You'll design and operate the distributed systems that power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems.

Join us and help influence how billions of gaming experiences are discovered, monetized, and how creators are rewarded.

What you'll be doing

  • Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests

  • Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure

  • Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput

  • Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana

  • Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment

  • Promote and implement best practices for backend service development, testing, deployment, and monitoring (DevOps, SRE)

What we're looking for

  • 5+ years designing, deploying, and maintaining distributed systems at scale

  • Expertise in Golang for building high-performance, low-latency backend infrastructure

  • Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes

  • Strong grounding in monitoring and observability tooling, including Prometheus and Grafana

  • Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains

  • Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices

  • Familiarity with machine learning platforms, workflows, and serving infrastructure

You might also have

  • Experience with ML inference servers like NVIDIA Triton Inference Server

  • Familiarity with auction mechanics or bidding systems in an ad tech context

  • Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security

Additional information

  • Relocation support is not available for this position

Benefits

At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.

Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.

While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program

Life at Unity

Unity [NYSE: U] is the world’s leading game engine, powering play for more than 3 billion consumers each month. The top mobile games in the world, the most played PC indie titles, the most innovative console games, and virtually all of the top XR and Web Games are developed, deployed, and grown in Unity. Unity also enables teams across industries like automotive, manufacturing, and healthcare to design, simulate, and collaborate in 3D — closing the gap between ideas and reality. For more information, please visit

Unity is a proud equal opportunity employer. We are committed to fostering an inclusive, innovative environment and celebrate our employees across age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, or any other protected status in accordance with applicable law. Our differences are strengths that enable us to support the growing and evolving needs of our customers, partners, and collaborators. If you have a disability that means there are preparations or accommodations we can make to help ensure you have a comfortable and positive interview experience, please fill out this form ( to let us know.

This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Unity does not accept unsolicited headhunter and agency resumes. Unity will not pay fees to any third-party agency or company that does not have a signed agreement with Unity.

Your privacy is important to us. Please take a moment to review our Prospect Privacy Policy ( and Applicant Privacy Policy ( . Should you have any concerns about your privacy, please contact us at View email address on click.appcast.io.

#SEN #LI-AR1

*Note: Certain locations require a good faith disclosure of the base salary range for the role. The actual salary for the successful candidate may differ based on location, experience, and other job-related factors.

Gross pay salary

$192,600—$305,600 USD

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Backend Engineer, ML Inference Systems in Mountain View, CA vacancy
  •  ...for the AI-first world. Why this role exists We need a Backend Engineer to build the systems that orchestrate GPU clusters for AI workloads. You'll create...  ...or HPC cluster management experience Understanding of ML,AI workload patterns and requirements Experience with... 
    Suggested
    Hourly pay
    Full time
    Work at office
    Work from home
    Visa sponsorship

    SproutsAI

    Palo Alto, CA
    3 days ago
  • Member of Technical Staff - Backend Engineer - Data Systems and APIs About Vinci We’re building a copilot for hardware. Software engineers have powerful...  ...of the platform You’ll work across the stack with ML engineers, physics researchers, and product engineers.... 
    Suggested

    Vinci4d

    Palo Alto, CA
    15 hours ago
  • Job type: Full Time · Department: Backend Engineer · Work type: On-Site About A...  ...scalable, and resilient distributed systems. You’ll work closely with researchers, ML engineers, and product teams to...  ..., low-latency AI model inference and data services. Partner with... 
    Suggested
    Full time

    Neara

    Palo Alto, CA
    3 days ago
  •  ...Cerebras Systems builds the world's largest AI chip...  ...-leading training and inference speeds and empowers machine...  ...run large-scale ML applications, without...  ...The Inference ML Engineering team at Cerebras Systems...  ...our scalable serving backend for handling many concurrent... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    15 hours ago
  • $185.5k - $270k

     ...About the Team: The ML Inference Platform is part of the AI...  ...Role: We are seeking a Staff ML Infrastructure engineer to help build and scale robust...  ...in designing distributed systems for ML, strong problem-solving...  ...implement core platform backend software components.... 
    Suggested
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  • $180k

    A cutting-edge AI firm in California is seeking engineers focused on optimizing AI model inference. Candidates should have experience with Python, Rust, and system optimizations. The role involves building reliable serving systems and contributing to innovative AI technologies... 

    Pantera Capital

    Palo Alto, CA
    15 hours ago
  • A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive... 

    Inworld

    Mountain View, CA
    3 days ago
  • $195k - $298k

     ...assistance. About the Team The ML Inference Platform is part of the AI...  ...the Role We are seeking a Staff ML Infrastructure engineer to help build and scale...  ...in designing distributed systems for ML, strong problem-...  ...and implement core platform backend software components.... 
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    15 hours ago
  •  ...converse with all of their business systems through natural language to...  ...with Moveworks' Reasoning Engine and natural language capabilities...  ...to help build cutting edge ML infrastructure for building and...  ...including distributed training and inference pipeline for large language... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    1 day ago
  •  ...technology company focused on AI and simulation is seeking a Backend Engineer to build and maintain data systems and APIs. The ideal candidate will have experience in...  ...role offers the opportunity to work closely with ML engineers and researchers, with a focus on building... 

    Getvinci

    Palo Alto, CA
    15 hours ago
  • $180k

    Pantera Capital is seeking a candidate to develop backend infrastructure primarily in Rust. Responsibilities include building the xAI API for global developers and ensuring high-throughput inference systems. Ideal candidates will have expert knowledge in Rust or C++, experience... 

    Pantera Capital

    Palo Alto, CA
    15 hours ago
  • $180k - $240k

     ...Backend Engineer - Infrastructure Los Angeles, San Francisco, Palo Alto...  ...spearhead the development of core systems and infrastructure crucial to...  ...management and scheduling ML Infrastructure: Construct the...  ...' productivity, and inference systems to optimize the scalability... 
    Work experience placement

    HeyGen

    Palo Alto, CA
    2 days ago
  • $147.4k - $272.1k

    A leading technology company in Cupertino, California is looking for a Backend Engineer to develop backend systems and APIs for evaluation platforms. The role involves software architecture design, coding in Python or C++, and collaboration with cross-functional teams.... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale...  ...coding skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive... 

    General Motors

    Sunnyvale, CA
    4 days ago
  • $251k - $310k

     ...Staff Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology company with the mission to be the world's...  ...Tackle challenging real-world problems with ML and engineering solutions. Use state of the art... 
    Full time
    Contract work
    Internship
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $212k - $318.4k

     ...Staff Machine Learning Performance Engineer, Siri Runtime Systems And Interaction Apple is where individual imaginations gather...  ...and optimizing our model inference stack. In this highly collaborative...  ...accelerate and optimize LLMs and other ML models used by Siri. This... 
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • A technology firm in Palo Alto is seeking a Backend Engineer to develop and maintain data generation systems and product backend APIs. This role entails working closely with machine learning engineers and researchers, along with deploying services in cloud environments... 

    Vinci4d

    Palo Alto, CA
    1 day ago
  • $190k - $220k

     ...Staff Data Engineer We're ALSO, an electric mobility company originally...  ...intersection of data engineering and backend systems, designing pipelines that...  ...to downstream analytics, ML, and visualization systems....  ...supporting real-time inference pipelines Prior Staff or... 
    Local area
    Flexible hours

    ALSO

    Palo Alto, CA
    2 days ago
  • $155k - $207k

     ...Mountain View, CA, is seeking a Senior/Staff Machine Learning Engineer to lead the development of cutting-...  ...enhance large-scale machine learning systems for summarization and conversation intelligence...  ...extensive experience in deploying ML systems, holds a degree in a relevant... 

    Cacheflow

    Mountain View, CA
    15 hours ago
  •  ...infrastructure company in California seeks a Member of Technical Staff — Training to design and optimize large-scale distributed training systems for frontier AI models. Candidates should have 5+ years of experience in ML systems and be proficient in Python along with another... 

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic...  ...over five years of experience in backend systems and proficiency in languages like...  ...or Python. You'll work alongside ML researchers to enhance infrastructure... 

    MongoDB

    Palo Alto, CA
    2 days ago
  •  ...Principal Cloud Backend Engineer San Jose, California, United States...  ...that powers our large-scale AI inference services, with a critical focus...  ...design and implementation of the systems that not only ensure...  ...strong plus. Experience in AI/ML Infrastructure: Direct experience... 
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova Systems

    Palo Alto, CA
    2 days ago
  • $218.8k - $335.3k

     ...the team: The AV ML Infra team at GM...  ...productivity of ML engineers, and drive the...  ...AI Validation & Inference: Ensures robust model...  ..., these tools and systems empower GM to tackle...  ...: As a Staff AI/ML Full-Stack Engineer...  ...interfaces to backend services and cloud... 
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $197k - $266.5k

    Intuit is seeking an experienced software engineer in Mountain View, California, to lead technology initiatives and drive AI integrations. The successful candidate will have over 7 years of experience in delivering enterprise-class applications and a strong proficiency... 

    ATX Venture Partners

    Mountain View, CA
    4 days ago
  •  ...research lab of top AI researchers and engineers, developing best-in-class...  ...year ago, reliably working agentic systems and sub-second multimodal inference at scale barely existed. Nobody has...  ...practical experience building backend or ML systems. Who Thrives Here... 
    Full time
    Work at office
    Relocation package

    Inworld AI

    Mountain View, CA
    2 days ago
  •  ...Staff Backend Engineer At Commure, our engineering team is at the forefront of revolutionizing healthcare technology by building and scaling the systems powering our suite of healthcare products. We are looking for a talented Staff Backend Engineer to help us craft... 
    Work at office

    Commure

    Mountain View, CA
    2 days ago
  •  ...Commure, we're building the AI Operating System for healthcare, the foundation that...  ...About the Role At Commure, our engineering team is at the forefront of revolutionizing...  ...products. We are looking for a talented Staff Backend Engineer to help us craft user-... 
    Work at office
    Immediate start

    Commure Athelas

    Mountain View, CA
    2 days ago
  • $225k - $300k

     ...Staff, Backend Engineer - Catalog Palo Alto, California, United States DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises...  ...time-to-value from their data investments, ensure AI system reliability, and implement unified governance, enabling AI... 
    Work at office
    Remote work
    Worldwide
    Home office
    Flexible hours

    Acryl Data, Inc.

    Palo Alto, CA
    2 days ago
  • $224k - $284k

     ...Senior/Staff Backend Engineer Mountain View, CA About Us CloudKitchens helps restaurateurs around the world succeed in online food...  ...a Backend Engineer, you'll design, implement, and optimize systems that power mission-critical applications. Your role will adapt... 
    Full time
    Temporary work
    Work at office
    Flexible hours

    CloudKitchens

    Mountain View, CA
    2 days ago
  •  ...seeking a Member of Technical Staff — Inference to push the limits of large...  ...You will work on the core systems that serve frontier models...  ...intersection of systems engineering, ML infrastructure, and performance...  ..., or performance‑critical backend systems Strong expertise... 
    Worldwide
    Flexible hours

    RadixArk

    Palo Alto, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Backend Engineer, ML Inference Systems. Be the first to apply!