Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, Runtime & Optimization

$204k - $259k
Full-time

Waymo

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The ML Platform team at Waymo provides a set of tools to support and automate the lifecycle of the machine learning workflow, including feature and experiment management, model development, optimization and monitoring. These efforts have resulted in making machine learning more accessible to teams at Waymo, including Perception, Planner, Research and Simulation.

We are looking for engineers with ML software or ML systems expertise to help us improve compute performance on both cloud and car. You'll work across the entire ML stack from the system perspective, from efficient deep learning models, model compression, ML software (e.g. JAX, XLA, Triton, and CUDA), to . You will be pleasantly challenged with deploying Waymo ML models on limited computation resources. In this hybrid role, you will report to the Senior Manager of Runtime and Optimization.

You Will

  • Lead the collaboration with the world-class Waymo ML scientists in perception, planner, research and simulation. Identify opportunities in both systems and models to make ML workloads faster.
  • Lead projects from proposals through execution by developing junior engineers.
  • Analyze and improve ML system workloads on both cloud and self-driving cars .
  • Apply model optimization, efficient deep learning techniques and ML software improvements to Waymo's ML systems.

You Have

  • M.S. in CS, EE, Deep Learning or a related field
  • 2+ years of experience as a technical lead, including writing project plans, engaging with customer teams, mentoring, responsible for goals & execution, reporting status.
  • 5+ years of experience developing solutions in ML systems or ML software stack (Pytorch/JAX/TF, runtime libraries, ML compiler).
  • Deep understanding of ML system architecture, performance analysis and tools.
  • Strong Python or C++ programming skills

We prefer you have one or more of the following:

  • PhD in CS, EE, Deep Learning or a related field.
  • Familiarity with the HW architecture of ML hardware accelerators (e.g., GPU/TPU).
  • Deep knowledge of model optimization or efficient deep learning techniques for foundation models or LLM.
  • Experience with GPU HW or TPU HW and related system software.

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range

$204,000—$259,000 USD

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Runtime & Optimization in California vacancy
  • $159.05k - $199.3k

     ...the role We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production‑grade embedded runtime environments. You’ll work across the...  ...Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX,... 
    Suggested
    Full time
    For contractors
    For subcontractor

    Decisive Point

    Sunnyvale, CA
    1 day ago
  • $140k - $250k

     ...support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and... 
    Suggested
    Temporary work

    pony.ai

    Fremont, CA
    more than 2 months ago
  • $278.1k - $347.6k

     ...games run inside a modern, browser-native runtime (built on technologies such as WebGPU...  ...entirely within that runtime. As our Principal Engineer for On-Device AI Inference & Systems,...  ...leaves research, through export, optimization, and kernel-level tuning, to a shipped feature... 
    Suggested
    Work at office
    Worldwide
    Relocation package

    Unity

    Mountain View, CA
    5 days ago
  • $170.5k - $315.49k

    ## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California, Santa Clara: US, Oregon,...  ...models run directly on the user's machine (AI PC, edge, on-prem, and beyond),...  ...engines where it helps us# What you’ll learn / grow into*Curiosity is required.... 
    Suggested
    Internship
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    1 day ago
  • $150k - $200k

     ...collision and force guards, smoothing, and runtime monitors for safety‑critical deployment...  ...‑speed robot autonomy software stack optimized for inference performance. Advance SOTA...  ...PhD or MS degree in Computer Science, Machine Learning, Robotics, or equivalent technical... 
    Suggested

    Deft AI, Inc.

    San Francisco, CA
    2 days ago
  • $170.5k - $315.49k

     ...run directly on the user's machine (AI PC, edge, on-prem, and beyond...  ...We are seeking a Machine Learning Engineer / Data Scientist to join...  ...intensive tasks. Debug and optimize training runs - Profile...  ...model choices interact with runtime constraints on edge hardware... 
    Internship
    Local area
    Immediate start
    Shift work

    Intel

    Folsom, CA
    5 days ago
  • $165.2k - $223.6k

     ...Description The Product: Amazon's Machine Learning accelerators are at the...  ...includes an ML compiler, runtime and natively integrates into...  ...including silicon engineering, hardware design and verification...  ...Amazon Neuron team works to optimize the performance of complex neural... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    5 days ago
  • $260k - $330k

     ...have: 3+ years of solid experience building machine learning, data science, ranking, prediction, recommendation, optimization, or large-scale data systems PubMatic is the...  ...: We are looking for a Machine Learning Engineer to help build and improve performance optimization... 
    Full time
    Work at office
    Remote work

    PubMatic

    Redwood City, CA
    4 days ago
  • $110k - $300k

     ...data centers. Our talented team of engineers and industry-leading executives...  ...positions become available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications,...  ...Understanding of ML compiler and runtime design. Experience working with... 

    TETRAMEM INC

    San Jose, CA
    3 days ago
  • $208k - $263.5k

     ...Atoms is building the machines that power the next era...  ..., operate them, learn from them, and improve...  ...We are roboticists, engineers, operators, and builders...  ...Research, Post-Training Optimization, and Data Engineering....  ...hardware-accelerated runtimes (e.g., TensorRT, CUDA)... 
    Full time
    Internship
    Work at office
    Flexible hours

    ATOMS Careers page

    San Francisco, CA
    1 day ago
  •  ...generalist robotics a reality. We're looking for an Inference Optimization MLE to help build and operate the systems that make our...  ...regressions across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment‑... 

    Rhoda AI

    Mountain View, CA
    2 days ago
  • $160.36k - $240.54k

     ...team in Mountain View, CA. You will design, build, and deploy components for machine learning model life cycles, enhancing Nuro's autonomy stack. Candidates should have expertise in ML optimization techniques and programming skills in Python and C++. This role offers a... 

    Icehouseventures

    Mountain View, CA
    1 day ago
  • $181.1k - $318.4k

    On-Device ML Infrastructure Engineer (CoreML Runtime), Graphics, Games & ML Cupertino, California, United States Machine Learning and AI Imagine being at the forefront of an evolution...  ...to embedded systems, developing optimization toolkits for model compression and acceleration... 
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • $160.16k - $266.93k

     ...prototyping, test, manufacturing, and optimization. Our ~15,000 employees create...  ...in over 100 countries. Learn more about what we do. Our...  ...is hiring a ML Performance Engineer to make our training and...  ...optimization, kernel fusion, runtime selection Write custom CUDA... 
    Flexible hours

    Keysight Technologies

    Santa Rosa, CA
    1 day ago
  •  ...persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,...  ...will work across data, modeling, and runtime systems to ensure ML-driven animation...  ...engines. Debug, profile, and optimize runtime performance across CPU, GPU,... 
    Full time
    Work experience placement
    Work at office

    GENIES INC

    San Francisco, CA
    5 days ago
  • $215.28k - $364.32k

     ...Staff Machine Learning Engineer Santa Clara, CA XPENG is a leading smart technology company at...  ...preparation, model training, evaluation, optimization, quantization, and deployment. You...  ..., quantization accuracy drop, or runtime performance bottlenecks. Familiarity... 
    Full time

    XPENG

    Santa Clara, CA
    5 days ago
  •  ...career. THE ROLE: We are looking for a Principal Machine Learning Engineer to join our Models and Applications team. If you are excited...  ...Improve the end-to-end training pipeline performance. Optimize the distributed training pipeline and algorithm to scale out... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    2 days ago
  •  ...Staff Machine Learning Engineer As a Staff Machine Learning Engineer, you will be responsible for...  ...inform research & product direction Optimize and validate models for integration...  ...throughput optimization (e.g., TensorRT, ONNX Runtime, mixed precision), and building... 
    Work at office

    Intuitive

    San Carlos, CA
    3 days ago
  • Generalist is seeking an expert in machine learning optimizations to join our innovative team. You will own the optimizations for both training and on-robot inference stacks, focusing on achieving step-function gains. The ideal candidate should be proficient with the latest... 

    Generalist

    San Francisco, CA
    3 days ago
  • $181.1k - $318.4k

     ...commitment to environmental sustainability and optimal resource utilization. This team plays a...  ...services. As a Sr. ML Optimization Engineer, you will work at the intersection of...  ...strategy, applied analytics, machine learning, and large-scale optimization. You will... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $19 - $65 per hour

    PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will...  ...Triton. This internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly... 
    Hourly pay
    Internship

    PlusAI

    Santa Clara, CA
    4 days ago
  • $181.1k - $318.4k

    Senior Machine Learning Engineer, AI, SIML Cupertino, California, United States Machine Learning and AI Are you passionate about Generative...  ...Preferred Qualifications Experience working on AI/ML-optimized runtime stacks. Familiarity with parallelization algorithms for... 
    Relocation
    Flexible hours

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $160k - $240k

    Tensec is looking for a Machine Learning Engineer in San Francisco to build algorithms and optimization systems that drive their autonomous decision engine. This role involves designing trading strategies, building execution layers, and deploying robust models that enhance... 
    Relocation package

    Tensec

    San Francisco, CA
    5 days ago
  • Qualcomm in San Diego is looking for a Machine Learning Engineer to implement graph optimization passes and develop AI solutions. The role emphasizes collaboration with engineering teams and requires experience in ML systems and Python programming. The position offers a... 

    Qualcomm

    San Diego, CA
    5 days ago
  • $2,000 per month

     ...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for...  ...performance characterization tools, and runtime abstractions by implementing...  ...allow for real-time image generation Optimize model instructions and scheduling algorithms... 
    Work at office
    Relocation package

    ETCHED LLC

    Cupertino, CA
    2 days ago
  • jobr.pro is seeking a Staff Machine Learning Engineer to join our Vector Bidding Science team in Mountain View, California. In this critical...  ...vision and develop advanced bidding systems using AI and optimization frameworks. You will design algorithms, analyze marketplace... 

    Jobr

    Mountain View, CA
    5 days ago
  • $178.4k - $267.6k

     ...Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group > Machine Learning Engineering General Summary:...  ...design, develop and test model optimization techniques that include - but...  ...like Huggingface Optimum, ONNX runtime and OpenVino. Proven hands-on experience... 
    Work experience placement
    Work from home

    Stryker

    San Diego, CA
    5 days ago
  • $150k - $300k

    Director, Machine Learning EngineeringSkip to main contentGEICO uses cookies...  ..., Machine Learning Engineering page is loaded## Director,...  ...Great Careers.**Director - Runtime Intelligence & Personalization...  ...environments**Observability & Optimization*** Establish observability... 
    Hourly pay
    Temporary work
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    2 days ago
  • The Role We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible... 

    Comfy

    San Francisco, CA
    5 days ago
  •  ...combines threat intelligence with machine learning, enables financial...  ...scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs...  ...concurrent models and users. Optimize high-throughput inference....  ...stacks such as TensorRT, ONNX Runtime, vLLM, FlashAttention, and... 
    Worldwide

    TRM Labs

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Runtime & Optimization. Be the first to apply!