Machine Learning Engineer, Runtime & Optimization

$204k - $259k

Full-time

Waymo

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The ML Platform team at Waymo provides a set of tools to support and automate the lifecycle of the machine learning workflow, including feature and experiment management, model development, optimization and monitoring. These efforts have resulted in making machine learning more accessible to teams at Waymo, including Perception, Planner, Research and Simulation.

We are looking for engineers with ML software or ML systems expertise to help us improve compute performance on both cloud and car. You'll work across the entire ML stack from the system perspective, from efficient deep learning models, model compression, ML software (e.g. JAX, XLA, Triton, and CUDA), to . You will be pleasantly challenged with deploying Waymo ML models on limited computation resources. In this hybrid role, you will report to the Senior Manager of Runtime and Optimization.

You Will

Lead the collaboration with the world-class Waymo ML scientists in perception, planner, research and simulation. Identify opportunities in both systems and models to make ML workloads faster.
Lead projects from proposals through execution by developing junior engineers.
Analyze and improve ML system workloads on both cloud and self-driving cars .
Apply model optimization, efficient deep learning techniques and ML software improvements to Waymo's ML systems.

You Have

M.S. in CS, EE, Deep Learning or a related field
2+ years of experience as a technical lead, including writing project plans, engaging with customer teams, mentoring, responsible for goals & execution, reporting status.
5+ years of experience developing solutions in ML systems or ML software stack (Pytorch/JAX/TF, runtime libraries, ML compiler).
Deep understanding of ML system architecture, performance analysis and tools.
Strong Python or C++ programming skills

We prefer you have one or more of the following:

PhD in CS, EE, Deep Learning or a related field.
Familiarity with the HW architecture of ML hardware accelerators (e.g., GPU/TPU).
Deep knowledge of model optimization or efficient deep learning techniques for foundation models or LLM.
Experience with GPU HW or TPU HW and related system software.

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range

$204,000—$259,000 USD

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Runtime & Optimization in California vacancy

ML Runtime Optimization Engineer
$159.05k - $199.3k
...the role We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production‑grade embedded runtime environments. You’ll work across the... ...Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX,...
Suggested
Full time
For contractors
For subcontractor
Decisive Point
Sunnyvale, CA
1 day ago
Machine Learning Engineer, ML Runtime & Optimization
$140k - $250k
...support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and...
Suggested
Temporary work
pony.ai
Fremont, CA
more than 2 months ago
Principal Machine Learning Engineer, Mobile AI Inference Optimization
$278.1k - $347.6k
...games run inside a modern, browser-native runtime (built on technologies such as WebGPU... ...entirely within that runtime. As our Principal Engineer for On-Device AI Inference & Systems,... ...leaves research, through export, optimization, and kernel-level tuning, to a shipped feature...
Suggested
Work at office
Worldwide
Relocation package
Unity
Mountain View, CA
5 days ago
Inference Optimization Engineer (local / edge runtime)
$170.5k - $315.49k
## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California, Santa Clara: US, Oregon,... ...models run directly on the user's machine (AI PC, edge, on-prem, and beyond),... ...engines where it helps us# What you’ll learn / grow into*Curiosity is required....
Suggested
Internship
Local area
Immediate start
Shift work
Intel
Santa Clara, CA
1 day ago
Machine Learning Engineer
$150k - $200k
...collision and force guards, smoothing, and runtime monitors for safety‑critical deployment... ...‑speed robot autonomy software stack optimized for inference performance. Advance SOTA... ...PhD or MS degree in Computer Science, Machine Learning, Robotics, or equivalent technical...
Suggested
Deft AI, Inc.
San Francisco, CA
2 days ago
Machine Learning Engineer
$170.5k - $315.49k
...run directly on the user's machine (AI PC, edge, on-prem, and beyond... ...We are seeking a Machine Learning Engineer / Data Scientist to join... ...intensive tasks. Debug and optimize training runs - Profile... ...model choices interact with runtime constraints on edge hardware...
Internship
Local area
Immediate start
Shift work
Intel
Folsom, CA
5 days ago
Machine Learning Compiler Engineer
$165.2k - $223.6k
...Description The Product: Amazon's Machine Learning accelerators are at the... ...includes an ML compiler, runtime and natively integrates into... ...including silicon engineering, hardware design and verification... ...Amazon Neuron team works to optimize the performance of complex neural...
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
5 days ago
Senior/Machine Learning Engineer — Performance Optimization
$260k - $330k
...have: 3+ years of solid experience building machine learning, data science, ranking, prediction, recommendation, optimization, or large-scale data systems PubMatic is the... ...: We are looking for a Machine Learning Engineer to help build and improve performance optimization...
Full time
Work at office
Remote work
PubMatic
Redwood City, CA
4 days ago
Senior Machine Learning Engineer
$110k - $300k
...data centers. Our talented team of engineers and industry-leading executives... ...positions become available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications,... ...Understanding of ML compiler and runtime design. Experience working with...
TETRAMEM INC
San Jose, CA
3 days ago
Senior Machine Learning Engineer
$208k - $263.5k
...Atoms is building the machines that power the next era... ..., operate them, learn from them, and improve... ...We are roboticists, engineers, operators, and builders... ...Research, Post-Training Optimization, and Data Engineering.... ...hardware-accelerated runtimes (e.g., TensorRT, CUDA)...
Full time
Internship
Work at office
Flexible hours
ATOMS Careers page
San Francisco, CA
1 day ago
Inference Optimization ML Engineer
...generalist robotics a reality. We're looking for an Inference Optimization MLE to help build and operate the systems that make our... ...regressions across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment‑...
Rhoda AI
Mountain View, CA
2 days ago
ML Infra Engineer for Autonomy Optimization
$160.36k - $240.54k
...team in Mountain View, CA. You will design, build, and deploy components for machine learning model life cycles, enhancing Nuro's autonomy stack. Candidates should have expertise in ML optimization techniques and programming skills in Python and C++. This role offers a...
Icehouseventures
Mountain View, CA
1 day ago
On-Device ML Infrastructure Engineer (CoreML Runtime), Graphics, Games & ML
$181.1k - $318.4k
On-Device ML Infrastructure Engineer (CoreML Runtime), Graphics, Games & ML Cupertino, California, United States Machine Learning and AI Imagine being at the forefront of an evolution... ...to embedded systems, developing optimization toolkits for model compression and acceleration...
Relocation
Apple
Cupertino, CA
1 day ago
Machine Learning Performance Engineer
$160.16k - $266.93k
...prototyping, test, manufacturing, and optimization. Our ~15,000 employees create... ...in over 100 countries. Learn more about what we do. Our... ...is hiring a ML Performance Engineer to make our training and... ...optimization, kernel fusion, runtime selection Write custom CUDA...
Flexible hours
Keysight Technologies
Santa Rosa, CA
1 day ago
Senior Machine Learning Engineer, Animation Integration
...persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...will work across data, modeling, and runtime systems to ensure ML-driven animation... ...engines. Debug, profile, and optimize runtime performance across CPU, GPU,...
Full time
Work experience placement
Work at office
GENIES INC
San Francisco, CA
5 days ago
Staff Machine Learning Engineer
$215.28k - $364.32k
...Staff Machine Learning Engineer Santa Clara, CA XPENG is a leading smart technology company at... ...preparation, model training, evaluation, optimization, quantization, and deployment. You... ..., quantization accuracy drop, or runtime performance bottlenecks. Familiarity...
Full time
XPENG
Santa Clara, CA
5 days ago
Principal ML Engineer - Large Scale Training Performance Optimization
...career. THE ROLE: We are looking for a Principal Machine Learning Engineer to join our Models and Applications team. If you are excited... ...Improve the end-to-end training pipeline performance. Optimize the distributed training pipeline and algorithm to scale out...
Advanced Micro Devices , Inc.
San Jose, CA
2 days ago
Staff Machine Learning Engineer
...Staff Machine Learning Engineer As a Staff Machine Learning Engineer, you will be responsible for... ...inform research & product direction Optimize and validate models for integration... ...throughput optimization (e.g., TensorRT, ONNX Runtime, mixed precision), and building...
Work at office
Intuitive
San Carlos, CA
3 days ago
ML Optimization Engineer: Training & Inference
Generalist is seeking an expert in machine learning optimizations to join our innovative team. You will own the optimizations for both training and on-robot inference stacks, focusing on achieving step-function gains. The ideal candidate should be proficient with the latest...
Generalist
San Francisco, CA
3 days ago
Sr. ML Optimization Engineer, iCloud
$181.1k - $318.4k
...commitment to environmental sustainability and optimal resource utilization. This team plays a... ...services. As a Sr. ML Optimization Engineer, you will work at the intersection of... ...strategy, applied analytics, machine learning, and large-scale optimization. You will...
Relocation
Apple Inc.
Cupertino, CA
4 days ago
ML Infra Engineer Intern: Optimize BEV Training on GPUs
$19 - $65 per hour
PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will... ...Triton. This internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly...
Hourly pay
Internship
PlusAI
Santa Clara, CA
4 days ago
Senior Machine Learning Engineer, AI, SIML
$181.1k - $318.4k
Senior Machine Learning Engineer, AI, SIML Cupertino, California, United States Machine Learning and AI Are you passionate about Generative... ...Preferred Qualifications Experience working on AI/ML-optimized runtime stacks. Familiarity with parallelization algorithms for...
Relocation
Flexible hours
Apple Inc.
Cupertino, CA
1 day ago
Senior ML Engineer for Autonomous Ad Optimization
$160k - $240k
Tensec is looking for a Machine Learning Engineer in San Francisco to build algorithms and optimization systems that drive their autonomous decision engine. This role involves designing trading strategies, building execution layers, and deploying robust models that enhance...
Relocation package
Tensec
San Francisco, CA
5 days ago
Senior ML Engineer - Graph Optimizations & ONNX (Onsite)
Qualcomm in San Diego is looking for a Machine Learning Engineer to implement graph optimization passes and develop AI solutions. The role emphasizes collaboration with engineering teams and requires experience in ML systems and Python programming. The position offers a...
Qualcomm
San Diego, CA
5 days ago
Machine Learning Research Engineer
$2,000 per month
...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for... ...performance characterization tools, and runtime abstractions by implementing... ...allow for real-time image generation Optimize model instructions and scheduling algorithms...
Work at office
Relocation package
ETCHED LLC
Cupertino, CA
2 days ago
Staff ML Engineer - Real-Time Bidding & AI Optimization
jobr.pro is seeking a Staff Machine Learning Engineer to join our Vector Bidding Science team in Mountain View, California. In this critical... ...vision and develop advanced bidding systems using AI and optimization frameworks. You will design algorithms, analyze marketplace...
Jobr
Mountain View, CA
5 days ago
Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)
$178.4k - $267.6k
...Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group > Machine Learning Engineering General Summary:... ...design, develop and test model optimization techniques that include - but... ...like Huggingface Optimum, ONNX runtime and OpenVino. Proven hands-on experience...
Work experience placement
Work from home
Stryker
San Diego, CA
5 days ago
Director, Machine Learning Engineering
$150k - $300k
Director, Machine Learning EngineeringSkip to main contentGEICO uses cookies... ..., Machine Learning Engineering page is loaded## Director,... ...Great Careers.**Director - Runtime Intelligence & Personalization... ...environments**Observability & Optimization*** Establish observability...
Hourly pay
Temporary work
Work experience placement
Local area
GEICO
Palo Alto, CA
2 days ago
Senior/Staff ML Engineer, Performance Optimization
The Role We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible...
Comfy
San Francisco, CA
5 days ago
Machine Learning Infrastructure Engineer
...combines threat intelligence with machine learning, enables financial... ...scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs... ...concurrent models and users. Optimize high-throughput inference.... ...stacks such as TensorRT, ONNX Runtime, vLLM, FlashAttention, and...
Worldwide
TRM Labs
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Runtime & Optimization. Be the first to apply!