Staff Machine Learning Engineer - VLM/LLM Evaluation

$238k - $302k

Waymo

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver™-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.

You will:

Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
Lead the development of end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire lifecycle from pretraining and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
Partner within and across organizations to land disruptive and innovative tech in production
Implement and extend large large scale data and evaluation pipelines

You have:

Master's degree or PhD degree in Computer Science, similar technical field of study, or equivalent practical experience
5+ years of experience in ML engineering and applied Deep Learning, with a strong portfolio of shipped products or publication record
Experience with large scale distributed system
Proficient programming skills (eg: Python, C/C++)
Strong analytical and debugging skills

We prefer:

ML infra experience: training, evaluating and deploying ML models at scale
Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)

In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:

Health, dental, vision, life, disability insurance
Retirement Benefits: 401(k) with company match
Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
Baby Bonding Leave: 18 weeks
Holidays: 13 paid days per year

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range

$238,000-$302,000 USD

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - VLM/LLM Evaluation in Mountain View, CA vacancy

Senior ML Engineer, LLM / VLM Distillation
$213k - $263k
...Waymo AI Foundations team is to develop machine learning solutions addressing open problems in... ...inference, hierarchical learning, and robust evaluation. This role follows a hybrid work... ...smaller real-time models. Explore LLM/VLM distillation recipes to maximally transfer...
Suggested
Full time
Remote work
Waymo
Mountain View, CA
13 hours ago
Machine Learning Engineer LLM Evaluation & Automation
$60 - $70 per hour
...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that...
Suggested
Contract work
Temporary work
Remote work
3 days per week
TEKsystems
Cupertino, CA
9 days ago
Senior Staff ML Engineer, Driver Understanding and Evaluation
$281k - $356k
...Senior Staff ML Engineer, Driver Understanding and Evaluation Waymo is an autonomous driving technology company with... ...15+ U.S. states. The DUE Machine Learning team will build and operate scalable... ...models and Generative AI (LLM/VLM) solutions. These solutions will...
Suggested
Full time
Waymo
Mountain View, CA
4 days ago
Senior Machine Learning Engineer, LLM - Moveworks
...workflows, and continuously learn and adapt. Moveworks... ...Moveworks’ Reasoning Engine and natural language... ...We are looking for a Machine Learning Engineer to help... ...for building and serving LLM’s at Moveworks. This... ...language models(LLM), model evaluation and monitoring...
Suggested
Full time
Work at office
Remote work
Flexible hours
Servicenow
Mountain View, CA
13 hours ago
Founding Machine learning Engineer - Evaluation
...Senior ML Engineer Medical Imaging Evaluation & AI Reliability About the Role: My client is building evaluation and evidence infrastructure... ...Required Qualifications: Strong experience in machine learning for medical imaging (radiology, pathology, cardiology...
Suggested
Shift work
Established Search
Sunnyvale, CA
3 days ago
Senior Machine Learning Engineer, Driver Understanding and Evaluation
$204k - $259k
...states. The Driver Understanding and Evaluation (DUE) team at Waymo is developing rich... ...of the Waymo Driver. The DUE Machine Learning team will build and operate scalable machine... ...looking for researchers and software engineers who are passionate about developing...
Full time
Waymo
Mountain View, CA
3 days ago
Machine Learning Engineer (Infra), Driver Understanding and Evaluation
$170k - $216k
...Machine Learning Engineer (Infra), Driver Understanding and Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building...
Full time
Waymo
Mountain View, CA
2 days ago
Senior ML Evaluation Engineer — Autonomous Driving
NVIDIA Gruppe is seeking a skilled professional to develop learned evaluation pipelines in Santa Clara, CA. The role requires a deep understanding of LLM/VLM frameworks and solid software engineering skills in Python and C++. Successful candidates will have a track record...
NVIDIA Gruppe
Santa Clara, CA
13 hours ago
Senior ML Evaluation Engineer - Autonomous Vehicles
$184k - $287.5k
...Artificial Intelligence, Deep Learning and Autonomous Vehicles.... ...doing: Design and build learned evaluation pipelines that assess... ...Computer Science, Computer Engineering, or a related technical field... ...Hands‑on experience building LLM/VLM-based pipelines — fine‑tuning...
NVIDIA Gruppe
Santa Clara, CA
13 hours ago
Senior/Staff Online Mapping Machine Learning Engineer
$140k - $230k
...and advancing the state-of-the-art of machine learning (ML) for perception, prediction, and motion... ...with other software and hardware engineers and researchers to tackle some of the... ...processing, model training, ablation studies, evaluation, deployment, inference optimization....
Full time
Temporary work
Flexible hours
Woven By Toyota
Palo Alto, CA
13 hours ago
Senior Machine Learning Engineer II - LLM
$200k - $275k
...What You Will Do We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be... ...for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization...
Full time
Moveworks
Mountain View, CA
more than 2 months ago
Senior ML Engineer — Healthcare AI Safety & Evaluation
Hippocratic AI is seeking an experienced ML Engineer in Palo Alto to design and build evaluation frameworks for large language models. The role involves developing... ...engineering skills in Python and experience in LLM evaluation. Join a team focused on transforming healthcare...
Hippocratic AI
Palo Alto, CA
3 days ago
R&D AI Software Engineer / End-to-End Machine Learning Engineer / RAG and LLM
...with the fastest data processing engine on the market, Pathway enables... ...backbone who are able to prototype, evaluate, improve, and productionize end-to-end Machine Learning projects with enterprise data.... ...in a way which would boost LLM accuracy in inference & training...
Remote job
Permanent employment
Full time
Contract work
Immediate start
Pathway
Palo Alto, CA
13 hours ago
Machine Learning Research Engineer/Scientist
...on the things they value most. As a Machine Learning Research Engineer, you will work on the software and... ...and data curation to model training, evaluation, and on-robot deployment Collaborate... ...-scale model training pipelines (LLM/VLM/VLA) At Sunday Robotics, we’re building...
Sunday Robotics
Mountain View, CA
1 day ago
Sr Machine Learning Engineer, Tech Lead - Autograder Systems, Evaluation
$181.1k - $318.4k
Sr Machine Learning Engineer, Tech Lead — Autograder Systems, Evaluation Cupertino, California, United States Machine Learning and AI... ...methods such as reward modeling, LLM-as-judge, preference learning, and... ..., with a strong focus on LLM or VLM systems. Deep expertise in...
Relocation package
Apple Inc.
Cupertino, CA
3 days ago
Founding Machine Learning Engineer [32934]
...We're hiring our Founding Machine Learning Engineer (MLE) with expertise in Agent Development and Time... ...grade systems that combine the power of LLM-powered agents with time-series... ...adaptive strategies Develop and refine evaluation frameworks for agents, ensuring reliability...
Visa sponsorship
Stealth Startup
Sunnyvale, CA
13 hours ago
Senior Machine Learning Engineer
$194k - $214k
...highly customer-centric Senior ML Engineer who will join our cross-... ...solutions, including DL- and LLM-based approaches, that reliably... .... Experience with deep learning in a production setting, understanding... ...and salary opportunities are evaluated through our interview process...
Instrumental Inc
Palo Alto, CA
2 days ago
Machine Learning Engineer - Agentic AI
$147.4k - $272.1k
...Machine Learning Engineer - Agentic AI The VCV organization has pioneered human-centric, real-time... ...handling. Develop infrastructure for evaluating and improving agentic system... ...environments. Strong proficiency with LLM-assisted coding, including using AI tools...
Relocation
Apple
Sunnyvale, CA
2 days ago
Machine Learning Engineer, Search Quality
$140k - $265k
...SaaS connectors, flexible LLM choice, and robust APIs... ...Glean is looking for engineers to help build the world... ...question-answering, evaluation, and experimentation. We... ...more junior engineers, or learn from battle-tested ones... ...systems involving machine learning ~ Strong analytical...
Work at office
Home office
Flexible hours
3 days per week
Glean.info
Mountain View, CA
1 day ago
Machine Learning Engineer, Enterprise Brain
$200k - $300k
...enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations... ...the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of... .... Lead development of scalable evaluation, benchmarking, and optimization loops....
Work at office
Home office
Flexible hours
Glean.info
Mountain View, CA
1 day ago
Senior Machine Learning Engineer - Fine-Tuning and On-device AI
$120k - $215k
...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP... ...generation and annotation. Define evaluation metrics. Perform evaluations and analyze... ...learning, including at least 3 years in LLM fine-tuning. ~ Proficiency in Python...
Full time
Temporary work
Local area
Flexible hours
HP IQ
Palo Alto, CA
4 days ago
Senior Machine Learning Engineer
$230k - $265k
...alongside industry-veteran scientists and engineers. As a Senior Machine Learning Engineer, you’ll bring your strong... ...large-scale SID / ASR / NLP / LLM systems that power mission-critical... ...ML infrastructure, data pipelines, evaluation frameworks, and deployment...
Permanent employment
Otter.ai
Mountain View, CA
13 hours ago
Senior Machine Learning Engineer
...Evaluation Engineer Evaluation is the bottleneck in healthcare AI — you can't ship what you can't measure. You'll build the systems that tell... ..., synthetic data pipelines, automated benchmarks, and LLM-as-judge systems. This is a high-leverage engineering role where...
Hippocratic AI
Palo Alto, CA
4 days ago
Machine Learning Engineer
...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting... ...training, debugging, and performance evaluation. Fine-tune large language models (LLMs... ...optimization, deep understanding of LLM serving optimizations for LLMs/VLMs....
Nace AI
Palo Alto, CA
13 hours ago
Machine Learning Engineer - LLM
...Description We are seeking a highly capable and dynamic Machine Learning Engineer to work cross-functional in building out a GenAI system in support... ...mining models with an emphasis on large language models (LLM) or large multimodal models (LMM). ~ Masters in Machine...
Apple
Cupertino, CA
4 days ago
Machine Learning Engineer
...data scientists, and engineers, tackling the most fundamental... ...computing in deep learning, driving impactful... .... The Role As a Machine Learning Engineer at the... ..., post-training, evaluation and so on, especially... ...Hands-on experience with LLM algorithms, such as Supervised...
Worldwide
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
22 days ago
Machine Learning Engineer
...Job Description Job Description Machine Learning Engineer This is an opportunity with an early... ...and level up the team's knowledge of LLM training and infrastructure About you... ...engineering skills and know that infrastructure and evaluation work are critical....
Work at office
Amiri Recruiting
Mountain View, CA
12 days ago
Machine Learning Infrastructure Engineer
...Machine Learning Infrastructure Engineer At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and... ...-scale ML training Deep understanding of how modern LLM/VLM systems are trained and scaled Proven experience...
Mind Robotics
Palo Alto, CA
1 day ago
Staff Machine Learning Engineer, Search Ranking
$229k - $343k
...Staff Machine Learning Engineer Snap Inc is a technology company. We believe the camera presents the greatest... ...improvement Design robust offline evaluation, online experimentation, and model... ...systems, ads ranking, generative AI, LLM-based ranking, and retrieval-...
Work experience placement
Live in
Work at office
Local area
Snap
Palo Alto, CA
8 days ago
Senior Staff Machine Learning Engineer
$214k - $289.5k
...Overview Come join Intuit as a Senior Staff Machine Learning Engineer (MLE). Senior Staff MLEs deliver... ..., and continuous improvement. Evaluate and integrate transformative technologies... ...-augmented generation (RAG), and LLM fine-tuning pipelines to accelerate product...
Local area
Intuit
Mountain View, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!