Senior Machine Learning Engineer - VLM/LLM Evaluation

$204k - $259k

Waymo

Senior Machine Learning Engineer – VLM/LLM Evaluation

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.

You will:

Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
Drive the development or significantly contribute to end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire life-cycle from pre-training and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
Partner with cross-functional teams within the organization to land innovative tech in production
Implement and extend large scale data and evaluation pipelines.

You have:

Bachelor or Master's degree in Computer Science, similar technical field of study, or equivalent practical experience
Experience in ML engineering and applied Deep Learning
Experience with large scale distributed system
Proficient programming skills (eg: Python, C/C++)

We prefer:

ML infra experience: training, evaluating and deploying ML models at scale
Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)

In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:

Health, dental, vision, life, disability insurance
Retirement Benefits: 401(k) with company match
Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
Baby Bonding Leave: 18 weeks
Holidays: 13 paid days per year

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range $204,000—$259,000 USD

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Senior Machine Learning Engineer - VLM/LLM Evaluation in San Francisco, CA vacancy

Senior Machine Learning Engineer, Perception LLM/VLM
$204k - $259k
...builds the system which learns the spatial-... ...sensors, enabling engineers like you to (1) develop... ...for cutting-edge VLM foundation models.... ...Develop and rigorously evaluate metrics and... ...years of experience in Machine Learning, with a focus... ...model development (LLM, VLM, or similar...
Senior
Full time
Remote work
Waymo
San Francisco, CA
5 days ago
Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
$200k - $365k
...and privacy protection. To learn more about Plaud, please... .... Possess strong software engineering skills (especially in Python)... ...systems, data pipelines, or evaluation harnesses that can run at scale... ...good" looks like for a Speech LLM, translating capabilities (like...
Suggested
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
4 days ago
Senior Machine Learning Engineer - Model Evaluations, Public Sector
$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC Senior Machine Learning... ..., robustness, and safety metrics, including LLM-judge–based evaluations. Design test datasets and...
Senior
Full time
Scale AI
San Francisco, CA
1 day ago
ML Engineer LLM Evaluation
...frontier research for their next generation of LLM products. Join us if you: Wish to... .... Responsibilities Own LLM evaluation processes and methods with a focus on... ...abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art research...
Suggested
Local area
Shift work
Dynamo AI
San Francisco, CA
1 day ago
Remote Senior Python Engineer - LLM Evaluation (US-based)
...pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and... ...pedigree. Project Overview: As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking...
Senior
For contractors
Remote work
Flexible hours
Turing
San Francisco, CA
2 days ago
Machine Learning Engineer, LLM Evals & Observability
$200k - $300k
...connectors, flexible LLM choice, and robust APIs... ...reliably better over time: evaluation pipelines, quality... ..., and the tooling engineers use to understand what... ...evaluation, reinforcement learning from human feedback,... ...large systems involving machine learning. ~...
Home office
Flexible hours
3 days per week
Glean.info
San Francisco, CA
4 days ago
Senior Machine Learning Engineer, RL Environments
...ML Engineers Preference Model is building automated... ...and build reinforcement learning environments to safely... ...specifically on machine learning research and... ...conducting experiments and evaluations, delivering your work... ...that powers frontier LLM capability. Note: This...
Senior
Visa sponsorship
Relocation package
Preference Model
San Francisco, CA
1 day ago
Senior Machine Learning Engineer
...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building planetary... ...large foundation models (beyond just LLM fine-tuning). This role is focused... ...Shaping benchmark design and model evaluation frameworks Building agentic AI...
Senior
Work experience placement
Remote work
Humai
San Francisco, CA
4 days ago
Senior Machine Learning Engineer
...Senior Machine Learning Engineer Oway Software Engineering San Francisco, CA, USA Supply chain... ...and iterate on Juno AI, our autonomous LLM-based dispatch agent, including... ...to get to know the team. Technical evaluation, details disclosed after step 2....
Senior
Immediate start
Ritual Capital
San Francisco, CA
1 day ago
Senior Machine Learning Engineer
$225k - $325k
...high-ownership role for ML engineers who want to build production... ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll... ...language models and audio models, evaluate them with rigorous... ...Technical Interview (45 min) : LLM theory specific coding...
Senior
H1b
Work at office
Retell AI
San Francisco, CA
3 days ago
Senior Machine Learning Engineer, Computer Vision/VLM
$204k - $259k
...serving as the foundation for training and validating the AV stack. We are an advanced ML and engineering team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene understanding,...
Senior
Full time
Remote work
Waymo
San Francisco, CA
4 days ago
Senior Machine Learning Engineer, Multimodal AI
...is the place. The Role As a Senior Machine Learning Engineer, you will build the intelligence layer... ...across LLMs, OCR pipelines, voice AI, evaluation systems, and backend production... ...structured facts and decisions. Design LLM-powered extraction, classification, validation...
Senior
Work at office
Hike Medical
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Voice AI
$200k - $260k
...Senior Machine Learning Engineer, Voice AI San Francisco About the Role Together AI is building the... ...-on with inference engines like TRT-LLM and SGLang to optimize how we serve models... ...'s infrastructure. Build quality evaluation frameworks that guide model selection...
Senior
Full time
Together AI
San Francisco, CA
5 days ago
Senior Machine Learning Engineer, MLOps West Coast
$131.4k - $235.95k
...tools for making buildings, machines, and even the latest... ...people in the world. As a Senior Machine Learning Engineer focused on Machine Learning... ...closely with researchers, evaluation engineers, and product... ...running production ML or LLM inference services, including...
Senior
For contractors
Remote work
Autodesk
San Francisco, CA
2 days ago
Senior Machine Learning Engineer
$175k - $250k
...Machine Learning Engineer Kiddom is a groundbreaking educational platform that... ...insight engine. Develop evaluation-first development workflows... ...frameworks and advanced LLM architectures. Experience... ...location, prior experience, seniority, and demonstrated role related...
Senior
Permanent employment
Full time
Local area
Flexible hours
Kiddom
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Animation Modeling
...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...including data processing, training, evaluation, optimization, and deployment. Develop... .... Collaborate with Behavior and LLM teams to connect motion systems with...
Senior
Full time
Work experience placement
Work at office
GENIES INC
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Animation Integration
...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...data) for training, fine-tuning, and evaluation. Build data pipelines for extraction... .... Collaborate with Behavior and LLM teams to integrate predictive motion...
Senior
Full time
Work experience placement
Work at office
GENIES INC
San Francisco, CA
1 day ago
Senior ML Engineer - Real-World AI Evaluations
Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience...
Senior
Arena Intelligence, Inc.
San Francisco, CA
5 days ago
Senior Machine Learning Engineer, AI Agent
$200k - $275k
...seeking a highly motivated and passionate Senior Machine Learning Engineer to join our core AI team in San... ...rigorous quantitative benchmarks to evaluate and continuously improve agentic systems... ...cost control. Fine-tune and train LLM/ML models when off-the-shelf options...
Senior
Full time
Work experience placement
Work at office
Local area
Flexible hours
GENIES INC
San Francisco, CA
1 day ago
Machine Learning Engineer: LLM Interpretability & Systems
...Fortune 500. By bridging the gap between LLM capabilities and domain-specific... ...improve its fundamentals?" CTGT's Senior Machine Learning Engineer will operate deep within the model stack... ...in model output. Build the evaluation and deployment loops needed to ship changes...
CTGT
San Francisco, CA
1 day ago
Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI
$264.8k - $331k
...customers. About the Role As a Senior/Staff Machine Learning Engineer (MLE) on the General Agents team,... ...lifecycle-from model and system design to evaluation, deployment, and iteration-bridging... ...-to-end agent systems that combine LLM reasoning, tool use, memory, and...
Senior
Full time
Scale AI
San Francisco, CA
4 days ago
Senior Machine Learning Engineer
$240k - $260k
...technical background with deep machine learning expertise shaped by hands-on... ...: framing, experimentation, evaluation, deployment, and iteration.... ...value the craft of software engineering and bring a thoughtful... ...developing extensible code with LLM coding assistants Job Perks...
Senior
Full time
Temporary work
Work at office
Local area
Flexible hours
Vsco
San Francisco, CA
2 hours ago
Senior Staff Machine Learning Engineer, Post Training
Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was born in 2007 when two hosts welcomed three... ..., ML services and tools including LLM fine‑tuning, alignment and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based...
Senior
Work experience placement
Remote work
airbnb, Inc.
San Francisco, CA
5 days ago
Senior Staff ML Engineer, Data & Evaluation (Remote)
Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires a PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical...
Senior
Remote job
airbnb, Inc.
San Francisco, CA
2 days ago
Senior Data Scientist / Machine Learning Engineer - Listing Quality
$192k - $264k
...the power of tech, data, and machine learning to connect this thriving... ...to help retailers find and evaluate products on Faire. You will... ...including product, design, engineering, analytics, and operations,... ...listings. Use deep learning, LLM fine tuning, and human-in-the...
Senior
Full time
Work experience placement
Work at office
Local area
Remote work
Monday to Friday
Flexible hours
2 days per week
Faire
San Francisco, CA
2 hours ago
Senior Staff Machine Learning Engineer, Data & Eval
Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust... ...models, ML services, and tools including LLM fine‑tuning and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based...
Senior
Work experience placement
Casual work
Live in
Work at office
Remote work
airbnb, Inc.
San Francisco, CA
5 days ago
Gentoro | Senior ML Engineer
...the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture... ...of ML, specifically training or fine-tuning LLM models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~...
Senior
Shift work
Palm Venture Studios
San Francisco, CA
23 days ago
Senior AI/ML Engineer LLM & Agent Stack
...Senior AI/ML Engineer — LLM & Agent Stack Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to...
Senior
TrueFoundry
San Francisco, CA
1 day ago
Senior AI/ML Engineer - LLM Pipelines & RAG (SF Onsite)
..., located in San Francisco, is looking to enhance its Silicon Valley engineering team with skilled professionals in AI research and product development. The role involves building production-grade LLM pipelines and integrating AI features into data pipelines. The ideal...
Senior
HopHR
San Francisco, CA
3 days ago
Senior Machine Learning Engineer
$206k - $308k
...Description The Enterprise Machine Learning team drives organizational value... ...As a Machine Learning Engineer, you will serve as a technical... ...Opportunity to develop and scale of LLM and deep learning solutions... ...may be used to screen or evaluate applications for this...
Senior
Full time
Work at office
Local area
Remote work
2 days per week
Zendesk
San Francisco, CA
19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!