Senior Machine Learning Engineer - VLM/LLM Evaluation
$204k - $259kWaymo
Senior Machine Learning Engineer – VLM/LLM Evaluation
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.
This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.
You will:
- Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
- Drive the development or significantly contribute to end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire life-cycle from pre-training and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
- Partner with cross-functional teams within the organization to land innovative tech in production
- Implement and extend large scale data and evaluation pipelines.
You have:
- Bachelor or Master's degree in Computer Science, similar technical field of study, or equivalent practical experience
- Experience in ML engineering and applied Deep Learning
- Experience with large scale distributed system
- Proficient programming skills (eg: Python, C/C++)
We prefer:
- ML infra experience: training, evaluating and deploying ML models at scale
- Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
- Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:
- Health, dental, vision, life, disability insurance
- Retirement Benefits: 401(k) with company match
- Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
- Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
- Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
- Baby Bonding Leave: 18 weeks
- Holidays: 13 paid days per year
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.
Salary Range $204,000—$259,000 USD
$204k - $259k
...builds the system which learns the spatial-... ...sensors, enabling engineers like you to (1) develop... ...for cutting-edge VLM foundation models.... ...Develop and rigorously evaluate metrics and... ...years of experience in Machine Learning, with a focus... ...model development (LLM, VLM, or similar...SeniorFull timeRemote work$200k - $365k
...and privacy protection. To learn more about Plaud, please... .... Possess strong software engineering skills (especially in Python)... ...systems, data pipelines, or evaluation harnesses that can run at scale... ...good" looks like for a Speech LLM, translating capabilities (like...SuggestedFull timeWork at officeWorldwide$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC Senior Machine Learning... ..., robustness, and safety metrics, including LLM-judge–based evaluations. Design test datasets and...SeniorFull time- ...frontier research for their next generation of LLM products. Join us if you: Wish to... .... Responsibilities Own LLM evaluation processes and methods with a focus on... ...abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art research...SuggestedLocal areaShift work
- ...pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and... ...pedigree. Project Overview: As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking...SeniorFor contractorsRemote workFlexible hours
$200k - $300k
...connectors, flexible LLM choice, and robust APIs... ...reliably better over time: evaluation pipelines, quality... ..., and the tooling engineers use to understand what... ...evaluation, reinforcement learning from human feedback,... ...large systems involving machine learning. ~...Home officeFlexible hours3 days per week- ...ML Engineers Preference Model is building automated... ...and build reinforcement learning environments to safely... ...specifically on machine learning research and... ...conducting experiments and evaluations, delivering your work... ...that powers frontier LLM capability. Note: This...SeniorVisa sponsorshipRelocation package
- ...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building planetary... ...large foundation models (beyond just LLM fine-tuning). This role is focused... ...Shaping benchmark design and model evaluation frameworks Building agentic AI...SeniorWork experience placementRemote work
- ...Senior Machine Learning Engineer Oway Software Engineering San Francisco, CA, USA Supply chain... ...and iterate on Juno AI, our autonomous LLM-based dispatch agent, including... ...to get to know the team. Technical evaluation, details disclosed after step 2....SeniorImmediate start
$225k - $325k
...high-ownership role for ML engineers who want to build production... ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll... ...language models and audio models, evaluate them with rigorous... ...Technical Interview (45 min) : LLM theory specific coding...SeniorH1bWork at office$204k - $259k
...serving as the foundation for training and validating the AV stack. We are an advanced ML and engineering team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene understanding,...SeniorFull timeRemote work- ...is the place. The Role As a Senior Machine Learning Engineer, you will build the intelligence layer... ...across LLMs, OCR pipelines, voice AI, evaluation systems, and backend production... ...structured facts and decisions. Design LLM-powered extraction, classification, validation...SeniorWork at office
$200k - $260k
...Senior Machine Learning Engineer, Voice AI San Francisco About the Role Together AI is building the... ...-on with inference engines like TRT-LLM and SGLang to optimize how we serve models... ...'s infrastructure. Build quality evaluation frameworks that guide model selection...SeniorFull time$131.4k - $235.95k
...tools for making buildings, machines, and even the latest... ...people in the world. As a Senior Machine Learning Engineer focused on Machine Learning... ...closely with researchers, evaluation engineers, and product... ...running production ML or LLM inference services, including...SeniorFor contractorsRemote work$175k - $250k
...Machine Learning Engineer Kiddom is a groundbreaking educational platform that... ...insight engine. Develop evaluation-first development workflows... ...frameworks and advanced LLM architectures. Experience... ...location, prior experience, seniority, and demonstrated role related...SeniorPermanent employmentFull timeLocal areaFlexible hours- ...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...including data processing, training, evaluation, optimization, and deployment. Develop... .... Collaborate with Behavior and LLM teams to connect motion systems with...SeniorFull timeWork experience placementWork at office
- ...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...data) for training, fine-tuning, and evaluation. Build data pipelines for extraction... .... Collaborate with Behavior and LLM teams to integrate predictive motion...SeniorFull timeWork experience placementWork at office
- Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience...Senior
$200k - $275k
...seeking a highly motivated and passionate Senior Machine Learning Engineer to join our core AI team in San... ...rigorous quantitative benchmarks to evaluate and continuously improve agentic systems... ...cost control. Fine-tune and train LLM/ML models when off-the-shelf options...SeniorFull timeWork experience placementWork at officeLocal areaFlexible hours- ...Fortune 500. By bridging the gap between LLM capabilities and domain-specific... ...improve its fundamentals?" CTGT's Senior Machine Learning Engineer will operate deep within the model stack... ...in model output. Build the evaluation and deployment loops needed to ship changes...
$264.8k - $331k
...customers. About the Role As a Senior/Staff Machine Learning Engineer (MLE) on the General Agents team,... ...lifecycle-from model and system design to evaluation, deployment, and iteration-bridging... ...-to-end agent systems that combine LLM reasoning, tool use, memory, and...SeniorFull time$240k - $260k
...technical background with deep machine learning expertise shaped by hands-on... ...: framing, experimentation, evaluation, deployment, and iteration.... ...value the craft of software engineering and bring a thoughtful... ...developing extensible code with LLM coding assistants Job Perks...SeniorFull timeTemporary workWork at officeLocal areaFlexible hours- Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was born in 2007 when two hosts welcomed three... ..., ML services and tools including LLM fine‑tuning, alignment and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based...SeniorWork experience placementRemote work
- Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires a PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical...SeniorRemote job
$192k - $264k
...the power of tech, data, and machine learning to connect this thriving... ...to help retailers find and evaluate products on Faire. You will... ...including product, design, engineering, analytics, and operations,... ...listings. Use deep learning, LLM fine tuning, and human-in-the...SeniorFull timeWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours2 days per week- Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust... ...models, ML services, and tools including LLM fine‑tuning and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based...SeniorWork experience placementCasual workLive inWork at officeRemote work
- ...the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture... ...of ML, specifically training or fine-tuning LLM models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~...SeniorShift work
- ...Senior AI/ML Engineer — LLM & Agent Stack Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to...Senior
- ..., located in San Francisco, is looking to enhance its Silicon Valley engineering team with skilled professionals in AI research and product development. The role involves building production-grade LLM pipelines and integrating AI features into data pipelines. The ideal...Senior
$206k - $308k
...Description The Enterprise Machine Learning team drives organizational value... ...As a Machine Learning Engineer, you will serve as a technical... ...Opportunity to develop and scale of LLM and deep learning solutions... ...may be used to screen or evaluate applications for this...SeniorFull timeWork at officeLocal areaRemote work2 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA



