Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Machine Learning Engineer - VLM/LLM Evaluation

$204k - $259k

Waymo

Senior Machine Learning Engineer – VLM/LLM Evaluation

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.

You will:

  • Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
  • Drive the development or significantly contribute to end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire life-cycle from pre-training and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
  • Partner with cross-functional teams within the organization to land innovative tech in production
  • Implement and extend large scale data and evaluation pipelines.

You have:

  • Bachelor or Master's degree in Computer Science, similar technical field of study, or equivalent practical experience
  • Experience in ML engineering and applied Deep Learning
  • Experience with large scale distributed system
  • Proficient programming skills (eg: Python, C/C++)

We prefer:

  • ML infra experience: training, evaluating and deploying ML models at scale
  • Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
  • Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)

In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:

  • Health, dental, vision, life, disability insurance
  • Retirement Benefits: 401(k) with company match
  • Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
  • Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
  • Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
  • Baby Bonding Leave: 18 weeks
  • Holidays: 13 paid days per year

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range $204,000—$259,000 USD

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Machine Learning Engineer - VLM/LLM Evaluation in San Francisco, CA vacancy
  • $204k - $259k

     ...builds the system which learns the spatial-...  ...sensors, enabling engineers like you to (1) develop...  ...for cutting-edge VLM foundation models....  ...Develop and rigorously evaluate metrics and...  ...years of experience in Machine Learning, with a focus...  ...model development (LLM, VLM, or similar... 
    Senior
    Full time
    Remote work

    Waymo

    San Francisco, CA
    5 days ago
  • $200k - $365k

     ...and privacy protection. To learn more about Plaud, please...  .... Possess strong software engineering skills (especially in Python)...  ...systems, data pipelines, or evaluation harnesses that can run at scale...  ...good" looks like for a Speech LLM, translating capabilities (like... 
    Suggested
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    4 days ago
  • $240.45k - $300.3k

     ...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC Senior Machine Learning...  ..., robustness, and safety metrics, including LLM-judge–based evaluations. Design test datasets and... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...frontier research for their next generation of LLM products. Join us if you: Wish to...  .... Responsibilities Own LLM evaluation processes and methods with a focus on...  ...abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art research... 
    Suggested
    Local area
    Shift work

    Dynamo AI

    San Francisco, CA
    1 day ago
  •  ...pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and...  ...pedigree. Project Overview: As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking... 
    Senior
    For contractors
    Remote work
    Flexible hours

    Turing

    San Francisco, CA
    2 days ago
  • $200k - $300k

     ...connectors, flexible LLM choice, and robust APIs...  ...reliably better over time: evaluation pipelines, quality...  ..., and the tooling engineers use to understand what...  ...evaluation, reinforcement learning from human feedback,...  ...large systems involving machine learning. ~... 
    Home office
    Flexible hours
    3 days per week

    Glean.info

    San Francisco, CA
    4 days ago
  •  ...ML Engineers Preference Model is building automated...  ...and build reinforcement learning environments to safely...  ...specifically on machine learning research and...  ...conducting experiments and evaluations, delivering your work...  ...that powers frontier LLM capability. Note: This... 
    Senior
    Visa sponsorship
    Relocation package

    Preference Model

    San Francisco, CA
    1 day ago
  •  ...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building planetary...  ...large foundation models (beyond just LLM fine-tuning). This role is focused...  ...Shaping benchmark design and model evaluation frameworks Building agentic AI... 
    Senior
    Work experience placement
    Remote work

    Humai

    San Francisco, CA
    4 days ago
  •  ...Senior Machine Learning Engineer Oway Software Engineering San Francisco, CA, USA Supply chain...  ...and iterate on Juno AI, our autonomous LLM-based dispatch agent, including...  ...to get to know the team. Technical evaluation, details disclosed after step 2.... 
    Senior
    Immediate start

    Ritual Capital

    San Francisco, CA
    1 day ago
  • $225k - $325k

     ...high-ownership role for ML engineers who want to build production...  ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll...  ...language models and audio models, evaluate them with rigorous...  ...Technical Interview (45 min) : LLM theory specific coding... 
    Senior
    H1b
    Work at office

    Retell AI

    San Francisco, CA
    3 days ago
  • $204k - $259k

     ...serving as the foundation for training and validating the AV stack. We are an advanced ML and engineering team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene understanding,... 
    Senior
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  •  ...is the place. The Role As a Senior Machine Learning Engineer, you will build the intelligence layer...  ...across LLMs, OCR pipelines, voice AI, evaluation systems, and backend production...  ...structured facts and decisions. Design LLM-powered extraction, classification, validation... 
    Senior
    Work at office

    Hike Medical

    San Francisco, CA
    1 day ago
  • $200k - $260k

     ...Senior Machine Learning Engineer, Voice AI San Francisco About the Role Together AI is building the...  ...-on with inference engines like TRT-LLM and SGLang to optimize how we serve models...  ...'s infrastructure. Build quality evaluation frameworks that guide model selection... 
    Senior
    Full time

    Together AI

    San Francisco, CA
    5 days ago
  • $131.4k - $235.95k

     ...tools for making buildings, machines, and even the latest...  ...people in the world. As a Senior Machine Learning Engineer focused on Machine Learning...  ...closely with researchers, evaluation engineers, and product...  ...running production ML or LLM inference services, including... 
    Senior
    For contractors
    Remote work

    Autodesk

    San Francisco, CA
    2 days ago
  • $175k - $250k

     ...Machine Learning Engineer Kiddom is a groundbreaking educational platform that...  ...insight engine. Develop evaluation-first development workflows...  ...frameworks and advanced LLM architectures. Experience...  ...location, prior experience, seniority, and demonstrated role related... 
    Senior
    Permanent employment
    Full time
    Local area
    Flexible hours

    Kiddom

    San Francisco, CA
    1 day ago
  •  ...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,...  ...including data processing, training, evaluation, optimization, and deployment. Develop...  .... Collaborate with Behavior and LLM teams to connect motion systems with... 
    Senior
    Full time
    Work experience placement
    Work at office

    GENIES INC

    San Francisco, CA
    1 day ago
  •  ...AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,...  ...data) for training, fine-tuning, and evaluation. Build data pipelines for extraction...  .... Collaborate with Behavior and LLM teams to integrate predictive motion... 
    Senior
    Full time
    Work experience placement
    Work at office

    GENIES INC

    San Francisco, CA
    1 day ago
  • Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience... 
    Senior

    Arena Intelligence, Inc.

    San Francisco, CA
    5 days ago
  • $200k - $275k

     ...seeking a highly motivated and passionate Senior Machine Learning Engineer to join our core AI team in San...  ...rigorous quantitative benchmarks to evaluate and continuously improve agentic systems...  ...cost control. Fine-tune and train LLM/ML models when off-the-shelf options... 
    Senior
    Full time
    Work experience placement
    Work at office
    Local area
    Flexible hours

    GENIES INC

    San Francisco, CA
    1 day ago
  •  ...Fortune 500. By bridging the gap between LLM capabilities and domain-specific...  ...improve its fundamentals?" CTGT's Senior Machine Learning Engineer will operate deep within the model stack...  ...in model output. Build the evaluation and deployment loops needed to ship changes... 

    CTGT

    San Francisco, CA
    1 day ago
  • $264.8k - $331k

     ...customers. About the Role As a Senior/Staff Machine Learning Engineer (MLE) on the General Agents team,...  ...lifecycle-from model and system design to evaluation, deployment, and iteration-bridging...  ...-to-end agent systems that combine LLM reasoning, tool use, memory, and... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    4 days ago
  • $240k - $260k

     ...technical background with deep machine learning expertise shaped by hands-on...  ...: framing, experimentation, evaluation, deployment, and iteration....  ...value the craft of software engineering and bring a thoughtful...  ...developing extensible code with LLM coding assistants Job Perks... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    Vsco

    San Francisco, CA
    2 hours ago
  • Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was born in 2007 when two hosts welcomed three...  ..., ML services and tools including LLM fine‑tuning, alignment and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based... 
    Senior
    Work experience placement
    Remote work

    airbnb, Inc.

    San Francisco, CA
    5 days ago
  • Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires a PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical... 
    Senior
    Remote job

    airbnb, Inc.

    San Francisco, CA
    2 days ago
  • $192k - $264k

     ...the power of tech, data, and machine learning to connect this thriving...  ...to help retailers find and evaluate products on Faire. You will...  ...including product, design, engineering, analytics, and operations,...  ...listings. Use deep learning, LLM fine tuning, and human-in-the... 
    Senior
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    2 days per week

    Faire

    San Francisco, CA
    2 hours ago
  • Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust...  ...models, ML services, and tools including LLM fine‑tuning and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based... 
    Senior
    Work experience placement
    Casual work
    Live in
    Work at office
    Remote work

    airbnb, Inc.

    San Francisco, CA
    5 days ago
  •  ...the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture...  ...of ML, specifically training or fine-tuning LLM models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~... 
    Senior
    Shift work

    Palm Venture Studios

    San Francisco, CA
    23 days ago
  •  ...Senior AI/ML Engineer — LLM & Agent Stack Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to... 
    Senior

    TrueFoundry

    San Francisco, CA
    1 day ago
  •  ..., located in San Francisco, is looking to enhance its Silicon Valley engineering team with skilled professionals in AI research and product development. The role involves building production-grade LLM pipelines and integrating AI features into data pipelines. The ideal... 
    Senior

    HopHR

    San Francisco, CA
    3 days ago
  • $206k - $308k

     ...Description The Enterprise Machine Learning team drives organizational value...  ...As a Machine Learning Engineer, you will serve as a technical...  ...Opportunity to develop and scale of LLM and deep learning solutions...  ...may be used to screen or evaluate applications for this... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    2 days per week

    Zendesk

    San Francisco, CA
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!