Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Engineer - VLM/LLM Evaluation

$238k - $302k

Waymo

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.

You will:

  • Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
  • Lead the development of end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire lifecycle from pretraining and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
  • Partner within and across organizations to land disruptive and innovative tech in production
  • Implement and extend large large scale data and evaluation pipelines

You have:

  • Master's degree or PhD degree in Computer Science, similar technical field of study, or equivalent practical experience
  • 5+ years of experience in ML engineering and applied Deep Learning, with a strong portfolio of shipped products or publication record
  • Experience with large scale distributed system
  • Proficient programming skills (eg: Python, C/C++)
  • Strong analytical and debugging skills

We prefer:

  • ML infra experience: training, evaluating and deploying ML models at scale
  • Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
  • Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range $238,000$302,000 USD
Required
Preferred
Job Industries
  • Other
Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - VLM/LLM Evaluation in Mountain View, CA vacancy
  • $204k - $259k

     ...builds the system which learns the spatial-...  ...sensors, enabling engineers like you to (1) develop...  ...for cutting-edge VLM foundation models....  ...Develop and rigorously evaluate metrics and...  ...years of experience in Machine Learning, with a focus...  ...model development (LLM, VLM, or similar... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    6 hours ago
  • $213k - $263k

     ...Waymo AI Foundations team is to develop machine learning solutions addressing open problems in...  ...inference, hierarchical learning, and robust evaluation. This role follows a hybrid work...  ...smaller real-time models. Explore LLM/VLM distillation recipes to maximally transfer... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    6 hours ago
  • $238k - $302k

     ...The Driver Understanding and Evaluation (DUE) team at Waymo is...  ...the Waymo Driver. The DUE Machine Learning team will build and operate...  ...for researchers and software engineers who are passionate about developing...  ...Design and build Gen AI LLM/VLM solutions for self driving car... 
    Suggested
    Full time

    Waymo

    Mountain View, CA
    7 hours ago
  • $60 - $70 per hour

     ...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that... 
    Suggested
    Contract work
    Temporary work
    Remote work
    3 days per week

    TEKsystems

    Cupertino, CA
    5 days ago
  • $200k - $300k

     ...connectors, flexible LLM choice, and robust APIs...  ...reliably better over time: evaluation pipelines, quality...  ..., and the tooling engineers use to understand what...  ...evaluation, reinforcement learning from human feedback,...  ...large systems involving machine learning. ~... 
    Suggested
    Home office
    Flexible hours
    3 days per week

    Glean.info

    Mountain View, CA
    3 days ago
  • $281k - $356k

     ...+ U.S. states. The DUE Machine Learning team will build and operate...  ...tools, improve and speed up the evaluation and onboard developer...  ...for researchers and software engineers who are passionate about developing...  ...models and Generative AI (LLM/VLM) solutions. These solutions... 
    Full time

    Waymo

    Mountain View, CA
    6 hours ago
  •  ...workflows, and continuously learn and adapt. Moveworks...  ...Moveworks’ Reasoning Engine and natural language...  ...We are looking for a Machine Learning Engineer to help...  ...for building and serving LLM’s at Moveworks. This...  ...language models(LLM), model evaluation and monitoring... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Servicenow

    Mountain View, CA
    23 hours ago
  • $120k - $235k

     ...innovative companies to build strong engineering teams ready for what's next....  ...role How developers were evaluated previously was whether they...  ...you will do Build LLM-powered evaluation pipelines...  ...bonus, and equity. Want to learn more about HackerRank? Check... 
    Shift work

    HackerRank

    Santa Clara, CA
    1 day ago
  •  ...Senior ML Engineer Medical Imaging Evaluation & AI Reliability About the Role: My client is building evaluation and evidence infrastructure...  ...Required Qualifications: Strong experience in machine learning for medical imaging (radiology, pathology, cardiology... 
    Shift work

    Established Search

    Sunnyvale, CA
    1 day ago
  • $213k - $263k

     ...across 15+ U.S. states. The DUE Machine Learning team will build and operate scalable machine...  ...tools, improve and speed up the evaluation and onboard developer journeys. It will...  ...looking for researchers and software engineers who are passionate about developing machine... 
    Full time

    Waymo

    Mountain View, CA
    7 hours ago
  • $204k - $259k

     ...states. The Driver Understanding and Evaluation (DUE) team at Waymo is developing rich...  ...of the Waymo Driver. The DUE Machine Learning team will build and operate scalable machine...  ...looking for researchers and software engineers who are passionate about developing... 
    Full time

    Waymo

    Mountain View, CA
    7 hours ago
  • $251k - $310k

     ...We are the software engineering team responsible...  ...future-looking deep-learning-based explorations....  ...Analyze, finetune, and evaluate model performance...  ...driving and machine learning, and be able...  ...Large Language Models (LLM) or Vision Language Models (VLM), prompt engineering... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    7 hours ago
  • $140k - $230k

     ...and advancing the state-of-the-art of machine learning (ML) for perception, prediction, and motion...  ...with other software and hardware engineers and researchers to tackle some of the...  ...processing, model training, ablation studies, evaluation, deployment, inference optimization.... 
    Full time
    Temporary work
    Flexible hours

    Woven By Toyota

    Palo Alto, CA
    23 hours ago
  •  ...join a centralized evaluation organization and define...  ..., and ML systems engineering. You will work closely...  ...as reward modeling, LLM-as-judge, preference learning, and calibration...  ...in Computer Science, Machine Learning, Artificial...  ...strong focus on LLM or VLM systems.\nDeep... 

    Apple

    Cupertino, CA
    23 hours ago
  •  ...save time, and enhance learning and creativity. Our...  ...seeking a Applied AI Engineer to facilitate the adoption...  ...on prompting, evaluation, and fine-tuning, and...  ...tuning, state-of-the-art LLM applications, and contributing...  ...algorithms underlying machine learning and LLMs •... 
    Work at office
    Visa sponsorship

    Mistral AI

    Palo Alto, CA
    2 days ago
  • $204k - $259k

     ...serving as the foundation for training and validating the AV stack. We are an advanced ML and engineering team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene understanding,... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    6 hours ago
  • $196k - $221k

     ...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong...  ...large-scale SID / ASR / NLP / LLM systems that power mission-critical product...  ...ML infrastructure, data pipelines, evaluation frameworks, and deployment workflows... 
    Permanent employment

    Otter.ai

    Mountain View, CA
    1 day ago
  • $120k - $215k

     ...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP...  ...generation and annotation. Define evaluation metrics. Perform evaluations and analyze...  ...learning, including at least 3 years in LLM fine-tuning. ~ Proficiency in Python... 
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    Palo Alto, CA
    23 hours ago
  • $140k - $265k

     ...SaaS connectors, flexible LLM choice, and robust APIs...  ...Glean is looking for engineers to help build the world...  ...question-answering, evaluation, and experimentation. We...  ...more junior engineers, or learn from battle-tested ones...  ...systems involving machine learning ~ Strong analytical... 
    Work at office
    Home office
    Flexible hours
    3 days per week

    Glean.info

    Mountain View, CA
    2 days ago
  • $200k - $300k

     ...enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations...  ...the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of...  .... Lead development of scalable evaluation, benchmarking, and optimization loops.... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    2 days ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer - Agentic AI The VCV organization has pioneered human-centric, real-time...  ...handling. Develop infrastructure for evaluating and improving agentic system...  ...environments. Strong proficiency with LLM-assisted coding, including using AI tools... 
    Relocation

    Apple

    Sunnyvale, CA
    3 days ago
  • $194k - $214k

     ...highly customer-centric Senior ML Engineer who will join our cross-...  ...solutions, including DL- and LLM-based approaches, that reliably...  .... Experience with deep learning in a production setting, understanding...  ...and salary opportunities are evaluated through our interview process... 

    Instrumental Inc

    Palo Alto, CA
    3 days ago
  • $150k

     ...data scientists, and engineers, tackling the most fundamental...  ...computing in deep learning, driving impactful...  ...The Role As a Machine Learning Engineer at the...  ...training, post-training, evaluation and so on, especially...  ...Hands-on experience with LLM algorithms, such as... 
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    23 hours ago
  •  ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting...  ...training, debugging, and performance evaluation. Fine-tune large language models (LLMs...  ...optimization, deep understanding of LLM serving optimizations for LLMs/VLMs.... 

    Nace AI

    Palo Alto, CA
    1 day ago
  • $181.1k - $272.1k

     ...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish... 
    Relocation

    Apple

    Cupertino, CA
    23 hours ago
  •  ...Evaluation Engineer Evaluation is the bottleneck in healthcare AI — you can't ship what you can't measure. You'll build the systems that tell...  ..., synthetic data pipelines, automated benchmarks, and LLM-as-judge systems. This is a high-leverage engineering role where... 

    Hippocratic AI

    Palo Alto, CA
    23 hours ago
  • $200k - $275k

     ...What You Will Do We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s  at Moveworks. This role will be...  ...for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization... 
    Full time

    Moveworks

    Mountain View, CA
    more than 2 months ago
  • $229k - $343k

     ...Staff Machine Learning Engineer Snap Inc is a technology company. We believe the camera presents the greatest...  ...improvement Design robust offline evaluation, online experimentation, and model...  ...systems, ads ranking, generative AI, LLM-based ranking, and retrieval-... 
    Work experience placement
    Live in
    Work at office
    Local area

    Snap

    Palo Alto, CA
    5 days ago
  •  ...Machine Learning Infrastructure Engineer At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and...  ...-scale ML training Deep understanding of how modern LLM/VLM systems are trained and scaled Proven experience... 

    Mind Robotics

    Palo Alto, CA
    2 days ago
  • $214k - $289.5k

     ...Overview Come join Intuit as a Senior Staff Machine Learning Engineer (MLE). Senior Staff MLEs deliver...  ..., and continuous improvement. Evaluate and integrate transformative technologies...  ...-augmented generation (RAG), and LLM fine-tuning pipelines to accelerate product... 
    Local area

    Intuit

    Mountain View, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!