Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Engineer - VLM/LLM Evaluation

$238k - $302k

Waymo

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.

You will:

  • Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
  • Lead the development of end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire lifecycle from pretraining and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
  • Partner within and across organizations to land disruptive and innovative tech in production
  • Implement and extend large large scale data and evaluation pipelines

You have:

  • Master's degree or PhD degree in Computer Science, similar technical field of study, or equivalent practical experience
  • 5+ years of experience in ML engineering and applied Deep Learning, with a strong portfolio of shipped products or publication record
  • Experience with large scale distributed system
  • Proficient programming skills (eg: Python, C/C++)
  • Strong analytical and debugging skills

We prefer:

  • ML infra experience: training, evaluating and deploying ML models at scale
  • Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
  • Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range $238,000$302,000 USD
Required
Preferred
Job Industries
  • Other
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - VLM/LLM Evaluation in New York, NY vacancy
  • $238k - $302k

     ...Staff Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver... 
    Suggested
    Full time
    Remote work

    Waymo

    New York, NY
    5 days ago
  • $240.45k - $300.3k

     ...Senior Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic...  ...performance, robustness, and safety metrics, including LLM-judge-based evaluations. Design test datasets and... 
    Suggested
    Full time

    Scale AI

    New York, NY
    1 day ago
  •  ...your unique tastes, our Taste AI engine sifts through the noise to find the...  ...applications. Role Overview As a Machine Learning Engineer reporting to the LLM Research Lead, you will operate at...  ...explainability Experiment with and evaluate modern ML approaches (transformers... 
    Suggested
    Remote work
    Flexible hours

    Medium

    New York, NY
    1 day ago
  • $176.17k - $251.67k

     ...Machine Learning Engineering Manager - LLM Serving (Remote - US) We are currently looking for a Machine Learning Engineering Manager - LLM Serving & Infrastructure...  ...for multiple LLM models, supporting batch, offline evaluation, and real‑time inference. Oversee the development and... 
    Suggested
    Remote work
    Flexible hours

    Jobgether

    New York, NY
    3 days ago
  •  ...Machine Learning / AI Engineering Internship (LLM Focus) During this internship, you will play a pivotal role in accelerating a fast-moving project encompassing...  ...focuses on continuing the development of an LLM+VLM tool that supports both multi modal content and external... 
    Suggested
    Internship
    Local area

    A One Consulting

    New York, NY
    3 days ago
  • $30 - $50 per hour

     ...A tech company specializing in AI research is looking for a STEM Research Engineer to enhance applied AI/ML workflows including LLM training and dataset development. This remote, full-time position requires mid-senior experience in STEM research and strong Python skills... 
    Hourly pay
    Full time
    Remote work

    Rex USA

    New York, NY
    3 days ago
  • $30 - $50 per hour

     ...technology company in the United States is seeking remote, full-time engineers with STEM backgrounds. Candidates will work on AI/ML systems...  ...computer vision tasks. Strong Python skills and hands-on machine learning experience are essential. Competitive hourly compensation... 
    Hourly pay
    Full time
    Remote work

    Rex USA

    New York, NY
    3 days ago
  •  ...urgently looking to hire ML Engineer – Newark, NJ (100% Remote) ....  ...hands-on experience with RAG, LLM fine-tuning, and agentic...  ...Design, build, and deploy robust machine learning models and Gen AI...  ...through to model training, evaluation, and MLOps integration. Conduct... 
    Contract work
    Immediate start
    Remote work

    Iris Software

    New York, NY
    3 days ago
  •  ...ll own the full lifecycle of machine learning at ServiceUp, from...  ...closely with product, design, and engineering to identify where ML can create...  ...Define success metrics and evaluation frameworks for each model. Productionize...  ...experience. Practical LLM experience for explanation... 
    Remote work
    Home office
    Flexible hours

    ServiceUp Inc.

    New York, NY
    3 days ago
  • $130k - $165k

     ...Overview Machine Learning Engineer, AI Platform As a Machine Learning Engineer, you will design...  ...problem discovery, through prototyping, evaluation, hardening, and production deployment....  ...Python. Experience with LLM APIs, agentic frameworks (LangChain, Strands... 
    Permanent employment
    Full time
    Work at office
    Remote work

    HealthEdge

    New York, NY
    5 days ago
  •  ...A leading technology firm in the United Kingdom is seeking a machine learning engineer to design evaluation suites and assess AI-generated solutions. The role requires strong experience in machine learning engineering and the ability to work independently. Ideal candidates... 

    Crossing Hurdles

    New York, NY
    3 days ago
  • $150k - $180k

     ...Overview We are looking for a Senior Machine Learning Engineer to drive the design and development of...  ...guidance, and design collaboration. Design, evaluate, and implement GenAI systems and...  ...prompt engineering, RAG systems, and LLM evaluation. Experience with LLM orchestration... 
    Flexible hours

    Perfect Path

    New York, NY
    3 days ago
  • $184.15k

     ...Overview Explore top remote machine learning engineer jobs and find flexible roles such as llm engineer, nlp engineer, computer vision engineer, ai engineer, ai research...  ...-world impact in adversarial testing and model evaluation. Senior Machine Learning Engineer developing AI... 
    Remote work
    Flexible hours

    Kickstart Remote

    New York, NY
    3 days ago
  •  ...A leading AI technology company is seeking a Senior Machine Learning Engineer to enhance their speech recognition and NLP systems. This role is pivotal in developing evaluation frameworks and improving model accuracy and performance. The ideal candidate will have extensive... 
    Remote work

    Cresta

    New York, NY
    3 days ago
  • $180k - $280k

     ...we're building the AI to fix it. As a Machine Learning Engineer at Finch, you'll own the full lifecycle...  ...agents, browser agents, OCR pipelines, and LLM-powered workflows that work reliably in production. Design rigorous evaluation frameworks and feedback loops to... 
    Work at office
    Remote work
    Flexible hours
    1 day per week

    Finch Legal Inc

    New York, NY
    2 days ago
  •  ...Machine Learning Engineer (AI Data Trainer) About the Role What if your machine learning expertise could directly...  ...experience with data annotation, data labeling, or AI evaluation workflows Familiarity with LLM behavior, prompt engineering, or AI training... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    New York, NY
    3 days ago
  • $184.05k - $262.93k

     ...Design and ship production-grade machine learning systems powering conversational and...  ...step tool orchestration Create evaluation frameworks, including LLM-as-judge pipelines, to measure quality...  ...Partner closely with product, engineering, and design to deliver seamless, user... 
    Work from home
    Flexible hours

    Spotify

    New York, NY
    2 days ago
  • $165k - $260k

     ...Senior Machine Learning Engineer - Search AI, BLAW/BTAX/BGOV Location New York Business Area...  ...indexing, retrieval, re-ranking, and LLM-based answer generation Personalize...  ...trustworthy content Design and implement evaluation frameworks that leverage implicit and... 
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    2 days ago
  • $184.05k - $262.93k

     ...is a team of about a hundred AI/ML Engineers, Applied Research Scientists,...  ...and monitoring Design and build evaluation tooling (including LLM-as-judge frameworks and dataset analysis...  ...experience building and shipping machine learning models end-to-end You have a... 
    Work from home
    Flexible hours

    Spotify

    New York, NY
    3 days ago
  • $205k - $270k

     ..., and it's at Cresta. About the role: Machine Learning Engineers at Cresta work across several high-impact...  ...time. This track requires strong pre-LLM ML foundations, deep expertise in LLMs...  .... Agent & System Quality: Design evaluation frameworks and improve the reliability... 
    Contract work
    For contractors
    For subcontractor
    Work at office
    Remote work
    Home office
    Flexible hours

    Cresta

    New York, NY
    3 days ago
  • $180k - $220k

     ...Intelligence WEBSITE : TITLE: Sr. Machine Learning Engineer LOCATION: New York City or London...  ...and trust. You’ll work hands-on with LLM and other ML Models, helping scale...  ...What You’ll Do: Design, train, and evaluate ML models for document classification,... 
    Local area
    Remote work
    Work from home
    Home office
    Flexible hours

    Canoe Intelligence

    New York, NY
    2 days ago
  • $118k - $176k

     ...Visits, March 2025) Day to Day The Machine Learning Engineer I role partners closely with business...  .... Work spans classical ML through LLM systems. You improve search and retrieval...  ...Excellent understanding of model evaluation techniques, feature engineering, experiment... 
    Work experience placement
    Local area

    Indeed

    New York, NY
    2 days ago
  • $184.05k - $262.93k

     ...Machine Learning Engineer The Rewards team in Personalization (PZN) is defining the next generation...  ...formation metrics—that directly shape the LLM-powered recommendation experience....  ...Contribute to designing, scaling/building, evaluating, integrating, shipping, and refining... 
    Work from home
    Flexible hours

    Spotify

    New York, NY
    1 day ago
  • $120k - $135k

     ...: A Place for Mom is seeking a Senior Machine Learning Engineer to design, build, and scale production...  ...will focus on developing advanced ML and LLM-powered applications that leverage...  ...systems, including prompt libraries, evaluation frameworks, and architectural decisions... 
    Work experience placement
    Work at office
    Remote work

    A Place for Mom

    New York, NY
    1 day ago
  • $175k - $200k

     ..., join us. About the role As a Senior Machine Learning Engineer for Accompany Health, you will help us...  ...effectively through model development and evaluation. Design and implement scalable Machine...  ...Developing and implementing modern LLM models and transformers and deploying... 
    Full time
    Local area
    Remote work
    Night shift

    Accompany Health

    New York, NY
    3 days ago
  • $184.05k - $262.93k

     ...innovation and cutting-edge machine learning to bring entirely new experiences...  ...'ll Do Design, build, evaluate, and ship agentic based...  ...science, product management, and engineering to build new product...  ...Experience with production LLM scale based systems is a plus... 
    Flexible hours

    Spotify

    New York, NY
    2 days ago
  • $144k - $192k

     ...framework, powers this discovery. As a Machine Learning Engineer on the Data Mining team, your mission...  .../JAX; comfortable training models and evaluating them with standard metrics. Strong proficiency...  ..., chain-of-thought models, or LLM-based planning. Background in autonomous... 
    Remote work

    Motional AD Inc.

    New York, NY
    3 days ago
  • $266k - $372.4k

     ...information, visit We’re looking for a Senior Staff Machine Learning Engineer to lead Reddit’s next-generation user...  ...(embeddings, tags, attributes, LLM-based user profile), how they are...  ...models: you consider data, training, evaluation, serving, and adoption as a cohesive... 
    For contractors
    Work experience placement
    Immediate start
    Remote work
    Flexible hours
    Shift work

    Reddit

    New York, NY
    3 days ago
  • $300k - $350k

     ...Senior Staff/Technical Lead Machine Learning Engineer Senior Staff/Technical Lead Machine Learning Engineer This range is provided by Harnham. Your actual...  ...product strategies. Lead experimentation and model evaluation in a fast-paced, data-rich environment. Contribute to... 
    Full time
    Internship
    Remote work

    Harnham

    New York, NY
    3 days ago
  • $230k - $322k

     ...partner teams. We are looking for a Staff Machine Learning Engineer who will lead the Commercial Content...  ...modeling and systems craft. Develop evaluation systems and quality monitoring systems...  ...reliable serving at Reddit scale. Drive LLM and modern ML best practices within... 
    Remote work

    Reddit

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!