Staff Machine Learning Engineer - VLM/LLM Evaluation
$238k - $302kWaymo
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.
This role follows a hybrid work schedule and you will report to a Senior Staff Software Engineer.
You will:
- Work with a creative team of people who help to build the state-of-the-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation
- Lead the development of end-to-end evaluation systems and benchmarks for Waymo Foundation models, encompassing the entire lifecycle from pretraining and supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of embodied AI agents
- Partner within and across organizations to land disruptive and innovative tech in production
- Implement and extend large large scale data and evaluation pipelines
You have:
- Master's degree or PhD degree in Computer Science, similar technical field of study, or equivalent practical experience
- 5+ years of experience in ML engineering and applied Deep Learning, with a strong portfolio of shipped products or publication record
- Experience with large scale distributed system
- Proficient programming skills (eg: Python, C/C++)
- Strong analytical and debugging skills
We prefer:
- ML infra experience: training, evaluating and deploying ML models at scale
- Deep learning experience, especially with generative models, e.g., LLMs/VLMs, and/or reinforcement learning
- Proficiency and in-depth knowledge of the inner workings of an ML framework (e.g. Pytorch, JAX, Tensorflow)
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.
Salary Range $238,000$302,000 USDRequired
Preferred
Job Industries
- Other
$204k - $259k
...Waymo AI Foundations team is to develop machine learning solutions addressing open problems... ..., hierarchical learning, and robust evaluation. This role follows a hybrid work... ...and you will report to a Senior Staff Software Engineer. You will: Work with a creative...SuggestedFull timeTemporary workRemote work$60 - $70 per hour
...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that...SuggestedContract workTemporary workRemote work3 days per week$139.5k - $258.1k
...LLM Machine Learning Research Engineer Apple is seeking a Research Engineer to join our Foundation Model Preparation and Algorithm Team. We are looking... ...server optimization, ML tools/platforms, datasets, and evaluation. You will develop reliable and scalable pipelines and...SuggestedRelocation$139.5k - $258.1k
...LLM Machine Learning Research Engineer, Model Optimization & Algorithms Development, SIML The Apple Intelligence Model Optimization and Algorithms... ...debugging experience Experimental rigor when training/evaluating DNNs for the purpose of benchmarking neural network...SuggestedRelocation$148.2k - $300.96k
...worldwide. We achieve this through: - Advanced machine learning systems to detect and prevent evolving... ...knowledge integration - Design prompt engineering and reasoning workflows that connect... ..., risk indicators, and real-time LLM-based decisions. - Knowledge Distillation...SuggestedTemporary workLocal areaWorldwide$181.1k - $318.4k
...Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms Work Locations (2) Submit Resume We are Foundation Model... ...driving complex, ambiguous projects. ~ Experience with LLM inference stack ~ Familiarity with GPU programming...Relocation- ...MT and Seattle, WA, we are a team of engineers and technologists from Boeing, Airbus,... ...Role As a Senior Software Engineer Evaluation, you will design and implement systems... ...company. You will work closely with machine learning and data engineering teams to evaluate...
$201.3k - $302.2k
...Staff Machine Learning Platform Engineer, AI Evaluation Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking... ...environments where you wore multiple hats Familiarity with LLM token economics, rate limiting, and cost management...Relocation$202.16k - $368.22k
...Responsible for exploring reinforcement learning and operational research... ...of multi-agent tools, models, engines, and platforms to enhance... ...vision, multimodal, search, graph, LLM, Agent etc. to provide support... .../algorithms, proficient in machine learning/deep learning theory,...Temporary workLocal areaOverseas$120.3k - $210.1k
...Applied ML Engineer – AI/ML Evaluation & Simulation We're building the next generation of AI evaluation... ...in software engineering and machine learning, and an eagerness to grow by building... ...simulate interactive behaviors (including LLM-driven agents) Help build tools...InternshipRelocation- ...next generation of AI evaluation systems — and we're looking for a hands-on engineer who can bridge ML, software... ..., seeking a Senior or Staff-level Applied ML... ...solid understanding of machine learning. In this hands-on role... ...AI evaluation systems, LLM-based simulations, or...
$175k - $215k
...team builds the system which learns the spatial-temporal representation... ...set of sensors, enabling engineers like you to (1) develop... ...Software Engineer with robotics and machine learning experience, who will... ...Develop and maintain model evaluation recipes and metrics for...Full timeTemporary workRemote work- ...Weekly Hours: 40 Role Number: 200657970-3337 Summary The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and creative applications; including Creator Studio, used by hundreds of millions...Shift work
- ...Number: 200657984-3337 Summary The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a... ...genuinely useful AI outputs Experience partnering with engineering or data teams to define data collection requirements and...
$125k - $180k
...company. Our extensive learning programs and... ...member of the Product and Engineering team at PitchBook, you... ...grow with us! As a Machine Learning Engineer (MLE... ...transparency, monitoring, evaluation, and compliance. Help... ...-grade GenAI or LLM-based systems with measurable...Work at officeRemote workVisa sponsorship$171.6k - $302.2k
...Machine Learning Engineer – Recommendations & Personalization (Feature Engineering) Apple Services... ...will prototype and build next-generation LLM-powered and agentic recommendation... ...Collaborate with research teams to prototype, evaluate, and integrate LLM-driven or...Work experience placementWorldwideRelocation$118k - $176k
...Visits, March 2025) Day to Day The Machine Learning Engineer I role partners closely with business... .... Work spans classical ML through LLM systems. You improve search and retrieval... ...Excellent understanding of model evaluation techniques, feature engineering, experiment...Work experience placementLocal area- ...access information. We are an Applied Machine Learning team pushing the boundaries of artificial... ...leverage and advance state-of-the-art LLM and ML techniques to better understand... ...abilities. Experimental rigor when training/evaluating LLMs for the purpose of benchmarking...
$212k - $318.4k
...AIML - Machine Learning Engineer, Visual Intelligence Technology Work Locations (2) Submit Resume... ...edge technologies like Visual Search and LLM to empower user-facing features like... ...scientists to develop, fine-tune, and evaluate domain specific Large Language Models...Relocation$139.5k - $258.1k
...Machine Learning Engineer, Video Search Team The Apple Services Engineering AI/ML organization is... ...context. Build and deploy ML, NLP and LLM models to improve search relevance and... .... Utilize big data tech to evaluate content discovery features. Ensure...Relocation- ...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present, and future.... ...generative image, generative audio, chatbots, LLM applications, and mixed agentic... ...ML modeling applications. Create, evaluate, improve, optimize technologies Drive...
$183.6k
...findings into scalable engineering solutions that align... ...software systems; Deep learning frameworks including... ...Agentic AI including LLM driven AI agents, agentic... ...and production-ready machine learning systems; and... ...every candidate is evaluated based on skills, experience...Work at officeRemote work1 day per week$226.89k - $363.03k
...you to hone existing skills and learn new ones "I can succeed as a Machine Learning Engineer at Capital Group" We are... ...engineers building Generative AI and LLM-based capabilities, including... ...~ Familiarity with model evaluation, monitoring, and performance trade...Temporary workLocal areaFlexible hours$200k - $250k
...Metropolis is seeking a Senior Manager of Machine Learning Engineering within the Advanced Technologies Group... ...and automated annotation workflows (LLM-in-the-loop) to reduce reliance on... ...employment decision tool (AEDT) to assess or evaluate your candidacy for employment or...Temporary workWork at officeLocal area$177.1k
...Retrieval team within Zoom's GenAI Engineering group builds a multi-tenant,... ...of distributed systems, machine learning, knowledge representation,... ..., relevance modeling, and evaluation frameworks to improve answer... ...integrating RAG systems and LLM-based applications in production...Work at officeRemote work- ...37 Summary The Apple Machine Translation team is seeking... ...and large language model (LLM) technologies. Our mission... ...research scientists and engineers with a passion for applied... ...investigating novel modeling and learning approaches and evaluation methods. As a member of...
$171.6k - $230.1k
...is a global organization of engineers, product developers, designers... ...for working across multiple machine learning areas with primary focus on... ...generative audio, chatbots, LLM applications, and mixed... ...modeling applications. Create, evaluate, improve, optimize...- ...looking for our first Pre/Post-Sales Machine Learning Engineer who bridges the gap between technical... ...custom models: Train, fine-tune, and evaluate modern LLMs and other AI models using... ...Python with a strong understanding of AI/LLM best practices including practical...Full timeRemote workWorldwide
$135k - $210k
...about the fruit they are seeing. We are looking for a Machine Learning Engineer to build creative, practical, and robust solutions to ML/... ...Develop and deploy infrastructure for model training, evaluation, and inference, both in the cloud and on edge devices....Full timeWork at officeWeekend work- ...Machine Learning Engineer (Senior) About AZX Our mission is to accelerate positive impact in critical... ...Learning ~ Generative AI and LLM-related capabilities (e.g., prompt engineering... ..., RAG, fine-tuning, LangChain, model evaluation tooling) ~ MLOps and infrastructure...Full timeRemote workWork visaFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Engineer - VLM/LLM Evaluation. Be the first to apply!
- engineering aide Kirkland, WA
- technology administrator Kirkland, WA
- staff engineer Kirkland, WA
- assistant engineer Kirkland, WA
- senior staff systems engineer Kirkland, WA
- machine learning remote Kirkland, WA
- machine learning scientist Kirkland, WA
- data engineer machine learning Kirkland, WA
- machine learning Kirkland, WA
- machine learning research scientist Kirkland, WA


