Research Scientist, RL for Autonomous Planning & World Modeling
$204k - $259kWaymo
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.
In this hybrid role, you will report to a Principal Scientist.
You will:
- Participate in Waymo's Foundation World Model post-training and evaluation
- Research and develop cutting edge RL and Distillation techniques for Autonomous Vehicle Trajectory Planning
- Integrate emerging research from the broader AI community into Waymo's internal RL infrastructure, conducting rigorous ablations to identify and scale the most promising methods
- Partner with engineering and research teams across Waymo to share recipes, techniques, and post-training best practices to accelerate our collective know-how
You have:
- PhD or Masters in Computer Science, Machine Learning, Robotics, or a similar technical field; with 3+ years of industry or post-doc research experience in Reinforcement Learning or Foundation Models
- Demonstration of original contributions to the field through high-impact publications (ArXiv, peer-reviewed conferences like NeurIPS/ICLR/CVPR), technical blog posts, or significant open-source contributions
- Proficiency in implementing model training flows in a scalable, distributed and performant manner such as Data parallel, FSDP and other sharding approaches
- A willingness to work with complexity of globally distributed inference infrastructure
We prefer:
- PhD in Computer Science, Machine Learning, or Robotics, with a research focus on Reinforcement Learning, Foundation Models, or Multi-Modal learning
- Extensive experience designing and deploying Reinforcement Learning infrastructure, specifically for on-policy learning or alignment with human preferences
- A consistent history of original contributions to the AI community, evidenced by first-author publications at top-tier venues (e.g., NeurIPS, ICLR, ICRA) or maintaining significant open-source ML projects
- Experience with large scale (many-machine) training infrastructure and techniques for inference with large models such as model sharding/tensor-parallel
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:
- Health, dental, vision, life, disability insurance
- Retirement Benefits: 401(k) with company match
- Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
- Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
- Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
- Baby Bonding Leave: 18 weeks
- Holidays: 13 paid days per year
Please note that Waymo may not be able to employ remotely in all locations. Please speak with your recruiter about your preferred location for remote work when you begin the interview process
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.
Salary Range $204,000$259,000 USDRequired
Preferred
Job Industries
- Other
$213k - $263k
...Waymo is an autonomous driving technology company... ...mission to be the world's most trusted... ...collaborations with other research teams in Alphabet.... ..., generative modeling, Bayesian... ...a Staff Research Scientist / Tech Lead Manager... ...equity incentive plan, and generous Company...SuggestedFull timeTemporary workRemote work$302.4k - $378k
...applications, and autonomous vehicles. With our... ...building upon our prior model evaluation work... ..., part of Scale's Research organization,... ...environments and RL reward signals, benchmarking... ...across real-world scenarios and... ...in LLM agents and planning algorithms but also...SuggestedFull time- ...and exploit probabilistic generative models that exploit to Achira's foundation... ...sampling. Why Achira Join a world-class team of researchers, scientists, and engineers unifying... ...learning / Bayesian optimization or RL-style acquisition for proposal selection...Suggested
- ...in New York seeks innovative individuals eager to shape the future of robotics. The role involves architecting large-scale video/world models and transforming generative models into controllable simulators. You will play a crucial role in ensuring robots achieve near-...Suggested
- ...generative, representational, and simulation models for molecules and materials - building... ...models that make the atomistic world learnable, predictable, and designable.... ...world-class, interdisciplinary team of ML researchers, physicists, chemists, and engineers reimagining...Suggested
$50 per hour
...quality human data in the world. Over 35,000 AI developers, researchers, and organizations use... ...for people with Research Scientist experience to work as... ...evaluate cutting‐edge AI models from some of the world's... ...and global organisation planning. Prolific's Candidate Privacy...Self employmentWork from homeFlexible hours- ...that is accelerating scientific research & development. We are at the... ...development, enhancing reasoning and planning throughout the development... ...data acquisition, model evaluation, pre- and post-training... ...problems and a commitment to doing world-class research. Domain knowledge...
- ...company that is accelerating scientific research & development. We are at the forefront of... ...AI Researcher specializing in generative models for materials discovery at Radical AI, you... ...related problems and a commitment to doing world-class research. Proficiency in Python...
- ...We build frontier foundation models that power intelligent experiences... ...to hard problems where the research and the product are... ...try to apply learning to real-world use cases, and this is also where... ...of world-class engineers and scientists to tackle some of the most challenging...
- ...Protege is looking for a Machine Learning Researcher to improve AI systems by focusing on data... ...and environments that enhance AI model performance, working closely with engineering... ...in machine learning and an understanding of RL and agentic systems. #J-18808-Ljbffr...
$216k - $270k
...Scale Labs, Research Scientist - Safety Post Training As the leading data... ...capabilities and safeguarding AI models and systems. Building on this... ...with post-training and RL techniques such as RLHF, DPO,... ...reliable AI systems for the world's most important decisions. Our...Full time$66k - $94k
...Precision Medicine Group is looking for an Associate Research Scientist to join their Real World Evidence team. This fully remote position involves conducting real-world studies in healthcare, making critical recommendations based on data analyses. The ideal candidate...Remote work$200k - $240k
...SentiLink is backed by world-class investors... ...As a Senior Applied ML Scientist at SentiLink, you will... ...build our core products: models that identify fraudsters... ...risk. As an experienced researcher you will be relied... ...end” DS development: Planning, fleshing out success...Work experience placementLive inWork at officeRemote workHome officeFlexible hours$90k - $120k
...Research Scientist – Complex Systems & Network Modeling (5720) Location: Remote (CST/EST only) Job Code: 5720 # of Openings: 1 Bennett Aerospace Inc, a subsidiary of Three Saints Bay, LLC and a Federal Government Contractor industry leader, is hiring a remote Research...For contractorsLocal areaRemote work$120k - $145k
...Computational Materials Scientist to join us in our... ...computational tools to industrial research and development. Who... ...experience with real‑world molecular and materials... ...implement molecular modeling solutions for various industries... ...Management team also plans a myriad of fun company...Work at officeFlexible hours$141.1k - $262.1k
...generations to come. Creating a world where we all have more time... ...and development. Roche's Research and Early Development organisations... ...data and novel computational models to drive impact. Seamless... ...Intelligence (AI) to assist our scientists in both pRED and gRED to...Work experience placementLocal areaWorldwideRelocation package$128k - $185k
...-end reserving and predictive modeling capabilities. This role will maintain... ...involves a lot of thinking, research, and creativity.... ...exercise is designed to reflect real-world challenges relevant to the role... .... Perks/Benefits: Health Care Plan (Medical, Dental & Vision) Retirement...Work visa$228.7k - $309.4k
...leading a team of applied scientists working across GenAI and predictive modeling. You will shape how... ...participate in organizational planning, hiring, mentorship,... ...through hands-on research and development in areas... ...scale models for real-world talent applications •...Local areaFlexible hours$197.27k - $267.04k
...systems the physical world runs on: factories, power... ...of machine learning research, real world data, and... ...As Principal Applied Scientist, you lead the science... ...experimentation, and into models that run in production... ...time series, control, planning, or evaluation...Local area$220k - $325k
...defense applications, and autonomous vehicles. With our recent Series... ...Building on our history of model evaluation with enterprise... ...at the forefront of AI research and real-world implementation, with a strong... ...understanding of LLMs, planning algorithms, and novel approaches...Full timeShift work$150k - $250k
...company partnering with the world's most ambitious institutions... ...social organizations. We research and deploy technologies that... ...knowledge graphs, mixed-initiative planning, etc. Excited about making... ...~ Access to stateoftheart models, generous usage of modern AI...3 days per week$168.8k - $241.2k
...the brave - product designers, researchers, content designers,... ...define UX metrics and telemetry plans, then synthesize data into narratives... ...automating workflows. Role model this practice for others.... ...across the physical and digital worlds. These solutions provide customers...Full timeTemporary workLocal areaFlexible hours$83.6k - $115k
...Collaborate with Economics and Strategy Research teams, Chief Risk Officer,... ...Financial Officer, Financial Planning teams, and other corporate... ...with and guide quantitative model development teams for the development... ...businesses and many of the world's most prominent corporate,...Work experience placementWork at officeVisa sponsorship$73.5k - $89k
...nimble and strategic engagement models, providing advertising and... ...marketing expertise to provide world-class Google marketing... ...innovation. Napkyn's team of data scientists, marketing experts and systems... ...analytics tools, actively planning and launching campaigns across...Full timeHome officeFlexible hours- ...on sustainability, land‑use planning, and environmental decision‑making... ...role where your real‑world knowledge directly influences... ...Identify gaps between theoretical models and real‑world environmental... ...cutting‑edge AI projects with top research labs — and make a real...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
- Achira is seeking a Generative AI Researcher to create cutting-edge foundation simulation models that unify deep learning with atomistic physics. You will design models that ensure effective understanding, generation, and simulation of molecular systems. The ideal candidate...
$220k - $325k
...AI, defense applications, and autonomous vehicles. With our recent... .... Building on our history of model evaluation with enterprise and... ...skilled at rapidly interpreting research literature and turning new... ...reliable AI systems for the world's most important decisions. Our...Full time$60 - $80 per hour
...building the largest foundation model in oncology and pairing it with proprietary... ...center. We're hiring an Applied Scientist to lead the pretraining and post... ...with oncologists and clinical researchers to ensure the model reflects real world medical reasoning. Your work will...Hourly payInternshipWork at office3 days per week$76.3k - $103.1k
...iconic story. TOMMY HILFIGER is one of the world's most recognized global lifestyle brands... ...strategies Support MMM (Marketing Mix Modeling) & MTA (Multi-Touch Attribution) data... ..., insight gathering, and measurement plans with various stakeholders Develop and...Local areaFlexible hours- ...measurable goals, deliver robust models, and operate them reliably in... ...time in learning, researching, and experimenting with new innovations... ..., multi-agent orchestration, planning, grounding, and safety... ...prompts/tools ~ Experience with RL/bandits, preference...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist, RL for Autonomous Planning & World Modeling. Be the first to apply!
- principal applied scientist New York, NY
- image scientist New York, NY
- scientist New York, NY
- research scientist machine learning deep learning New York, NY
- deep learning scientist New York, NY
- senior principal scientist New York, NY
- machine learning scientist New York, NY
- bioanalytical scientist New York, NY
- scientist immunology New York, NY
- safety scientist New York, NY


