Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Engineer, Reinforcement Learning & Scenario Generation

$190k - $230k

Serve Robotics

At Serve Robotics, we're reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It's designed to take deliveries away from congested streets, make deliveries available to more people, and benefit local businesses.


The Serve fleet has been delighting merchants, customers, and pedestrians along the way in Los Angeles, Miami, Dallas, Atlanta and Chicago while doing commercial deliveries. We're looking for talented individuals who will grow robotic deliveries from surprising novelty to efficient ubiquity.

Who We Are

We are tech industry veterans in software, hardware, and design who are pooling our skills to build the future we want to live in. We are solving real-world problems leveraging robotics, machine learning and computer vision, among other disciplines, with a mindful eye towards the end-to-end user experience. Our team is agile, diverse, and driven. We believe that the best way to solve complicated dynamic problems is collaboratively and respectfully.

The Lead Engineer, RL Scaling & Procedural Scenario Generation is responsible for building scalable training pipelines and generating high-fidelity synthetic scenarios. This role designs procedural simulation environments, creates diverse long-tail edge cases, and optimizes RL systems to train robust foundational models. This role sits at the intersection of simulation , machine learning , distributed systems , and content generation and has a high impact on how quickly and safely agents learn in simulation.

Responsibilities
  • Develop RL algorithms that can help with terrain intelligence and social navigation behaviors.
  • Design, build, and optimize large-scale RL training pipelines (distributed compute, GPU clusters, containerized workflows).
  • Implement curriculum learning, domain randomization, and multi-agent RL strategies.
  • Optimize RL model performance, sample efficiency, and stability across thousands to millions of simulation steps.
  • Build automated tools for experiment orchestration, rollout collection, and metrics visualization.
  • Develop procedural generation pipelines for synthetic environments, agents, and dynamic behaviors.
  • Build tools to generate long-tail scenarios, sudden appearance of objects, traffic behaviors, rare events, and environmental variations.
  • Create systems for configuration, validation, and scoring of generated scenarios.
  • Collaborate with autonomy, ML, and safety teams to map real-world failures into repeatable synthetic simulation cases.
  • Design APIs to connect RL agents, scenario generators, planners, and environment simulators.
  • Debug and optimize simulation performance (real-time speed, determinism, reproducibility).
  • Work with 3D assets, traffic models, mapping systems (e.g., Isaac Sim, CARLA, Unity, Gazebo).
  • Partner with autonomy, data, and modeling teams to define training objectives and scenario requirements.
  • Translate real-world logs and edge cases into parameterized procedural content.
  • Document tools, frameworks, and workflows for internal users.
Qualifications
  • Master's degree in Robotics, AI, Computer Science, Mathematics, or a related field.
  • 7+ years of professional experience with shipping transformer based AI models handling complex navigation or manipulation tasks in AV or robotics solutions at scale in the real world.
  • 3+ years technical leadership/architecture experience
  • Strong experience with Reinforcement Learning (PPO, SAC, A3C, DQN, multi-agent RL, or equivalents).
  • Hands-on experience with distributed training frameworks (Ray RLlib, Accelerate, PyTorch Distributed, Kubernetes, or similar).
  • Proficiency in Python and C++ for performance-critical simulation or graphics pipelines.
  • Experience building or modifying simulation environments (Isaac Sim, Unity, Unreal, CARLA, Gazebo, MuJoCo or custom engines).
  • Experience with procedural generation (noise functions, rule-based systems, agent scripts, behavior trees).
  • Experience with GPU compute, containers, and cloud infrastructure.
What Make You Stand Out
  • Background in generative AI (diffusion, LLMs) for scenario synthesis or environment creation.
  • Experience with traffic simulation (SUMO) or sensor simulation (LiDAR, camera pipelines).
  • Knowledge of CUDA, graphics engines, physics modeling, or rendering.

* Please note: The base salary range listed in this job description reflects compensation for candidates based in the San Francisco Bay Area. We are also open to qualified talent working remotely across the:


United States - Base salary range (U.S. - all locations): $190k - $230k USD

Canada - Base salary range (Canada - all locations): $160k - $190k CAD
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Lead Engineer, Reinforcement Learning & Scenario Generation in United States vacancy
  • $191.52k - $212.8k

     ...LE POSTE VOTRE PROFIL Lead Engineer, Machine Learning Publiée le 11.05.2026 Sephora...  ...our customers - building the next generation of intelligent, AI-powered experiences...  ...learning, recommendation systems, reinforcement learning, and model evaluation methodologies... 
    Suggested
    Permanent employment
    Full time

    LVMH

    San Francisco, CA
    3 days ago
  • $130k - $250k

     ...Conversion Company, Launch Potato is a leading connector of advertisers to...  ...YOUR ROLE Lead a team of ML engineers building next-generation personalization and optimization systems...  ...optimization Knowledge of reinforcement learning for personalization Compensation... 
    Suggested
    Remote job
    Full time

    Launch Potato

    Remote
    16 hours ago
  •  ...detail-oriented team members, Outsmart is already seeing significant traction, generating 1-5 million weekly organic views on social media. About the role The Faculty Lead & Learning Engineer - Sciences is Outsmart's designated faculty member for undergraduate... 
    Suggested
    Full time
    Work at office
    Remote work

    Outsmart

    Lehi, UT
    1 day ago
  •  ...Lead Software Engineer - Machine Learning Location: Hybrid Model (5 days in month), Boston MA, Durham NC, New Jersey Duration: Long term The...  ...Value You Deliver Designing & developing a feature generation & store framework that promotes sharing of data/... 
    Suggested
    Shift work

    Samprasoft

    Durham, NC
    3 days ago
  • $135k - $300k

     ...Machine Learning Engineer Ema is building the world's leading Agentic AI platform to transform enterprise productivity...  ..., reasoning, dialog and code-generation systems. # Implement advanced...  ...as Transformer-based models, reinforcement learning, ensemble learning, and... 
    Suggested

    EMA

    San Francisco, CA
    1 day ago
  • $151k - $297k

     ...most critical challenges: leveraging Generative AI to transform rigid, legacy applications...  ...looking for an experienced Senior Lead Software Engineer with a strong background in database...  ...attitude, and a drive to continue learning Excellent verbal and written communication... 
    Local area

    I did my part and supported the Regular Toilet

    California, MO
    3 days ago
  • $200k - $240k

     ...Lead Machine Learning Engineer Interos.ai is defining the category of supply chain risk intelligence, building the world's most trusted and transparent...  ...on experience with vector databases, retrieval-augmented generation (RAG), and fine-tuning LLMs. Experience with fine-... 
    Remote work
    Flexible hours

    Interos Inc

    United States
    1 day ago
  •  ...Lead Machine Learning Engineer Janea Systems (USA) is a dynamic team of the best & brightest software engineering specialists and solutions innovators...  ...will also have: Experience working with LLMs, generative AI systems, or deep learning architectures. Experience... 
    Full time
    Work at office
    Remote work
    Work from home
    Flexible hours

    Janea Systems

    United States
    1 day ago
  • $220k - $270k

     ...Lead Machine Learning Engineer Remote, USA May Mobility is transforming cities through autonomous technology to create a safer, greener, more...  ...Direct experience architecting & training VLA, MMLM, or Generative World Models for commercial-scale applications ~... 
    Work at office
    Immediate start
    Remote work
    Flexible hours

    May Mobility

    United States
    16 hours ago
  •  ...Senior/Lead Machine Learning Engineer Innovate at the frontier of AI as a Senior Machine Learning Engineer at Exadel. Design intelligent systems...  ...build, insure, and replace vehicles to power the next generation of transportation. Its platform delivers advanced mobile,... 
    Contract work
    Work at office
    Local area
    Remote work

    Exadel

    United States
    1 day ago
  • $171.6k - $230.1k

     ...Technology is a global organization of engineers, product developers, designers,...  ...for Disney's industry-leading ad technology and products - driving...  ...across multiple machine learning areas with primary focus on specialization in generative AI applications, including generative... 

    The Walt Disney Company

    Seattle, WA
    1 day ago
  •  ...Senior Lead Machine Learning Engineer, Entry & Re-engagement We're on a mission to unleash the power of content… you in? We've got the brands...  ...engineering, specifically in ranking, retrieval, or reinforcement learning. Cold Start Experience: Proven experience building... 
    Immediate start

    Paramount Global Services

    United States
    7 days ago
  • $229.9k - $262.4k

     ...Senior Lead Machine Learning Engineer (Intelligent Foundations and Experiences) As a Capital One Machine Learning Engineer (MLE) , you'll be...  ...Python ~2+ years of experience with Retrieval Augmented Generation (RAG) ~ Experience staying abreast of latest ML research... 
    Full time
    Part time
    Internship
    Local area

    Capital One

    McLean, VA
    2 days ago
  •  ...Lead Machine Learning Engineer Bulgaria, Hungary, Poland Why Join Exadel We're an AI-first global tech company with 25+ years of engineering...  ...build, insure, and replace vehicles to power the next generation of transportation. Its platform delivers advanced mobile,... 
    Contract work
    Work at office
    Local area
    Remote work

    Exadel

    Poland, ME
    3 days ago
  • $171.6k - $230.1k

     ...Lead Machine Learning Engineer Technology is at the heart of Disney's past, present, and future. Disney Entertainment and ESPN Product & Technology...  ...related evaluation methodologies ~ Retrieval-augmented generation (RAG) architectures Experience building systems... 

    The Walt Disney Studios

    Seattle, WA
    2 days ago
  •  ...and make a difference with us. The Generative and Agentic AI Engineering Department works across the full...  ...through briefings and/or white papers Lead large projects and/or small...  ...Ability to apply state-of-the-art machine learning techniques to data and natural language... 
    Work experience placement
    Internship
    Local area

    The MITRE Corporation

    McLean, VA
    1 day ago
  • $179.7k - $225k

     ...Lead Machine Learning Engineer This is not a remote role. You must be in the local area or willing to relocate. The cross-media measurement...  ...of groundbreaking storytelling, captivating audiences and generating critical acclaim. In 2023, Disney Entertainment... 
    Local area
    Relocation

    The Walt Disney Studios

    New York, NY
    7 days ago
  •  ...manufacture, and market a portfolio of leading aesthetics brands and...  ..., Data Scientists, Data Engineers, Software Engineers, Business...  ...teams) to build data and Machine Learning products Take ownership of...  ...Models (LLMs), other generative AI modalities, and how they are... 
    Local area
    Remote work

    AbbVie

    United States
    2 days ago
  •  ...collaboration, and high standards. Our engineers, product leaders, and go-to-...  .... Responsibilities: Lead the team in: research, design...  ...and deploy cutting-edge deep learning models across all Eightfold...  ...of agent-based modeling, reinforcement learning, and autonomous systems... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    3 days per week

    Eightfold LLC

    Santa Clara, CA
    7 days ago
  • $181.5k - $283.8k

     ...that scale across therapeutic areas. As a Generative AI Engineer, you will design, build, and operate the...  ...Leadership Network (VLN), Women's Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups. Actual... 
    Full time
    Flexible hours

    Eli Lilly

    San Francisco, CA
    4 days ago
  •  ...- and make a difference with us. Generative and Agentic AI Engineering Department The Generative and Agentic...  ...briefings and/or white papers Lead large projects and/or small...  ...Ability to apply state-of-the-art machine learning techniques to data and natural language... 
    Work experience placement

    Navstar

    McLean, VA
    2 days ago
  • $197.3k - $225.1k

     ...Lead Machine Learning Engineer (Gen AI, Python, Go, AWS) As a Capital One Machine Learning Engineer (MLE) on the GenAI Workflows Serving team,...  ...team dedicated to designing, building, and productionizing Generative AI applications and Agentic Workflow systems at massive... 
    Full time
    Part time
    Internship
    H1b
    Local area

    Capital One

    McLean, VA
    3 days ago
  • A leading financial institution located in New York is seeking a Lead, Generative AI Engineering to spearhead the delivery of enterprise-grade generative AI products. This hands...  ...discipline and significant experience in machine learning and technical leadership. A competitive... 

    JPMorgan Chase

    New York, NY
    3 days ago
  •  ...AI-powered visual and spatial engine. Backed by the Harvard...  ...Innovation Labs, we’re creating next-generation intelligent systems that...  ..., and experiment tracking Lead research experiments, benchmarking...  ..., or multi-view learning Familiarity with node-based... 
    Full time

    Edensign

    Boston, MA
    1 day ago
  • $240k - $260k

     ...intent beyond legacy search engines. Today, our Native...  ...exclusively integrated across next-generation platforms where discovery...  ...seeking an experienced Lead Machine Learning Engineer passionate about...  ..., deep learning, and reinforcement learning techniques. Develop... 
    Summer work
    Work at office

    adMarketplace

    New York, NY
    a month ago
  •  ...fastest, and also allow us to learn as much about the brain as possible...  ...role As a Machine Learning Lead at Nudge, you will drive the development of next-generation ML and imaging systems at the...  ...Partner closely with mechanical engineers, electrical engineers,... 

    Nudge

    San Francisco, CA
    18 days ago
  • $270k - $307k

     ...Lead Machine Learning R&D Engineer CoStar Group is a leading global provider of commercial and residential real estate information, analytics,...  ..., point cloud processing, mesh processing), image/video generation models, 2D/3D semantic understanding, depth estimation.... 
    Remote work

    CoStar Group

    United States
    2 days ago
  •  ...Machine Learning Engineer Gen is a global company dedicated to powering Digital Freedom through its trusted consumer brands including...  ...financial empowerment and cyber safety for the first digital generations, and today we deliver award-winning cybersecurity, online privacy... 
    Remote work
    Flexible hours

    Gen Digital

    United States
    4 days ago
  • $165k - $215k

     ...consumers with the world’s leading brands through data-driven content...  ...through data, machine learning, and relentless optimization...  ...Are you a Machine Learning Engineer with a passion for bridging...  ...cutting-edge ML approaches (e.g., reinforcement learning, deep learning for... 
    Full time
    Remote work

    Launch Potato

    West Palm Beach, FL
    16 hours ago
  •  ...Machine Learning Engineer We are seeking a highly skilled and communicative Machine Learning Engineer (8+ years of experience) to join...  ...will be responsible for designing, building, and deploying Generative AI models (LLMs, Diffusion Models) to solve complex business... 

    Saxon Global

    Washington DC
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Engineer, Reinforcement Learning & Scenario Generation. Be the first to apply!