Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Machine Learning Engineer, RL Environments

Preference Model

ML Engineers

Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world complexity, with diverse tasks and robust reward functions.

Our founding team has previous experience on Anthropic's data team building data infrastructure, and datasets behind Claude. We are partnering with leading AI labs to push AI closer to achieving its transformative potential.

About the Role

We're hiring experienced ML Engineers to design and build reinforcement learning environments to safely advance model capabilities specifically on machine learning research and engineering tasks to do the work of an MLE at a frontier lab.

This role blends research and engineering. It will require you to both develop novel approaches and realize them in code. Your work will include designing and implementing RL environments, conducting experiments and evaluations, delivering your work into production training runs, and collaborating with other researchers and engineers.

You'll join a small, high-ownership team and contribute directly to the data layer that powers frontier LLM capability.

Note: This role is only for experienced ML Engineers. We have separate openings for New Grads, and for Interns.

What You Will Do:
  • Design and build RL environments and reward functions that produce clean, learnable signals for frontier models on ML research and engineering tasks.

  • Build deep expertise across the frontier of ML research, training, and inference infrastructure.

  • Collaborate with others to brainstorm and create new ideas and tools to improve the environment building process.

What We Are Looking For (Qualifications):
  • You have strong ML fundamentals and broad research interests. You read many papers or tutorials, understand topics deeply and have the creativity to translate them into RLVR problems.

  • Proficiency in Python and systems programming and at least one of PyTorch or JAX

  • Problem solvers who take ownership and drives solutions end-to-end

  • Passion for staying current with the rapidly evolving ML infrastructure landscape

  • Ability to meet throughput expectations and respond quickly to feedback

About You
  • Expert knowledge in an active DL/ML research area, with publications or public code to show for it. Research experience (PhD, MS) is a big plus.

  • Deep understanding of transformer internals, training/inference of modern LLMs, experience with inference libraries (vLLM, SGLang, etc)

  • Strong expertise in kernel development (CUDA, Triton, Pallas)

  • You have built complex interactive RL environments

What We Offer:
  • Competitive cash and equity compensation (>90th percentile)

  • Ownership and autonomy in a fast moving startup environment

  • Opportunity to work with top machine learning engineers

  • Health, vision, dental, benefits

  • 401K match

  • Lunch provided everyday onsite

  • Weekly snack orders

  • Visa sponsorship & relocation support available

We value diverse perspectives and experiences. If you're excited about this role but don't check every box, we still encourage you to apply.

Note: We utilize AI note-taking during our interview sessions to ensure we capture all answers and details accurately. Candidates are allowed to use AI note-takers as well, however, no other AI tools are permitted during any live interviews.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Machine Learning Engineer, RL Environments in San Francisco, CA vacancy
  •  ...Model is building automated ML research engineering. Existing frontier models are...  ...bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments...  ...~ Opportunity to work with top machine learning engineers Competitive cash and... 
    Suggested
    Full time
    Internship
    Remote work

    Preference Model

    San Francisco, CA
    3 days ago
  • Preference Model, Inc. is seeking a Senior Software Engineer to design and develop RL environments that enhance AI models. This role involves working with complex tasks and mentoring newer engineers. Candidates should have deep software engineering experience, proficiency... 
    Senior
    Visa sponsorship

    Preference Model, Inc.

    San Francisco, CA
    3 days ago
  •  ...Senior Software Engineer AI models have gotten good at narrow coding tasks but still fail at...  ...a Senior Software Engineer on the RL Environments team, you will build the environments...  ...Opportunity to work with top machine learning engineers Health, vision, dental... 
    Senior
    Visa sponsorship
    Relocation package

    Preference Model

    San Francisco, CA
    3 days ago
  •  ...the ground up—and we're looking for a Senior+ Machine Learning Engineer to help make it autonomous. We're...  ...equipment uptime simultaneously. The environment is noisy and non-stationary: wastewater...  ...realistic simulators to pretrain RL control algorithms, and work with subject... 
    Senior
    Immediate start
    Shift work

    Mariana Minerals

    San Francisco, CA
    1 day ago
  • $204k - $259k

     ...stack. We are an advanced ML and engineering team that leverages state-of-...  ...-art computer vision, deep learning, and generative AI to...  ...and Reinforcement Learning (RL) techniques to create a "data...  ...projects in a fast-paced R&D environment. We prefer: PhD in... 
    Senior
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of... 
    Senior
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    2 days ago
  • $186.1k - $300.55k

     ...you'll do We are looking for a Senior Machine Learning Engineer to redefine how we operate our global...  ...Systems and Reinforcement Learning (RL) loops work in tandem with Large Language...  ...monitoring in production Kubernetes environments Collaborate with Applied... 
    Senior
    Contract work
    Work at office
    Local area
    Remote work
    2 days per week

    DocuSign

    San Francisco, CA
    3 days ago
  • $240.45k - $300.3k

     ...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco, CA; St. Louis, MO;...  ...pipelines—into mission-critical government environments. We build evaluation frameworks that...  ...LLM evaluation, CV robustness, or RL validation. Knowledge of... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  • A forward-thinking AI company seeks experienced ML engineers to build distributed training infrastructure. This role involves designing...  ...and Ray, ensuring performance and reliability in large-scale environments. The ideal candidates will possess strong coding skills in... 
    Senior

    Preference Model, Inc.

    San Francisco, CA
    2 days ago
  •  ...Machine Learning Systems Engineer, RL Engineering San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission...  ...system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  • $242k - $290k

    Senior Machine Learning Engineer - ML Agents And Planning Develop and deploy transformer-based RL/imitation models to plan realistic driving behaviors. Location: San Francisco Bay Area Compensation: $242,000 - 290,000 USD / year Job Tags: Software About The Role Machine... 
    Senior
    Temporary work
    Relocation package

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    5 days ago
  •  ...work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated...  .... We’re hiring a Senior Product Manager to own the...  ...Today, building a single RL environment is a substantial...  ...steps across operators and engineers, and depends on tribal... 
    Senior
    Full time
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    5 days ago
  • $150k - $265k

     ...about the fruit they are seeing. We are looking for a Senior Machine Learning Engineer to build creative, practical, and robust solutions to...  ...independently, learn quickly, and operate in a dynamic environment ~ Enthusiasm for taking on multiple roles and... 
    Senior
    Full time
    Work at office
    Flexible hours
    Weekend work

    Orchard Robotics

    San Francisco, CA
    a month ago
  • Handshake is looking for a Senior Product Manager in San Francisco to lead the development of the Environment Factory platform for creating reinforcement learning environments. You will drive the product roadmap, focusing on tooling and quality assurance while collaborating... 
    Senior
    Flexible hours

    Handshake

    San Francisco, CA
    1 day ago
  • Handshake is seeking a Senior Software Engineer to build the Reinforcement Learning Environments (RLE) platform. This role involves developing scalable systems for AI models and requires at least 6 years of experience in backend and distributed systems. Proficiency in ReactJS... 
    Senior
    Flexible hours

    Handshake

    San Francisco, CA
    3 days ago
  • $180k - $270k

     ...Senior Machine Learning Engineer – Generative AI & Avatar Animation | Los Angeles/San Francisco I’m hiring right now for an AI avatar technology...  ...: ~ Based in Los Angeles/San Francisco ~ Hybrid environment with strong in-person collaboration ~$180K-$270K base... 
    Senior

    Acceler8 Talent

    San Francisco, CA
    3 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Simulation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The... 
    Senior
    Work experience placement

    Waymo

    San Francisco, CA
    3 days ago
  •  ...Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM Houston, TX or SF Bay Area preferred At Bot Auto, we are revolutionizing...  ...strong motivation to work independently in a fast paced environment while collaborating with other teams on more complex and... 
    Senior

    Bot Auto

    San Francisco, CA
    3 days ago
  •  ...with both traditional infrastructure such as load balancers, NLB, etc., as well as very different infrastructure around inference engines and GPU loads. This is a role that will inherently require deep experience with inference engines. Contributions to vLLM, SGLang... 
    Senior

    Morph Inc.

    San Francisco, CA
    2 days ago
  •  ...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building planetary superintelligence. Backed by top funds, we've raised $10M+ and are now heads down building. Join us at the cutting edge, where we're scaling generative... 
    Senior
    Work experience placement
    Remote work

    Humai

    San Francisco, CA
    4 days ago
  •  ...Senior Machine Learning Engineer Oway Software Engineering San Francisco, CA, USA Supply chain is the circulatory system of the global economy. Every product on every shelf, every component in every factory, every meal in every restaurant, moved on a truck... 
    Senior
    Immediate start

    Ritual Capital

    San Francisco, CA
    1 day ago
  • $204k - $259k

     ...team builds the system which learns the spatial-temporal representation...  ...meanings of the surrounding environment of the autonomously driving...  ...set of sensors, enabling engineers like you to (1) develop methods...  ...~5+ years of experience in Machine Learning, with a focus on large... 
    Senior
    Full time
    Remote work

    Waymo

    San Francisco, CA
    5 days ago
  • $212k - $318.4k

     ...Senior Machine Learning Engineer (Search) Apple Maps and the thousands of applications it empowers are being used by millions every single day! As a fundamental tool for human activity, Maps technology is evolving and new techniques are emerging. We are looking for... 
    Senior
    Local area
    Relocation

    Apple

    San Francisco, CA
    4 days ago
  •  ...technical discussions, stay proactive in learning, and apply new technologies to solve...  ...problems. Track cutting-edge research in machine learning and incorporate advanced...  ...’s degree in Computer Science, Software Engineering, AI, Mathematics, Physics, or a closely... 
    Senior
    Work experience placement

    Mintegral

    San Francisco, CA
    9 hours ago
  • $160k - $250k

     ...Senior Machine Learning Engineer In order to execute our vision, we need to grow our team of best-in-class machine learning engineers. We are looking for developers who are excited about staying at the forefront of deep learning technology, prototyping state-of-the... 
    Senior

    Hive

    San Francisco, CA
    1 day ago
  •  ...and healthcare, this is the place. The Role As a Senior Machine Learning Engineer, you will bridge the gap between raw computer vision and...  ...team to deploy models into our AWS/serverless environment, ensuring clinical-grade accuracy at scale. Ideal... 
    Senior
    Work at office

    Hike Medical

    San Francisco, CA
    5 days ago
  •  ...Machine Learning Engineer We are looking for a Machine Learning Engineer to join the growing AI and Machine Learning team at Strava. This...  ...testing. Have built backend production services on cloud environments like AWS, using languages like (but not limited to)... 
    Senior
    Work at office
    Worldwide
    Flexible hours
    3 days per week

    Strava

    San Francisco, CA
    4 days ago
  • $175k - $250k

     ...Machine Learning Engineer Kiddom is a groundbreaking educational platform that promotes student...  ...deploying ML/AI systems in production environments that drive measurable business...  ...geographic location, prior experience, seniority, and demonstrated role related ability... 
    Senior
    Permanent employment
    Full time
    Local area
    Flexible hours

    Kiddom

    San Francisco, CA
    1 day ago
  • $200k - $400k

     ..., and we are seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role in designing...  ...of project outcomes in a fast-paced, startup environment. Qualifications & Experience ~6+ years... 
    Senior
    Work experience placement

    Troveo AI Inc

    San Francisco, CA
    1 day ago
  • $225k - $325k

     ...THE ROLE This is a hands-on, high-ownership role for ML engineers who want to build production models that actually ship, and perform under real-world constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across the ML stack to power human... 
    Senior
    H1b
    Work at office

    Retell AI

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Engineer, RL Environments. Be the first to apply!