Senior Machine Learning Engineer, RL Environments
Preference Model
ML Engineers
Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world complexity, with diverse tasks and robust reward functions.
Our founding team has previous experience on Anthropic's data team building data infrastructure, and datasets behind Claude. We are partnering with leading AI labs to push AI closer to achieving its transformative potential.
About the Role
We're hiring experienced ML Engineers to design and build reinforcement learning environments to safely advance model capabilities specifically on machine learning research and engineering tasks to do the work of an MLE at a frontier lab.
This role blends research and engineering. It will require you to both develop novel approaches and realize them in code. Your work will include designing and implementing RL environments, conducting experiments and evaluations, delivering your work into production training runs, and collaborating with other researchers and engineers.
You'll join a small, high-ownership team and contribute directly to the data layer that powers frontier LLM capability.
Note: This role is only for experienced ML Engineers. We have separate openings for New Grads, and for Interns.
What You Will Do:
Design and build RL environments and reward functions that produce clean, learnable signals for frontier models on ML research and engineering tasks.
Build deep expertise across the frontier of ML research, training, and inference infrastructure.
Collaborate with others to brainstorm and create new ideas and tools to improve the environment building process.
What We Are Looking For (Qualifications):
You have strong ML fundamentals and broad research interests. You read many papers or tutorials, understand topics deeply and have the creativity to translate them into RLVR problems.
Proficiency in Python and systems programming and at least one of PyTorch or JAX
Problem solvers who take ownership and drives solutions end-to-end
Passion for staying current with the rapidly evolving ML infrastructure landscape
Ability to meet throughput expectations and respond quickly to feedback
About You
Expert knowledge in an active DL/ML research area, with publications or public code to show for it. Research experience (PhD, MS) is a big plus.
Deep understanding of transformer internals, training/inference of modern LLMs, experience with inference libraries (vLLM, SGLang, etc)
Strong expertise in kernel development (CUDA, Triton, Pallas)
You have built complex interactive RL environments
What We Offer:
Competitive cash and equity compensation (>90th percentile)
Ownership and autonomy in a fast moving startup environment
Opportunity to work with top machine learning engineers
Health, vision, dental, benefits
401K match
Lunch provided everyday onsite
Weekly snack orders
Visa sponsorship & relocation support available
We value diverse perspectives and experiences. If you're excited about this role but don't check every box, we still encourage you to apply.
Note: We utilize AI note-taking during our interview sessions to ensure we capture all answers and details accurately. Candidates are allowed to use AI note-takers as well, however, no other AI tools are permitted during any live interviews.
- ...Model is building automated ML research engineering. Existing frontier models are... ...bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments... ...~ Opportunity to work with top machine learning engineers Competitive cash and...SuggestedFull timeInternshipRemote work
- Preference Model, Inc. is seeking a Senior Software Engineer to design and develop RL environments that enhance AI models. This role involves working with complex tasks and mentoring newer engineers. Candidates should have deep software engineering experience, proficiency...SeniorVisa sponsorship
- ...Senior Software Engineer AI models have gotten good at narrow coding tasks but still fail at... ...a Senior Software Engineer on the RL Environments team, you will build the environments... ...Opportunity to work with top machine learning engineers Health, vision, dental...SeniorVisa sponsorshipRelocation package
- ...the ground up—and we're looking for a Senior+ Machine Learning Engineer to help make it autonomous. We're... ...equipment uptime simultaneously. The environment is noisy and non-stationary: wastewater... ...realistic simulators to pretrain RL control algorithms, and work with subject...SeniorImmediate startShift work
$204k - $259k
...stack. We are an advanced ML and engineering team that leverages state-of-... ...-art computer vision, deep learning, and generative AI to... ...and Reinforcement Learning (RL) techniques to create a "data... ...projects in a fast-paced R&D environment. We prefer: PhD in...SeniorFull timeRemote work$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...supervised fine-tuning (SFT) to reinforcement learning (RL), for evaluating the quality, safety, and realism of...SeniorFull timeTemporary workRemote work$186.1k - $300.55k
...you'll do We are looking for a Senior Machine Learning Engineer to redefine how we operate our global... ...Systems and Reinforcement Learning (RL) loops work in tandem with Large Language... ...monitoring in production Kubernetes environments Collaborate with Applied...SeniorContract workWork at officeLocal areaRemote work2 days per week$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco, CA; St. Louis, MO;... ...pipelines—into mission-critical government environments. We build evaluation frameworks that... ...LLM evaluation, CV robustness, or RL validation. Knowledge of...SeniorFull time- A forward-thinking AI company seeks experienced ML engineers to build distributed training infrastructure. This role involves designing... ...and Ray, ensuring performance and reliability in large-scale environments. The ideal candidates will possess strong coding skills in...Senior
- ...Machine Learning Systems Engineer, RL Engineering San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission... ...system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training...Work at officeVisa sponsorshipFlexible hours
$242k - $290k
Senior Machine Learning Engineer - ML Agents And Planning Develop and deploy transformer-based RL/imitation models to plan realistic driving behaviors. Location: San Francisco Bay Area Compensation: $242,000 - 290,000 USD / year Job Tags: Software About The Role Machine...SeniorTemporary workRelocation package- ...work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated... .... We’re hiring a Senior Product Manager to own the... ...Today, building a single RL environment is a substantial... ...steps across operators and engineers, and depends on tribal...SeniorFull timeWork at officeImmediate startRemote workFlexible hours
$150k - $265k
...about the fruit they are seeing. We are looking for a Senior Machine Learning Engineer to build creative, practical, and robust solutions to... ...independently, learn quickly, and operate in a dynamic environment ~ Enthusiasm for taking on multiple roles and...SeniorFull timeWork at officeFlexible hoursWeekend work- Handshake is looking for a Senior Product Manager in San Francisco to lead the development of the Environment Factory platform for creating reinforcement learning environments. You will drive the product roadmap, focusing on tooling and quality assurance while collaborating...SeniorFlexible hours
- Handshake is seeking a Senior Software Engineer to build the Reinforcement Learning Environments (RLE) platform. This role involves developing scalable systems for AI models and requires at least 6 years of experience in backend and distributed systems. Proficiency in ReactJS...SeniorFlexible hours
$180k - $270k
...Senior Machine Learning Engineer – Generative AI & Avatar Animation | Los Angeles/San Francisco I’m hiring right now for an AI avatar technology... ...: ~ Based in Los Angeles/San Francisco ~ Hybrid environment with strong in-person collaboration ~$180K-$270K base...Senior$204k - $259k
...Senior Machine Learning Engineer, Simulation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The...SeniorWork experience placement- ...Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM Houston, TX or SF Bay Area preferred At Bot Auto, we are revolutionizing... ...strong motivation to work independently in a fast paced environment while collaborating with other teams on more complex and...Senior
- ...with both traditional infrastructure such as load balancers, NLB, etc., as well as very different infrastructure around inference engines and GPU loads. This is a role that will inherently require deep experience with inference engines. Contributions to vLLM, SGLang...Senior
- ...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building planetary superintelligence. Backed by top funds, we've raised $10M+ and are now heads down building. Join us at the cutting edge, where we're scaling generative...SeniorWork experience placementRemote work
- ...Senior Machine Learning Engineer Oway Software Engineering San Francisco, CA, USA Supply chain is the circulatory system of the global economy. Every product on every shelf, every component in every factory, every meal in every restaurant, moved on a truck...SeniorImmediate start
$204k - $259k
...team builds the system which learns the spatial-temporal representation... ...meanings of the surrounding environment of the autonomously driving... ...set of sensors, enabling engineers like you to (1) develop methods... ...~5+ years of experience in Machine Learning, with a focus on large...SeniorFull timeRemote work$212k - $318.4k
...Senior Machine Learning Engineer (Search) Apple Maps and the thousands of applications it empowers are being used by millions every single day! As a fundamental tool for human activity, Maps technology is evolving and new techniques are emerging. We are looking for...SeniorLocal areaRelocation- ...technical discussions, stay proactive in learning, and apply new technologies to solve... ...problems. Track cutting-edge research in machine learning and incorporate advanced... ...’s degree in Computer Science, Software Engineering, AI, Mathematics, Physics, or a closely...SeniorWork experience placement
$160k - $250k
...Senior Machine Learning Engineer In order to execute our vision, we need to grow our team of best-in-class machine learning engineers. We are looking for developers who are excited about staying at the forefront of deep learning technology, prototyping state-of-the...Senior- ...and healthcare, this is the place. The Role As a Senior Machine Learning Engineer, you will bridge the gap between raw computer vision and... ...team to deploy models into our AWS/serverless environment, ensuring clinical-grade accuracy at scale. Ideal...SeniorWork at office
- ...Machine Learning Engineer We are looking for a Machine Learning Engineer to join the growing AI and Machine Learning team at Strava. This... ...testing. Have built backend production services on cloud environments like AWS, using languages like (but not limited to)...SeniorWork at officeWorldwideFlexible hours3 days per week
$175k - $250k
...Machine Learning Engineer Kiddom is a groundbreaking educational platform that promotes student... ...deploying ML/AI systems in production environments that drive measurable business... ...geographic location, prior experience, seniority, and demonstrated role related ability...SeniorPermanent employmentFull timeLocal areaFlexible hours$200k - $400k
..., and we are seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role in designing... ...of project outcomes in a fast-paced, startup environment. Qualifications & Experience ~6+ years...SeniorWork experience placement$225k - $325k
...THE ROLE This is a hands-on, high-ownership role for ML engineers who want to build production models that actually ship, and perform under real-world constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across the ML stack to power human...SeniorH1bWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Machine Learning Engineer, RL Environments. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA


