Software Engineer - RL Environments

$200k

AfterQuery

About AfterQuery

AfterQuery builds the training data and evaluation infrastructure that frontier AI labs use to make their models better. We work with the world's leading labs to design high signal datasets and run rigorous evaluations that go beyond static benchmarks. We are a small, early team (post Series A) where individual contributors have a direct impact on how the next generation of models learn and improve.

The Role

As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale.

Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications.

What You'll Do

Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows
Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines
Model annotator behavior and run experiments to improve different model capabilities
Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability
Create and manage both real world & synthetic data pipelines
Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications

What We're Looking For

1-4 YOE
Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc..
Genuine obsession with how data structure, selection, and quality drive model behavior
Ability to design lightweight experiments, move fast, and extract actionable insights from messy results
Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right.

Compensation Structure:

$200k base + profit share (around 150% of base) + competitive equity

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Software Engineer - RL Environments in San Francisco, CA vacancy

Software Engineer - RL Environments
$200k
...learn and improve. The Role As a SWE (Environments), you will design the datasets and evaluation... ...if they’ve worked for/interned for any RL environment companies in the past or any... ...results Former founders and early engineers at early stage startups are a plus. We don...
Suggested
AfterQuery
San Francisco, CA
5 days ago
Senior Software Engineer, RL Environments
...Model is building automated ML research engineering. Existing frontier models are... ...bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments... ...the complex, judgment-heavy parts of software engineering: working in a large...
Suggested
Visa sponsorship
Relocation package
Preference Model, Inc.
San Francisco, CA
2 days ago
Software Engineer - RL Environments — AfterQuery
$180k - $220k
Software Engineer - RL Environments — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $180,000 - $220,000 base | ~$500,000 total cash + equity About AfterQuery AfterQuery is an AI infrastructure company building training data and evaluation systems for frontier...
Suggested
David Joseph & Company
San Francisco, CA
4 days ago
Software Engineer — RL Environments for Frontier AI
A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in...
Suggested
Mechanize, Inc.
San Francisco, CA
1 day ago
Junior Software Engineer - RL Environments for Frontier AI
RippleMatch Inc. is seeking an innovative and motivated individual to design and refine reinforcement learning tasks in San Francisco. This role requires a strong command of Python and the ability to work independently with coding agents. Responsibilities include the full...
Suggested
RippleMatch Inc.
San Francisco, CA
2 days ago
Software Engineer, RL Environments & ML Infra (Equity)
An innovative AI company in San Francisco seeks a software engineer to architect reinforcement learning environments and develop scalable training infrastructures for AI models. Ideal candidates should have at least 4 years of software engineering experience, skills in...
Preference Model, Inc.
San Francisco, CA
2 days ago
Senior Software Engineer, RL Environments Platform
Handshake is seeking a Senior Software Engineer to build the Reinforcement Learning Environments (RLE) platform. This role involves developing scalable systems for AI models and requires at least 6 years of experience in backend and distributed systems. Proficiency in...
Flexible hours
Handshake
San Francisco, CA
2 days ago
Staff RL Research Engineer — Post-Training Environments
$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...
SupportFinity™
San Francisco, CA
5 days ago
RL Environment Engineering Intern (Python)
A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine...
Summer work
Internship
Mechanize, Inc.
San Francisco, CA
2 days ago
Senior Software Engineer — Scalable AI Environments & QA
IDLER is looking for an experienced engineer to design and build scalable systems that create coding environments in San Francisco. Key responsibilities include developing automated QA systems for quality assurance and collaborating with AI researchers. Ideal candidates...
IDLER
San Francisco, CA
5 days ago
Senior Software Engineer, Reinforcement Learning Environments
Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates have...
Handshake
San Francisco, CA
5 days ago
AI Engineer (RL & WBC)
...Foundation Robotics Engineer Foundation is developing the future... ...that can operate in complex environments, reducing human risk in conflict... ...between hardware and software. Apply advanced techniques... ...development of our in-house RL training pipelines and tooling...
Foundation
San Francisco, CA
8 hours ago
Lead RL Engineer: Scalable Training & Scenario Gen
$190k - $230k
Serve Robotics is hiring a Lead Engineer to focus on RL Scaling & Procedural Scenario Generation in San Francisco. Your role will involve developing... .... You will contribute to the design of simulation environments and collaborate with teams to generate dynamic training scenarios...
Serve Robotics
San Francisco, CA
4 days ago
Software Engineer - Full Stack (Frontend Focus)
...Software Engineer - Full Stack (Frontend Focus) Location Team What we're looking for... ...for our AI training, fine-tuning, and RL platforms. Collaborate with product,... ...A collaborative and inclusive work environment. Access to cutting-edge technology and...
Emissary
San Francisco, CA
1 day ago
Full-Stack Software Engineer, Reinforcement Learning
$300k - $405k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...About the Role As a Full-Stack Software Engineer in RL, you'll build the platforms, tools, and interfaces that power environment creation, data collection, and training...
Work at office
Visa sponsorship
Flexible hours
Shift work
Anthropic
San Francisco, CA
8 hours ago
AI/ML Engineer(RL & Physical Systems)
...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data... ...and refine digital twin and simulation environments to accelerate training, testing, and... ...meet. Collaborate with controls, software, and field engineering teams to integrate...
Weekend work
Fluix AI
San Francisco, CA
3 hours ago
Cybersecurity RL Research Engineer - Flexible Hours
...San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing... ...cybersecurity, machine learning, and solid software engineering skills. Successful... ...compensation and a supportive work environment that values safety and...
Flexible hours
Menlo Ventures
San Francisco, CA
5 days ago
Software Engineer
$150k - $200k
...As one of the foundational members of our Engineering team, you will architect and develop systems... ...implement, test, and debug code across the software stack Collaborate with ML Engineers to develop blazingly fast online RL systems at scale What You'll Bring...
Watney Robotics Inc
San Francisco, CA
8 hours ago
Junior Software Engineer
$300k
...Mechanize builds reinforcement learning environments that frontier AI labs use to train and... ...the complex, judgment-heavy parts of software engineering. We build the environments that expose... .... You'll design, build, and refine RL tasks. Each task is a self-contained...
Mechanize
San Francisco, CA
8 hours ago
Software Programmers (QA)
$350k
...Mechanize RL Task Designer Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more... ...at the complex, judgment-heavy parts of software engineering. We build the environments that expose those...
Mechanize
San Francisco, CA
8 hours ago
Founding Software Engineer
...Founding Software Engineer As a Founding Software Engineer, you'll have end-to-end ownership over projects pushing the frontier of AI... ...This isn't a narrow role. One week you might prototype a new RL environment from a research paper, the next you'll deploy distributed...
Work at office
Visa sponsorship
RainesDev
San Francisco, CA
1 day ago
Senior Software Engineer
$400k
...Mechanize RL Engineer Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize... ...fail at the complex, judgment-heavy parts of software engineering. We build the environments that...
Mechanize
San Francisco, CA
8 hours ago
Software Engineer, ML Research
...Software Engineer, ML Research Engineering · Full-time · San Francisco; New York Our mission is to automate coding. The first step in our... ...future of coding. We train frontier coding agents and scale RL on real user data to make them increasingly effective. About...
Full time
Anysphere
San Francisco, CA
8 hours ago
Remote Python Engineer: FastAPI & RL Gym
...project. The ideal candidate will have over 5 years of experience in Python and 3+ years with FastAPI, demonstrating strong software engineering practices. This fully remote position requires a commitment of at least 20 hours per week, with a contract duration of 3 months...
Contract work
Remote work
Turing
San Francisco, CA
1 day ago
Research Engineer/Research Scientist, RL/Reasoning
$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and has created... ...at scale. About the Role As a Research Engineer/Research Scientist at OpenAI, you will... ...-paced, dynamic, and technically complex environment where rapid iteration is key. You're...
Work at office
Relocation package
Slope
San Francisco, CA
3 days ago
Research Engineer, RL Infrastructure and Reliability (Knowledge Work)
$350k
Research Engineer, RL Infrastructure and Reliability (Knowledge Work) Anthropic’s mission is to create reliable, interpretable, and steerable... .... About the role The Knowledge Work team builds training environments and evaluations that make Claude effective at real-world...
Visa sponsorship
Shift work
aijoblist
San Francisco, CA
3 days ago
Senior Software Engineer
...agentic systems that create and QA coding environments at scale. Most of your day will be spent... ...build scalable systems that generate RL environments Create automated QA systems... ...work with The founding team, a founding engineer, and a small group of engineers. You will...
Relocation package
IDLER
San Francisco, CA
4 days ago
Software Engineer (Product)
The role As an AI product engineer, you'll build the products, interfaces, and tools that... ...that let agents operate across different environments What We're Looking For Strong full-... ...of us are former founders. We've built RL infrastructure at OpenAI, data foundations...
Work at office
Visa sponsorship
Relocation package
Applied Compute Inc.
San Francisco, CA
1 day ago
Senior Software Engineer (RLE)
...in revenue About the Role We’re hiring a Senior Software Engineer to build our Reinforcement Learning Environments (RLE) platform—the interactive systems where frontier... ...systems at scale. Nice to Have Experience with RL training infrastructure, simulation systems, or...
Full time
Freelance
Internship
Work at office
Flexible hours
Cacheflow
San Francisco, CA
1 day ago
Robotics Software Engineer
$125k - $195k
...building a small team of exceptional, hands-on engineers to make this happen. Mechanical,... ...0 years. About the Team The Tool Software team builds the software that makes our semiconductor... ...function tuning Experience with ML or RL frameworks applied to perception or...
Work at office
Visa sponsorship
Night shift
Atomic Semi
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - RL Environments. Be the first to apply!