Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer - RL Environments

$200k

AfterQuery

About AfterQuery

AfterQuery is an applied research lab curating data solutions for foundation model development.

We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve.

This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate.

We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco.

The Role

As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale.

Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications.

What You'll Do
  • Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows
  • Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines
  • Model annotator behavior and run experiments to improve different model capabilities
  • Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability
  • Create and manage both real world & synthetic data pipelines
  • Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications
What We're Looking For
  • 1-4 YOE
  • Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc..
  • Genuine obsession with how data structure, selection, and quality drive model behavior
  • Ability to design lightweight experiments, move fast, and extract actionable insights from messy results
  • Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right.

Compensation Structure:

$200k base + profit share (around 150% of base) + competitive equity
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Software Engineer - RL Environments in San Francisco, CA vacancy
  •  ...scale , ~(Desirable) Experience with RL training infrastructure, simulation...  ...Working in an operations-heavy, tech-enabled environment , ~(Desirable) Experience supporting...  ...involves We’re hiring a Senior Software Engineer to build our Reinforcement Learning... 
    Suggested

    Handshake

    San Francisco, CA
    4 days ago
  •  ...Handshake is seeking a Senior Software Engineer in San Francisco to build and scale reinforcement learning environments for AI models. This high-ownership role entails driving architecture for scalable systems and collaborating with cross-functional teams to develop production... 
    Suggested
    Flexible hours

    Handshake

    San Francisco, CA
    4 days ago
  • A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in... 
    Suggested

    Mechanize, Inc.

    San Francisco, CA
    3 days ago
  • RippleMatch Inc. is seeking an innovative and motivated individual to design and refine reinforcement learning tasks in San Francisco. This role requires a strong command of Python and the ability to work independently with coding agents. Responsibilities include the full...
    Suggested

    RippleMatch Inc.

    San Francisco, CA
    4 days ago
  • Handshake is seeking a Senior Software Engineer to build the Reinforcement Learning Environments (RLE) platform. This role involves developing scalable systems for AI models and requires at least 6 years of experience in backend and distributed systems. Proficiency in... 
    Suggested
    Flexible hours

    Handshake

    San Francisco, CA
    4 days ago
  • $200k

    A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background... 

    SupportFinity™

    San Francisco, CA
    2 days ago
  • $250k

     ...A leading applied research lab in San Francisco is searching for a talented professional to build reinforcement learning environments and post-train LLM-based agents for their clients. Responsibilities include translating customer needs into custom environments and contributing... 

    VMAX LLC

    San Francisco, CA
    4 days ago
  • $320k

    Nerdleveltech is seeking a Software Engineer to join its RL Data team in San Francisco, California. In this role, you'll help build and improve systems that produce high-quality reinforcement learning data for AI applications. Responsibilities include developing data pipelines... 

    Nerdleveltech

    San Francisco, CA
    2 days ago
  • $320k

     ...group of committed researchers, engineers, policy experts, and business...  ...About the role Anthropic's RL Data team builds the systems...  ...feedback tooling, the execution environments RL tasks run in, and the...  ...Minimum qualifications Strong software engineering skills and proficiency... 
    Visa sponsorship

    Nerdleveltech

    San Francisco, CA
    2 days ago
  • A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine... 
    Summer work
    Internship

    Mechanize, Inc.

    San Francisco, CA
    4 days ago
  •  ...Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates... 

    Handshake

    San Francisco, CA
    4 days ago
  •  ...IDLER is looking for an experienced engineer to design and build scalable systems that create coding environments in San Francisco. Key responsibilities include developing automated QA systems for quality assurance and collaborating with AI researchers. Ideal candidates... 

    IDLER

    San Francisco, CA
    4 days ago
  •  ...A leading technology company in San Francisco is seeking a Senior Software Engineer to develop core infrastructure for AI systems. The successful candidate will have a strong background in full-stack or infrastructure systems and a degree from a top-tier university. This... 

    AMIGOS

    San Francisco, CA
    5 days ago
  •  ...Foundation Robotics Engineer Foundation is developing the future...  ...that can operate in complex environments, reducing human risk in conflict...  ...between hardware and software. Apply advanced techniques...  ...development of our in-house RL training pipelines and tooling... 

    Foundation

    San Francisco, CA
    2 days ago
  • $190k - $230k

    Serve Robotics is hiring a Lead Engineer to focus on RL Scaling & Procedural Scenario Generation in San Francisco. Your role will involve developing...  .... You will contribute to the design of simulation environments and collaborate with teams to generate dynamic training scenarios... 

    Serve Robotics

    San Francisco, CA
    1 day ago
  • $300k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...About the Role As a Full-Stack Software Engineer in RL, you'll build the platforms, tools, and interfaces that power environment creation, data collection, and training... 
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...Senior Full Stack Software Engineer Mariana Minerals is looking for an experienced Senior Full...  ...and implement production-ready ML, RL, and LLM-powered features, ensuring robust...  ...record of working in ambiguous, fast-paced environments and bringing structure to complex... 

    Mariana Minerals

    San Francisco, CA
    3 days ago
  •  ...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data...  ...and refine digital twin and simulation environments to accelerate training, testing, and...  ...meet. Collaborate with controls, software, and field engineering teams to integrate... 
    Weekend work

    Fluix AI

    San Francisco, CA
    2 days ago
  •  ...San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing...  ...cybersecurity, machine learning, and solid software engineering skills. Successful...  ...compensation and a supportive work environment that values safety and... 
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    2 days ago
  •  ...Founding Software Engineer As a Founding Software Engineer, you'll have end-to-end ownership over projects pushing the frontier of AI...  ...This isn't a narrow role. One week you might prototype a new RL environment from a research paper, the next you'll deploy distributed... 
    Work at office
    Visa sponsorship

    RainesDev

    San Francisco, CA
    3 days ago
  • $350k

     ...Mechanize RL Task Designer Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more...  ...at the complex, judgment-heavy parts of software engineering. We build the environments that expose those... 

    Mechanize

    San Francisco, CA
    2 days ago
  • $300k

     ...Mechanize builds reinforcement learning environments that frontier AI labs use to train and...  ...the complex, judgment-heavy parts of software engineering. We build the environments that expose...  .... You'll design, build, and refine RL tasks. Each task is a self-contained... 

    Mechanize

    San Francisco, CA
    2 days ago
  • $320k

     ...growing group of committed researchers, engineers, policy experts, and business leaders...  ...without engineering support Construct RL environments to improve Claude’s safety investigation...  ...Preferred qualifications 6+ years of industry software engineering experience Expertise in... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • $400k

     ...Mechanize RL Engineer Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize...  ...fail at the complex, judgment-heavy parts of software engineering. We build the environments that... 

    Mechanize

    San Francisco, CA
    2 days ago
  •  ...Software Engineer, ML Research Engineering · Full-time · San Francisco; New York Our mission is to automate coding. The first step in our...  ...future of coding. We train frontier coding agents and scale RL on real user data to make them increasingly effective. About... 
    Full time

    Anysphere

    San Francisco, CA
    2 days ago
  •  ...Software Engineer - Full Stack (Frontend Focus) Location Team What we're looking for We’re looking...  ...for our AI training, fine-tuning, and RL platforms. Collaborate with product,...  ...package. A collaborative and inclusive work environment. Access to cutting-edge technology and... 

    Emissary

    San Francisco, CA
    4 days ago
  • $192k - $240k

     ...organizations to empower scientists, engineers, financial experts, product...  ...the datasets and evaluation environments that frontier models rely on...  ..., agentic workflows, RL environments, and scalable LLM...  ...customer‑facing, cloud‑native software systems ~ Deep experience with... 
    Work at office
    Local area
    3 days per week

    Snorkel AI

    San Francisco, CA
    4 days ago
  • $150k - $200k

     ...As one of the foundational members of our Engineering team, you will architect and develop systems...  ...implement, test, and debug code across the software stack Collaborate with ML Engineers to develop blazingly fast online RL systems at scale What You'll Bring ~... 

    Watney

    San Francisco, CA
    20 days ago
  • $180k - $280k

     ...Description What we do Idler builds reinforcement learning environments that teach AI models to code like 0.01% engineers. Our training environments are based on real-world...  ...Design and build scaleable systems that generate RL environments Create automated QA systems to... 
    Contract work
    Relocation package

    Idler

    San Francisco, CA
    29 days ago
  •  ...manipulation, long-horizon reliability, safety and more. Combining rigorous research with high-quality engineering across evaluation, data, training, RL environments and shared infrastructures, we aim to create reliable and practical computer-using agents. About the Role... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - RL Environments. Be the first to apply!