RL Environment Software Engineer
talentpluto
Location: San Francisco, CA Work Model: Onsite Industry: Applied AI / AI research data Compensation: $180K-$220K base, ~$400K+ OTE (uncapped profit share) About the Company Our partner is a fast-growing applied AI research lab that builds high-quality reinforcement-learning environments and agents sold to the world's leading AI labs. In under two years they have scaled to a nine-figure revenue run rate and grown their team severalfold in a matter of months, backed by leading venture investors. Quality is their core differentiator, and they are rapidly expanding into new domains. The Opportunity As an RL Environment Software Engineer, you will sit at the intersection of research engineering and traditional software engineering, building the environments that simulate real-world workflows and the agents that automate them. This is forward-looking work, you will help research and predict what high-quality environments the frontier will need next, then build them from the ground up. You will join a brand-new RL team being assembled with exceptional talent, with a clear path to grow alongside it as the function scales into industry pods. Responsibilities Design and build high-quality RL environments that simulate real working environments end to end. Develop agents for the tasks within those environments and iterate until they are efficient and production-ready. Partner with the research team to scope which environments to build and why, staying ahead of future demand rather than only meeting present needs. Own the backend and infrastructure layers that make environments reliable and scalable. Help set engineering standards for a zero-to-one team as the RL function grows. Requirements Strong machine-learning engineers who code heavily and build systems from scratch, with strong intuition for reinforcement learning. Proficiency across a modern stack, Node.js and Python on the backend and React/TypeScript on the frontend, with strong Kubernetes and Docker skills. Comfort operating in a fast-paced startup environment with high ownership and long hours. A track record of meaningful tenure and impact at previous companies. Reinforcement-learning experience or an RL research background is a strong plus, though not required. Bachelor's degree in computer science or a related technical field, or equivalent practical experience.
- A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine...SuggestedSummer workInternship
- AI Talent Now in San Francisco is seeking a skilled RL Environment Engineer to design impactful datasets that influence the learning of frontier AI models. You will collaborate with elite research teams at leading AI labs to refine reinforcement learning environments and...Suggested
- A company specializing in AI training data is seeking a Reinforcement Learning Environment Engineer to design and build MLE/SWE environments. This remote contractor position requires strong Python skills, hands-on LLM experience, and the ability to operate independently...SuggestedRemote jobFull timeFor contractors
- A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in...Suggested
- Reflection in San Francisco is looking for a strong software engineer to design and optimize training infrastructure for AI models. You will collaborate with researchers to create scalable systems and ensure they are computationally efficient. The ideal candidate has experience...Suggested
$190k - $230k
Serve Robotics is hiring a Lead Engineer to focus on RL Scaling & Procedural Scenario Generation in San Francisco. Your role will involve developing... .... You will contribute to the design of simulation environments and collaborate with teams to generate dynamic training scenarios...- ...that can operate in complex environments, reducing human risk in conflict... ...lookout for extraordinary engineers and scientists to join our team... ...between hardware and software. Apply advanced techniques such... ...development of our in‑house RL training pipelines and tooling...
- AI Talent Now in San Francisco is looking for an engineer to take ownership of projects that innovate AI evaluation techniques. You... ...This role involves a variety of tasks, from prototyping new RL environments to enhancing existing systems, providing an exciting balance...
- ...group of committed researchers, engineers, policy experts, and business... ...AI systems. About the RL Teams Our Reinforcement Learning... ..., test, debug, and ship real software — end to end, on real codebases... .... You'll design RL environments and coding tasks, build the reward...Visa sponsorship
$350k
Menlo Ventures is looking for a Research Engineer for their Code RL team. This role focuses on advancing AI models' coding capabilities while ensuring safety and performance in output. The successful candidate should have expertise in machine learning frameworks and a...Work at officeFlexible hours- ...Digital Space LLC in San Francisco is hiring a Research Engineer for the Code RL team. This position aims to enhance AI models' software capabilities using deep Python expertise. The role involves designing RL environments, managing training experiments, and ensuring code...
$350k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...build beneficial AI systems. About the RL Teams Our Reinforcement Learning teams... ...Invent, design and implement RL environments and evaluations. Conduct experiments and...Work at officeVisa sponsorshipFlexible hours- Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English (C1/C2); We’re hiring RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality...Full timeFor contractorsRemote workRelocation
- Software Engineer - Full Stack (Frontend Focus) Location Team What we're looking for We’re looking... ...for our AI training, fine-tuning, and RL platforms. Collaborate with product,... ...package. A collaborative and inclusive work environment. Access to cutting-edge technology and...
- ...exciting Reinforcement Learning (RL) Gym project . You will... ...collaborate with researchers and engineers, and deliver high-quality... ...experimentation workflows and simulation environments. Collaborate with ML... .... Solid understanding of software engineering best practices (testing...Contract workFor contractorsRemote work
$180k - $280k
...streams into high-fidelity simulated environments that generate the training signal... ...training loop, not a wrapper. Our engineers work at the core of the RL pipeline — building the systems that... ...customers, real usage. We already ship software that leading AI teams depend on...Full timeRelocation package$320k
...growing group of committed researchers, engineers, policy experts, and business leaders... ...without engineering support. Construct RL environments to improve Claude’s safety investigation... ...Qualifications 6+ years of industry software engineering experience. Expertise in building...Work at officeVisa sponsorshipFlexible hours- ...London offices. About the Role We’re looking for a strong engineer who can build agentic products that scale. You will work with... ...We own Mercor’s evaluation system & annotation platform for RL environments and tasks. We build harnesses, agents, verifiers, and the end...Work at officeRelocation package
- ...Mechanize builds reinforcement learning environments that frontier AI labs use to train and... ...the complex, judgment-heavy parts of software engineering. We build the environments that expose... ...'ll design, build, and quality-assure RL tasks. Each task is a self-contained...
- ...exceptional talent with industry-leading organizations across IT, Engineering, Financial Services & Fintech, and Manufacturing & Robotics.... ...is not a narrow role. One week you might prototype a new RL environment from a research paper, the next you’ll be deploying...
- ...in revenue About the Role We’re hiring a Senior Software Engineer to build our Reinforcement Learning Environments (RLE) platform—the interactive systems where frontier... ...systems at scale. Nice to Have Experience with RL training infrastructure, simulation systems, or...Full timeFreelanceInternshipWork at officeFlexible hours
$350k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...Knowledge Work team builds the training environments and evaluations that make Claude... ...scale. Experience building or operating RL environments, agent harnesses, or LLM evaluation...Visa sponsorshipShift work$350k
Mirendil is seeking research engineers in San Francisco to build the post-training stack for frontier reasoning models. You will engage in designing experiments and iterating on reinforcement learning methodologies, focusing on scalable infrastructure for large-scale AI...$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and has created... ...at scale. About the Role As a Research Engineer/Research Scientist at OpenAI, you will... ...-paced, dynamic, and technically complex environment where rapid iteration is key. You're...Work at officeRelocation package$180k - $220k
...and get work done in the AI era. We’re looking for a backend engineer to join our small team and help lead the effort to prepare the... ..., server-side language (e.g. Go, TypeScript) in production environments ~2+ years working with SQL (and NoSQL) databases at both application...Work at officeLocal areaImmediate startFlexible hours- ...improve our product's UI/UX. Qualifications ~3+ years of software engineering experience building production-grade frontend systems. ~... ...diversity and are committed to creating an inclusive environment for all employees, free from discrimination based on race,...
$180k - $270k
A leading data extraction company is seeking a Research Engineer focused on reinforcement learning in San Francisco or Remote. In this full... ...infrastructures, fine-tune models, and bridge classical RL and modern agent systems. Ideal candidates have 3+ years in applied...Full timeRemote work- ...project. The ideal candidate will have over 5 years of experience in Python and 3+ years with FastAPI, demonstrating strong software engineering practices. This fully remote position requires a commitment of at least 20 hours per week, with a contract duration of 3 months...Remote jobContract work
- ...manipulation, long-horizon reliability, safety and more. Combining rigorous research with high-quality engineering across evaluation, data, training, RL environments and shared infrastructures, we aim to create reliable and practical computer-using agents. About the Role...Work at officeRelocation package
- A pioneering technology company in San Francisco is hiring an AI/ML Engineer to develop and implement advanced models using deep reinforcement learning. The successful candidate will work on real-world systems, conducting experiments and analysis to improve model performance...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to RL Environment Software Engineer. Be the first to apply!
- environmental engineer San Francisco, CA
- software engineer amazon San Francisco, CA
- experienced software developer San Francisco, CA
- federal - software developer San Francisco, CA
- software developer internship San Francisco, CA
- senior software engineer San Francisco, CA
- software developer fintech San Francisco, CA
- part time software developer remote San Francisco, CA
- software developer intern San Francisco, CA
- software data engineer San Francisco, CA

