SWE (RL Environments) "Reinforcement Learning"

AI Talent Now

San Francisco, United States | Posted on 06/15/2026 AI Talent Now, LLC is a powerhouse in direct‑hire talent acquisition, connecting exceptional talent with industry‑leading organizations across IT, Engineering, Financial Services & Fintech, and Manufacturing & Robotics. Headquartered in Atlanta, Georgia, we serve clients and candidates nationwide. Job Description AI Talent Now Job #ZR 72 About us This dynamic company is helping push the frontier of LLMs and AI Agents through novel datasets and experimentation. We build the most complex infrastructure that powers frontier data creation for agentic and hard‑reasoning workflows. Working with all five leading AI labs, we are becoming the go‑to partner for data infrastructure for YC companies. Our sharp hockey‑stick growth and talent density come from a founding team with backgrounds in top IB and quant firms. We build the training data and evaluation infrastructure that frontier AI labs use to improve their models. Working with the world's leading labs, we design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. At a small, early post‑Series A team, individual contributors directly influence how the next generation of models learn and improve. As an RL Environment Engineer you will design datasets that directly influence how frontier models learn and work hands‑on with research teams at top AI labs. Responsibilities Design and develop datasets that shape frontier model learning. Collaborate with research teams at leading AI labs to build and refine reinforcement learning environments. Implement high‑quality benchmarks and evaluate model performance beyond static tests. Qualifications Recent graduates from top schools focused on excellence and depth; extensive track record not required. First‑author publications at top venues such as NeurIPS or ICML are highly desirable. Open to profiles from data companies with benchmarking experience. Ideal candidates have created iconic benchmarks and possess experience in supervised fine‑tuning (SFT) or reinforcement learning (RL). Experience 1–6 years of experience as a software engineer. Explicit experience building reinforcement learning environments. Signals of excellence in software engineering (e.g., work at an RL company, top VC‑backed startup, strong side projects, quant or hedge‑fund background, or founding/early‑stage startup engineer). CS degree from a top‑30 school (US/CA/EUR only). Strong full‑stack skills with depth in Python, Typescript, and other backend languages. Developed quantitative frameworks for measuring dataset quality/diversity. Bias for action and execution; willing to tackle difficult and tedious work. Compensation and Logistics Base salary: $150k–$250k, with significant bonus potential based on performance. Bonuses are uncapped and can substantially increase total compensation. Visa sponsorships are possible (experience in H1B and other types). Work Arrangement Flexible working hours; most team members in the office from noon to midnight. Work environment encourages results over strict hours. EEO Statement We are an equal opportunity employer and welcome applications from all qualified individuals regardless of race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or veteran status. #J-18808-Ljbffr AI Talent Now

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the SWE (RL Environments) "Reinforcement Learning" in San Francisco, CA vacancy

Reinforcement Learning Environment Engineer
Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English... ...RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality, diverse tasks...
Suggested
Full time
For contractors
Remote work
Relocation
Open Data Science
San Francisco, CA
3 days ago
Senior Engineering Manager, Reinforcement Learning Environments (RLE)
...Role We’re hiring a Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group building the interactive sandboxes... ...Lead, hire, and develop a high-performing team building RL environments and the platform behind them Own the RLE...
Suggested
Work at office
Remote work
Flexible hours
Handshake
San Francisco, CA
3 days ago
Reinforcement Learning Data & Environments Engineer
$350k
Mirendil, based in San Francisco, is seeking a research engineer to develop and optimize data systems supporting reinforcement learning. The role involves constructing data pipelines, ensuring model training efficiency, and collaborating with cross-functional teams. Ideal...
Suggested
Mirendil
San Francisco, CA
2 days ago
Remote RL Environment Engineer — Contractor (PST Overlap)
A company specializing in AI training data is seeking a Reinforcement Learning Environment Engineer to design and build MLE/SWE environments. This remote contractor position requires strong Python skills, hands-on LLM experience, and the ability to operate independently...
Suggested
Remote job
Full time
For contractors
Open Data Science
San Francisco, CA
3 days ago
RL Environment Engineer: Shape Frontier AI
...Now in San Francisco is seeking a skilled RL Environment Engineer to design impactful datasets that influence the learning of frontier AI models. You will collaborate... ...research teams at leading AI labs to refine reinforcement learning environments and create comprehensive...
Suggested
AI Talent Now
San Francisco, CA
12 hours ago
Lead Engineering Manager, Reinforcement Learning Environments
Handshake in San Francisco is seeking a Senior Engineering Manager to lead their Reinforcement Learning Environments team. You will be responsible for managing a team of engineers, driving architectural decisions, and owning the project roadmap while working in an in-office...
Full time
Work at office
Flexible hours
Handshake
San Francisco, CA
3 days ago
Senior Software Engineer, Reinforcement Learning Environments
Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates have...
Handshake
San Francisco, CA
4 days ago
RL Environment Software Engineer
...growing applied AI research lab that builds high-quality reinforcement-learning environments and agents sold to the world's leading AI labs. In under... ...expanding into new domains. The Opportunity As an RL Environment Software Engineer, you will sit at the intersection...
Full time
talentpluto
San Francisco, CA
3 days ago
Senior Product Manager, RL Environments — Handshake AI
...models forward. A growing share of that work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated and trained against real... ...a repeatable factory. Today, building a single RL environment is a substantial cross-team effort involving...
Full time
Work at office
Immediate start
Remote work
Flexible hours
Handshake
San Francisco, CA
3 days ago
AI Research Manager/Scientist, Reinforcement Learning
$192.6k - $344.85k
...Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco... ...Manager Reinforcement Learning** at Autodesk Research, you will... ...the best of an academic environment with product-guided research.... ...with strengths across *ML*, RL, evaluation, and human-centered...
Remote work
Autodesk, Inc.
San Francisco, CA
12 hours ago
RL Environment Engineering Intern (Python)
A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine...
Summer work
Internship
Mechanize, Inc.
San Francisco, CA
1 day ago
Member of Technical Staff, Post-Training, RL Environments
$350k
...work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure... ...build the data systems and execution environments that power reinforcement learning at... ...collection pipelines for complex, long‑horizon RL tasks. Build robust systems to...
Mirendil
San Francisco, CA
2 days ago
Robotics RL Research Scientist: Learn from Real-World Data
Pantograph in San Francisco is looking for research scientists who specialize in reinforcement learning and multimodal representation learning. Ideal candidates should have experience with large GPU clusters and comfortable working with Kubernetes. You will work alongside...
Pantograph
San Francisco, CA
12 hours ago
RL Algorithms Research Scientist - Post-LLM Learning
$300k
Vmax is seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers...
Vmax
San Francisco, CA
3 days ago
Senior Product Manager, RL Environment Factory
Handshake is looking for a Senior Product Manager in San Francisco to lead the development of the Environment Factory platform for creating reinforcement learning environments. You will drive the product roadmap, focusing on tooling and quality assurance while collaborating...
Flexible hours
Handshake
San Francisco, CA
4 days ago
Staff Cybersecurity RL Engineer - Real-World Environments
Preference Model in San Francisco is seeking experienced Security/Cybersecurity Engineers to design and build reinforcement learning environments. You will work on solving real-world cybersecurity challenges, ensuring frontier AI models can understand and handle them effectively...
Preference Model
San Francisco, CA
2 days ago
Low-Level ML Engineer: Kernels & RL Environments
Preference Model seeks experienced Machine Learning Engineers for its Low Level / Kernels Capabilities team. This role focuses on crafting reinforcement learning environments requiring strong low-level programming skills in C/C++/CUDA. Candidates will have ownership of...
Preference Model
San Francisco, CA
3 days ago
Staff SWE, RL Environments — Build Real-World AI
Preference Model is seeking a Member of Technical Staff - Software Engineer to develop software engineering environments for AI models. This role combines research and engineering, requiring deep software engineering expertise and proficiency in Python. You will have ownership...
Preference Model
San Francisco, CA
2 days ago
Senior ML Engineer: RL Environments for Frontier Models
Preference Model in San Francisco is seeking experienced Machine Learning Engineers to design and build reinforcement learning environments aimed at advancing model capabilities. The role blends research and engineering, requiring ownership of environment design and implementation...
Preference Model
San Francisco, CA
2 days ago
New Grad ML Engineer: Design & Build RL Environments
Preference Model is hiring new graduate Machine Learning Engineers to design and build reinforcement learning environments. You will blend research and engineering roles in a dynamic startup environment that values diverse perspectives. The ideal candidate should possess...
Preference Model
San Francisco, CA
2 days ago
Software Engineer — RL Environments for Frontier AI
A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in...
Mechanize, Inc.
San Francisco, CA
12 hours ago
Applied RL Research Engineer: Domain Data & Environments
...strategies, vendor relationships, and collaborate closely with domain experts. The ideal candidate will possess experience with reinforcement learning and be eager to contribute to applied research that enhances AI systems across sectors like finance and healthcare. #J-188...
United States Digital Space LLC
San Francisco, CA
1 day ago
Robot Autonomy Scientist RL & Dexterous Manipulation
$150k - $250k
...more dexterous behaviors in unstructured environments ● Research and implement state-of-the-art robot learning policies, including reinforcement learning and imitation learning-based... ...tech stacks with imitation learning or RL-based methods ● Background in real-time...
Deft AI, Inc.
San Francisco, CA
12 hours ago
Simulation Environments Engineer, Robotics
$325k - $405k
...Software Engineer to lead our simulation environments initiative, setting the vision for... ...and synthetic‑data generation for robot learning. Have led projects that bridge simulation... ...distributed compute jobs, or large‑scale reinforcement‑learning workloads. Have experience...
Work at office
Local area
Relocation package
Flexible hours
Dormont Manufacturing Company
San Francisco, CA
1 day ago
AI Systems Engineer — RL Environments & Scalable Infra
...teams, and ensure code quality through reviews and documentation. This role involves a variety of tasks, from prototyping new RL environments to enhancing existing systems, providing an exciting balance of challenges in the fast-paced AI sector. #J-18808-Ljbffr AI Talent...
AI Talent Now
San Francisco, CA
12 hours ago
Agent Post-Training, Frontier Evals and Environments Research
...get made. We build the data, environments, graders, training methods, and... ...in this role include GDPval, SWE‑bench Verified, MLE‑bench, PaperBench... ...Create ambitious RL environments to push our models... ...technical fundamentals in machine learning, software engineering, systems...
United States Digital Space LLC
San Francisco, CA
12 hours ago
Research Engineer/Research Scientist, RL/Reasoning
$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and... ...They focus on pushing the boundaries of reinforcement learning research, building next-generation... ...paced, dynamic, and technically complex environment where rapid iteration is key. You're comfortable...
Work at office
Relocation package
Slope
San Francisco, CA
2 days ago
Senior RL Infra Engineer — Frontier AI Training & Scale
$350k
...to build infrastructure for frontier reasoning models at their San Francisco location. This role focuses on large-scale reinforcement learning (RL) model training and requires a solid understanding of engineering principles. The ideal candidate will design reliable training...
Mirendil
San Francisco, CA
2 days ago
Sanitation Worker
...equipment. sort recyclable materials. maintain a clean and safe work environment. follow safety and environmental regulations. respond to... ...supply of waste containers. assist with billing and documentation. learn company waste services and pricing. work closely with finance...
TradeJobsWorkForce
San Francisco, CA
3 days ago
Humanoid Robotic Engineer | RL, MPC & Locomotion
...advanced control algorithms focusing on Reinforcement Learning to enhance robotic systems both in... ...developing control algorithms, training RL policies, and analyzing telemetry data... ...-Body Control, and robotic simulation environments is essential. #J-18808-Ljbffr Hyphen...
Hyphen Connect Limited
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to SWE (RL Environments) "Reinforcement Learning". Be the first to apply!