SWE (RL Environments) "Reinforcement Learning"
AI Talent Now
San Francisco, United States | Posted on 06/15/2026 AI Talent Now, LLC is a powerhouse in direct‑hire talent acquisition, connecting exceptional talent with industry‑leading organizations across IT, Engineering, Financial Services & Fintech, and Manufacturing & Robotics. Headquartered in Atlanta, Georgia, we serve clients and candidates nationwide. Job Description AI Talent Now Job #ZR 72 About us This dynamic company is helping push the frontier of LLMs and AI Agents through novel datasets and experimentation. We build the most complex infrastructure that powers frontier data creation for agentic and hard‑reasoning workflows. Working with all five leading AI labs, we are becoming the go‑to partner for data infrastructure for YC companies. Our sharp hockey‑stick growth and talent density come from a founding team with backgrounds in top IB and quant firms. We build the training data and evaluation infrastructure that frontier AI labs use to improve their models. Working with the world's leading labs, we design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. At a small, early post‑Series A team, individual contributors directly influence how the next generation of models learn and improve. As an RL Environment Engineer you will design datasets that directly influence how frontier models learn and work hands‑on with research teams at top AI labs. Responsibilities Design and develop datasets that shape frontier model learning. Collaborate with research teams at leading AI labs to build and refine reinforcement learning environments. Implement high‑quality benchmarks and evaluate model performance beyond static tests. Qualifications Recent graduates from top schools focused on excellence and depth; extensive track record not required. First‑author publications at top venues such as NeurIPS or ICML are highly desirable. Open to profiles from data companies with benchmarking experience. Ideal candidates have created iconic benchmarks and possess experience in supervised fine‑tuning (SFT) or reinforcement learning (RL). Experience 1–6 years of experience as a software engineer. Explicit experience building reinforcement learning environments. Signals of excellence in software engineering (e.g., work at an RL company, top VC‑backed startup, strong side projects, quant or hedge‑fund background, or founding/early‑stage startup engineer). CS degree from a top‑30 school (US/CA/EUR only). Strong full‑stack skills with depth in Python, Typescript, and other backend languages. Developed quantitative frameworks for measuring dataset quality/diversity. Bias for action and execution; willing to tackle difficult and tedious work. Compensation and Logistics Base salary: $150k–$250k, with significant bonus potential based on performance. Bonuses are uncapped and can substantially increase total compensation. Visa sponsorships are possible (experience in H1B and other types). Work Arrangement Flexible working hours; most team members in the office from noon to midnight. Work environment encourages results over strict hours. EEO Statement We are an equal opportunity employer and welcome applications from all qualified individuals regardless of race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or veteran status. #J-18808-Ljbffr AI Talent Now
- Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English... ...RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality, diverse tasks...SuggestedFull timeFor contractorsRemote workRelocation
- ...Role We’re hiring a Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group building the interactive sandboxes... ...Lead, hire, and develop a high-performing team building RL environments and the platform behind them Own the RLE...SuggestedWork at officeRemote workFlexible hours
$350k
Mirendil, based in San Francisco, is seeking a research engineer to develop and optimize data systems supporting reinforcement learning. The role involves constructing data pipelines, ensuring model training efficiency, and collaborating with cross-functional teams. Ideal...Suggested- A company specializing in AI training data is seeking a Reinforcement Learning Environment Engineer to design and build MLE/SWE environments. This remote contractor position requires strong Python skills, hands-on LLM experience, and the ability to operate independently...SuggestedRemote jobFull timeFor contractors
- ...Now in San Francisco is seeking a skilled RL Environment Engineer to design impactful datasets that influence the learning of frontier AI models. You will collaborate... ...research teams at leading AI labs to refine reinforcement learning environments and create comprehensive...Suggested
- Handshake in San Francisco is seeking a Senior Engineering Manager to lead their Reinforcement Learning Environments team. You will be responsible for managing a team of engineers, driving architectural decisions, and owning the project roadmap while working in an in-office...Full timeWork at officeFlexible hours
- Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates have...
- ...growing applied AI research lab that builds high-quality reinforcement-learning environments and agents sold to the world's leading AI labs. In under... ...expanding into new domains. The Opportunity As an RL Environment Software Engineer, you will sit at the intersection...Full time
- ...models forward. A growing share of that work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated and trained against real... ...a repeatable factory. Today, building a single RL environment is a substantial cross-team effort involving...Full timeWork at officeImmediate startRemote workFlexible hours
$192.6k - $344.85k
...Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco... ...Manager Reinforcement Learning** at Autodesk Research, you will... ...the best of an academic environment with product-guided research.... ...with strengths across *ML*, RL, evaluation, and human-centered...Remote work- A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine...Summer workInternship
$350k
...work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure... ...build the data systems and execution environments that power reinforcement learning at... ...collection pipelines for complex, long‑horizon RL tasks. Build robust systems to...- Pantograph in San Francisco is looking for research scientists who specialize in reinforcement learning and multimodal representation learning. Ideal candidates should have experience with large GPU clusters and comfortable working with Kubernetes. You will work alongside...
$300k
Vmax is seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers...- Handshake is looking for a Senior Product Manager in San Francisco to lead the development of the Environment Factory platform for creating reinforcement learning environments. You will drive the product roadmap, focusing on tooling and quality assurance while collaborating...Flexible hours
- Preference Model in San Francisco is seeking experienced Security/Cybersecurity Engineers to design and build reinforcement learning environments. You will work on solving real-world cybersecurity challenges, ensuring frontier AI models can understand and handle them effectively...
- Preference Model seeks experienced Machine Learning Engineers for its Low Level / Kernels Capabilities team. This role focuses on crafting reinforcement learning environments requiring strong low-level programming skills in C/C++/CUDA. Candidates will have ownership of...
- Preference Model is seeking a Member of Technical Staff - Software Engineer to develop software engineering environments for AI models. This role combines research and engineering, requiring deep software engineering expertise and proficiency in Python. You will have ownership...
- Preference Model in San Francisco is seeking experienced Machine Learning Engineers to design and build reinforcement learning environments aimed at advancing model capabilities. The role blends research and engineering, requiring ownership of environment design and implementation...
- Preference Model is hiring new graduate Machine Learning Engineers to design and build reinforcement learning environments. You will blend research and engineering roles in a dynamic startup environment that values diverse perspectives. The ideal candidate should possess...
- A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in...
- ...strategies, vendor relationships, and collaborate closely with domain experts. The ideal candidate will possess experience with reinforcement learning and be eager to contribute to applied research that enhances AI systems across sectors like finance and healthcare. #J-188...
$150k - $250k
...more dexterous behaviors in unstructured environments ● Research and implement state-of-the-art robot learning policies, including reinforcement learning and imitation learning-based... ...tech stacks with imitation learning or RL-based methods ● Background in real-time...$325k - $405k
...Software Engineer to lead our simulation environments initiative, setting the vision for... ...and synthetic‑data generation for robot learning. Have led projects that bridge simulation... ...distributed compute jobs, or large‑scale reinforcement‑learning workloads. Have experience...Work at officeLocal areaRelocation packageFlexible hours- ...teams, and ensure code quality through reviews and documentation. This role involves a variety of tasks, from prototyping new RL environments to enhancing existing systems, providing an exciting balance of challenges in the fast-paced AI sector. #J-18808-Ljbffr AI Talent...
- ...get made. We build the data, environments, graders, training methods, and... ...in this role include GDPval, SWE‑bench Verified, MLE‑bench, PaperBench... ...Create ambitious RL environments to push our models... ...technical fundamentals in machine learning, software engineering, systems...
$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and... ...They focus on pushing the boundaries of reinforcement learning research, building next-generation... ...paced, dynamic, and technically complex environment where rapid iteration is key. You're comfortable...Work at officeRelocation package$350k
...to build infrastructure for frontier reasoning models at their San Francisco location. This role focuses on large-scale reinforcement learning (RL) model training and requires a solid understanding of engineering principles. The ideal candidate will design reliable training...- ...equipment. sort recyclable materials. maintain a clean and safe work environment. follow safety and environmental regulations. respond to... ...supply of waste containers. assist with billing and documentation. learn company waste services and pricing. work closely with finance...
- ...advanced control algorithms focusing on Reinforcement Learning to enhance robotic systems both in... ...developing control algorithms, training RL policies, and analyzing telemetry data... ...-Body Control, and robotic simulation environments is essential. #J-18808-Ljbffr Hyphen...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to SWE (RL Environments) "Reinforcement Learning". Be the first to apply!
- project manager environment San Francisco, CA
- senior environment artist San Francisco, CA
- environment San Francisco, CA
- environment artist San Francisco, CA
- quality environment health safety manager San Francisco, CA
- project manager environment
- environmental social work
- environment consultant
- cold environment
- 3d artist environment

