Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

SWE (RL Environments) "Reinforcement Learning"

AI Talent Now

San Francisco, United States | Posted on 06/15/2026 AI Talent Now, LLC is a powerhouse in direct‑hire talent acquisition, connecting exceptional talent with industry‑leading organizations across IT, Engineering, Financial Services & Fintech, and Manufacturing & Robotics. Headquartered in Atlanta, Georgia, we serve clients and candidates nationwide. Job Description AI Talent Now Job #ZR 72 About us This dynamic company is helping push the frontier of LLMs and AI Agents through novel datasets and experimentation. We build the most complex infrastructure that powers frontier data creation for agentic and hard‑reasoning workflows. Working with all five leading AI labs, we are becoming the go‑to partner for data infrastructure for YC companies. Our sharp hockey‑stick growth and talent density come from a founding team with backgrounds in top IB and quant firms. We build the training data and evaluation infrastructure that frontier AI labs use to improve their models. Working with the world's leading labs, we design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. At a small, early post‑Series A team, individual contributors directly influence how the next generation of models learn and improve. As an RL Environment Engineer you will design datasets that directly influence how frontier models learn and work hands‑on with research teams at top AI labs. Responsibilities Design and develop datasets that shape frontier model learning. Collaborate with research teams at leading AI labs to build and refine reinforcement learning environments. Implement high‑quality benchmarks and evaluate model performance beyond static tests. Qualifications Recent graduates from top schools focused on excellence and depth; extensive track record not required. First‑author publications at top venues such as NeurIPS or ICML are highly desirable. Open to profiles from data companies with benchmarking experience. Ideal candidates have created iconic benchmarks and possess experience in supervised fine‑tuning (SFT) or reinforcement learning (RL). Experience 1–6 years of experience as a software engineer. Explicit experience building reinforcement learning environments. Signals of excellence in software engineering (e.g., work at an RL company, top VC‑backed startup, strong side projects, quant or hedge‑fund background, or founding/early‑stage startup engineer). CS degree from a top‑30 school (US/CA/EUR only). Strong full‑stack skills with depth in Python, Typescript, and other backend languages. Developed quantitative frameworks for measuring dataset quality/diversity. Bias for action and execution; willing to tackle difficult and tedious work. Compensation and Logistics Base salary: $150k–$250k, with significant bonus potential based on performance. Bonuses are uncapped and can substantially increase total compensation. Visa sponsorships are possible (experience in H1B and other types). Work Arrangement Flexible working hours; most team members in the office from noon to midnight. Work environment encourages results over strict hours. EEO Statement We are an equal opportunity employer and welcome applications from all qualified individuals regardless of race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or veteran status. #J-18808-Ljbffr AI Talent Now

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the SWE (RL Environments) "Reinforcement Learning" in San Francisco, CA vacancy
  • Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English...  ...RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality, diverse tasks... 
    Suggested
    Full time
    For contractors
    Remote work
    Relocation

    Open Data Science

    San Francisco, CA
    3 days ago
  •  ...Role We’re hiring a Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group building the interactive sandboxes...  ...Lead, hire, and develop a high-performing team building RL environments and the platform behind them Own the RLE... 
    Suggested
    Work at office
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    3 days ago
  • $350k

    Mirendil, based in San Francisco, is seeking a research engineer to develop and optimize data systems supporting reinforcement learning. The role involves constructing data pipelines, ensuring model training efficiency, and collaborating with cross-functional teams. Ideal... 
    Suggested

    Mirendil

    San Francisco, CA
    2 days ago
  • A company specializing in AI training data is seeking a Reinforcement Learning Environment Engineer to design and build MLE/SWE environments. This remote contractor position requires strong Python skills, hands-on LLM experience, and the ability to operate independently... 
    Suggested
    Remote job
    Full time
    For contractors

    Open Data Science

    San Francisco, CA
    3 days ago
  •  ...Now in San Francisco is seeking a skilled RL Environment Engineer to design impactful datasets that influence the learning of frontier AI models. You will collaborate...  ...research teams at leading AI labs to refine reinforcement learning environments and create comprehensive... 
    Suggested

    AI Talent Now

    San Francisco, CA
    12 hours ago
  • Handshake in San Francisco is seeking a Senior Engineering Manager to lead their Reinforcement Learning Environments team. You will be responsible for managing a team of engineers, driving architectural decisions, and owning the project roadmap while working in an in-office... 
    Full time
    Work at office
    Flexible hours

    Handshake

    San Francisco, CA
    3 days ago
  • Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates have... 

    Handshake

    San Francisco, CA
    4 days ago
  •  ...growing applied AI research lab that builds high-quality reinforcement-learning environments and agents sold to the world's leading AI labs. In under...  ...expanding into new domains. The Opportunity As an RL Environment Software Engineer, you will sit at the intersection... 
    Full time

    talentpluto

    San Francisco, CA
    3 days ago
  •  ...models forward. A growing share of that work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated and trained against real...  ...a repeatable factory. Today, building a single RL environment is a substantial cross-team effort involving... 
    Full time
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    3 days ago
  • $192.6k - $344.85k

     ...Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco...  ...Manager Reinforcement Learning** at Autodesk Research, you will...  ...the best of an academic environment with product-guided research....  ...with strengths across *ML*, RL, evaluation, and human-centered... 
    Remote work

    Autodesk, Inc.

    San Francisco, CA
    12 hours ago
  • A technology company specializing in AI is seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine... 
    Summer work
    Internship

    Mechanize, Inc.

    San Francisco, CA
    1 day ago
  • $350k

     ...work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure...  ...build the data systems and execution environments that power reinforcement learning at...  ...collection pipelines for complex, long‑horizon RL tasks. Build robust systems to... 

    Mirendil

    San Francisco, CA
    2 days ago
  • Pantograph in San Francisco is looking for research scientists who specialize in reinforcement learning and multimodal representation learning. Ideal candidates should have experience with large GPU clusters and comfortable working with Kubernetes. You will work alongside... 

    Pantograph

    San Francisco, CA
    12 hours ago
  • $300k

    Vmax is seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers... 

    Vmax

    San Francisco, CA
    3 days ago
  • Handshake is looking for a Senior Product Manager in San Francisco to lead the development of the Environment Factory platform for creating reinforcement learning environments. You will drive the product roadmap, focusing on tooling and quality assurance while collaborating... 
    Flexible hours

    Handshake

    San Francisco, CA
    4 days ago
  • Preference Model in San Francisco is seeking experienced Security/Cybersecurity Engineers to design and build reinforcement learning environments. You will work on solving real-world cybersecurity challenges, ensuring frontier AI models can understand and handle them effectively... 

    Preference Model

    San Francisco, CA
    2 days ago
  • Preference Model seeks experienced Machine Learning Engineers for its Low Level / Kernels Capabilities team. This role focuses on crafting reinforcement learning environments requiring strong low-level programming skills in C/C++/CUDA. Candidates will have ownership of... 

    Preference Model

    San Francisco, CA
    3 days ago
  • Preference Model is seeking a Member of Technical Staff - Software Engineer to develop software engineering environments for AI models. This role combines research and engineering, requiring deep software engineering expertise and proficiency in Python. You will have ownership... 

    Preference Model

    San Francisco, CA
    2 days ago
  • Preference Model in San Francisco is seeking experienced Machine Learning Engineers to design and build reinforcement learning environments aimed at advancing model capabilities. The role blends research and engineering, requiring ownership of environment design and implementation... 

    Preference Model

    San Francisco, CA
    2 days ago
  • Preference Model is hiring new graduate Machine Learning Engineers to design and build reinforcement learning environments. You will blend research and engineering roles in a dynamic startup environment that values diverse perspectives. The ideal candidate should possess... 

    Preference Model

    San Francisco, CA
    2 days ago
  • A tech company specializing in AI model training in San Francisco is seeking a Software Engineer to design and build reinforcement learning tasks. You will oversee the full lifecycle of these tasks, from ideation to evaluation, focusing on meaningful capability gaps in... 

    Mechanize, Inc.

    San Francisco, CA
    12 hours ago
  •  ...strategies, vendor relationships, and collaborate closely with domain experts. The ideal candidate will possess experience with reinforcement learning and be eager to contribute to applied research that enhances AI systems across sectors like finance and healthcare. #J-188... 

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • $150k - $250k

     ...more dexterous behaviors in unstructured environments ● Research and implement state-of-the-art robot learning policies, including reinforcement learning and imitation learning-based...  ...tech stacks with imitation learning or RL-based methods ● Background in real-time... 

    Deft AI, Inc.

    San Francisco, CA
    12 hours ago
  • $325k - $405k

     ...Software Engineer to lead our simulation environments initiative, setting the vision for...  ...and synthetic‑data generation for robot learning. Have led projects that bridge simulation...  ...distributed compute jobs, or large‑scale reinforcement‑learning workloads. Have experience... 
    Work at office
    Local area
    Relocation package
    Flexible hours

    Dormont Manufacturing Company

    San Francisco, CA
    1 day ago
  •  ...teams, and ensure code quality through reviews and documentation. This role involves a variety of tasks, from prototyping new RL environments to enhancing existing systems, providing an exciting balance of challenges in the fast-paced AI sector. #J-18808-Ljbffr AI Talent... 

    AI Talent Now

    San Francisco, CA
    12 hours ago
  •  ...get made. We build the data, environments, graders, training methods, and...  ...in this role include GDPval, SWE‑bench Verified, MLE‑bench, PaperBench...  ...Create ambitious RL environments to push our models...  ...technical fundamentals in machine learning, software engineering, systems... 

    United States Digital Space LLC

    San Francisco, CA
    12 hours ago
  • $310k

    About the Team The RL and Reasoning team drives the core reasoning paradigm and...  ...They focus on pushing the boundaries of reinforcement learning research, building next-generation...  ...paced, dynamic, and technically complex environment where rapid iteration is key. You're comfortable... 
    Work at office
    Relocation package

    Slope

    San Francisco, CA
    2 days ago
  • $350k

     ...to build infrastructure for frontier reasoning models at their San Francisco location. This role focuses on large-scale reinforcement learning (RL) model training and requires a solid understanding of engineering principles. The ideal candidate will design reliable training... 

    Mirendil

    San Francisco, CA
    2 days ago
  •  ...equipment. sort recyclable materials. maintain a clean and safe work environment. follow safety and environmental regulations. respond to...  ...supply of waste containers. assist with billing and documentation. learn company waste services and pricing. work closely with finance... 

    TradeJobsWorkForce

    San Francisco, CA
    3 days ago
  •  ...advanced control algorithms focusing on Reinforcement Learning to enhance robotic systems both in...  ...developing control algorithms, training RL policies, and analyzing telemetry data...  ...-Body Control, and robotic simulation environments is essential. #J-18808-Ljbffr Hyphen... 

    Hyphen Connect Limited

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to SWE (RL Environments) "Reinforcement Learning". Be the first to apply!