Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff (RL Research & Environments)

$200k

Magic Inc

Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role As a Software Engineer on the RL Research & Environments team, you will design and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This role focuses on post-training: identifying capability gaps, building targeted datasets, designing reward signals, and running iterative training loops that measurably improve user‑facing behavior. You will own the infrastructure and experimental workflows that connect product priorities to concrete capability gains. Magic’s long‑context models introduce distinct post‑training challenges: long‑horizon reasoning, sustained coherence over extended trajectories, context‑use quality, and tool‑augmented behavior. You will build systems that expose failure modes, generate high‑signal training data, and enable rapid RL iteration at scale. This role can evolve into ownership of major capability areas, deeper RL systems work, or broader influence over post‑training strategy as Magic scales long‑context model performance and reliability. What you’ll work on Design and build post‑training datasets using synthetic generation, targeted data collection, and self‑play Implement filtering, scoring, and mixture strategies for RL and post‑training corpora Build and maintain evaluation frameworks that surface long‑context failure modes Design reward signals and training environments for targeted capability improvements Run ablations across data sources, reward designs, and long‑horizon task structures Improve reliability and observability of post‑training data and environment pipelines Collaborate closely with Product and Research to translate capability goals into measurable iteration cycles What we’re looking for Strong software engineering fundamentals Experience building or operating large‑scale data or ML systems Ability to design and interpret experiments that measure model behavior changes Comfort working at the intersection of ML, data systems, and infrastructure Strong attention to data quality and evaluation rigor Track record of owning experimental or production systems end‑to‑end Compensation, benefits, and perks (US) Annual salary range: between $200K - $550K based on experience Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic Compensation Range: $200K - $550K #J-18808-Ljbffr

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff (RL Research & Environments) in San Francisco, CA vacancy
  • $180k

     ...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems...  ...to increase the productivity of researchers and engineers. Typical problems...  ...we design an efficient and robust environment for the agent to perform actions in... 
    Suggested
    Temporary work

    Xai

    San Francisco, CA
    1 day ago
  •  ...Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that lets our customers...  ...or serving large language models Experience building RL environments or evaluation suites for LLMs Proficiency in PyTorch,... 
    Suggested

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  • Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that lets our customers...  ...or serving large language models Experience building RL environments or evaluation suites for LLMs Proficiency in PyTorch,... 
    Suggested

    Trajectory

    San Francisco, CA
    2 days ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise.... 
    Suggested
    Work at office
    Local area

    Vmax

    San Francisco, CA
    5 days ago
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans...  .... Collaborate with researchers working on environments, evals, interpretability, reward modeling,... 
    Suggested
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    5 days ago
  • $150k

     ...We're a new research lab in San Francisco, currently...  ...reinforcement learning (RL) to solve reasoning...  ...generalizing agents to physical environments. Our work builds on...  ...The Product Manager - Technical role for the AGI...  ...employees, supervisors, and staff; adhere to standards... 
    Local area

    Amazon

    San Francisco, CA
    3 days ago
  •  ...and help define the infrastructure layer for the future of AI. About the role Gimlet Labs is seeking an Member of Technical Staff focused on AI research. As an AI Researcher, you will be evaluating and implementing techniques to drive performance and... 

    Gimlet Labs

    San Francisco, CA
    5 days ago
  • $150k

     ...'s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground...  ...algorithms perform robustly in dynamic real-world environments. You'll?have the freedom to pursue ambitious... 
    Local area

    Amazon

    San Francisco, CA
    3 days ago
  • Member of Technical Staff - Research Scientist Patronus AI is a frontier lab developing simulation research and...  ...standard for robust, high quality environments. Your work will inform how frontier...  ...new training methods for efficient RL. Develop state-of-the-art systems for... 

    Patronus AI, Inc.

    San Francisco, CA
    3 days ago
  • $150k

     ...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground...  ...physical robotic platforms in dynamic real‑world environments. You’ll have access to Amazon’s computational... 

    Amazon Science

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location...  ..., model architecture, and numerics research to the challenges of biology. We are...  ...distributed or high-performance computing environments. Excellent written and verbal... 
    Full time

    Radical Numerics

    San Francisco, CA
    4 days ago
  • $148.5k - $223.9k

     ...of Salesforce. Salesforce AI Research is looking for a Machine Learning...  ...customers. With your strong technical competence, strategic...  ...such as LLM, pre/post‑training, RL, agentic system. Prioritizes...  ..., comfortable in fast‑pacing environment. Unleash Your Potential When... 

    salesforce.com, inc.

    San Francisco, CA
    4 days ago
  • $150k - $350k

    Member of Technical Staff, Applied Research — Sieve Location: San Francisco, CA (Onsite) Compensation: $150,000 - $350,000 base + 0.05% - 0.4% equity Visa Sponsorship: H-1B, O-1, OPT supported Experience Level: 2+ years Employment Type: Full-Time Headcount: 4 open seats... 
    Full time
    H1b
    Visa sponsorship

    David Joseph & Company

    San Francisco, CA
    5 days ago
  • Member of Technical Staff - Computational Biologist Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Computational Biologist...  ...Contribute to shaping and executing the Valthos-wide research and development roadmap Identify large-scale biological datasets... 
    Full time
    Work at office

    Valthos

    San Francisco, CA
    2 days ago
  • $200k - $400k

     ...generative agents based on real humans. Our research pioneered the field of AI-based...  ...Rauch. About the Role As a Member of Technical Staff (MTS) in Research, you will work across...  ...the final deployment in a production environment. Nice to Haves Academic & Technical... 
    Flexible hours

    Simile

    San Francisco, CA
    2 days ago
  • $148.5k - $223.9k

     ...future of Salesforce. Salesforce AI Research is looking for a Machine Learning Engineer...  ...with customers. With your strong technical competence, strategic thinking and customer...  ...to action, comfortable in fast-pacing environment. Competitive coding winner (ACM-... 

    Salesforce.Com Inc

    San Francisco, CA
    4 days ago
  • $200k

     ...promising path to safe AGI lies in automating research and code generation to improve models...  ...-scale pre-training, domain-specific RL, ultra-long context, and inference-...  ...'s most important decisions. As a Member of Technical Staff on Evals, you will build both the... 
    Visa sponsorship
    Relocation package

    Magic Inc

    San Francisco, CA
    4 days ago
  •  ...people who deeply understand what's possible Massive leverage - the systems you build will multiply the output of every data team member and every stakeholder who needs data Direct impact - small team, no layers of approval. Idea to shipped system in days, not... 

    Perplexity

    San Francisco, CA
    3 days ago
  •  ...having proprietary data to post-train on and unique usage to RL on possess strong opinions about how AI agents and interfaces...  ...feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"... 
    Immediate start

    Tomo

    San Francisco, CA
    3 days ago
  •  ...Researcher And Research Engineer Role We are looking for exceptional researchers and research engineers to design and build the next...  ...design, statistical analysis, and evaluation frameworks Technical skills: Proficiency in Python for research and experimentation... 
    Work experience placement
    Relocation
    Relocation package

    Vals AI

    San Francisco, CA
    1 day ago
  • $150k - $300k

     ...and pair it with the full RL post-training stack: environments, secure sandboxes,...  ...async RL trainer. We enable researchers, startups, and enterprises...  ...the jobs. Core Technical Responsibilities Hosted...  ...development and encourage team members to contribute to the... 
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    1 day ago
  •  ...re looking for someone highly technical (our current team includes 3...  ...Listen Labs is an AI-powered research platform that helps teams uncover...  ...time off to recharge, and an environment that values balance and trust. Room to Grow: As an early member of the team, you’ll have the... 
    Flexible hours

    Listen Labs

    San Francisco, CA
    5 days ago
  •  ...session accuracy with millions of real tool calls as baseline data SFT on our agentic traces and RL models on top of our agentic harness and app sandboxes Qualifications research you can independently execute against the research goals you can prototype and test... 

    Composio

    San Francisco, CA
    1 day ago
  •  ...even nation states. Our team of AI researchers and company builders come from...  ...powers our models. This includes RL training loops, distributed GPU systems...  .... You thrive in high-agency environments and enjoy solving hard technical problems. What We Offer: We... 
    Relocation package

    Reflection AI

    San Francisco, CA
    1 day ago
  •  ...infrastructure for massive, petabyte-scale, multimodal datasets Rapidly iterate on experiments and ablations Stay up-to-date on research to bring new ideas to work What we’re looking for We value a relentless approach to problem-solving, rapid execution, and the ability... 

    Kindredventures

    San Francisco, CA
    2 days ago
  • $150k - $300k

     ...and pair it with the full rl post-training stack: environments, secure sandboxes,...  ...async RL trainer. We enable researchers, startups and enterprises...  ...RL training stack. Core Technical Responsibilities LLM Serving...  ...development and encourage team members to contribute to the... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    3 days ago
  •  ...sit on top of our leading RL infrastructure. You’ll work...  ...move quickly in a zero-to-one environment Preferred Experience...  ...Mosaic, Adept, and Windsurf RL researchers, raised $15M led by...  ...building our team of founding Members of Technical Staff to design the frontier of continually... 

    Trajectory

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...and pair it with the full rl post-training stack: environments, secure sandboxes,...  ...async RL trainer. We enable researchers, startups and enterprises...  ...reliable at scale. Core Technical Responsibilities Infrastructure...  ...and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    5 days ago
  • About the Role As a Deployed Research Engineer at Sieve, you’ll work on highly specific dataset problems for frontier AI labs and build...  ...or external teams to translate ambiguous needs into concrete technical systems Strong Python developer with hands‑on experience in... 

    Sieve

    San Francisco, CA
    1 day ago
  • $150k - $350k

    About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80... 

    Sieve

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff (RL Research & Environments). Be the first to apply!