Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, RL Research & Environments

$200k

Magic

Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role As a Software Engineer on the RL Research & Environments team, you will design and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This role focuses on post-training: identifying capability gaps, building targeted datasets, designing reward signals, and running iterative training loops that measurably improve user‑facing behavior. You will own the infrastructure and experimental workflows that connect product priorities to concrete capability gains. Magic’s long‑context models introduce distinct post‑training challenges: long‑horizon reasoning, sustained coherence over extended trajectories, context‑use quality, and tool‑augmented behavior. You will build systems that expose failure modes, generate high‑signal training data, and enable rapid RL iteration at scale. This role can evolve into ownership of major capability areas, deeper RL systems work, or broader influence over post‑training strategy as Magic scales long‑context model performance and reliability. What you’ll work on Design and build post‑training datasets using synthetic generation, targeted data collection, and self‑play Implement filtering, scoring, and mixture strategies for RL and post‑training corpora Build and maintain evaluation frameworks that surface long‑context failure modes Design reward signals and training environments for targeted capability improvements Run ablations across data sources, reward designs, and long‑horizon task structures Improve reliability and observability of post‑training data and environment pipelines Collaborate closely with Product and Research to translate capability goals into measurable iteration cycles What we’re looking for Strong software engineering fundamentals Experience building or operating large‑scale data or ML systems Ability to design and interpret experiments that measure model behavior changes Comfort working at the intersection of ML, data systems, and infrastructure Strong attention to data quality and evaluation rigor Track record of owning experimental or production systems end‑to‑end Compensation, benefits, and perks (US) Annual salary range: between $200K - $550K based on experience Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic Compensation Range: $200K - $550K #J-18808-Ljbffr Magic

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, RL Research & Environments in San Francisco, CA vacancy
  • $200k - $275k

    Founding Member of Technical Staff (Research / Post-Training) Applied AI / RL | San Francisco (onsite) | $200k-$275k + 0.25-0.50% equity DeepRec is partnered with...  ...core platform focuses on building high‑quality RL environments that simulate tasks across investment banking,... 
    Suggested
    Full time
    Visa sponsorship
    Relocation package

    DeepRec.ai

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise.... 
    Suggested
    Work at office
    Local area

    Vmax

    San Francisco, CA
    4 days ago
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans...  .... Collaborate with researchers working on environments, evals, interpretability, reward modeling,... 
    Suggested
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    4 days ago
  • $180k

    Member of Technical Staff - RL Infrastructure About xAI xAI’s mission is to create AI systems that can...  ...to increase the productivity of researchers and engineers. Typical problems you...  ...we design an efficient and robust environment for the agent to perform actions in... 
    Suggested
    Temporary work

    xAI

    San Francisco, CA
    2 days ago
  • $300k

     ...Vmax Vmax is an applied research lab developing AI...  ...engineers who can turn RL research ideas into working...  ...systems, evals, environment and rewards. You will...  ...across RL projects. Own technically ambiguous projects end...  ...other technical team members. Clear written and verbal... 
    Suggested
    Full time
    Work at office
    Local area

    Vmax

    San Francisco, CA
    4 days ago
  •  ...real production workloads built to scale to gigawatt-class AI datacenters. Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern), you will be evaluating and implementing techniques to drive performance and... 
    Internship

    Gimlet Labs

    San Francisco, CA
    3 days ago
  • Member of Technical Staff - Research Scientist Patronus AI is a frontier lab developing simulation research and...  ...standard for robust, high quality environments. Your work will inform how frontier...  ...new training methods for efficient RL. Develop state-of-the-art systems for... 

    Patronus AI, Inc.

    San Francisco, CA
    2 days ago
  • Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location...  ..., model architecture, and numerics research to the challenges of biology. We are...  ...distributed or high-performance computing environments. Excellent written and verbal... 
    Full time

    Radical Numerics

    San Francisco, CA
    3 days ago
  • $150k - $350k

    Member of Technical Staff, Applied Research — Sieve Location: San Francisco, CA (Onsite) Compensation: $150,000 - $350,000 base + 0.05% - 0.4% equity Visa Sponsorship: H-1B, O-1, OPT supported Experience Level: 2+ years Employment Type: Full-Time Headcount: 4 open seats... 
    Full time
    H1b
    Visa sponsorship

    David Joseph & Company

    San Francisco, CA
    4 days ago
  • $225k

     ...believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably...  ...approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this... 
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    4 days ago
  • $150k

     ...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground...  ..., ensuring robust performance in production environments Collaborate with platform and hardware teams to ensure... 
    Local area

    Amazon Science

    San Francisco, CA
    5 days ago
  • $150k

     ...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground...  ...physical robotic platforms in dynamic real‑world environments. You’ll have access to Amazon’s computational... 

    Amazon Science

    San Francisco, CA
    5 days ago
  • $245k

     ...foundation model, bridging research with real‑world impact to solve...  ...both virtual and physical environments. Key Job Responsibilities You...  ...objectives, mentoring team members, and hiring top talent. Basic...  ...and insights to both technical and non‑technical audiences,... 

    Amazon Science

    San Francisco, CA
    2 days ago
  • Member of Technical Staff - Computational Biologist Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Computational Biologist...  ...Contribute to shaping and executing the Valthos-wide research and development roadmap Identify large-scale biological datasets... 
    Full time
    Work at office

    Valthos

    San Francisco, CA
    1 day ago
  •  ...people who deeply understand what's possible Massive leverage - the systems you build will multiply the output of every data team member and every stakeholder who needs data Direct impact - small team, no layers of approval. Idea to shipped system in days, not... 

    Perplexity

    San Francisco, CA
    2 days ago
  •  ...session accuracy with millions of real tool calls as baseline data SFT on our agentic traces and RL models on top of our agentic harness and app sandboxes Qualifications research you can independently execute against the research goals you can prototype and test... 

    Composio

    San Francisco, CA
    5 days ago
  •  ...simulate, and reason over 3D environments for embodied AI, robotics...  ...are looking for exceptional research engineers and applied...  ...We're looking for a Member of Technical Staff - Embodied Agents to help...  ...Policy optimization RL and imitation learning Agent... 

    Moonlake AI

    San Francisco, CA
    2 days ago
  • About the Role We are looking for exceptional researchers and research engineers to design and build the next generation of AI benchmarks...  ...design, statistical analysis, and evaluation frameworks Technical skills : Proficiency in Python for research and experimentation... 
    Work experience placement
    Relocation
    Relocation package

    PetsApp

    San Francisco, CA
    3 days ago
  •  ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who...  ...expertise. You do not need to be an ML researcher. You need to be the kind of engineer who...  ...observability. Background in offline RL, contextual bandits, or sequential... 

    Beacon Software

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...and pair it with the full RL post‑training stack: environments, secure sandboxes,...  ...async RL trainer. We enable researchers, startups and enterprises...  ...RL training stack. Core Technical Responsibilities LLM Serving...  ...development and encourage team members to contribute to the... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We...  ...representations evolve during RL and post-training, and use...  ...tools, and RL environments. Define and pursue a high-... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    2 days ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical...  ...states. Our team of AI researchers and company builders come from...  ...data, reward modeling, or RL techniques. Evidence of...  ...paced, high‑agency startup environment; bias toward action and clarity... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    3 days ago
  •  ...customers. Cohere is a team of researchers, engineers, designers, and...  ...remote-friendly! As a Member of Technical Staff, you will: Design and write...  ...both on the SFT and the RL regime. Research, implement...  ...to create an inclusive work environment for all. We welcome applicants... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    5 days ago
  • $148.5k - $223.9k

     .... Responsibilities You will work with research scientists, software engineers, product...  ...systems with customers. With your strong technical competence, strategic thinking and...  ...to action, comfortable in fast‑pacing environment. For this role, we mainly look for BS/... 

    Salesforce

    San Francisco, CA
    4 days ago
  • $150k - $220k

    # Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity##...  ...team combines deep AI/ML research leadership at Google...  ...ownership in a fast, evolving environment.## What You'll Do* •Build backend...  ...already* •Experience with RL, reward modeling, LLM evals... 

    VizopsAI

    San Francisco, CA
    1 day ago
  • $150k - $350k

    About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80... 

    Sieve

    San Francisco, CA
    1 day ago
  •  ...infrastructure for massive, petabyte-scale, multimodal datasets Rapidly iterate on experiments and ablations Stay up-to-date on research to bring new ideas to work What we’re looking for We value a relentless approach to problem-solving, rapid execution, and the ability... 

    Kindredventures

    San Francisco, CA
    1 day ago
  • We're seeking an exceptional AI researcher/engineer to join our team in developing the ego...  ...system for autonomous gameplay in 3D environments. This role combines cutting‑edge research...  ..., with the ability to explain complex technical concepts to both technical and non‑technical... 

    EGO AI

    San Francisco, CA
    5 days ago
  • $150k - $300k

     ...and pair it with the full rl post-training stack: environments, secure sandboxes,...  ...async RL trainer. We enable researchers, startups and enterprises...  ...reliable at scale. Core Technical Responsibilities Infrastructure...  ...and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    4 days ago
  • About the Role As a Deployed Research Engineer at Sieve, you’ll work on highly specific dataset problems for frontier AI labs and build...  ...or external teams to translate ambiguous needs into concrete technical systems Strong Python developer with hands‑on experience in... 

    Sieve

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, RL Research & Environments. Be the first to apply!