Member of Technical Staff (RL Research & Environments)
$200kMagic Inc
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role As a Software Engineer on the RL Research & Environments team, you will design and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This role focuses on post-training: identifying capability gaps, building targeted datasets, designing reward signals, and running iterative training loops that measurably improve user‑facing behavior. You will own the infrastructure and experimental workflows that connect product priorities to concrete capability gains. Magic’s long‑context models introduce distinct post‑training challenges: long‑horizon reasoning, sustained coherence over extended trajectories, context‑use quality, and tool‑augmented behavior. You will build systems that expose failure modes, generate high‑signal training data, and enable rapid RL iteration at scale. This role can evolve into ownership of major capability areas, deeper RL systems work, or broader influence over post‑training strategy as Magic scales long‑context model performance and reliability. What you’ll work on Design and build post‑training datasets using synthetic generation, targeted data collection, and self‑play Implement filtering, scoring, and mixture strategies for RL and post‑training corpora Build and maintain evaluation frameworks that surface long‑context failure modes Design reward signals and training environments for targeted capability improvements Run ablations across data sources, reward designs, and long‑horizon task structures Improve reliability and observability of post‑training data and environment pipelines Collaborate closely with Product and Research to translate capability goals into measurable iteration cycles What we’re looking for Strong software engineering fundamentals Experience building or operating large‑scale data or ML systems Ability to design and interpret experiments that measure model behavior changes Comfort working at the intersection of ML, data systems, and infrastructure Strong attention to data quality and evaluation rigor Track record of owning experimental or production systems end‑to‑end Compensation, benefits, and perks (US) Annual salary range: between $200K - $550K based on experience Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic Compensation Range: $200K - $550K #J-18808-Ljbffr
$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems... ...to increase the productivity of researchers and engineers. Typical problems... ...we design an efficient and robust environment for the agent to perform actions in...SuggestedTemporary work- ...Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that lets our customers... ...or serving large language models Experience building RL environments or evaluation suites for LLMs Proficiency in PyTorch,...Suggested
- Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that lets our customers... ...or serving large language models Experience building RL environments or evaluation suites for LLMs Proficiency in PyTorch,...Suggested
$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise....SuggestedWork at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans... .... Collaborate with researchers working on environments, evals, interpretability, reward modeling,...SuggestedWork at officeLocal areaShift work$150k
...We're a new research lab in San Francisco, currently... ...reinforcement learning (RL) to solve reasoning... ...generalizing agents to physical environments. Our work builds on... ...The Product Manager - Technical role for the AGI... ...employees, supervisors, and staff; adhere to standards...Local area- ...and help define the infrastructure layer for the future of AI. About the role Gimlet Labs is seeking an Member of Technical Staff focused on AI research. As an AI Researcher, you will be evaluating and implementing techniques to drive performance and...
$150k
...'s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground... ...algorithms perform robustly in dynamic real-world environments. You'll?have the freedom to pursue ambitious...Local area- Member of Technical Staff - Research Scientist Patronus AI is a frontier lab developing simulation research and... ...standard for robust, high quality environments. Your work will inform how frontier... ...new training methods for efficient RL. Develop state-of-the-art systems for...
$150k
...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground... ...physical robotic platforms in dynamic real‑world environments. You’ll have access to Amazon’s computational...- Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location... ..., model architecture, and numerics research to the challenges of biology. We are... ...distributed or high-performance computing environments. Excellent written and verbal...Full time
$148.5k - $223.9k
...of Salesforce. Salesforce AI Research is looking for a Machine Learning... ...customers. With your strong technical competence, strategic... ...such as LLM, pre/post‑training, RL, agentic system. Prioritizes... ..., comfortable in fast‑pacing environment. Unleash Your Potential When...$150k - $350k
Member of Technical Staff, Applied Research — Sieve Location: San Francisco, CA (Onsite) Compensation: $150,000 - $350,000 base + 0.05% - 0.4% equity Visa Sponsorship: H-1B, O-1, OPT supported Experience Level: 2+ years Employment Type: Full-Time Headcount: 4 open seats...Full timeH1bVisa sponsorship- Member of Technical Staff - Computational Biologist Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Computational Biologist... ...Contribute to shaping and executing the Valthos-wide research and development roadmap Identify large-scale biological datasets...Full timeWork at office
$200k - $400k
...generative agents based on real humans. Our research pioneered the field of AI-based... ...Rauch. About the Role As a Member of Technical Staff (MTS) in Research, you will work across... ...the final deployment in a production environment. Nice to Haves Academic & Technical...Flexible hours$148.5k - $223.9k
...future of Salesforce. Salesforce AI Research is looking for a Machine Learning Engineer... ...with customers. With your strong technical competence, strategic thinking and customer... ...to action, comfortable in fast-pacing environment. Competitive coding winner (ACM-...$200k
...promising path to safe AGI lies in automating research and code generation to improve models... ...-scale pre-training, domain-specific RL, ultra-long context, and inference-... ...'s most important decisions. As a Member of Technical Staff on Evals, you will build both the...Visa sponsorshipRelocation package- ...people who deeply understand what's possible Massive leverage - the systems you build will multiply the output of every data team member and every stakeholder who needs data Direct impact - small team, no layers of approval. Idea to shipped system in days, not...
- ...having proprietary data to post-train on and unique usage to RL on possess strong opinions about how AI agents and interfaces... ...feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"...Immediate start
- ...Researcher And Research Engineer Role We are looking for exceptional researchers and research engineers to design and build the next... ...design, statistical analysis, and evaluation frameworks Technical skills: Proficiency in Python for research and experimentation...Work experience placementRelocationRelocation package
$150k - $300k
...and pair it with the full RL post-training stack: environments, secure sandboxes,... ...async RL trainer. We enable researchers, startups, and enterprises... ...the jobs. Core Technical Responsibilities Hosted... ...development and encourage team members to contribute to the...Work at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours- ...re looking for someone highly technical (our current team includes 3... ...Listen Labs is an AI-powered research platform that helps teams uncover... ...time off to recharge, and an environment that values balance and trust. Room to Grow: As an early member of the team, you’ll have the...Flexible hours
- ...session accuracy with millions of real tool calls as baseline data SFT on our agentic traces and RL models on top of our agentic harness and app sandboxes Qualifications research you can independently execute against the research goals you can prototype and test...
- ...even nation states. Our team of AI researchers and company builders come from... ...powers our models. This includes RL training loops, distributed GPU systems... .... You thrive in high-agency environments and enjoy solving hard technical problems. What We Offer: We...Relocation package
- ...infrastructure for massive, petabyte-scale, multimodal datasets Rapidly iterate on experiments and ablations Stay up-to-date on research to bring new ideas to work What we’re looking for We value a relentless approach to problem-solving, rapid execution, and the ability...
$150k - $300k
...and pair it with the full rl post-training stack: environments, secure sandboxes,... ...async RL trainer. We enable researchers, startups and enterprises... ...RL training stack. Core Technical Responsibilities LLM Serving... ...development and encourage team members to contribute to the...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work- ...sit on top of our leading RL infrastructure. You’ll work... ...move quickly in a zero-to-one environment Preferred Experience... ...Mosaic, Adept, and Windsurf RL researchers, raised $15M led by... ...building our team of founding Members of Technical Staff to design the frontier of continually...
$150k - $300k
...and pair it with the full rl post-training stack: environments, secure sandboxes,... ...async RL trainer. We enable researchers, startups and enterprises... ...reliable at scale. Core Technical Responsibilities Infrastructure... ...and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hours- About the Role As a Deployed Research Engineer at Sieve, you’ll work on highly specific dataset problems for frontier AI labs and build... ...or external teams to translate ambiguous needs into concrete technical systems Strong Python developer with hands‑on experience in...
$150k - $350k
About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff (RL Research & Environments). Be the first to apply!
- technical support associate San Francisco, CA
- decision support analyst San Francisco, CA
- desktop support analyst San Francisco, CA
- senior technical analyst San Francisco, CA
- user support analyst San Francisco, CA
- customer support technician San Francisco, CA
- technical support analyst San Francisco, CA
- support analyst San Francisco, CA
- tech assistant San Francisco, CA
- technical support specialist San Francisco, CA

