Member of Technical Staff, RL Research & Environments
$200kMagic
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role As a Software Engineer on the RL Research & Environments team, you will design and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This role focuses on post-training: identifying capability gaps, building targeted datasets, designing reward signals, and running iterative training loops that measurably improve user‑facing behavior. You will own the infrastructure and experimental workflows that connect product priorities to concrete capability gains. Magic’s long‑context models introduce distinct post‑training challenges: long‑horizon reasoning, sustained coherence over extended trajectories, context‑use quality, and tool‑augmented behavior. You will build systems that expose failure modes, generate high‑signal training data, and enable rapid RL iteration at scale. This role can evolve into ownership of major capability areas, deeper RL systems work, or broader influence over post‑training strategy as Magic scales long‑context model performance and reliability. What you’ll work on Design and build post‑training datasets using synthetic generation, targeted data collection, and self‑play Implement filtering, scoring, and mixture strategies for RL and post‑training corpora Build and maintain evaluation frameworks that surface long‑context failure modes Design reward signals and training environments for targeted capability improvements Run ablations across data sources, reward designs, and long‑horizon task structures Improve reliability and observability of post‑training data and environment pipelines Collaborate closely with Product and Research to translate capability goals into measurable iteration cycles What we’re looking for Strong software engineering fundamentals Experience building or operating large‑scale data or ML systems Ability to design and interpret experiments that measure model behavior changes Comfort working at the intersection of ML, data systems, and infrastructure Strong attention to data quality and evaluation rigor Track record of owning experimental or production systems end‑to‑end Compensation, benefits, and perks (US) Annual salary range: between $200K - $550K based on experience Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic Compensation Range: $200K - $550K #J-18808-Ljbffr Magic
$200k - $275k
Founding Member of Technical Staff (Research / Post-Training) Applied AI / RL | San Francisco (onsite) | $200k-$275k + 0.25-0.50% equity DeepRec is partnered with... ...core platform focuses on building high‑quality RL environments that simulate tasks across investment banking,...SuggestedFull timeVisa sponsorshipRelocation package$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise....SuggestedWork at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans... .... Collaborate with researchers working on environments, evals, interpretability, reward modeling,...SuggestedWork at officeLocal areaShift work$180k
Member of Technical Staff - RL Infrastructure About xAI xAI’s mission is to create AI systems that can... ...to increase the productivity of researchers and engineers. Typical problems you... ...we design an efficient and robust environment for the agent to perform actions in...SuggestedTemporary work$300k
...Vmax Vmax is an applied research lab developing AI... ...engineers who can turn RL research ideas into working... ...systems, evals, environment and rewards. You will... ...across RL projects. Own technically ambiguous projects end... ...other technical team members. Clear written and verbal...SuggestedFull timeWork at officeLocal area- ...real production workloads built to scale to gigawatt-class AI datacenters. Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern), you will be evaluating and implementing techniques to drive performance and...Internship
- Member of Technical Staff - Research Scientist Patronus AI is a frontier lab developing simulation research and... ...standard for robust, high quality environments. Your work will inform how frontier... ...new training methods for efficient RL. Develop state-of-the-art systems for...
- Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location... ..., model architecture, and numerics research to the challenges of biology. We are... ...distributed or high-performance computing environments. Excellent written and verbal...Full time
$150k - $350k
Member of Technical Staff, Applied Research — Sieve Location: San Francisco, CA (Onsite) Compensation: $150,000 - $350,000 base + 0.05% - 0.4% equity Visa Sponsorship: H-1B, O-1, OPT supported Experience Level: 2+ years Employment Type: Full-Time Headcount: 4 open seats...Full timeH1bVisa sponsorship$225k
...believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably... ...approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this...RelocationVisa sponsorship$150k
...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground... ..., ensuring robust performance in production environments Collaborate with platform and hardware teams to ensure...Local area$150k
...’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic systems from the ground... ...physical robotic platforms in dynamic real‑world environments. You’ll have access to Amazon’s computational...$245k
...foundation model, bridging research with real‑world impact to solve... ...both virtual and physical environments. Key Job Responsibilities You... ...objectives, mentoring team members, and hiring top talent. Basic... ...and insights to both technical and non‑technical audiences,...- Member of Technical Staff - Computational Biologist Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Computational Biologist... ...Contribute to shaping and executing the Valthos-wide research and development roadmap Identify large-scale biological datasets...Full timeWork at office
- ...people who deeply understand what's possible Massive leverage - the systems you build will multiply the output of every data team member and every stakeholder who needs data Direct impact - small team, no layers of approval. Idea to shipped system in days, not...
- ...session accuracy with millions of real tool calls as baseline data SFT on our agentic traces and RL models on top of our agentic harness and app sandboxes Qualifications research you can independently execute against the research goals you can prototype and test...
- ...simulate, and reason over 3D environments for embodied AI, robotics... ...are looking for exceptional research engineers and applied... ...We're looking for a Member of Technical Staff - Embodied Agents to help... ...Policy optimization RL and imitation learning Agent...
- About the Role We are looking for exceptional researchers and research engineers to design and build the next generation of AI benchmarks... ...design, statistical analysis, and evaluation frameworks Technical skills : Proficiency in Python for research and experimentation...Work experience placementRelocationRelocation package
- ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who... ...expertise. You do not need to be an ML researcher. You need to be the kind of engineer who... ...observability. Background in offline RL, contextual bandits, or sequential...
$150k - $300k
...and pair it with the full RL post‑training stack: environments, secure sandboxes,... ...async RL trainer. We enable researchers, startups and enterprises... ...RL training stack. Core Technical Responsibilities LLM Serving... ...development and encourage team members to contribute to the...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We... ...representations evolve during RL and post-training, and use... ...tools, and RL environments. Define and pursue a high-...Work at officeLocal area- Member of Technical Staff - Post‑Training Join to apply for the Member of Technical... ...states. Our team of AI researchers and company builders come from... ...data, reward modeling, or RL techniques. Evidence of... ...paced, high‑agency startup environment; bias toward action and clarity...Full timeRelocation package
- ...customers. Cohere is a team of researchers, engineers, designers, and... ...remote-friendly! As a Member of Technical Staff, you will: Design and write... ...both on the SFT and the RL regime. Research, implement... ...to create an inclusive work environment for all. We welcome applicants...Full timeWork at officeRemote workFlexible hours
$148.5k - $223.9k
.... Responsibilities You will work with research scientists, software engineers, product... ...systems with customers. With your strong technical competence, strategic thinking and... ...to action, comfortable in fast‑pacing environment. For this role, we mainly look for BS/...$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity##... ...team combines deep AI/ML research leadership at Google... ...ownership in a fast, evolving environment.## What You'll Do* •Build backend... ...already* •Experience with RL, reward modeling, LLM evals...$150k - $350k
About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80...- ...infrastructure for massive, petabyte-scale, multimodal datasets Rapidly iterate on experiments and ablations Stay up-to-date on research to bring new ideas to work What we’re looking for We value a relentless approach to problem-solving, rapid execution, and the ability...
- We're seeking an exceptional AI researcher/engineer to join our team in developing the ego... ...system for autonomous gameplay in 3D environments. This role combines cutting‑edge research... ..., with the ability to explain complex technical concepts to both technical and non‑technical...
$150k - $300k
...and pair it with the full rl post-training stack: environments, secure sandboxes,... ...async RL trainer. We enable researchers, startups and enterprises... ...reliable at scale. Core Technical Responsibilities Infrastructure... ...and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hours- About the Role As a Deployed Research Engineer at Sieve, you’ll work on highly specific dataset problems for frontier AI labs and build... ...or external teams to translate ambiguous needs into concrete technical systems Strong Python developer with hands‑on experience in...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, RL Research & Environments. Be the first to apply!
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- end user support technician San Francisco, CA
- IT assistant San Francisco, CA
- help desk assistant San Francisco, CA
- IT support technician San Francisco, CA
- operations support technician San Francisco, CA
- desktop support analyst San Francisco, CA
- support analyst San Francisco, CA
- technical associate San Francisco, CA

