Member of Technical Staff - RL Algorithms
$300kVmax
About Vmax Vmax is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About the role RL has become the de-facto method of post-training LLMs. We are limited by the sample efficiency of the current policy gradient algorithms in use today, and are looking for a talented researcher to weave together pre-LLM and post-LLM approaches to learning from experience. Responsibilities Develop new RL algorithms for post-training language models. Adapt ideas from pre-LLM reinforcement learning, such as model-based RL, temporal abstraction, and value-based learning, to modern LLM and agentic settings. Establish empirical baselines and evaluation protocols for measuring sample efficiency, robustness, generalization, and reward exploitation in LLM RL. Analyze failure modes of RL-trained models, including reward hacking, mode collapse, over-optimization, exploration failures, and distribution shift. Collaborate with researchers working on environments, evals, interpretability, reward modeling, and infrastructure to turn algorithmic ideas into reliable training systems. Own and develop a research agenda within Vmax, from identifying promising directions to executing experiments and communicating results. Minimum Requirements PhD or equivalent experience in machine learning, reinforcement learning, or a closely related field. Track record of research excellence, as demonstrated by publications, open source work, deployed AI systems, or other substantial technical contributions. Deep understanding of modern machine learning, especially reinforcement learning, representation learning, and large language models. Strong familiarity with LLM post-training methods. Experience designing and running rigorous ML experiments, including ablations, baselines, evaluation design, and failure analysis. Experience with large-scale ML infrastructure, distributed training, experiment tracking, data pipelines, and debugging unstable training runs. Expertise with Python and at least one major ML framework such as PyTorch or JAX. Ability to work independently on open-ended research problems and turn ambiguous ideas into concrete experimental programs. Nice to have Experience developing new RL algorithms or improving existing ones in domains such as robotics, games, simulated control, language models, or agents. Experience with LLM pre-training. Strong understanding of reward modeling, verifiers, process supervision, outcome supervision, or automated evaluation systems. Demonstrated software engineering ability Strong communication skills, especially the ability to explain algorithmic ideas, empirical results, and research implications to both technical and non-technical audiences Role specific location policy This role is based in our San Francisco office; for exceptional candidates we are willing to consider a hybrid arrangement Compensation The expected salary range for this position is $300,000 - $500,000 USD
$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...SuggestedWork at officeLocal areaShift work$300k
...exceptional ML engineers who can turn RL research ideas into working... ...whether failures come from algorithms, rewards, data,... ...experimentation across RL projects. Own technically ambiguous projects end to end... ...by other technical team members. Clear written and verbal communication...SuggestedFull timeWork at officeLocal area$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence...SuggestedTemporary work$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise....SuggestedWork at officeLocal area$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI...SuggestedFull timeRelocationVisa sponsorship$225k
...improve models and solve alignment more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL...RelocationVisa sponsorship$150k - $300k
...plane and pair it with the full rl post-training stack:... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...Design placement and scheduling algorithms for heterogeneous accelerators... ...development and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$150k - $300k
...plane and pair it with the full RL post-training stack:... ...runs the jobs. Core Technical Responsibilities Hosted Training... ...(new model architectures, RL algorithms, modes) Technical Requirements... ...and encourage team members to contribute to the broader...Work at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours- Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training... ...reward models, reinforcement learning algorithms, and inference‑time scaling techniques... ...through data, reward modeling, or RL techniques. Evidence of owning ambitious...Full timeRelocation package
- ...having proprietary data to post-train on and unique usage to RL on possess strong opinions about how AI agents and interfaces... ...feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"...Immediate start
- ...requirements, and very few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build the platform that... ..., including evals and observability. Background in offline RL, contextual bandits, or sequential decision‑making (for the...
- ...utilise, and a feedback loop tight enough to discover new algorithms autonomously. The world's most capable AI models are being... ...ours at the frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core offerings: AI software that...
- ...the core product interfaces that sit on top of our leading RL infrastructure. You’ll work directly with the founders and... ...their core agent products. We’re building our team of founding Members of Technical Staff to design the frontier of continually learning systems. #J-...
$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI is the secure runtime for custom enterprise... ...don't expect people to know them already* •Experience with RL, reward modeling, LLM evals, or agent stacks (retrievers, tool...$110k - $350k
...Technology is a firm based in San Francisco that develops algorithms, models, and software systems to solve complex problems... ...the largest companies in the world. About the role As a Member of the Technical Staff, you will join a team with deep expertise in machine learning...Work at officeRelocation package$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning... ...hacking. Investigate how internal representations evolve during RL and post-training, and use these insights to improve...Work at officeLocal area$150k
...Amazon's Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic... ...collaborating with platform teams to ensure your models and algorithms perform robustly in dynamic real-world environments. You'...Local area- ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering with extraordinary founders to launch businesses... ...AI: Experienced in prompting and familiarity with Evals and RL We are focused on building a diverse and inclusive...Local area
$200k - $350k
...Member of ML Technical Staff Title of Role: Member of ML Technical Staff Location: San Francisco, onsite Company Stage of Funding... ...in a team setting. ~ Familiarity with machine learning algorithms and data structures. Preferred Experience with...Work at officeVisa sponsorship$200k - $275k
Founding Member of Technical Staff (Research / Post-Training) Applied AI / RL | San Francisco (onsite) | $200k-$275k + 0.25-0.50% equity DeepRec is partnered with a YC‑backed (S25), seed‑stage applied AI and data company working at the cutting edge of reinforcement learning...Full timeVisa sponsorshipRelocation package- Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress... ...behavior analysis and new training methods for efficient RL Drive novel research in RL and environment design, including...
- Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location: SF Bay Area or Tokyo, Japan Type: Full... ...biological modalities and tasks. Work on architecture, algorithms, and optimization. Evaluate ideas in model design, optimization...Full time
- ...The Role We are seeking a highly motivated and experienced Member of Technical Staff, Computational Biology to join our dynamic R&D team. In... ...massive datasets. Innovate & Create Tools: Develop novel algorithms, computational approaches, and internal tools to solve unique...
$150k - $300k
...control plane and pair it with the full rl post-training stack: environments, secure... ...fast, robust, and reliable at scale. Core Technical Responsibilities Infrastructure... ...believe in open development and encourage team members to contribute to the broader AI community...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hours- ...and New York but also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-performant and scalable software... ...and results of our training cycles both on the SFT and the RL regime. Research, implement, and experiment with ideas on...Full timeWork at officeRemote workFlexible hours
$150k
Description Amazon’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent... ...and controls engineering teams to ensure your models and algorithms perform robustly on physical robotic platforms in dynamic...- Member of Technical Staff, Hardware Security Modules At Anchorage Digital, we are building the world’s most advanced digital asset platform for... ...systems. You have a deep understanding of common crypto algorithms (RSA, ECDSA, BLS, etc.), code signing, network security, authentication...Flexible hours
- ...frontier of interactive AI. The Role We're looking for a Member of Technical Staff - Embodied Agents to help build general-purpose agents... ...Discrete action systems Policy optimization RL and imitation learning Agent-environment interaction systems...
- ...designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly growing product. We're looking for folks...Work at office
- ...Member of Technical Staff, Product TL;DR: Listen is building the human layer of AI. We're Sequoia-backed, raised $100M, and our customers include Anthropic, Google, and Cursor. We're hiring engineers who can build a complex AI-native product on a small team of former...Flexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - RL Algorithms. Be the first to apply!
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- end user support technician San Francisco, CA
- IT assistant San Francisco, CA
- help desk assistant San Francisco, CA
- IT support technician San Francisco, CA
- operations support technician San Francisco, CA
- desktop support analyst San Francisco, CA
- support analyst San Francisco, CA
- technical associate San Francisco, CA

