Member of Technical Staff - RL Algorithms

$300k

Full-time

Vmax

About Vmax Vmax is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About the role RL has become the de-facto method of post-training LLMs. We are limited by the sample efficiency of the current policy gradient algorithms in use today, and are looking for a talented researcher to weave together pre-LLM and post-LLM approaches to learning from experience. Responsibilities Develop new RL algorithms for post-training language models. Adapt ideas from pre-LLM reinforcement learning, such as model-based RL, temporal abstraction, and value-based learning, to modern LLM and agentic settings. Establish empirical baselines and evaluation protocols for measuring sample efficiency, robustness, generalization, and reward exploitation in LLM RL. Analyze failure modes of RL-trained models, including reward hacking, mode collapse, over-optimization, exploration failures, and distribution shift. Collaborate with researchers working on environments, evals, interpretability, reward modeling, and infrastructure to turn algorithmic ideas into reliable training systems. Own and develop a research agenda within Vmax, from identifying promising directions to executing experiments and communicating results. Minimum Requirements PhD or equivalent experience in machine learning, reinforcement learning, or a closely related field. Track record of research excellence, as demonstrated by publications, open source work, deployed AI systems, or other substantial technical contributions. Deep understanding of modern machine learning, especially reinforcement learning, representation learning, and large language models. Strong familiarity with LLM post-training methods. Experience designing and running rigorous ML experiments, including ablations, baselines, evaluation design, and failure analysis. Experience with large-scale ML infrastructure, distributed training, experiment tracking, data pipelines, and debugging unstable training runs. Expertise with Python and at least one major ML framework such as PyTorch or JAX. Ability to work independently on open-ended research problems and turn ambiguous ideas into concrete experimental programs. Nice to have Experience developing new RL algorithms or improving existing ones in domains such as robotics, games, simulated control, language models, or agents. Experience with LLM pre-training. Strong understanding of reward modeling, verifiers, process supervision, outcome supervision, or automated evaluation systems. Demonstrated software engineering ability Strong communication skills, especially the ability to explain algorithmic ideas, empirical results, and research implications to both technical and non-technical audiences Role specific location policy This role is based in our San Francisco office; for exceptional candidates we are willing to consider a hybrid arrangement Compensation The expected salary range for this position is $300,000 - $500,000 USD

Apply

Vacancy posted 11 hours ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff - RL Algorithms in San Francisco, CA vacancy

Member of Technical Staff - RL Algorithms
$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...
Suggested
Work at office
Local area
Shift work
Vmax
San Francisco, CA
5 days ago
Member of Technical Staff - Applied RL
$300k
...exceptional ML engineers who can turn RL research ideas into working... ...whether failures come from algorithms, rewards, data,... ...experimentation across RL projects. Own technically ambiguous projects end to end... ...by other technical team members. Clear written and verbal communication...
Suggested
Full time
Work at office
Local area
Vmax
San Francisco, CA
11 hours ago
Member of Technical Staff - RL Infrastructure
$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence...
Suggested
Temporary work
Xai
San Francisco, CA
17 hours ago
Member of Technical Staff - RL Infrastructure
$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise....
Suggested
Work at office
Local area
Vmax
San Francisco, CA
5 days ago
Member of Technical Staff, RL Research & Environments
$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI...
Suggested
Full time
Relocation
Visa sponsorship
Magic
San Francisco, CA
1 day ago
Member of Technical Staff, Inference & RL Systems
$225k
...improve models and solve alignment more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL...
Relocation
Visa sponsorship
Magic
San Francisco, CA
5 days ago
Member of Technical Staff - Inference
$150k - $300k
...plane and pair it with the full rl post-training stack:... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...Design placement and scheduling algorithms for heterogeneous accelerators... ...development and encourage team members to contribute to the broader...
Work at office
Remote work
Visa sponsorship
Relocation package
Flexible hours
Shift work
Prime-Intellect
San Francisco, CA
3 days ago
Member of Technical Staff - Training Platform
$150k - $300k
...plane and pair it with the full RL post-training stack:... ...runs the jobs. Core Technical Responsibilities Hosted Training... ...(new model architectures, RL algorithms, modes) Technical Requirements... ...and encourage team members to contribute to the broader...
Work at office
Local area
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
San Francisco, CA
16 hours ago
Member of Technical Staff - Post-Training
Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training... ...reward models, reinforcement learning algorithms, and inference‑time scaling techniques... ...through data, reward modeling, or RL techniques. Evidence of owning ambitious...
Full time
Relocation package
Reflection AI
San Francisco, CA
4 days ago
Member of Technical Staff
...having proprietary data to post-train on and unique usage to RL on possess strong opinions about how AI agents and interfaces... ...feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"...
Immediate start
Tomo
San Francisco, CA
3 days ago
Member of Technical Staff
...requirements, and very few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build the platform that... ..., including evals and observability. Background in offline RL, contextual bandits, or sequential decision‑making (for the...
Beacon Software
San Francisco, CA
5 days ago
Member of Technical Staff
...utilise, and a feedback loop tight enough to discover new algorithms autonomously. The world's most capable AI models are being... ...ours at the frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core offerings: AI software that...
Conductor Quantum, Inc.
San Francisco, CA
1 day ago
Member of Technical Staff - Generalist
...the core product interfaces that sit on top of our leading RL infrastructure. You’ll work directly with the founders and... ...their core agent products. We’re building our team of founding Members of Technical Staff to design the frontier of continually learning systems. #J-...
Trajectory
San Francisco, CA
3 days ago
Founding Member of Technical Staff (MTS)
$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI is the secure runtime for custom enterprise... ...don't expect people to know them already* •Experience with RL, reward modeling, LLM evals, or agent stacks (retrievers, tool...
VizopsAI
San Francisco, CA
2 days ago
Member of the Technical Staff
$110k - $350k
...Technology is a firm based in San Francisco that develops algorithms, models, and software systems to solve complex problems... ...the largest companies in the world. About the role As a Member of the Technical Staff, you will join a team with deep expertise in machine learning...
Work at office
Relocation package
Kiso Technology
San Francisco, CA
5 days ago
Member of Technical Staff - Mechanistic Interpretability
$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning... ...hacking. Investigate how internal representations evolve during RL and post-training, and use these insights to improve...
Work at office
Local area
Vmax
San Francisco, CA
3 days ago
Member of Technical Staff - Science, Frontier AI & Robotics (FAR)
$150k
...Amazon's Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic... ...collaborating with platform teams to ensure your models and algorithms perform robustly in dynamic real-world environments. You'...
Local area
Amazon
San Francisco, CA
3 days ago
Senior Member of Technical Staff
...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering with extraordinary founders to launch businesses... ...AI: Experienced in prompting and familiarity with Evals and RL We are focused on building a diverse and inclusive...
Local area
Atomic VC
San Francisco, CA
4 days ago
Member of ML Technical Staff
$200k - $350k
...Member of ML Technical Staff Title of Role: Member of ML Technical Staff Location: San Francisco, onsite Company Stage of Funding... ...in a team setting. ~ Familiarity with machine learning algorithms and data structures. Preferred Experience with...
Work at office
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
1 day ago
Founding Member of Technical Staff (Research/Post-training)
$200k - $275k
Founding Member of Technical Staff (Research / Post-Training) Applied AI / RL | San Francisco (onsite) | $200k-$275k + 0.25-0.50% equity DeepRec is partnered with a YC‑backed (S25), seed‑stage applied AI and data company working at the cutting edge of reinforcement learning...
Full time
Visa sponsorship
Relocation package
DeepRec.ai
San Francisco, CA
4 days ago
Member of Technical Staff - Applied Research
Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress... ...behavior analysis and new training methods for efficient RL Drive novel research in RL and environment design, including...
Patronus AI, Inc.
San Francisco, CA
2 days ago
Member of Technical Staff, Pretraining Science
Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location: SF Bay Area or Tokyo, Japan Type: Full... ...biological modalities and tasks. Work on architecture, algorithms, and optimization. Evaluate ideas in model design, optimization...
Full time
Radical Numerics
San Francisco, CA
4 days ago
Member of Technical Staff, Computational Biology
...The Role We are seeking a highly motivated and experienced Member of Technical Staff, Computational Biology to join our dynamic R&D team. In... ...massive datasets. Innovate & Create Tools: Develop novel algorithms, computational approaches, and internal tools to solve unique...
Algen Biotechnologies
San Francisco, CA
2 days ago
Member of Technical Staff - Sandbox Platform
$150k - $300k
...control plane and pair it with the full rl post-training stack: environments, secure... ...fast, robust, and reliable at scale. Core Technical Responsibilities Infrastructure... ...believe in open development and encourage team members to contribute to the broader AI community...
Work at office
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect, Inc.
San Francisco, CA
5 days ago
Member of Technical Staff, Post-Training
...and New York but also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-performant and scalable software... ...and results of our training cycles both on the SFT and the RL regime. Research, implement, and experiment with ideas on...
Full time
Work at office
Remote work
Flexible hours
Jaide Health
San Francisco, CA
1 day ago
Member of Technical Staff - Hardware Science, Frontier AI & Robotics (FAR)
$150k
Description Amazon’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent... ...and controls engineering teams to ensure your models and algorithms perform robustly on physical robotic platforms in dynamic...
Amazon Science
San Francisco, CA
1 day ago
Member of Technical Staff, Hardware Security Modules
Member of Technical Staff, Hardware Security Modules At Anchorage Digital, we are building the world’s most advanced digital asset platform for... ...systems. You have a deep understanding of common crypto algorithms (RSA, ECDSA, BLS, etc.), code signing, network security, authentication...
Flexible hours
Crypto Pro Network
San Francisco, CA
2 days ago
Member of Technical Staff - Embodied Agents
...frontier of interactive AI. The Role We're looking for a Member of Technical Staff - Embodied Agents to help build general-purpose agents... ...Discrete action systems Policy optimization RL and imitation learning Agent-environment interaction systems...
Moonlake AI
San Francisco, CA
3 days ago
Member of Technical Staff, Infrastructure
...designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly growing product. We're looking for folks...
Work at office
LlamaIndex
San Francisco, CA
4 days ago
Member of Technical Staff, Product
...Member of Technical Staff, Product TL;DR: Listen is building the human layer of AI. We're Sequoia-backed, raised $100M, and our customers include Anthropic, Google, and Cursor. We're hiring engineers who can build a complex AI-native product on a small team of former...
Flexible hours
Shift work
Listen Labs
San Francisco, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - RL Algorithms. Be the first to apply!