Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - RL Algorithms

$300k
Full-time

Vmax

About Vmax Vmax is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About the role RL has become the de-facto method of post-training LLMs. We are limited by the sample efficiency of the current policy gradient algorithms in use today, and are looking for a talented researcher to weave together pre-LLM and post-LLM approaches to learning from experience. Responsibilities Develop new RL algorithms for post-training language models. Adapt ideas from pre-LLM reinforcement learning, such as model-based RL, temporal abstraction, and value-based learning, to modern LLM and agentic settings. Establish empirical baselines and evaluation protocols for measuring sample efficiency, robustness, generalization, and reward exploitation in LLM RL. Analyze failure modes of RL-trained models, including reward hacking, mode collapse, over-optimization, exploration failures, and distribution shift. Collaborate with researchers working on environments, evals, interpretability, reward modeling, and infrastructure to turn algorithmic ideas into reliable training systems. Own and develop a research agenda within Vmax, from identifying promising directions to executing experiments and communicating results. Minimum Requirements PhD or equivalent experience in machine learning, reinforcement learning, or a closely related field. Track record of research excellence, as demonstrated by publications, open source work, deployed AI systems, or other substantial technical contributions. Deep understanding of modern machine learning, especially reinforcement learning, representation learning, and large language models. Strong familiarity with LLM post-training methods. Experience designing and running rigorous ML experiments, including ablations, baselines, evaluation design, and failure analysis. Experience with large-scale ML infrastructure, distributed training, experiment tracking, data pipelines, and debugging unstable training runs. Expertise with Python and at least one major ML framework such as PyTorch or JAX. Ability to work independently on open-ended research problems and turn ambiguous ideas into concrete experimental programs. Nice to have Experience developing new RL algorithms or improving existing ones in domains such as robotics, games, simulated control, language models, or agents. Experience with LLM pre-training. Strong understanding of reward modeling, verifiers, process supervision, outcome supervision, or automated evaluation systems. Demonstrated software engineering ability Strong communication skills, especially the ability to explain algorithmic ideas, empirical results, and research implications to both technical and non-technical audiences Role specific location policy This role is based in our San Francisco office; for exceptional candidates we are willing to consider a hybrid arrangement Compensation The expected salary range for this position is $300,000 - $500,000 USD

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - RL Algorithms in San Francisco, CA vacancy
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About... 
    Suggested
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    5 days ago
  • $300k

     ...exceptional ML engineers who can turn RL research ideas into working...  ...whether failures come from algorithms, rewards, data,...  ...experimentation across RL projects. Own technically ambiguous projects end to end...  ...by other technical team members. Clear written and verbal communication... 
    Suggested
    Full time
    Work at office
    Local area

    Vmax

    San Francisco, CA
    11 hours ago
  • $180k

     ...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence... 
    Suggested
    Temporary work

    Xai

    San Francisco, CA
    17 hours ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise.... 
    Suggested
    Work at office
    Local area

    Vmax

    San Francisco, CA
    5 days ago
  • $200k

    Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI... 
    Suggested
    Full time
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    1 day ago
  • $225k

     ...improve models and solve alignment more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL... 
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    5 days ago
  • $150k - $300k

     ...plane and pair it with the full rl post-training stack:...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...Design placement and scheduling algorithms for heterogeneous accelerators...  ...development and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...plane and pair it with the full RL post-training stack:...  ...runs the jobs. Core Technical Responsibilities Hosted Training...  ...(new model architectures, RL algorithms, modes) Technical Requirements...  ...and encourage team members to contribute to the broader... 
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    16 hours ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training...  ...reward models, reinforcement learning algorithms, and inference‑time scaling techniques...  ...through data, reward modeling, or RL techniques. Evidence of owning ambitious... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    4 days ago
  •  ...having proprietary data to post-train on and unique usage to RL on possess strong opinions about how AI agents and interfaces...  ...feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"... 
    Immediate start

    Tomo

    San Francisco, CA
    3 days ago
  •  ...requirements, and very few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build the platform that...  ..., including evals and observability. Background in offline RL, contextual bandits, or sequential decision‑making (for the... 

    Beacon Software

    San Francisco, CA
    5 days ago
  •  ...utilise, and a feedback loop tight enough to discover new algorithms autonomously. The world's most capable AI models are being...  ...ours at the frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core offerings: AI software that... 

    Conductor Quantum, Inc.

    San Francisco, CA
    1 day ago
  •  ...the core product interfaces that sit on top of our leading RL infrastructure. You’ll work directly with the founders and...  ...their core agent products. We’re building our team of founding Members of Technical Staff to design the frontier of continually learning systems. #J-... 

    Trajectory

    San Francisco, CA
    3 days ago
  • $150k - $220k

    # Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI is the secure runtime for custom enterprise...  ...don't expect people to know them already* •Experience with RL, reward modeling, LLM evals, or agent stacks (retrievers, tool... 

    VizopsAI

    San Francisco, CA
    2 days ago
  • $110k - $350k

     ...Technology is a firm based in San Francisco that develops algorithms, models, and software systems to solve complex problems...  ...the largest companies in the world. About the role As a Member of the Technical Staff, you will join a team with deep expertise in machine learning... 
    Work at office
    Relocation package

    Kiso Technology

    San Francisco, CA
    5 days ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning...  ...hacking. Investigate how internal representations evolve during RL and post-training, and use these insights to improve... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    3 days ago
  • $150k

     ...Amazon's Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic...  ...collaborating with platform teams to ensure your models and algorithms perform robustly in dynamic real-world environments. You'... 
    Local area

    Amazon

    San Francisco, CA
    3 days ago
  •  ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering with extraordinary founders to launch businesses...  ...AI: Experienced in prompting and familiarity with Evals and RL We are focused on building a diverse and inclusive... 
    Local area

    Atomic VC

    San Francisco, CA
    4 days ago
  • $200k - $350k

     ...Member of ML Technical Staff Title of Role: Member of ML Technical Staff Location: San Francisco, onsite Company Stage of Funding...  ...in a team setting. ~ Familiarity with machine learning algorithms and data structures. Preferred Experience with... 
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    1 day ago
  • $200k - $275k

    Founding Member of Technical Staff (Research / Post-Training) Applied AI / RL | San Francisco (onsite) | $200k-$275k + 0.25-0.50% equity DeepRec is partnered with a YC‑backed (S25), seed‑stage applied AI and data company working at the cutting edge of reinforcement learning... 
    Full time
    Visa sponsorship
    Relocation package

    DeepRec.ai

    San Francisco, CA
    4 days ago
  • Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress...  ...behavior analysis and new training methods for efficient RL Drive novel research in RL and environment design, including... 

    Patronus AI, Inc.

    San Francisco, CA
    2 days ago
  • Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location: SF Bay Area or Tokyo, Japan Type: Full...  ...biological modalities and tasks. Work on architecture, algorithms, and optimization. Evaluate ideas in model design, optimization... 
    Full time

    Radical Numerics

    San Francisco, CA
    4 days ago
  •  ...The Role We are seeking a highly motivated and experienced Member of Technical Staff, Computational Biology to join our dynamic R&D team. In...  ...massive datasets. Innovate & Create Tools: Develop novel algorithms, computational approaches, and internal tools to solve unique... 

    Algen Biotechnologies

    San Francisco, CA
    2 days ago
  • $150k - $300k

     ...control plane and pair it with the full rl post-training stack: environments, secure...  ...fast, robust, and reliable at scale. Core Technical Responsibilities Infrastructure...  ...believe in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    5 days ago
  •  ...and New York but also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-performant and scalable software...  ...and results of our training cycles both on the SFT and the RL regime. Research, implement, and experiment with ideas on... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    1 day ago
  • $150k

    Description Amazon’s Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent...  ...and controls engineering teams to ensure your models and algorithms perform robustly on physical robotic platforms in dynamic... 

    Amazon Science

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, Hardware Security Modules At Anchorage Digital, we are building the world’s most advanced digital asset platform for...  ...systems. You have a deep understanding of common crypto algorithms (RSA, ECDSA, BLS, etc.), code signing, network security, authentication... 
    Flexible hours

    Crypto Pro Network

    San Francisco, CA
    2 days ago
  •  ...frontier of interactive AI. The Role We're looking for a Member of Technical Staff - Embodied Agents to help build general-purpose agents...  ...Discrete action systems Policy optimization RL and imitation learning Agent-environment interaction systems... 

    Moonlake AI

    San Francisco, CA
    3 days ago
  •  ...designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly growing product. We're looking for folks... 
    Work at office

    LlamaIndex

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, Product TL;DR: Listen is building the human layer of AI. We're Sequoia-backed, raised $100M, and our customers include Anthropic, Google, and Cursor. We're hiring engineers who can build a complex AI-native product on a small team of former... 
    Flexible hours
    Shift work

    Listen Labs

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - RL Algorithms. Be the first to apply!