Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Applied RL

$250k

VMAX LLC

V max is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to our RL platform, which automates the engineering involved in converting data and evals into RL environments.

About the role

Your objective will be to rapidly deliver bespoke environments and agents for our customers. It will be your responsibility to translate customer needs into bespoke environments and then post-train agents within these environments. You will also shape our product and research directions, helping us productize our research and make RL more widely accessible.

Responsibilities
  • Build RL environments for our customers
  • Post train LLM-based agents on domain specific tasks
  • Productizing Vmax research - apply environment generation and automated RL research to improve our customers' agents
Role Requirements
  • Experience post-training LLMs
  • Software engineering experience beyond research projects
  • Can independently build post-training data and training pipelines
Nice to have
  • Research experience in RL
  • Open source contributions to RL frameworks
Role specific location policy
  • This role is based in our San Francisco office; For exceptional candidates we are willing to consider a hybrid arrangement
Compensation

The expected salary range for this position is $250,000 - $450,000 USD

#J-18808-Ljbffr
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Applied RL in San Francisco, CA vacancy
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About... 
    Suggested
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    22 hours ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About... 
    Suggested
    Work at office
    Local area

    Vmax

    San Francisco, CA
    22 hours ago
  • Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward...  ...behavior analysis and new training methods for efficient RL Drive novel research in RL and environment design,... 
    Suggested

    Patronus AI, Inc.

    San Francisco, CA
    2 days ago
  • $180k

     ...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering... 
    Suggested
    Temporary work

    Xai

    San Francisco, CA
    23 hours ago
  •  ...Member Of Technical Staff – Applied AI, Fullstack Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining... 
    Suggested
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining... 
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    22 hours ago
  • $180k - $300k

     ...Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world... 
    Visa sponsorship
    Relocation package

    Preference Model

    San Francisco, CA
    1 day ago
  •  ...models and real-world deployment. You'll own applied post-training work for VLMs end-to-end...  ...between customer needs and internal technical teams, and push back when needed. The Work...  ...using SFT, preference alignment, and/or RL. Strong intuition for visual data quality... 

    Liquid AI

    San Francisco, CA
    4 days ago
  •  ...and real-world deployment. You'll own the applied post-training work that adapts LFM2.5-...  ...parameters, not blockers. The Work Act as the technical owner for enterprise audio post-training...  ...(SFT, preference alignment, and/or RL). Experience with data generation and evaluation... 

    Liquid AI

    San Francisco, CA
    4 days ago
  • $200k

    Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI... 
    Full time
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, Applied AI The opportunity We are looking for a Member of Technical Staff with deep expertise in generative modelling to work at the interface between our frontier models and the customers who depend on them. You will join an interdisciplinary... 
    Flexible hours

    Latent Labs

    San Francisco, CA
    22 hours ago
  • $250k

     ...Member of Technical Staff - Open Endedness About V max V max is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to... 
    Work at office

    VMAX LLC

    San Francisco, CA
    3 days ago
  •  ...only our kindest and smartest friends. Apply for the MTS role if you: have...  ...data to post-train on and unique usage to RL on possess strong opinions about how...  ...both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"... 
    Immediate start

    Tomo

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...plane and pair it with the full rl post-training stack:...  ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi...  ...memory bandwidth and transport; apply techniques such as...  ...development and encourage team members to contribute to the broader AI... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed...  ...how internal representations evolve during RL and post-training, and use these insights to... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    3 days ago
  •  ...few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build...  ...and observability. Background in offline RL, contextual bandits, or sequential decision‑making (for the applied research workstreams). Open‑source contributions... 

    Beacon Software

    San Francisco, CA
    22 hours ago
  •  ...Member of Technical Staff, Post-Training Join to apply for the Member of Technical Staff, Post-Training role at Cohere . Get AI-powered advice on this...  ...results of our training cycles both on the SFT and the RL regime. Research, implement, and experiment with... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...plane and pair it with the full RL post-training stack:...  ...runs the jobs. Core Technical Responsibilities Hosted Training...  ...development and encourage team members to contribute to the broader AI...  ...help shape the future of AI? Apply now and join us in our mission... 
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    1 day ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission Reflection’s mission...  ...model behavior through data, reward modeling, or RL techniques. Evidence of owning ambitious research or... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    4 days ago
  •  ...also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-...  ...training cycles both on the SFT and the RL regime. Research, implement, and experiment...  ...of attributes. Even if none of these apply to you, but you believe you will... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    1 day ago
  •  ...help us get there. The Opportunity This is a rare chance to own applied post-training work end-to-end for text workloads, adapting...  ...the world’s largest enterprise customers. You will act as the technical bridge between customer requirements and model delivery. You will... 

    Liquid AI

    San Francisco, CA
    2 days ago
  • $150k - $350k

     ...our Series A last year from Tier 1 firms such as Matrix Partners, Swift Ventures, Y Combinator, and AI Grant. About The Role As an applied research engineer at Sieve, you’ll build high performance building blocks and large‑scale pipelines to understand video with high... 

    Sieve

    San Francisco, CA
    2 days ago
  • Introduction Plato is an applied research lab building the foundational infrastructure...  ...are not designed for long-running RL environments, persistent agent workspaces...  ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own... 

    Plato

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...plane and pair it with the full rl post-training stack:...  ...and reliable at scale. Core Technical Responsibilities Infrastructure...  ...development and encourage team members to contribute to the broader AI...  ...help shape the future of AI? Apply now and join us in our mission... 
    Full time
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Kubelt

    San Francisco, CA
    12 hours ago
  • $100k - $300k

     ...AI Cyber Taskforce Engineer Cogent is an Applied AI Lab building the next generation of AI agents for cybersecurity. AI has fundamentally...  ...at Cogent Onboard, support and uplevel future team members Mentor and grow future junior team members Actively contribute... 

    Cogent Security, Inc.

    San Francisco, CA
    1 day ago
  •  ...for exceptional research engineers and applied researchers to help push the frontier...  ...The Role We're looking for a Member of Technical Staff - Embodied Agents to help build general...  ...action systems Policy optimization RL and imitation learning Agent-... 

    Moonlake AI

    San Francisco, CA
    3 days ago
  •  ...Building environments for AI agents , Tools for automating common workflows , Improving alerts, metrics and error handling on large scale RL jobs , Refactoring existing agent, data, eval, training frameworks for better modularity , Designing operation procedures and coding... 

    xAI

    San Francisco, CA
    2 days ago
  • Member of Technical Staff - Applied AI Engineer Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Valthos Inc. Valthos is an applied biological intelligence company. We build and deploy software and biological AI systems to safeguard humanity. Applied... 
    Full time
    Work at office

    Valthos

    San Francisco, CA
    2 days ago
  • $120k - $220k

    Get AI-powered advice on this job and more exclusive features. This role is provided by Anthrogen. Your actual pay will be based on your skills and experience – talk with your recruiter to learn more. Base pay range $120,000.00/yr - $220,000.00/yr Responsibilities...
    Full time
    Relocation

    Anthrogen

    San Francisco, CA
    3 days ago
  • $130k - $240k

     ...domain expertise with frontier technology. The Role Being a Member of Technical Staff at SketchPro means the problem in front of you will keep...  ...on one hard technical area. What stays constant: you'll apply state-of-the-art LLMs and VLMs to agentic workflows for 3D... 
    Work at office
    Shift work

    SketchPro AI

    San Francisco, CA
    28 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Applied RL. Be the first to apply!