Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Reinforcement Learning

$185k - $255k

Optimized, Inc.

Member of Technical Staff - Reinforcement Learning Optimized deploys AI agents into the most critical supply chains in the world: the operations of Fortune 500 manufacturers and government organizations. Those agents have to make sound decisions across long, multi-step workflows where a single misstep is costly, and we need a reinforcement learning researcher to teach them how. As a Member of Technical Staff on reinforcement learning, you'll own RL and post-training: the reward models, training loops, and evaluations that turn raw model capability into reliable long-horizon decision-making. You'll ground your work in real deployment data and ship it into production, closing the loop between how agents learn and how they perform in the field. Compensation range for this role is $185,000 - $255,000 + equity. What you'll do • Train agents to act: You'll design and run RL and post-training pipelines that improve how our agents plan and execute multi-step work.

  • Build reward models: You'll define and train the reward signals that capture what a good supply chain decision actually looks like.
  • Evaluate long-horizon behavior: You'll build evals that measure agent reliability across long, high-stakes workflows, not just single turns.
  • Ground learning in reality: You'll use real deployment data and feedback to close the gap between simulation and production.
  • Ship research to production: You'll work with engineers to bring training breakthroughs into the live agent platform.
What you'll bring • Have a PhD or equivalent research experience in RL, ML, or a related field
  • Have hands-on experience with reinforcement learning, post-training, or RLHF for LLMs
  • Are comfortable building research prototypes in Python and iterating quickly
  • Understand reward modeling, policy optimization, and evaluation of sequential decision-making
  • Care about real-world impact, and you have driven research through to production
  • Are excited about applying AI to complex, messy, real-world optimization problems
#J-18808-Ljbffr Optimized, Inc.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Reinforcement Learning in San Francisco, CA vacancy
  •  ...We’re looking for candidates with experience building reinforcement learning-based LLM training pipelines. As Part Of Our Founding Team You May Train reinforcement learning-based LLMs to solve tasks in the domain of materials science, chemical engineering, and engineering... 
    Suggested

    Outerport

    San Francisco, CA
    3 days ago
  •  ...push AI closer to achieving its transformative potential. About the Role We’re hiring new graduate Machine Learning Engineers to design and build reinforcement learning environments to safely advance model capabilities specifically on machine learning research and engineering... 
    Suggested
    Visa sponsorship
    Relocation package

    Preference Model

    San Francisco, CA
    2 days ago
  • $150k - $300k

     ...startups and enterprises to run end‑to‑end reinforcement learning at frontier scale, adapting models to...  ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi‑tenant...  ...open development and encourage team members to contribute to the broader AI community... 
    Suggested
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    3 days ago
  •  ...frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core...  ...applications. Data Analysis and Machine Learning Develop Python pipelines to analyse...  ...such as Bayesian optimisation and reinforcement learning to solve calibration challenges... 
    Suggested

    Conductor Quantum

    San Francisco, CA
    3 days ago
  • $300k

     ...Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research...  ...developing AI capable of open-ended learning. We are building systems to exceed humans...  ...interpretability to enhance reinforcement learning by generating intrinsic rewards... 
    Suggested
    Work at office
    Local area

    VMAX LLC

    San Francisco, CA
    3 days ago
  • $300k

     ...Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by...  ...models. Adapt ideas from pre-LLM reinforcement learning, such as model-based... 
    Work at office
    Local area
    Shift work

    VMAX LLC

    San Francisco, CA
    3 days ago
  • $200k

     ...many of the company's most important decisions. As a Member of Technical Staff on Evals, you will build both the platform and the evaluations...  ...validate eval tasks for pre-training, post-training, reinforcement learning, inference, and product systems Develop infrastructure... 
    Visa sponsorship
    Relocation package

    Magic AI Corp.

    San Francisco, CA
    4 days ago
  • $160k - $240k

     ...are backed by Quiet Capital and are looking for early engineers to join us in San Francisco. The Role We are looking for a Machine Learning Engineer to build the models, optimization systems and algorithms that drive our autonomous decision engine. You will not just... 
    Full time
    Immediate start
    Relocation
    Relocation package

    Tensec

    San Francisco, CA
    4 days ago
  •  ...Compute and algorithms are rapidly commoditizing, but reinforcement learning data remains the bottleneck. Plato is changing that by...  ...evaluation, and iteration feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own the systems that... 

    Plato.ai

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change...  ...-to-end. Our work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure for large-scale... 

    Mirendil

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to...  ..., robust, and reliable at scale. Core Technical Responsibilities Infrastructure...  ...in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    4 days ago
  •  ...that turns problems at home into problems for the world. The Role Two Dots is hiring a Machine Learning Engineer for a low-headcount, high-impact role focused on technically difficult applied ML problems in housing verification, underwriting, fraud detection, and document... 

    Two Dots Inc

    San Francisco, CA
    2 days ago
  • $150k

     ...We are seeking a Member of Technical Staff Simulation Engineer to join our AI robotics research team developing...  ...to enable training large-scale machine learning models. Key job responsibilities Develop simulations for reinforcement learning, closed-loop simulations and... 
    Internship
    Local area

    Amazon Science

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...startups, and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to...  ...that runs the jobs. Core Technical Responsibilities Hosted Training...  ...in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    1 day ago
  •  ...Translate cutting‑edge research into production‑ready machine learning systems Design, build, and deploy end‑to‑end ML models and pipelines...  ...the world. Our founding team, along with many of our team members, has contributed to many of the breakthroughs in AI over the past... 
    H1b
    Remote work
    Visa sponsorship

    Reka AI, Inc.

    San Francisco, CA
    3 days ago
  • Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate...  ...and improved. You will work at the intersection of reinforcement learning, simulations and scalable oversight, building systems that... 

    Patronus AI, Inc.

    San Francisco, CA
    2 days ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission...  .... Develop data generation pipelines, reward models, reinforcement learning algorithms, and inference‑time scaling techniques.... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    4 days ago
  •  ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding member of the engineering...  ...designed to simplify and accelerate the entire machine learning lifecycle. We're a team of passionate engineers who believe... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    3 days ago
  • $227.5k - $401k

     ...motivated individuals who tackle unique technical challenges at scale and solve them as...  ...the financial technology sector. As a Member of Technical Staff, you will operate with a high degree...  ...or fixing critical performance issues. Learn and Lead : connect with the broader AI... 
    Work at office
    Immediate start
    Relocation
    Flexible hours

    Adyen

    San Francisco, CA
    3 days ago
  •  ...Moonlake is hiring a Member of Technical Staff — Diffusion Models to design and train advanced multimodal generative systems. This role focuses...  .... The ideal candidate should possess strong machine learning research fundamentals and experience in generative models.... 

    Moon Lake

    San Francisco, CA
    3 days ago
  •  ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience. We are looking for an exceptional builder...  ...crucial in building the core systems that deploy our machine learning capabilities and driving architectural choices for the... 
    Work experience placement

    RST Recruitment

    San Francisco, CA
    4 days ago
  •  ...generation to improve model capabilities and alignment more reliably than humans can alone. Our technical approach combines frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and inference-time compute. These systems create unique... 
    Work at office
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...your work will define what cutting edge means. We're hiring Members of Technical Staff to design the evaluations that set the standard for how AI...  ...the subject matter than consulting allows. AI and Machine Learning — backgrounds include ML Engineer, ML Researcher, AI... 

    Artificial Analysis, Inc.

    San Francisco, CA
    1 day ago
  •  ...requirements, and very few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build the platform that...  ...weakly held. We actively seek new ideas and believe we can learn from anyone at any time. Honesty: We are truth seeking in... 

    BEACON SOFTWARE COMPANY

    San Francisco, CA
    3 days ago
  •  ...will be the primary driver of the system architecture, technical direction and each team member’s technical skill development. At Anchorage Digital, we...  ...Villagers” better measure impact and the rich and rewarding learning that happens while exploring, developing, and mastering... 

    Crypto Pro Network

    San Francisco, CA
    3 days ago
  •  ...We have been working quietly on a SOTA personal agent that learns what real people care about and empowers them to lead better...  ...equally strong obligations to both 1) choose good and 2) to win. think that this role should be renamed "member of tomo staff" #J-18808-Ljbffr... 
    Immediate start

    Tomo

    San Francisco, CA
    4 days ago
  • $300k

     ...Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About... 
    Work at office
    Local area

    VMAX LLC

    San Francisco, CA
    3 days ago
  •  ...Intelligence Build environments for training embodied agents Explore reinforcement learning and multimodal reasoning What We're Looking For Strong...  ...publication record or impactful open-source contributions Technical Skills Deep learning frameworks (PyTorch, JAX, etc.)... 

    Moon Lake

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff We're looking for a member of technical staff to build and deploy production-grade AI systems. In this role, you...  ...environments Fine-tune, evaluate, and work with machine learning models in real-world applications Design scalable pipelines... 

    ERAGON

    San Francisco, CA
    3 days ago
  • $150k - $250k

     ...Build scalable data processing and machine learning systems Solve min-max problems: how...  ...You have a track record in a technical domain, e.g., machine learning, computer...  ...Experience with applying deep learning, reinforcement learning, unsupervised learning, and other... 
    Work at office
    Flexible hours

    Asari AI

    San Francisco, CA
    25 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Reinforcement Learning. Be the first to apply!