Member of Technical Staff - Reinforcement Learning
$185k - $255kOptimized, Inc.
Member of Technical Staff - Reinforcement Learning Optimized deploys AI agents into the most critical supply chains in the world: the operations of Fortune 500 manufacturers and government organizations. Those agents have to make sound decisions across long, multi-step workflows where a single misstep is costly, and we need a reinforcement learning researcher to teach them how. As a Member of Technical Staff on reinforcement learning, you'll own RL and post-training: the reward models, training loops, and evaluations that turn raw model capability into reliable long-horizon decision-making. You'll ground your work in real deployment data and ship it into production, closing the loop between how agents learn and how they perform in the field. Compensation range for this role is $185,000 - $255,000 + equity. What you'll do • Train agents to act: You'll design and run RL and post-training pipelines that improve how our agents plan and execute multi-step work. • Build reward models: You'll define and train the reward signals that capture what a good supply chain decision actually looks like. • Evaluate long-horizon behavior: You'll build evals that measure agent reliability across long, high-stakes workflows, not just single turns. • Ground learning in reality: You'll use real deployment data and feedback to close the gap between simulation and production. • Ship research to production: You'll work with engineers to bring training breakthroughs into the live agent platform. What you'll bring • Have a PhD or equivalent research experience in RL, ML, or a related field • Have hands-on experience with reinforcement learning, post-training, or RLHF for LLMs • Are comfortable building research prototypes in Python and iterating quickly • Understand reward modeling, policy optimization, and evaluation of sequential decision-making • Care about real-world impact, and you have driven research through to production • Are excited about applying AI to complex, messy, real-world optimization problems #J-18808-Ljbffr
- ...We’re looking for candidates with experience building reinforcement learning-based LLM training pipelines. As Part Of Our Founding Team You May Train reinforcement learning-based LLMs to solve tasks in the domain of materials science, chemical engineering, and engineering...Suggested
- ...leading AI labs to push AI closer to achieving its transformative potential. About the Role We’re hiring Machine Learning Engineers to design and build reinforcement learning environments to safely advance model capabilities specifically on machine learning research and...SuggestedVisa sponsorshipRelocation package
$150k - $300k
...startups and enterprises to run end‑to‑end reinforcement learning at frontier scale, adapting models to... ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi‑tenant... ...open development and encourage team members to contribute to the broader AI community...SuggestedWork at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$300k
...Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research... ...developing AI capable of open-ended learning. We are building systems to exceed humans... ...interpretability to enhance reinforcement learning by generating intrinsic rewards...SuggestedWork at officeLocal area- ...frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core... ...applications. Data Analysis and Machine Learning Develop Python pipelines to analyse... ...such as Bayesian optimisation and reinforcement learning to solve calibration challenges...Suggested
$300k
...Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by... ...models. Adapt ideas from pre-LLM reinforcement learning, such as model-based...Work at officeLocal areaShift work$200k
...many of the company's most important decisions. As a Member of Technical Staff on Evals, you will build both the platform and the evaluations... ...validate eval tasks for pre-training, post-training, reinforcement learning, inference, and product systems Develop infrastructure...Visa sponsorshipRelocation package$160k - $240k
...are backed by Quiet Capital and are looking for early engineers to join us in San Francisco. The Role We are looking for a Machine Learning Engineer to build the models, optimization systems and algorithms that drive our autonomous decision engine. You will not just...Full timeImmediate startRelocationRelocation package$150k - $300k
...startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to... ..., robust, and reliable at scale. Core Technical Responsibilities Infrastructure... ...in open development and encourage team members to contribute to the broader AI community...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hours- ...Compute and algorithms are rapidly commoditizing, but reinforcement learning data remains the bottleneck. Plato is changing that by... ...evaluation, and iteration feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own the systems that...
- ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change... ...-to-end. Our work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure for large-scale...
- ...that turns problems at home into problems for the world. The Role Two Dots is hiring a Machine Learning Engineer for a low-headcount, high-impact role focused on technically difficult applied ML problems in housing verification, underwriting, fraud detection, and document...
- ...Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate... ...and improved. You will work at the intersection of reinforcement learning, simulations and scalable oversight, building systems that...
$150k
...We are seeking a Member of Technical Staff Simulation Engineer to join our AI robotics research team developing... ...to enable training large-scale machine learning models. Key job responsibilities Develop simulations for reinforcement learning, closed-loop simulations and...InternshipLocal area$150k - $300k
...startups, and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to... ...that runs the jobs. Core Technical Responsibilities Hosted Training... ...in open development and encourage team members to contribute to the broader AI community...Work at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours- ...Translate cutting‑edge research into production‑ready machine learning systems Design, build, and deploy end‑to‑end ML models and pipelines... ...the world. Our founding team, along with many of our team members, has contributed to many of the breakthroughs in AI over the past...H1bRemote workVisa sponsorship
- Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission... .... Develop data generation pipelines, reward models, reinforcement learning algorithms, and inference‑time scaling techniques....Full timeRelocation package
$150k - $250k
...Build scalable data processing and machine learning systems Solve min-max problems: how... ...You have a track record in a technical domain, e.g., machine learning, computer... ...Experience with applying deep learning, reinforcement learning, unsupervised learning, and other...Work at officeFlexible hours- ...re looking for engineers and scientists to design, optimize, and maintain the core systems that enable scalable, efficient reinforcement learning for large models. This role sits at the intersection of research and large-scale systems engineering: you'll wear many hats...Immediate startFlexible hours
- ...Moonlake is hiring a Member of Technical Staff — Diffusion Models to design and train advanced multimodal generative systems. This role focuses... .... The ideal candidate should possess strong machine learning research fundamentals and experience in generative models....
- ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience. We are looking for an exceptional builder... ...crucial in building the core systems that deploy our machine learning capabilities and driving architectural choices for the...Work experience placement
$227.5k - $401k
...motivated individuals who tackle unique technical challenges at scale and solve them as... ...the financial technology sector. As a Member of Technical Staff, you will operate with a high degree... ...or fixing critical performance issues. Learn and Lead : connect with the broader AI...Work at officeImmediate startRelocationFlexible hours- ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding member of the engineering... ...designed to simplify and accelerate the entire machine learning lifecycle. We're a team of passionate engineers who believe...Full timePart timeWork at officeWork from homeFlexible hours2 days per week
- ...Porto, Portugal; Singapore; and Sioux Falls, South Dakota. Learn more at anchorage.com, on X @Anchorage, and on LinkedIn... ...will be the primary driver of the system architecture, technical direction and each team member’s technical skill development. At Anchorage Digital, we...
- ...your work will define what cutting edge means. We're hiring Members of Technical Staff to design the evaluations that set the standard for how AI... ...the subject matter than consulting allows. AI and Machine Learning — backgrounds include ML Engineer, ML Researcher, AI...
- ...generation to improve model capabilities and alignment more reliably than humans can alone. Our technical approach combines frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and inference-time compute. These systems create unique...Work at officeVisa sponsorshipRelocation packageFlexible hours
$150k
...Hiring: Member of Technical Staff (AI Systems / Full-Stack Engineering) only U.S. Citizen or Green Card Holder Required Compensation: $150,000+... ...autonomous AI. Their platform connects fragmented factory systems, learns how operations function, and deploys AI agents that...Immediate start- ...will reshape how people discover and buy online. Role As a Member of Technical Staff, you will ship core systems, set engineering culture, and move... ...and cost. Launch a product graph and ranking pipeline that learns from real outcomes. Nice to have Shipped data products in...Work at office
- ...Intelligence Build environments for training embodied agents Explore reinforcement learning and multimodal reasoning What We're Looking For Strong... ...publication record or impactful open-source contributions Technical Skills Deep learning frameworks (PyTorch, JAX, etc.)...
$300k
...Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...Work at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Reinforcement Learning. Be the first to apply!
- remote support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- customer support analyst San Francisco, CA
- systems support technician San Francisco, CA
- help desk administrator San Francisco, CA
- decision support analyst San Francisco, CA
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- technical assistant San Francisco, CA
- IT support technician San Francisco, CA

