Member of Technical Staff - Applied RL
$250kVMAX LLC
V max is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to our RL platform, which automates the engineering involved in converting data and evals into RL environments.
About the role
Your objective will be to rapidly deliver bespoke environments and agents for our customers. It will be your responsibility to translate customer needs into bespoke environments and then post-train agents within these environments. You will also shape our product and research directions, helping us productize our research and make RL more widely accessible.
Responsibilities
- Build RL environments for our customers
- Post train LLM-based agents on domain specific tasks
- Productizing Vmax research - apply environment generation and automated RL research to improve our customers' agents
Role Requirements
- Experience post-training LLMs
- Software engineering experience beyond research projects
- Can independently build post-training data and training pipelines
Nice to have
- Research experience in RL
- Open source contributions to RL frameworks
Role specific location policy
- This role is based in our San Francisco office; For exceptional candidates we are willing to consider a hybrid arrangement
Compensation
The expected salary range for this position is $250,000 - $450,000 USD
#J-18808-Ljbffr$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...SuggestedWork at officeLocal areaShift work$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...SuggestedWork at officeLocal area- Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward... ...behavior analysis and new training methods for efficient RL Drive novel research in RL and environment design,...Suggested
$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering...SuggestedTemporary work- ...Member Of Technical Staff – Applied AI, Fullstack Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining...SuggestedFull timeFlexible hours
- ...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining...Full timeFlexible hours
$180k - $300k
...Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world...Visa sponsorshipRelocation package- ...models and real-world deployment. You'll own applied post-training work for VLMs end-to-end... ...between customer needs and internal technical teams, and push back when needed. The Work... ...using SFT, preference alignment, and/or RL. Strong intuition for visual data quality...
- ...and real-world deployment. You'll own the applied post-training work that adapts LFM2.5-... ...parameters, not blockers. The Work Act as the technical owner for enterprise audio post-training... ...(SFT, preference alignment, and/or RL). Experience with data generation and evaluation...
$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI...Full timeRelocationVisa sponsorship- Member of Technical Staff, Applied AI The opportunity We are looking for a Member of Technical Staff with deep expertise in generative modelling to work at the interface between our frontier models and the customers who depend on them. You will join an interdisciplinary...Flexible hours
$250k
...Member of Technical Staff - Open Endedness About V max V max is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to...Work at office- ...only our kindest and smartest friends. Apply for the MTS role if you: have... ...data to post-train on and unique usage to RL on possess strong opinions about how... ...both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"...Immediate start
$150k - $300k
...plane and pair it with the full rl post-training stack:... ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi... ...memory bandwidth and transport; apply techniques such as... ...development and encourage team members to contribute to the broader AI...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed... ...how internal representations evolve during RL and post-training, and use these insights to...Work at officeLocal area- ...few precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build... ...and observability. Background in offline RL, contextual bandits, or sequential decision‑making (for the applied research workstreams). Open‑source contributions...
- ...Member of Technical Staff, Post-Training Join to apply for the Member of Technical Staff, Post-Training role at Cohere . Get AI-powered advice on this... ...results of our training cycles both on the SFT and the RL regime. Research, implement, and experiment with...Full timeWork at officeRemote workFlexible hours
$150k - $300k
...plane and pair it with the full RL post-training stack:... ...runs the jobs. Core Technical Responsibilities Hosted Training... ...development and encourage team members to contribute to the broader AI... ...help shape the future of AI? Apply now and join us in our mission...Work at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours- Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission Reflection’s mission... ...model behavior through data, reward modeling, or RL techniques. Evidence of owning ambitious research or...Full timeRelocation package
- ...also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-... ...training cycles both on the SFT and the RL regime. Research, implement, and experiment... ...of attributes. Even if none of these apply to you, but you believe you will...Full timeWork at officeRemote workFlexible hours
- ...help us get there. The Opportunity This is a rare chance to own applied post-training work end-to-end for text workloads, adapting... ...the world’s largest enterprise customers. You will act as the technical bridge between customer requirements and model delivery. You will...
$150k - $350k
...our Series A last year from Tier 1 firms such as Matrix Partners, Swift Ventures, Y Combinator, and AI Grant. About The Role As an applied research engineer at Sieve, you’ll build high performance building blocks and large‑scale pipelines to understand video with high...- Introduction Plato is an applied research lab building the foundational infrastructure... ...are not designed for long-running RL environments, persistent agent workspaces... ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own...
$150k - $300k
...plane and pair it with the full rl post-training stack:... ...and reliable at scale. Core Technical Responsibilities Infrastructure... ...development and encourage team members to contribute to the broader AI... ...help shape the future of AI? Apply now and join us in our mission...Full timeWork at officeRemote workVisa sponsorshipRelocation packageFlexible hours$100k - $300k
...AI Cyber Taskforce Engineer Cogent is an Applied AI Lab building the next generation of AI agents for cybersecurity. AI has fundamentally... ...at Cogent Onboard, support and uplevel future team members Mentor and grow future junior team members Actively contribute...- ...for exceptional research engineers and applied researchers to help push the frontier... ...The Role We're looking for a Member of Technical Staff - Embodied Agents to help build general... ...action systems Policy optimization RL and imitation learning Agent-...
- ...Building environments for AI agents , Tools for automating common workflows , Improving alerts, metrics and error handling on large scale RL jobs , Refactoring existing agent, data, eval, training frameworks for better modularity , Designing operation procedures and coding...
- Member of Technical Staff - Applied AI Engineer Valthos | Posted Mar 3 Full-time Negotiable Advanced (5-10 yrs) Valthos Inc. Valthos is an applied biological intelligence company. We build and deploy software and biological AI systems to safeguard humanity. Applied...Full timeWork at office
$120k - $220k
Get AI-powered advice on this job and more exclusive features. This role is provided by Anthrogen. Your actual pay will be based on your skills and experience – talk with your recruiter to learn more. Base pay range $120,000.00/yr - $220,000.00/yr Responsibilities...Full timeRelocation$130k - $240k
...domain expertise with frontier technology. The Role Being a Member of Technical Staff at SketchPro means the problem in front of you will keep... ...on one hard technical area. What stays constant: you'll apply state-of-the-art LLMs and VLMs to agentic workflows for 3D...Work at officeShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Applied RL. Be the first to apply!
- IT assistant San Francisco, CA
- desktop support analyst San Francisco, CA
- senior IT support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- technical analyst San Francisco, CA
- customer support technician San Francisco, CA
- tech assistant San Francisco, CA
- technical support assistant San Francisco, CA
- customer support analyst San Francisco, CA
- remote (work from home) technical support representative San Francisco, CA


