Member of Technical Staff - RL Research (Experienced)

$300k

Nuance Labs, Inc.

About Nuance Labs

Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person.

We're a Series A company ($60M raised) backed by Lightspeed, Accel, South Park Commons, NVentures, and Define Ventures, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved.
How Nuance Differentiates

Most conversational AI avatars today are hacks - a face slapped on a speech-to-speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one-turn-at-a-time. Current systems take 2-5 seconds to respond; natural conversation requires sub-500ms. That's a 10x improvement, and it demands rethinking the entire stack.

That rethinking starts with full-duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up.

About the Role

We're looking for a deeply technical Member of Technical Staff to own RL and post-training for large-scale omni models. This posting is aimed at experienced researchers and engineers who've operated at a senior to senior-staff level at big tech or a leading research lab. Everyone at Nuance is MTS - we don't run title ladders - but we're hiring people who have already done this work at scale.

This role is broader than a traditional RL algorithm role. You will be expected to understand modern post-training methods and build the infrastructure needed to run them at scale. The work spans RL method development, rollout generation, reward modeling, policy optimization, evaluation, data feedback loops, serving, observability, and distributed execution.

You will build Nuance's RL/post-training stack from 0→1 and scale it from 1→10. That means turning rapidly evolving research ideas into reliable training systems: defining the abstractions, choosing or modifying frameworks, wiring together rollout workers and trainers, building reward/evaluation loops, debugging failure modes, and making the system fast enough for researchers to iterate.

For Nuance, post-training is not limited to text. Our models are omni from the ground up: audio, video, language, and real-time full-duplex interaction. We need RL and post-training methods that improve interactive behavior, timing, interruption, emotional response, audiovisual coherence, and real-time conversational quality.

This is a high-ownership role with direct impact on how Nuance models improve after pretraining.
What You'll Own

Build Nuance's RL/post-training stack from 0→1: rollout generation, policy optimization, reward/reference model serving, data feedback loops, evaluation, checkpointing, observability, and debugging.
Develop and scale post-training methods such as PPO, GRPO, DPO, rejection sampling, RLHF/RLAIF, online RL, and model-based data improvement.
Design the systems abstractions that connect research ideas to production-scale RL runs: trainers, rollout workers, reward models, evaluators, data queues, experience buffers, and checkpoint promotion.
Build evaluation and feedback loops for omni behavior: turn-taking, interruption, timing, emotional response, audiovisual coherence, instruction following, and real-time interaction quality.
Optimize the end-to-end post-training loop across rollout throughput, serving latency, GPU utilization, policy update efficiency, queueing, checkpoint overhead, and research iteration speed.
Evolve the platform as algorithms, model architectures, reward definitions, data sources, and evaluation methods change.

What We're Looking For

Significant hands-on experience with RL, RLHF, RLAIF, post-training, alignment, or large-scale fine-tuning for modern foundation models.
Deep understanding of RL/post-training methods: policy optimization, reward modeling, preference optimization, rejection sampling, KL control, evaluation, and data feedback loops.
A track record reasoning about model behavior and training dynamics: reward hacking, unstable rewards, distribution shift, stale policies, mode collapse, over-optimization, noisy preferences, and evaluation mismatch.
Proven experience building or operating RL/post-training pipelines at scale with frameworks such as verl, ms-swift, OpenRLHF, or equivalent internal systems, including integration with rollout serving systems such as vLLM.
Experience with large-scale training or inference systems, including rollout generation, model serving, batching, queueing, GPU utilization, checkpointing, and debugging.
Understanding of omni post-training for real-time audio-video-language interaction: temporal alignment, interruption, emotional response, and multimodal evaluation.
Strong software engineering fundamentals, curiosity, and adaptability to new RL algorithms, model architectures, serving systems, evaluation methods, and research ideas.

Bonus Points

Prior 0→1 experience building post-training systems, RL pipelines, agent training systems, evaluation platforms, or large-scale model improvement loops.
Experience with PPO, GRPO, DPO, online RL, RLHF/RLAIF, reward modeling, preference data, synthetic data generation, or model-based data improvement.
Experience with omni or multimodal post-training for audio-video-language models, especially long-context or real-time interactive systems.
Experience scaling mixed training/inference workloads across large GPU clusters.
Experience with adjacent areas such as distributed pretraining, data infrastructure, inference serving, simulation, human/AI feedback collection, or evaluation infrastructure.
Publications or substantial open-source contributions in RL, post-training, alignment, evaluation, ML systems, or model behavior.

Compensation

$300,000 - $500,000 base salary, plus meaningful equity. We think long-term ownership matters and structure equity accordingly.

Logistics

Location: In-person in Seattle, five days a week - we believe in the compounding value of working shoulder-to-shoulder.
Visa sponsorship: We sponsor visas (O-1, H-1B, green card) from day one.
AI-native tooling: Do your best work with the best tools, including unlimited tokens.

Benefits

Health: HSA plan with ~$2,000 in annual company contributions - roughly 2x what most big tech companies put in.
Time off: 15 days of PTO plus public holidays, and we close the office for a full week at year-end.
Food: Lunch, drinks, and snacks on us every workday - the small thing that quietly makes the day better.
Commuter benefits: We help cover the cost of getting to the office.
401(k): In the works.

Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI.

Apply

Vacancy posted 13 hours ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff - RL Research (Experienced) in Seattle, WA vacancy

Member of Technical Staff - Pretraining Infra (Experienced)
$300k - $400k
...the Role We're looking for a deeply technical MTS to own distributed training infrastructure... ...This role sits at the intersection of research, systems, and GPU-scale execution -... ...with adjacent infrastructure areas such as RL/post-training, data infrastructure, synthetic...
Suggested
H1b
Work at office
Visa sponsorship
Nuance Labs, Inc.
Seattle, WA
12 hours ago
Member of Technical Staff - Research Fellow
$200k - $250k
...ground up. ABOUT THE ROLE The Nuance Research Fellowship is a 3-month engagement for... ...: pretraining, post-training, RL, evaluation, data, multimodal modeling... ...together whether to convert to a full-time Member of Technical Staff role. Fellows who convert step into...
Suggested
Full time
Temporary work
Internship
H1b
Work at office
Visa sponsorship
Nuance Labs
Seattle, WA
2 days ago
Member of Technical Staff - Model Optimization and Inference (Experienced)
$250k - $350k
...frameworks at scale and have strong opinions about where they fall short. This posting is aimed at experienced engineers and researchers who've operated at a senior to senior-staff level at big tech, a leading AI lab, or a high-traffic inference team. Everyone at Nuance is...
Suggested
H1b
Work at office
Visa sponsorship
Nuance Labs, Inc.
Seattle, WA
13 hours ago
Member of Technical Staff - Imagine Model
$180k
...Member Of Technical Staff - Imagine Model Palo Alto, CA; Seattle, WA About XAI XAI's mission is to create AI systems that can accurately... ...understanding, controllable/long-horizon synthesis, agentic planning, RL training, and world simulation (including audio integration...
Suggested
Temporary work
Xai
Seattle, WA
3 days ago
Member of Technical Staff, FAR (Frontier AI & Robotics)
$150k
...Member Of Technical Staff Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-... ...novel algorithms that bridge the gap between state-of-the-art research and real-world deployment at Amazon scale. In this role,...
Suggested
Full time
Temporary work
Seasonal work
Local area
Amazon Technologies, Inc.
Seattle, WA
4 days ago
Senior Member Technical Staff (JoinOCI-SDE)
$79.2k - $178.1k
...the Oracle Cloud to provide the broadest, most comprehensive cloud in the industry. Responsibilities As a Senior Member of Technical Staff, you will own the software design and development for major components of Oracle's Cloud Infrastructure. You should be both...
Temporary work
Worldwide
Flexible hours
Oracle
Seattle, WA
18 hours ago
Principal Member Technical Staff
$96.8k - $223.4k
...shares our enthusiasm for innovation and excellence. As a valued member of our software engineering division in Seattle, you'll have... ...exceptional customer experiences. Collaborate and lead technical discussions across multiple teams to ensure seamless integrations...
Temporary work
Remote work
Flexible hours
Oracle
Seattle, WA
4 days ago
Member of Technical Staff — ML Data Infra
$200k - $300k
Member of Technical Staff — ML Data Infra Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars... ...and maintainable — and you're just as comfortable taking a researcher's messy processing script and turning it into something that...
Nuance Labs
Seattle, WA
3 days ago
Member of Technical Staff (AI-Powered EdTech)
$120k - $150k
# Member of Technical Staff (AI-Powered EdTech)Colleague AI$120K - $1600KKirkland, WA, USSeniorAI/ML EngineerInterested in this AI/ML Engineer... ...enhancing **teaching, learning, and school management** through research\-backed and AI\-driven technology.As we continue scaling...
Permanent employment
Full time
Flexible hours
AI Pulse
Kirkland, WA
3 days ago
Senior Member Technical Staff (JoinOCI-SDE)
...systems challenges, and help deliver the foundation for OCI’s most performant compute services. Responsibilities As a Senior Member of Technical Staff, you will own the software design and development for major components of Oracle’s Cloud Infrastructure. You should be a...
Temporary work
Worldwide
Flexible hours
Oracle
Seattle, WA
3 days ago
Sr. Full Stack Member of Technical Staff
$159.75k - $255.6k
Sr. Full Stack Member of Technical Staff Seattle, Washington, United States Join Axon and be a Force for Good. At Axon, we’re on a mission to... ...Critical and Robotics platforms. This role goes beyond pure research. You will operate across the full stack, from data, models...
Work at office
Axon
Seattle, WA
18 hours ago
Member of Technical Staff - Model Training
$180k
...Member Of Technical Staff - Model Training Austin, TX; New York, NY; Palo Alto, CA; Seattle, WA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly...
Temporary work
Xai
Seattle, WA
3 days ago
Principal Member of Technical Staff
Senior Software Developer OCI Compute is looking for strong Senior Software Developers with a strong cloud/distributed systems/microservices background to take on the challenge of engineering Compute Infrastructure solutions and build services for Large Scale Compute...
Flexible hours
Oracle
Seattle, WA
3 days ago
Senior Member of Technical Staff
$79.2k - $209.5k
...growing fast, still at an early stage, and working on ambitious new initiatives. An engineer at any level can have a significant technical and business impact. The ideal software engineer candidate for this team is a proficient programmer who has large breadth of knowledge...
Temporary work
Local area
Flexible hours
Oracle
Seattle, WA
9 hours ago
Member of Technical Staff (Rust, Search & Database Engines)
....ai OSS project Your first month - ramp up and contribute meaningful improvements to the Spice.ai OSS project 30-60 days - take technical and engineering ownership of an entire feature area. 60-90 days - propose and execute on aligned technical strategy going forward...
Spice AI
Bellevue, WA
2 days ago
FinTech Support Analyst - Drive Member Success
...Seattle is seeking a skilled Support Analyst to provide exceptional support for members using their AI-powered solutions. The ideal candidate will have 3-5 years of experience in a technical role, with expertise in financial services technology. Responsibilities include...
Range
Seattle, WA
4 days ago
Senior Member Technical Staff (JoinOCI-SDE)
Join Oracle Cloud Infrastructure’s Compute team to design, build, and scale the next generation of bare-metal provisioning systems powering millions of servers worldwide. As a senior engineer, you will develop highly reliable and secure infrastructure, tackle complex distributed...
Worldwide
Flexible hours
Ll Oefentherapie
Seattle, WA
18 hours ago
Member of Technical Staff - Media
$180k
Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization ...
Temporary work
xAI
Seattle, WA
22 days ago
Member of Technical Staff, Mobile Engineer
...shared below is a general expectation for the function as posted; we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range. Lastly, the...
Remote work
runwayml.com
Seattle, WA
9 hours ago
Member of Technical Staff - Model Training
$180k
Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization ...
Temporary work
xAI
Seattle, WA
18 days ago
Engineering Technical Specialist (Experienced or Senior)
$85k - $123.05k
...Boeing Test and Evaluation (BT&E) is seeking an Engineering Technical Specialist to join our team in Seattle, WA or Berkeley, MO, to... ...currently hiring for a broad range of experience levels including Experienced and Senior level Engineering Technical Specialists. Position...
Permanent employment
Work experience placement
Currently hiring
Relocation
Visa sponsorship
Work visa
Relocation package
Flexible hours
Shift work
Day shift
The Boeing Company
Seattle, WA
1 day ago
Experienced Engineering Technical Specialist
$90.95k - $123.05k
Experienced Engineering Technical Specialist Overview Join The Boeing Company’s Boeing Test & Evaluation (BT&E) team in Seattle, WA as an Engineering Technical Specialist within the Metrology and Test Equipment Services (MATES) Infrastructure Organization. The role focuses...
Work experience placement
Flexible hours
Boeing
Seattle, WA
18 hours ago
Engineering Technical Specialist (Experienced or Senior)
$85k - $123.05k
Overview Engineering Technical Specialist (Experienced or Senior) Boeing Test and Evaluation (BT&E) is seeking an Engineering Technical Specialist to join our team in Seattle, WA or Berkeley, MO. The role focuses on developing innovative solutions while adhering to the...
Work experience placement
Relocation package
Flexible hours
Shift work
Boeing
Seattle, WA
1 day ago
Member of Technical Staff — Model Optimization and Inference (New Grad)
$200k - $300k
...optimization, kernel-level acceleration, batching strategies. You’ve worked with vLLM, SGLang, or similar frameworks (through coursework, research, internships, or open-source) and have opinions about where they fall short. This posting is aimed at early-career engineers...
Full time
Internship
H1b
Work at office
Visa sponsorship
Nuance Labs
Seattle, WA
2 days ago
Support Analyst
...eliminate financial complexity for our members. We're transforming wealth management through... ...for our members. You'll troubleshoot technical issues, collaborate with cross-... ...of defense on technical issues they are experiencing Figure out the member's problem quickly...
Work at office
Relocation
Monday to Friday
Range
Bellevue, WA
1 day ago
Aviation Technical Systems Specialist
...Aviation Technical Systems Specialist (Runway Safety Specialist) Serves as an Aviation Technical Systems Specialist (Runway Safety... ...of runway safety policies. Provides guidance to less experienced members of the runway safety team on the process of technically reviewing...
Local area
Department of Transportation
Des Moines, WA
2 days ago
Senior Technical Specialist - Avionics Packaging
$114.46k - $160.24k
...avionics equipment for Lunar missions. As a member of the highly collaborative and fast‑... ...tooling Participate in or provide technical guidance for hardware installations Report... ...integration skills using 3D CAD software Experienced with Geometric Dimensioning and Tolerancing...
Permanent employment
Temporary work
Local area
Flexible hours
Blue Origin
Seattle, WA
9 hours ago
Fleet Technical/Network Support Rep-Ship Rider III-NAS-Multiple Locations
...the Fleet NavMPS user as an embedded member of the Carrier Airwing Staff Provide NavMPS-specific... ...platform training and guidance to less experienced team members and team members from... ...all applicable NavMPS assets Provide technical advice, subject matter expertise, recommendations...
Work at office
Local area
Remote work
Vectrus, Inc
Seattle, WA
3 days ago
Support Engineering Support Maintenance Analyst 2
...Preferred Qualifications Ability to apply existing skills, learn post‑production kit planning and contribute with minimum supervision. Technical skills: read, understand, and interpret engineering drawings and specifications. Excellent verbal and written communication...
Contract work
Work experience placement
Shift work
Cts Technology Solutions, Inc.
Seattle, WA
3 days ago
Support Engineering Maintenance Analyst 2
$37 per hour
...Preferred Qualifications Ability to apply existing skills, learn post-production kit planning and contribute with minimum supervision. Technical Skills: Ability to read, understand, and interpret engineering drawings and specifications. Communication: Excellent verbal and...
Hourly pay
Contract work
Work experience placement
Boeing Future of Flight
Seattle, WA
18 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - RL Research (Experienced). Be the first to apply!