Member of Technical Staff — RL Research (Experienced)

$300k

Nuance Labs

Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person. We're a research company, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved. How Nuance Differentiates Most conversational AI avatars today are hacks — a face slapped on a speech‑to‑speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one‑turn‑at‑a‑time. Current systems take 2–5 seconds to respond; natural conversation requires sub‑500ms. That's a 10x improvement, and it demands rethinking the entire stack. That rethinking starts with full‑duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up. About the Role We’re looking for a deeply technical Member of Technical Staff to own RL and post‑training for large‑scale omni models. This posting is aimed at experienced researchers and engineers who’ve operated at a senior to senior‑staff level at big tech or a leading research lab. Everyone at Nuance is MTS — we don’t run title ladders — but we’re hiring people who have already done this work at scale. This role is broader than a traditional RL algorithm role. You will be expected to understand modern post‑training methods and build the infrastructure needed to run them at scale. The work spans RL method development, rollout generation, reward modeling, policy optimization, evaluation, data feedback loops, serving, observability, and distributed execution. You will build Nuance’s RL/post‑training stack from 0→1 and scale it from 1→10. That means turning rapidly evolving research ideas into reliable training systems: defining the abstractions, choosing or modifying frameworks, wiring together rollout workers and trainers, building reward/evaluation loops, debugging failure modes, and making the system fast enough for researchers to iterate. For Nuance, post‑training is not limited to text. Our models are omni from the ground up: audio, video, language, and real‑time full‑duplex interaction. We need RL and post‑training methods that improve interactive behavior, timing, interruption, emotional response, audiovisual coherence, and real‑time conversational quality. This is a high‑ownership role with direct impact on how Nuance models improve after pretraining. What You’ll Own Build Nuance’s RL/post‑training stack from 0→1: rollout generation, policy optimization, reward/reference model serving, data feedback loops, evaluation, checkpointing, observability, and debugging. Develop and scale post‑training methods such as PPO, GRPO, DPO, rejection sampling, RLHF/RLAIF, online RL, and model‑based data improvement. Design the systems abstractions that connect research ideas to production‑scale RL runs: trainers, rollout workers, reward models, evaluators, data queues, experience buffers, and checkpoint promotion. Build evaluation and feedback loops for omni behavior: turn‑taking, interruption, timing, emotional response, audiovisual coherence, instruction following, and real‑time interaction quality. Optimize the end‑to‑end post‑training loop across rollout throughput, serving latency, GPU utilization, policy update efficiency, queueing, checkpoint overhead, and research iteration speed. Evolve the platform as algorithms, model architectures, reward definitions, data sources, and evaluation methods change. What We’re Looking For Significant hands‑on experience with RL, RLHF, RLAIF, post‑training, alignment, or large‑scale fine‑tuning for modern foundation models. Deep understanding of RL/post‑training methods: policy optimization, reward modeling, preference optimization, rejection sampling, KL control, evaluation, and data feedback loops. A track record reasoning about model behavior and training dynamics: reward hacking, unstable rewards, distribution shift, stale policies, mode collapse, over‑optimization, noisy preferences, and evaluation mismatch. Proven experience building or operating RL/post‑training pipelines at scale with frameworks such as verl, ms‑swift, OpenRLHF, or equivalent internal systems, including integration with rollout serving systems such as vLLM. Experience with large‑scale training or inference systems, including rollout generation, model serving, batching, queueing, GPU utilization, checkpointing, and debugging. Understanding of omni post‑training for real‑time audio‑video‑language interaction: temporal alignment, interruption, emotional response, and multimodal evaluation. Strong software engineering fundamentals, curiosity, and adaptability to new RL algorithms, model architectures, serving systems, evaluation methods, and research ideas. Bonus Points Prior 0→1 experience building post‑training systems, RL pipelines, agent training systems, evaluation platforms, or large‑scale model improvement loops. Experience with PPO, GRPO, DPO, online RL, RLHF/RLAIF, reward modeling, preference data, synthetic data generation, or model‑based data improvement. Experience with omni or multimodal post‑training for audio‑video‑language models, especially long‑context or real‑time interactive systems. Experience scaling mixed training/inference workloads across large GPU clusters. Experience with adjacent areas such as distributed pretraining, data infrastructure, inference serving, simulation, human/AI feedback collection, or evaluation infrastructure. Publications or substantial open‑source contributions in RL, post‑training, alignment, evaluation, ML systems, or model behavior. Compensation $300,000 – $500,000 base salary, plus meaningful equity. We think long‑term ownership matters and structure equity accordingly. Logistics Location: In‑person in Seattle, five days a week — we believe in the compounding value of working shoulder‑to‑shoulder. Visa sponsorship: We sponsor visas (O-1, H-1B, green card) from day one. AI-native tooling: Do your best work with the best tools, including unlimited tokens. Health: HSA plan with ~$2,000 in annual company contributions — roughly 2x what most big tech companies put in. Time off: 15 days of PTO plus public holidays, and we close the office for a full week at year‑end. Food: Lunch, drinks, and snacks on us every workday — the small thing that quietly makes the day better. Commuter benefits: We help cover the cost of getting to the office. 401(k): In the works. Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI. #J-18808-Ljbffr Nuance Labs

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff — RL Research (Experienced) in Seattle, WA vacancy

Member of Technical Staff — Pretraining Infra (Experienced)
$300k - $400k
Member of Technical Staff — Pretraining Infra (Experienced) Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic... ...like a real person. We're a research company, with PhDs from MIT, UW,... ...infrastructure areas such as RL/post-training, data...
Suggested
H1b
Work at office
Visa sponsorship
Nuance Labs
Seattle, WA
2 days ago
Member of Technical Staff - Research & Post-training
$200k - $350k
About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that...
Suggested
Visa sponsorship
Relocation package
Preference Model
Seattle, WA
17 hours ago
Member of Technical Staff - Model Training
$180k
...Member Of Technical Staff - Model Training SpaceXAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization...
Suggested
Temporary work
Xai
Seattle, WA
3 days ago
Senior Consulting Member of Technical Staff
Senior Consulting Member of Technical Staff Join to apply for the Senior Consulting Member of Technical Staff role at Jobright.ai . This position involves defining and evolving cloud-scale services, hands-on software design, development, and debugging in a cloud-native...
Suggested
Full time
Jobright.ai
Seattle, WA
4 days ago
Member of Technical Staff — ML Infra (Data)
$200k - $300k
Member of Technical Staff — ML Infra (Data) Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars... ..., interrupt, and respond like a real person. We're a research company, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins...
Suggested
H1b
Work at office
Visa sponsorship
Nuance Labs
Seattle, WA
2 days ago
Member of Technical Staff - Machine Learning Capabilities, New Graduates
$165k - $200k
About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that...
Visa sponsorship
Relocation package
Preference Model
Seattle, WA
4 days ago
Principal Member Technical Staff
$96.8k - $223.4k
...shares our enthusiasm for innovation and excellence. As a valued member of our software engineering division in Seattle, you'll have... ...deliver exceptional customer experiences. Collaborate and lead technical discussions across multiple teams to ensure seamless integrations...
Temporary work
Remote work
Flexible hours
Ll Oefentherapie
Seattle, WA
4 days ago
Member of Technical Staff - Pretraining Infrastructure | AI Foundation Models
$300k
Member of Technical Staff - Pretraining Infrastructure | AI Foundation Models Compensation: $300,000+ Base with Competitive Equity I'm hiring... ...language simultaneously. The engineering team consists of researchers and industry experts from some of the world's most...
Relocation
Carnaby Fox
Seattle, WA
3 days ago
Sr. Full Stack Member of Technical Staff
$159.75k - $255.6k
Sr. Full Stack Member of Technical Staff Seattle, Washington, United States Join Axon and be a Force for Good. At Axon, we’re on a mission to... ...Critical and Robotics platforms. This role goes beyond pure research. You will operate across the full stack, from data, models...
Work at office
Axon
Seattle, WA
17 hours ago
Member of Technical Staff - Imagine Model
$180k
...capabilities, with emphasis on image and video generation, editing, understanding, controllable/long‑horizon synthesis, agentic planning, RL training, and world simulation (including audio integration for richer video experiences). Improve data quality through annotation,...
Temporary work
xAI
Seattle, WA
17 hours ago
Member of Technical Staff — Model Optimization and Inference (New Grad)
$200k - $300k
Member of Technical Staff — Model Optimization and Inference (New Grad) Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic... ...react, interrupt, and respond like a real person. We're a research company, with PhDs from MIT, UW, Oxford, CMU, and Johns...
Internship
H1b
Work at office
Visa sponsorship
Nuance Labs
Seattle, WA
1 day ago
Member of Technical Staff
$150k - $250k
...structure out of mess. You've done real work turning messy, unstructured data into something usable, in ML, data engineering, or research. You're fluent with AI tooling. LLM APIs, MCP servers, and agentic frameworks, and you drive Claude Code, Cursor, or similar every...
Local area
FluidStack
Seattle, WA
17 hours ago
Member of Technical Staff (Rust, Search & Database Engines)
....ai OSS project Your first month - ramp up and contribute meaningful improvements to the Spice.ai OSS project 30-60 days - take technical and engineering ownership of an entire feature area. 60-90 days - propose and execute on aligned technical strategy going forward...
Alumni Ventures
Bellevue, WA
2 days ago
Member of Technical Staff - Media
$180k
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves...
Temporary work
xAI
Seattle, WA
3 days ago
Member of Technical Staff Rust, Search & Database Engines - spiceai
Key Responsibilities Develop and maintain core components of the Spice AI runtime, focusing on search and database engine functionalities. Design and implement efficient data access and querying mechanisms across disparate systems. Contribute to the unification of query...
Gravity Engineering Services Pvt Ltd.
Bellevue, WA
1 day ago
Member of Technical Staff - Software Engineering Capabilities
$180k - $300k
...stakeholder interactions, large codebases with real conventions and technical debt, and challenging system design. Build verification robust... ...roles. You May Be a Good Fit You have been a senior or staff software engineer at a company known for engineering rigor (e.g...
Visa sponsorship
Relocation package
Preference Model
Seattle, WA
4 days ago
Technical Data Analyst
...Description Airbus Customer Services invites an experienced professional to join the Operational Unit “Technical Data Support and Services”. This unit develops and... ...rules and approved schedules. Mentor junior team members and act as a technical data specialist. Propose and...
Permanent employment
For subcontractor
Work at office
Jabil
Seattle, WA
6 hours ago
Underwriting Technical Assistant
$22 - $28 per hour
## Underwriting Technical AssistantApplylocations: Bellevue - Washington - 130th Ave NEtime... ...insurance products and services. Our experienced brokers are specialized in property, casualty... ...that helps them — and their family members — achieve their physical, financial, and...
Hourly pay
Full time
Work experience placement
Work at office
RT Specialty, LLC
Bellevue, WA
2 days ago
Fleet Technical/Network Support Rep-Ship Rider III-NAS-Multiple Locations
...the Fleet NavMPS user as an embedded member of the Carrier Airwing Staff Provide NavMPS-specific... ...platform training and guidance to less experienced team members and team members from... ...all applicable NavMPS assets Provide technical advice, subject matter expertise, recommendations...
Work at office
Local area
Remote work
Vectrus, Inc
Seattle, WA
3 days ago
Support Engineering Support Maintenance Analyst 2
...Preferred Qualifications Ability to apply existing skills, learn post‑production kit planning and contribute with minimum supervision. Technical skills: read, understand, and interpret engineering drawings and specifications. Excellent verbal and written communication...
Contract work
Work experience placement
Shift work
CTS Technology Solutions, Inc.
Seattle, WA
3 days ago
Telemarketer - State Farm Agent Team Member
$20 per hour
...Telemarketer - State Farm Agent Team Member Part Time in Washington Twp., MI Role Description As a Telemarketer - State Farm Agent Team Member for Robert Drake - State Farm Agent, your creativity and strategy promote the continued growth of our agency. Your diversified...
Hourly pay
Part time
For contractors
Flexible hours
Draketeamsf
Seattle, WA
2 days ago
IT Specialist
$80k - $100k
...lifecycle. Partner closely with an experienced Tech Operations Manager and... ...-on IT support for office staff (workstations, printers, connectivity). Translate technical issues into clear explanations... ...behind you. You will be a member of a small team responsible for...
Full time
Work at office
Local area
Shift work
Day shift
2 days per week
Dick's Drive-In Restaurants
Seattle, WA
1 day ago
IT Help Desk Technician
$25 - $30 per hour
...for a skilledperson to provide reliable, skilled, and thorough technical assistance on computer systems. This role requires a strong understanding... ...Escalate any unresolved issues to the correct team member as necessary Provide accurate information on IT products or services...
Work experience placement
Casual work
Work at office
Remote work
Charlie's Produce
Seattle, WA
2 days ago
Outside Property Technical Specialist
$81.5k - $134.5k
...training resources and serves as a contact and technical resource to the field and our business... ...provides mentoring and coaching to less experienced claim professionals. May attend... ...Insurance :Employees and their eligible family members – including spouses, domestic partners,...
For contractors
Local area
Remote work
Relocation package
THE TRAVELERS INDEMNITY COMPANY
Bellevue, WA
1 day ago
SENIOR IT SYSTEMS SPECIALIST
$120k - $145k
...performance. This role will be a technical expert, providing support... ...with assigned managers and staff to resolve hardware and... ...results. Maintains systems by researching and resolving problems,... ..., and/or mentoring to less experienced team members. Acquires knowledge of Trident...
Full time
Contract work
Work at office
Remote work
Worldwide
Home office
Trident Seafoods
Seattle, WA
4 days ago
Telemarketer - State Farm Agent Team Member
$35k - $112k
...Telemarketer - State Farm Agent Team Member Successful State Farm Agent is seeking a qualified professional to join their winning team. We seek an energetic professional interested in helping our business grow through value-based conversations and remarkable customer...
Hourly pay
For contractors
Work at office
Chris Jones - State Farm Agent
Kenmore, WA
1 day ago
MP&P Technical Analyst 2
...Title- MP&P Technical Analyst 2 Location- Tukwila, WA, 98108 Duration-12 Months Contract Shift-6:30 AM-3:00 PM We... ...visual aids, work instructions, and process specifications. As a member of the lab team, you will commit to team goals; be flexible in...
Contract work
Work at office
Flexible hours
Shift work
eTeam
Seattle, WA
3 days ago
Advanced Technical Support Specialist
...Job Description The Advanced Technical Support Specialist delivers world‑class, elite‑level... ..., and service optimization strategies. Research and replicate advanced configuration, compatibility... ...and train other technical support team members, promoting knowledge dissemination and...
Local area
Remote work
Cornerstone OnDemand
Seattle, WA
1 day ago
Principal Technical Analyst
...) support requirements, US Citizenship is required. As a member of the Support organization, your focus is to deliver post-sales... ...for customer needs. This involves resolving post-sales non-technical customer inquiries via phone and electronic means, as well as,...
Oracle
Seattle, WA
5 hours ago
Help Desk Technician
$25 - $29 per hour
...Position Summary The Help Desk Technician provides frontline technical support for staff across the organization. This role is responsible for... ...Escalate advanced or unresolved issues to appropriate IT team members, vendors, or external partners. Support onboarding and...
Bi-weekly pay
Hourly pay
Full time
Remote work
Kids For The Future
Seattle, WA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff — RL Research (Experienced). Be the first to apply!