Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff — RL Research (Experienced)

$300k

Nuance Labs

Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person. We're a research company, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved. How Nuance Differentiates Most conversational AI avatars today are hacks — a face slapped on a speech‑to‑speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one‑turn‑at‑a‑time. Current systems take 2–5 seconds to respond; natural conversation requires sub‑500ms. That's a 10x improvement, and it demands rethinking the entire stack. That rethinking starts with full‑duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up. About the Role We’re looking for a deeply technical Member of Technical Staff to own RL and post‑training for large‑scale omni models. This posting is aimed at experienced researchers and engineers who’ve operated at a senior to senior‑staff level at big tech or a leading research lab. Everyone at Nuance is MTS — we don’t run title ladders — but we’re hiring people who have already done this work at scale. This role is broader than a traditional RL algorithm role. You will be expected to understand modern post‑training methods and build the infrastructure needed to run them at scale. The work spans RL method development, rollout generation, reward modeling, policy optimization, evaluation, data feedback loops, serving, observability, and distributed execution. You will build Nuance’s RL/post‑training stack from 0→1 and scale it from 1→10. That means turning rapidly evolving research ideas into reliable training systems: defining the abstractions, choosing or modifying frameworks, wiring together rollout workers and trainers, building reward/evaluation loops, debugging failure modes, and making the system fast enough for researchers to iterate. For Nuance, post‑training is not limited to text. Our models are omni from the ground up: audio, video, language, and real‑time full‑duplex interaction. We need RL and post‑training methods that improve interactive behavior, timing, interruption, emotional response, audiovisual coherence, and real‑time conversational quality. This is a high‑ownership role with direct impact on how Nuance models improve after pretraining. What You’ll Own Build Nuance’s RL/post‑training stack from 0→1: rollout generation, policy optimization, reward/reference model serving, data feedback loops, evaluation, checkpointing, observability, and debugging. Develop and scale post‑training methods such as PPO, GRPO, DPO, rejection sampling, RLHF/RLAIF, online RL, and model‑based data improvement. Design the systems abstractions that connect research ideas to production‑scale RL runs: trainers, rollout workers, reward models, evaluators, data queues, experience buffers, and checkpoint promotion. Build evaluation and feedback loops for omni behavior: turn‑taking, interruption, timing, emotional response, audiovisual coherence, instruction following, and real‑time interaction quality. Optimize the end‑to‑end post‑training loop across rollout throughput, serving latency, GPU utilization, policy update efficiency, queueing, checkpoint overhead, and research iteration speed. Evolve the platform as algorithms, model architectures, reward definitions, data sources, and evaluation methods change. What We’re Looking For Significant hands‑on experience with RL, RLHF, RLAIF, post‑training, alignment, or large‑scale fine‑tuning for modern foundation models. Deep understanding of RL/post‑training methods: policy optimization, reward modeling, preference optimization, rejection sampling, KL control, evaluation, and data feedback loops. A track record reasoning about model behavior and training dynamics: reward hacking, unstable rewards, distribution shift, stale policies, mode collapse, over‑optimization, noisy preferences, and evaluation mismatch. Proven experience building or operating RL/post‑training pipelines at scale with frameworks such as verl, ms‑swift, OpenRLHF, or equivalent internal systems, including integration with rollout serving systems such as vLLM. Experience with large‑scale training or inference systems, including rollout generation, model serving, batching, queueing, GPU utilization, checkpointing, and debugging. Understanding of omni post‑training for real‑time audio‑video‑language interaction: temporal alignment, interruption, emotional response, and multimodal evaluation. Strong software engineering fundamentals, curiosity, and adaptability to new RL algorithms, model architectures, serving systems, evaluation methods, and research ideas. Bonus Points Prior 0→1 experience building post‑training systems, RL pipelines, agent training systems, evaluation platforms, or large‑scale model improvement loops. Experience with PPO, GRPO, DPO, online RL, RLHF/RLAIF, reward modeling, preference data, synthetic data generation, or model‑based data improvement. Experience with omni or multimodal post‑training for audio‑video‑language models, especially long‑context or real‑time interactive systems. Experience scaling mixed training/inference workloads across large GPU clusters. Experience with adjacent areas such as distributed pretraining, data infrastructure, inference serving, simulation, human/AI feedback collection, or evaluation infrastructure. Publications or substantial open‑source contributions in RL, post‑training, alignment, evaluation, ML systems, or model behavior. Compensation $300,000 – $500,000 base salary, plus meaningful equity. We think long‑term ownership matters and structure equity accordingly. Logistics Location: In‑person in Seattle, five days a week — we believe in the compounding value of working shoulder‑to‑shoulder. Visa sponsorship: We sponsor visas (O-1, H-1B, green card) from day one. AI-native tooling: Do your best work with the best tools, including unlimited tokens. Health: HSA plan with ~$2,000 in annual company contributions — roughly 2x what most big tech companies put in. Time off: 15 days of PTO plus public holidays, and we close the office for a full week at year‑end. Food: Lunch, drinks, and snacks on us every workday — the small thing that quietly makes the day better. Commuter benefits: We help cover the cost of getting to the office. 401(k): In the works. Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI. #J-18808-Ljbffr Nuance Labs

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff — RL Research (Experienced) in Seattle, WA vacancy
  • $300k - $400k

    Member of Technical Staff — Pretraining Infra Seattle, Washington About Nuance Labs Nuance Labs is building...  ...This role sits at the intersection of research, systems, and GPU-scale execution —...  ...adjacent infrastructure areas such as RL/post-training, data infrastructure,... 
    Suggested

    Nuance Labs

    Seattle, WA
    1 day ago
  • $165k - $200k

    About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that... 
    Suggested
    Visa sponsorship
    Relocation package

    Preference Model

    Seattle, WA
    3 days ago
  • $96.8k - $223.4k

     ...shares our enthusiasm for innovation and excellence. As a valued member of our software engineering division in Seattle, you'll have...  ...deliver exceptional customer experiences. Collaborate and lead technical discussions across multiple teams to ensure seamless integrations... 
    Suggested
    Temporary work
    Remote work
    Flexible hours

    Ll Oefentherapie

    Seattle, WA
    3 days ago
  • $200k - $300k

    Member of Technical Staff — ML Infra (Data) Seattle, Washington About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars...  ..., interrupt, and respond like a real person. We're a research company, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins... 
    Suggested
    H1b
    Work at office
    Visa sponsorship

    Nuance Labs

    Seattle, WA
    1 day ago
  • $120k - $150k

    # Member of Technical Staff (AI-Powered EdTech)Colleague AI$120K - $1600KKirkland, WA, USSeniorAI/ML EngineerInterested in this AI/ML Engineer...  ...enhancing **teaching, learning, and school management** through research\-backed and AI\-driven technology.As we continue scaling... 
    Suggested
    Permanent employment
    Full time
    Flexible hours

    AI Pulse

    Kirkland, WA
    2 days ago
  • $159.75k - $255.6k

    Sr. Full Stack Member of Technical Staff Seattle, Washington, United States Join Axon and be a Force for Good. At Axon, we’re on a mission to...  ...Critical and Robotics platforms. This role goes beyond pure research. You will operate across the full stack, from data, models... 
    Work at office

    Axon

    Seattle, WA
    4 days ago
  • $79.2k - $178.1k

     ...hybrid, multi‑cloud, and edge deployments. Within OCI, the Technical Strategy & Oversight (TSO) organization builds the...  ...a durable competitive edge. We are looking for a Senior Member of Technical Staff. You are technically strong, hands‑on, and have a track record... 
    Temporary work
    Worldwide
    Flexible hours

    Ll Oefentherapie

    Seattle, WA
    21 hours ago
  • $180k

     ...capabilities, with emphasis on image and video generation, editing, understanding, controllable/long‑horizon synthesis, agentic planning, RL training, and world simulation (including audio integration for richer video experiences). Improve data quality through annotation,... 
    Temporary work

    xAI

    Seattle, WA
    4 days ago
  • $150k - $250k

     ...structure out of mess. You’ve done real work turning messy, unstructured data into something usable, in ML, data engineering, or research. You’re fluent with AI tooling. LLM APIs, MCP servers, and agentic frameworks, and you drive Claude Code, Cursor, or similar every... 
    Local area

    Fluidstack

    Seattle, WA
    21 hours ago
  • $180k

    xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves...
    Temporary work

    Xai

    Seattle, WA
    21 hours ago
  •  ...Spice.ai OSS project Your first month: ramp up and contribute meaningful improvements to the Spice.ai OSS project 30‑60 days: take technical and engineering ownership of an entire feature area 60‑90 days: propose and execute on aligned technical strategy going forward... 

    Spice AI

    Bellevue, WA
    2 days ago
  •  ....ai OSS project Your first month - ramp up and contribute meaningful improvements to the Spice.ai OSS project 30-60 days - take technical and engineering ownership of an entire feature area. 60-90 days - propose and execute on aligned technical strategy going forward... 

    Alumni Ventures

    Bellevue, WA
    1 day ago
  • We’re looking for exceptional media engineers who want to join us on a new project to deeply integrate xAI’s advanced AI infrastructure into a platform used by around 600 million users every month We’re bringing xAI’s technology stack and using it to transform the video...
    Visa sponsorship
    Flexible hours

    xAI

    Seattle, WA
    1 day ago
  •  ...Seattle is seeking a skilled Support Analyst to provide exceptional support for members using their AI-powered solutions. The ideal candidate will have 3-5 years of experience in a technical role, with expertise in financial services technology. Responsibilities include... 

    Range

    Seattle, WA
    3 days ago
  • $200k - $300k

     ..., kernel‑level acceleration, and batching strategies. You’ve worked with vLLM, SGLang, or similar frameworks (through coursework, research, internships, or open‑source) and have opinions about where they fall short. This posting is aimed at early‑career engineers finishing... 
    Internship
    H1b
    Work at office
    Visa sponsorship

    Nuance Labs

    Seattle, WA
    3 days ago
  •  ...eliminate financial complexity for our members. We’re transforming wealth management through...  ...for our members. You’ll troubleshoot technical issues, collaborate with cross-...  ...of defense on technical issues they are experiencing Figure out the member's problem quickly... 
    Work at office
    Relocation
    Monday to Friday

    Range

    Seattle, WA
    4 days ago
  • $114.46k - $160.24k

     ...avionics equipment for Lunar missions. As a member of the highly collaborative and fast-...  ...are looking for someone to apply their technical expertise, leadership skills, and commitment...  ...skills using 3D CAD software Experienced with Geometric Dimensioning and Tolerancing... 
    Permanent employment
    Temporary work
    Local area
    Flexible hours

    Blue Origin

    Seattle, WA
    4 days ago
  • $180k

    Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization ...
    Temporary work

    xAI

    Seattle, WA
    6 days ago
  •  ...the Fleet NavMPS user as an embedded member of the Carrier Airwing Staff Provide NavMPS-specific...  ...platform training and guidance to less experienced team members and team members from...  ...all applicable NavMPS assets Provide technical advice, subject matter expertise, recommendations... 
    Work at office
    Local area
    Remote work

    Vectrus, Inc

    Seattle, WA
    2 days ago
  • $22 - $28 per hour

     ...specialty insurance products and services. Our experienced brokers are specialized in property,...  .... Position Summary The Underwriting Technical Assistant is responsible for the binding...  ...package that helps them — and their family members — achieve their physical, financial, and... 
    Hourly pay
    Full time
    Work experience placement
    Work at office

    Ryan Specialty

    Bellevue, WA
    1 day ago
  • $37 per hour

     ...Preferred Qualifications Ability to apply existing skills, learn post-production kit planning and contribute with minimum supervision. Technical Skills: Ability to read, understand, and interpret engineering drawings and specifications. Communication: Excellent verbal and... 
    Hourly pay
    Contract work
    Work experience placement

    Boeing Future of Flight

    Seattle, WA
    4 days ago
  • $81.5k - $134.5k

     ...training resources and serves as a contact and technical resource to the field and our business...  ...provides mentoring and coaching to less experienced claim professionals. May attend...  ...Insurance :Employees and their eligible family members – including spouses, domestic partners,... 
    For contractors
    Local area
    Remote work
    Relocation package

    THE TRAVELERS INDEMNITY COMPANY

    Bellevue, WA
    21 hours ago
  • $60k - $85k

     ...to fill a full-time, remote, technical support role. Candidates should...  ...across the country. Our staff supports nonprofit organizations...  ...be friendly, flexible, and experienced with lending. Familiarity with...  ...fit for your skills. Most team members choose an additional... 
    Full time
    Part time
    Live in
    Work at office
    Remote work
    Home office
    Flexible hours
    Day shift

    Beyond Hosting

    Seattle, WA
    1 day ago
  • $180k

    Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization ...
    Temporary work

    xAI

    Seattle, WA
    1 day ago
  • $35k - $112k

     ...Telemarketer - State Farm Agent Team Member Successful State Farm Agent is seeking a qualified professional to join their winning team. We seek an energetic professional interested in helping our business grow through value-based conversations and remarkable customer... 
    Hourly pay
    For contractors
    Work at office

    Chris Jones - State Farm Agent

    Kenmore, WA
    21 hours ago
  •  ...Job Description Insight Global is seeking a Technical Support Specialist to join one of our clients in the Seattle area. This is a contract...  ..., protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable... 
    Permanent employment
    Full time
    Contract work
    Remote work
    3 days per week

    Insight Global

    Seattle, WA
    4 days ago
  •  ...where everyone belongs and is inspired to do their best work.As a Technical Specialist, you offer technical support and skilled...  ...demonstrating shared responsibility and accountability with other team members.* Be trusted with sensitive or confidential information, keeping... 
    Local area
    Relocation
    Night shift

    Apple

    Seattle, WA
    11 hours ago
  • CTS International is seeking an Engineering Support Maintenance Analyst 2 in Seattle, Washington. This role involves developing BOMs, coordinating with suppliers, and ensuring that manufacturing requirements are met. The position requires strong communication skills and...
    Hourly pay
    Contract work

    CTS International

    Seattle, WA
    1 day ago
  • $100k - $120k

     ...Regional Technical Specialist (Filtration Focus) Job Category: Sales / Marketing Location: Riverside, CA 92507, USA +3 more locations...  ...facing critical liquid management challenges. Our experienced teams, supported by skilled engineers, are continually redefining... 
    Temporary work
    Night shift
    Weekend work
    Afternoon shift

    Rain For Rent

    Seattle, WA
    2 days ago
  •  ...Preferred Qualifications Ability to apply existing skills, learn post‑production kit planning and contribute with minimum supervision. Technical skills: read, understand, and interpret engineering drawings and specifications. Excellent verbal and written communication... 
    Contract work
    Work experience placement
    Shift work

    CTS International

    Seattle, WA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff — RL Research (Experienced). Be the first to apply!