Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer - Audio & Speech Models

Zyphra

Job Description

Job Description

Zyphra is an artificial intelligence company based in San Francisco, California.

The Role:

As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra’s Audio Team, building the next generation of open-source autoencoders, ASR, TTS, SSL, and speech-to-speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel architectures and training methodologies.

You’ll Work Across:
  • Large-scale audio training runs

  • Performance optimization of our training stack

  • Audio dataset collection, processing, and evaluation

  • Architecture and training methodology ablations and improvements

What We're Looking For / Requirements:
  • Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up.

  • Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly)

  • The ability to work well with others in a high-paced research setting

  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Qualifications / Additional Skills:
  • Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models

  • Experience in training audio autoencoders

  • Understanding of signal processing, especially of audio signals

  • Experience with diffusion models, consistency models, or GANs

  • Experience with training on large-scale (multi-node) GPU clusters

  • Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing

  • Understanding of and interest in large-scale, highly parallel data processing pipelines

  • Proficiency with PyTorch and Python

  • Experience contributing to large pre-existing codebases and rapidly getting up to speed

  • Previously published machine learning research in well-respected venues

  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics, Machine Learning)

Why Work at Zyphra:
  • Our research methodology is grounded in methodical, step-by-step approaches to ambitious goals. Both deep research and engineering excellence are equally valued

  • We strongly value new and crazy ideas and are very willing to bet big on new ideas

  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible

  • We all enjoy what we do and love discussing AI

Benefits and Perks:
  • Comprehensive medical, dental, vision, and FSA plans

  • Competitive compensation and 401(k) plan

  • Relocation and immigration support on a case-by-case basis

  • In-office snacks and meals provided

  • Unlimited PTO and company holidays

  • In-person team in San Francisco with a collaborative, high-energy environment

Vacancy posted 25 days ago
Similar jobs that could be interesting for youBased on the Research Engineer - Audio & Speech Models in San Francisco, CA vacancy
  • $295k

    Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence, driving innovation and shaping the future with cutting‑edge research. Our mission is to ensure that AI's benefits reach... 
    Suggested
    Internship

    OpenAI

    San Francisco, CA
    2 days ago
  • $180k - $270k

     ...automated metrics that researchers and leadership can...  ...Possess strong software engineering skills (especially in...  ...at scale against live model checkpoints. Can deeply...  ...good" looks like for a Speech LLM, translating...  ...PESQ, etc) and modern audio evaluation frameworks... 
    Audio
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    4 days ago
  •  ...Today, not even the best models can continuously...  ...a year-long stream of audio, video and text—1B text...  ...innovation and systems engineering paired with a design‑minded...  ...Build evaluations of speech models, both via...  ...scalable systems that bridge research and production.... 
    Audio
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    17 hours ago
  • $200k - $400k

     ...About the Team Read more about the research team\'s work here: The Research team develops the model and decision-making stack that...  ...the Role As a Voice Research Engineer, you’ll lead the development of...  ...initiatives that advance speech understanding, naturalness, turn... 
    Suggested
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    17 hours ago
  • Gravity Engineering Services Pvt Ltd. is seeking a Research Engineer for its Applied Voice Team in San Francisco. This role focuses on designing and building advanced speech models, applying cutting-edge AI research to practical applications. Collaboration with cross-functional... 
    Suggested

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  •  ...Applied Scientist / Research Engineer — Speech AI   HIGHLIGHTS Location:  San Francisco, CA OR...  ...person will work across applied research, model development, training infrastructure,...  ...with modern machine learning for audio, speech, and language, and who enjoys... 
    Audio
    Remote work

    GTN Technical Staffing

    San Francisco, CA
    2 days ago
  • About the Role As an applied research engineer at Sieve, you’ll build high performance building...  ...will be working in the computer vision, audio processing, and text processing domains...  ...fit if you’re comfortable working with models + APIs and squeezing every drop of performance... 
    Audio

    Sieve

    San Francisco, CA
    3 days ago
  •  ...Human Archive Human Archive is a research lab backed by Y Combinator focused on modeling human embodied intelligence. Humans...  .... The Opportunity As a Research Engineer, you’ll work on multimodal...  ...capture, IMUs, tactile sensing, audio, and wearable systems, and study... 
    Audio
    Shift work

    Human Archive

    San Francisco, CA
    4 days ago
  • $295k

    OpenAI in San Francisco is searching for a Research Engineer specializing in Speech & Realtime Models. In this role, you will design and implement advanced machine learning models that address real-world challenges and create AI-powered products. The ideal candidate holds... 

    OpenAI

    San Francisco, CA
    17 hours ago
  • $180k - $270k

     ...specialist to join their SpeechLLM lab in San Francisco. This role involves building advanced audio and speech models and includes responsibilities related to research and engineering. Successful candidates will earn a competitive salary between $180K to $270K plus bonuses... 
    Audio
    Work at office

    Plaud

    San Francisco, CA
    4 days ago
  •  ...Description Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor to Zyphra’s BCI work, building the next generation of open-source EEG and... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    6 days ago
  • $155k - $269k

     ...driving stack is powered by Waabi World, which delivers realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop algorithms and productionize the next generation of World Models that can reason about... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    19 days ago
  • $100k - $120k

     ...infrastructure to power the next generation of robotic foundation models. Data is crucial to improving these models, yet robotic...  ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality... 

    Coda Robotics

    San Francisco, CA
    17 hours ago
  • A generative modeling company in San Francisco is seeking a Research Engineer to lead pre-training and post-training of action-conditioned world models. You will design and implement methodologies in collaboration with industrial partners, contributing directly to publications... 
    Work at office

    Hedra

    San Francisco, CA
    2 days ago
  • An innovative AI company based in San Francisco is seeking a Research Engineer focused on Brain-Computer Interface models. You will contribute to building open-source EEG models and support all stages of development from data collection to integration in real-world applications... 

    Zyphra

    San Francisco, CA
    3 days ago
  • A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance... 
    Work at office

    Hedra, Inc

    San Francisco, CA
    17 hours ago
  • $220k - $280k

     ...About the role In your role as Senior Research Engineer, you'll be at the heart of building the...  ...next generation of generative video and audio technology. Your work will push the...  ...shipping cutting-edge video generation models, you'll help redefine how hundreds of millions... 
    Audio
    Work at office
    Local area
    Flexible hours

    black.ai

    San Francisco, CA
    2 days ago
  •  ...Job Description Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Model Architectures , you will be a core contributor to Zyphra’s AI Architecture Research Team. This will involve designing and... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    18 days ago
  •  ...Job Description Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our language model roadmap through end-to-end pretraining development. You will work... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    18 days ago
  • Zyphra, an AI company in San Francisco, seeks a Research Engineer - Language Model Pre-Training to develop their language model roadmap. Candidates should have strong engineering skills and excel in machine learning, collaborating closely with their pretraining team. The... 

    Zyphra

    San Francisco, CA
    17 hours ago
  • $320k

    Anthropic in New York City is seeking a Research Engineer to develop evaluations for Claude’s capabilities. The ideal candidate should have...  ...results during training runs. The role offers a hybrid work model and competitive compensation ranging from $320,000 to $485,00... 
    Remote job

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • $180k

     ...highly motivated, and focused on engineering excellence. This organization...  .... All engineers and researchers are expected to have strong communication...  ...to build generative models that can accurately simulate...  ..., with a particular focus on audio and visual data. Develop and... 
    Audio
    Work at office
    Local area
    Relocation

    xAI

    San Francisco, CA
    more than 2 months ago
  • $315k

     ...as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to...  ...beneficial AI systems. About the role Anthropic's production models undergo sophisticated post-training processes to enhance their... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures, improving core modeling capabilities and collaborating closely with pre-training teams. Candidates... 

    Zyphra

    San Francisco, CA
    1 day ago
  • $200k

     ...proven track record of building and training large-scale audio or speech models from the ground up, whether that involves unified...  ...audio architectures. Love living at the intersection of research and engineering, eager to design novel sequence modeling architectures... 
    Audio
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold... 
    Remote work

    Cohere

    San Francisco, CA
    2 days ago
  • $200k

     ...throughput, ultra-low-latency inference engines for large language models or foundational speech models. Understand the...  ...To-First-Token (or Time-To-First-Audio) in real-time streaming environments...  ...highly collaborative, fast-paced research. Gear & Perks: Choice of top-... 
    Audio
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    2 days ago
  •  ...intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems...  ...to do what’s best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft.... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    17 hours ago
  •  ...Astra, and top-tier AI researchers. As an early member...  ...AI researchers and engineers in the world in a small...  ...and video generation model teams in the world on...  ...training text-to-motion or audio-to-motion models....  ...if you have worked on speech-driven 3D facial animation... 
    Audio
    Work at office
    Visa sponsorship

    Spellbrush

    San Francisco, CA
    9 days ago
  • $26 - $28 per hour

     ...join our team as Data Labeling Analysts, supporting speech and voice AI systems. This is a high-impact...  ...power real-world AI systems. You'll be working with audio, speech, and language data — helping ensure models are trained on accurate, well-structured, and representative... 
    Audio
    Full time
    Work experience placement
    Remote work
    Visa sponsorship

    Welocalize

    San Francisco, CA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer - Audio & Speech Models. Be the first to apply!