Research Engineer - Audio & Speech Models

Zyphra

Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra’s Audio Team, building the next generation of open-source autoencoders, ASR, TTS, SSL, and speech-to-speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel architectures and training methodologies. You’ll Work Across: Large-scale audio training runs Performance optimization of our training stack Audio dataset collection, processing, and evaluation Architecture and training methodology ablations and improvements What We're Looking For / Requirements: Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up. Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly) The ability to work well with others in a high-paced research setting Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models Experience in training audio autoencoders Understanding of signal processing, especially of audio signals Experience with diffusion models, consistency models, or GANs Experience with training on large-scale (multi-node) GPU clusters Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing Understanding of and interest in large-scale, highly parallel data processing pipelines Proficiency with PyTorch and Python Experience contributing to large pre-existing codebases and rapidly getting up to speed Previously published machine learning research in well-respected venues Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics, Machine Learning) Why Work at Zyphra: Our research methodology is grounded in methodical, step-by-step approaches to ambitious goals. Both deep research and engineering excellence are equally valued We strongly value new and crazy ideas and are very willing to bet big on new ideas We move as quickly as we can; we aim to minimize the bar to impact as low as possible We all enjoy what we do and love discussing AI Benefits and Perks: Comprehensive medical, dental, vision, and FSA plans Competitive compensation and 401(k) plan Relocation and immigration support on a case-by-case basis In-office snacks and meals provided Unlimited PTO and company holidays In-person team in San Francisco with a collaborative, high-energy environment #J-18808-Ljbffr Zyphra

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Research Engineer - Audio & Speech Models in San Francisco, CA vacancy

Research Engineer - Speech & Realtime Models
$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence, driving innovation and shaping the future with cutting‑edge research. Our mission is to ensure that AI's benefits reach...
Suggested
Internship
OpenAI
San Francisco, CA
2 days ago
Applied Scientist/Research Engineer—Speech AI
...next generation of real-time speech and conversational AI... ...person will work across applied research, model development, training infrastructure... ...modern machine learning for audio, speech, and language, and... ...You will work closely with engineering and product teams to improve...
Audio
GTN Technical Staffing
San Francisco, CA
6 days ago
Senior Speech AI Research Engineer - Real-Time Voice
$350k
GTN Technical Staffing is hiring a Senior Audio Researcher to advance speech, voice, and conversational AI. The role involves hands-on research, model development, and optimizing complex systems for real-time user experiences. The ideal candidate will have a robust background...
Audio
GTN Technical Staffing
San Francisco, CA
4 days ago
Speech AI Research / Applied Engineer
$350k
...Hire Compensation: Senior Audio Researcher: Up to approximately $350,00... ...next generation of real-time speech, voice, and conversational... ...full‑duplex conversational models, speaker understanding, and... ...is a hands‑on research and engineering role for someone who wants to...
Audio
Immediate start
Remote work
Work visa
GTN Technical Staffing
San Francisco, CA
4 days ago
AI Research Engineer / Scientist (Speech & Conversational Intelligence)
...are seeking a high-caliber AI Research Engineer / Scientist to join our... ...between advanced generative speech research and real-world production... ...roots in generative speech modeling (TTS, Speech-to-Speech... ...entity resolution, and implement audio robustness behaviors against...
Audio
Insight Global
San Francisco, CA
4 days ago
Senior Audio ML Engineer — Production-Ready Speech Models
Gravity Engineering Services Pvt Ltd. in San Francisco is seeking an Applied ML Engineer to develop advanced speech and audio models and production systems. This role is pivotal in enhancing the quality of data for our customers and requires 5+ years of ML experience....
Audio
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
6 days ago
Research Engineer
...Phonic is a product and research lab focused on powering the... ...pursuit of this goal, from models to product, to create voice... ...The Role As a Research Engineer at Phonic, you'll sit at the... ...models across the voice stack (speech, audio, language, and beyond), while...
Audio
Work at office
Phonic
San Francisco, CA
3 days ago
Research Engineer, Data
...Today, not even the best models can continuously... ...a year-long stream of audio, video and text—1B text... ...innovation and systems engineering paired with a design‑minded... ...Build evaluations of speech models, both via... ...scalable systems that bridge research and production....
Audio
Work at office
Relocation package
Cartesia
San Francisco, CA
6 hours ago
Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
$180k - $270k
...automated metrics that researchers and leadership can... ...Possess strong software engineering skills (especially in... ...at scale against live model checkpoints. Can deeply... ...good" looks like for a Speech LLM, translating... ...PESQ, etc) and modern audio evaluation frameworks...
Audio
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
4 days ago
Applied Voice Research Engineer, Speech ML Innovator
Gravity Engineering Services Pvt Ltd. is seeking a Research Engineer for its Applied Voice Team in San Francisco. This role focuses on designing and building advanced speech models, applying cutting-edge AI research to practical applications. Collaboration with cross-functional...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
6 days ago
Voice & Speech Research Engineer — Real-Time AI Agents
Decagon in San Francisco is seeking a Voice Research Engineer to lead the development of AI models for voice agents. You will oversee multi-quarter projects aimed at advancing speech understanding and naturalness in voice interactions. The ideal candidate has over 8 years...
Decagon
San Francisco, CA
2 days ago
Research Engineer - World Model
$180k
...highly motivated, and focused on engineering excellence. This organization... .... All engineers and researchers are expected to have strong communication... ...to build generative models that can accurately simulate... ..., with a particular focus on audio and visual data. Develop and...
Audio
Work at office
Local area
Relocation
xAI
San Francisco, CA
more than 2 months ago
Research Engineer
...Archive Human Archive is a research lab backed by Y Combinator focused on modeling human embodied intelligence. Humans... ...Opportunity As a Research Engineer, you'll work on multimodal... ...capture, IMUs, tactile sensing, audio, and wearable systems, and study...
Audio
Shift work
Human Archive
San Francisco, CA
6 hours ago
Member of Technical Staff - ML Research Engineer, Multi-Modal - Audio
...get there. The Opportunity Our Audio team is building frontier speech-language models that handle STT, TTS, and speech... ...collaborating with infrastructure and research teams Support experimentation... ...in shared codebases with high engineering standards Nice‑to‑have: Direct...
Audio
Shift work
Liquid AI
San Francisco, CA
2 days ago
Member of Technical Staff: AI Research & Engineering
Member of Technical Staff: AI Research & Engineering in Media Integrity About... ...builds omnimodal foundation models for communication integrity,... ...platform analyzes the integrity of audio and video, and protects... ...includes detecting synthetic speech & voice cloning, video &...
Audio
Immediate start
Shift work
Synhawk
San Francisco, CA
2 days ago
Applied ML Engineer, Speech
...looking for an experienced Machine Learning Engineer to join our team and help develop cutting-edge speech recognition models that help teach language fluency. In this role... ...experience Bonus Experience with speech or audio Office San Francisco, CA Why work at Speak...
Audio
Live in
Work at office
Worldwide
Dormont Manufacturing Co
San Francisco, CA
4 days ago
Senior Speech AI Scientist - Real-Time, Production Ready
...Francisco to develop next-generation speech and conversational AI systems.... ...experience in modern machine learning for audio and enjoy transitioning from... ...involves collaborating closely with engineering and product teams to enhance model quality and scalability, focusing on...
Audio
GTN Technical Staffing
San Francisco, CA
5 days ago
Applied Research Engineer
About the Role As an applied research engineer at Sieve, you’ll build high performance building... ...will be working in the computer vision, audio processing, and text processing domains... ...fit if you’re comfortable working with models + APIs and squeezing every drop of performance...
Audio
Sieve
San Francisco, CA
3 days ago
Speech & Real-Time AI Research Engineer (Equity)
$295k
OpenAI in San Francisco is searching for a Research Engineer specializing in Speech & Realtime Models. In this role, you will design and implement advanced machine learning models that address real-world challenges and create AI-powered products. The ideal candidate holds...
OpenAI
San Francisco, CA
5 days ago
Speech & Real-Time AI Research Engineer
Slope is looking for a Research Engineer in San Francisco to innovate and build state-of-the-art speech models for real-world applications. This role requires a Master's/PhD in a relevant field and 5+ years of engineering experience, focusing on deep learning and AI technologies...
Slope
San Francisco, CA
2 days ago
Research Engineer: Action-Conditioned World Models
A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance...
Work at office
Hedra, Inc
San Francisco, CA
5 days ago
Research Engineer, World Models
$155k - $269k
...driving stack is powered by Waabi World, which delivers realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop algorithms and production‑ize the next generation of World Models that can reason about...
Full time
Work at office
Work from home
Flexible hours
Waabi
San Francisco, CA
2 days ago
Research Engineer - Brain Computer Interface Models
Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor to Zyphra’s BCI work, building the next generation of open‑source EEG and brain-computer...
Work at office
Relocation package
Zyphra
San Francisco, CA
3 days ago
BCI Research Engineer: Open-Source EEG Models
An innovative AI company based in San Francisco is seeking a Research Engineer focused on Brain-Computer Interface models. You will contribute to building open-source EEG models and support all stages of development from data collection to integration in real-world applications...
Zyphra
San Francisco, CA
3 days ago
Speech LLM Engineer: Audio AI Training
$180k - $270k
...specialist to join their SpeechLLM lab in San Francisco. This role involves building advanced audio and speech models and includes responsibilities related to research and engineering. Successful candidates will earn a competitive salary between $180K to $270K plus bonuses...
Audio
Work at office
Plaud
San Francisco, CA
4 days ago
Research Engineer - World Models
$100k - $120k
...infrastructure to power the next generation of robotic foundation models. Data is crucial to improving these models, yet robotic... ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality...
Coda Robotics
San Francisco, CA
5 days ago
World Models Research Engineer - Generative AI for 4D Scenes
$155k - $269k
A pioneering autonomous vehicle company in San Francisco is seeking a Research Engineer to develop advanced world models for autonomous systems. The role requires strong programming skills in Python and experience with generative models. You will collaborate with experts...
Flexible hours
Waabi
San Francisco, CA
2 days ago
AI Anime Researcher - Motion Generation Models
...Astra, and top-tier AI researchers. As an early member... ...AI researchers and engineers in the world in a small... ...and video generation model teams in the world on... ...training text-to-motion or audio-to-motion models.... ...if you have worked on speech-driven 3D facial animation...
Audio
Work at office
Visa sponsorship
Spellbrush
San Francisco, CA
4 days ago
Senior Research Engineer - Video Agents
$220k - $280k
...About the role In your role as Senior Research Engineer, you'll be at the heart of building the... ...next generation of generative video and audio technology. Your work will push the... ...shipping cutting-edge video generation models, you'll help redefine how hundreds of millions...
Audio
Work at office
Local area
Flexible hours
black.ai
San Francisco, CA
2 days ago
AI Research Engineer
...AI Research Engineer Opportunity Poly is building a better file storage platform for everyone... ...re maximally multimodal. We support any audio, video, document, office file, slide... ...baseline knowledge of core statistical modeling principles, including the ability to write...
Audio
Work at office
Poly
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer - Audio & Speech Models. Be the first to apply!