Research Engineer - Audio & Speech Models
Zyphra
Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra’s Audio Team, building the next generation of open-source autoencoders, ASR, TTS, SSL, and speech-to-speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel architectures and training methodologies. You’ll Work Across: Large-scale audio training runs Performance optimization of our training stack Audio dataset collection, processing, and evaluation Architecture and training methodology ablations and improvements What We're Looking For / Requirements: Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up. Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly) The ability to work well with others in a high-paced research setting Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models Experience in training audio autoencoders Understanding of signal processing, especially of audio signals Experience with diffusion models, consistency models, or GANs Experience with training on large-scale (multi-node) GPU clusters Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing Understanding of and interest in large-scale, highly parallel data processing pipelines Proficiency with PyTorch and Python Experience contributing to large pre-existing codebases and rapidly getting up to speed Previously published machine learning research in well-respected venues Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics, Machine Learning) Why Work at Zyphra: Our research methodology is grounded in methodical, step-by-step approaches to ambitious goals. Both deep research and engineering excellence are equally valued We strongly value new and crazy ideas and are very willing to bet big on new ideas We move as quickly as we can; we aim to minimize the bar to impact as low as possible We all enjoy what we do and love discussing AI Benefits and Perks: Comprehensive medical, dental, vision, and FSA plans Competitive compensation and 401(k) plan Relocation and immigration support on a case-by-case basis In-office snacks and meals provided Unlimited PTO and company holidays In-person team in San Francisco with a collaborative, high-energy environment #J-18808-Ljbffr Zyphra
$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence, driving innovation and shaping the future with cutting‑edge research. Our mission is to ensure that AI's benefits reach...SuggestedInternship- ...next generation of real-time speech and conversational AI... ...person will work across applied research, model development, training infrastructure... ...modern machine learning for audio, speech, and language, and... ...You will work closely with engineering and product teams to improve...Audio
$350k
GTN Technical Staffing is hiring a Senior Audio Researcher to advance speech, voice, and conversational AI. The role involves hands-on research, model development, and optimizing complex systems for real-time user experiences. The ideal candidate will have a robust background...Audio$350k
...Hire Compensation: Senior Audio Researcher: Up to approximately $350,00... ...next generation of real-time speech, voice, and conversational... ...full‑duplex conversational models, speaker understanding, and... ...is a hands‑on research and engineering role for someone who wants to...AudioImmediate startRemote workWork visa- ...are seeking a high-caliber AI Research Engineer / Scientist to join our... ...between advanced generative speech research and real-world production... ...roots in generative speech modeling (TTS, Speech-to-Speech... ...entity resolution, and implement audio robustness behaviors against...Audio
- Gravity Engineering Services Pvt Ltd. in San Francisco is seeking an Applied ML Engineer to develop advanced speech and audio models and production systems. This role is pivotal in enhancing the quality of data for our customers and requires 5+ years of ML experience....Audio
- ...Phonic is a product and research lab focused on powering the... ...pursuit of this goal, from models to product, to create voice... ...The Role As a Research Engineer at Phonic, you'll sit at the... ...models across the voice stack (speech, audio, language, and beyond), while...AudioWork at office
- ...Today, not even the best models can continuously... ...a year-long stream of audio, video and text—1B text... ...innovation and systems engineering paired with a design‑minded... ...Build evaluations of speech models, both via... ...scalable systems that bridge research and production....AudioWork at officeRelocation package
$180k - $270k
...automated metrics that researchers and leadership can... ...Possess strong software engineering skills (especially in... ...at scale against live model checkpoints. Can deeply... ...good" looks like for a Speech LLM, translating... ...PESQ, etc) and modern audio evaluation frameworks...AudioFull timeWork at officeWorldwide- Gravity Engineering Services Pvt Ltd. is seeking a Research Engineer for its Applied Voice Team in San Francisco. This role focuses on designing and building advanced speech models, applying cutting-edge AI research to practical applications. Collaboration with cross-functional...
- Decagon in San Francisco is seeking a Voice Research Engineer to lead the development of AI models for voice agents. You will oversee multi-quarter projects aimed at advancing speech understanding and naturalness in voice interactions. The ideal candidate has over 8 years...
$180k
...highly motivated, and focused on engineering excellence. This organization... .... All engineers and researchers are expected to have strong communication... ...to build generative models that can accurately simulate... ..., with a particular focus on audio and visual data. Develop and...AudioWork at officeLocal areaRelocation- ...Archive Human Archive is a research lab backed by Y Combinator focused on modeling human embodied intelligence. Humans... ...Opportunity As a Research Engineer, you'll work on multimodal... ...capture, IMUs, tactile sensing, audio, and wearable systems, and study...AudioShift work
- ...get there. The Opportunity Our Audio team is building frontier speech-language models that handle STT, TTS, and speech... ...collaborating with infrastructure and research teams Support experimentation... ...in shared codebases with high engineering standards Nice‑to‑have: Direct...AudioShift work
- Member of Technical Staff: AI Research & Engineering in Media Integrity About... ...builds omnimodal foundation models for communication integrity,... ...platform analyzes the integrity of audio and video, and protects... ...includes detecting synthetic speech & voice cloning, video &...AudioImmediate startShift work
- ...looking for an experienced Machine Learning Engineer to join our team and help develop cutting-edge speech recognition models that help teach language fluency. In this role... ...experience Bonus Experience with speech or audio Office San Francisco, CA Why work at Speak...AudioLive inWork at officeWorldwide
- ...Francisco to develop next-generation speech and conversational AI systems.... ...experience in modern machine learning for audio and enjoy transitioning from... ...involves collaborating closely with engineering and product teams to enhance model quality and scalability, focusing on...Audio
- About the Role As an applied research engineer at Sieve, you’ll build high performance building... ...will be working in the computer vision, audio processing, and text processing domains... ...fit if you’re comfortable working with models + APIs and squeezing every drop of performance...Audio
- Slope is looking for a Research Engineer in San Francisco to innovate and build state-of-the-art speech models for real-world applications. This role requires a Master's/PhD in a relevant field and 5+ years of engineering experience, focusing on deep learning and AI technologies...
$295k
OpenAI in San Francisco is searching for a Research Engineer specializing in Speech & Realtime Models. In this role, you will design and implement advanced machine learning models that address real-world challenges and create AI-powered products. The ideal candidate holds...- An innovative AI company based in San Francisco is seeking a Research Engineer focused on Brain-Computer Interface models. You will contribute to building open-source EEG models and support all stages of development from data collection to integration in real-world applications...
$155k - $269k
...driving stack is powered by Waabi World, which delivers realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop algorithms and production‑ize the next generation of World Models that can reason about...Full timeWork at officeWork from homeFlexible hours- A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance...Work at office
- Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor to Zyphra’s BCI work, building the next generation of open‑source EEG and brain-computer...Work at officeRelocation package
$155k - $269k
A pioneering autonomous vehicle company in San Francisco is seeking a Research Engineer to develop advanced world models for autonomous systems. The role requires strong programming skills in Python and experience with generative models. You will collaborate with experts...Flexible hours$180k - $270k
...specialist to join their SpeechLLM lab in San Francisco. This role involves building advanced audio and speech models and includes responsibilities related to research and engineering. Successful candidates will earn a competitive salary between $180K to $270K plus bonuses...AudioWork at office$100k - $120k
...infrastructure to power the next generation of robotic foundation models. Data is crucial to improving these models, yet robotic... ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality...$220k - $280k
...About the role In your role as Senior Research Engineer, you'll be at the heart of building the... ...next generation of generative video and audio technology. Your work will push the... ...shipping cutting-edge video generation models, you'll help redefine how hundreds of millions...AudioWork at officeLocal areaFlexible hours- ...Astra, and top-tier AI researchers. As an early member... ...AI researchers and engineers in the world in a small... ...and video generation model teams in the world on... ...training text-to-motion or audio-to-motion models.... ...if you have worked on speech-driven 3D facial animation...AudioWork at officeVisa sponsorship
- ...AI Research Engineer Opportunity Poly is building a better file storage platform for everyone... ...re maximally multimodal. We support any audio, video, document, office file, slide... ...baseline knowledge of core statistical modeling principles, including the ability to write...AudioWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer - Audio & Speech Models. Be the first to apply!
- ai research engineer San Francisco, CA
- research software engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- research assistant engineering San Francisco, CA
- research engineer San Francisco, CA
- music audio San Francisco, CA
- audio San Francisco, CA


