Machine Learning Engineer, Data Quality
RIME
Machine Learning Engineer, Data Quality
Rime builds voice AI for enterprises running customer experiences at scale. Our text-to-speech models are purpose-built for high-volume conversational deployments, engineered for the pronunciation accuracy, latency, and deployment flexibility that production environments actually demand.
We started from a different premise than the rest of the field: voice AI isn't bottlenecked by model architecture. It's bottlenecked by data. So before we trained a single model, we built our own corpus: full-duplex, studio-quality conversational speech, recorded and annotated by PhD linguists. That's our moat. It's also why enterprises pick Rime when pilots need to convert into production.
Role Overview
We're hiring a Machine Learning Engineer, Data Quality to own the operational data pipeline that produces our training corpus end-to-end — and to bring a vision for where it should go next. We take that seriously: if you can plan an overhaul, justify it, and orchestrate the human and machine migration work, we'll do it together.
This is a sociotechnical role. You'll be in the loop on everything and talking to everyone that touches the data across 42+ languages: 50+ annotators, 32+ external vendors and an in-house recording studio, and the systems behind them — ingestion, quality assurance, pre-processing, cataloging, export to training. At any given moment, dozens of deliverables are in flight, each on its own clock.
The people who thrive here want to listen to the audio clips and design the system that scales their judgment to the next million. You don't need deep expertise across the whole stack on day one — you need the judgment to know what good looks like at each stage, and the engineering depth to build (or learn to build) the parts that need building.
What You'll Own
Linguist- and annotation-team-facing tooling: annotation UI, PM workflow for project management, QC dashboards. This is the surface the frontline uses every day.
Vendor data QA workflows: A large share of incoming data arrives from vendors in various states and needs to pass QA before it can be trusted. The tooling, routing, and tracking for that work is yours.
Quality systems across the network: The signals, dashboards, and review loops that surface when a corner of the network is drifting — a vendor's transcripts getting sloppy, an annotator's IAA slipping, a language's gold set going stale — before it lands in the training pool.
End-to-end audio annotation pipeline: Currently some stages exist as prototypes; productionizing and rebuilding them is work that's currently in flight.
Dataset versioning and experimenter tooling: the model team will want to subset the vetted pool ("speakers X/Y/Z, duration 3–12s, quality > 0.8") into reproducible training manifests. The query interface, manifest format, and lineage tracking are all yours.
Pipelines for full- and half-duplex training data
What We're Looking For
Instinct for data quality. You can tell good data from bad. You know what "bad" looks like in this specific domain — not just generic "anomalies," but the particular ways audio and transcripts go wrong.
Willing to look at the data. Open the file. Listen to the clip. Read the transcript. You don't outsource the first-pass checks to a script.
Opinionated, and curious when challenged. You arrive with a perspective informed by what you've seen work and what you've seen fail — and you're equally interested in pressure-testing it. A "what about..." question isn't a threat; it's where the work happens.
Project sense. You can hold a lot of moving parts in your head — what's in flight, what's blocked, what's about to slip — and keep the picture clear enough that others can step into it.
Designs, doesn't just execute. You want to take on more design responsibility over time, not less. You're looking for a role where you (co-)own things end-to-end, not one where someone hands you tasks to implement.
Comfort being out of your depth at the boundary. You'll sometimes debug code you didn't write in tools you don't use daily. You should find this energizing, not threatening.
Solid software and data engineering fundamentals. Python, schemas you can reason about, production data pipelines you've built and operated on cloud-native infrastructure.
Nice to have — in rough order from hardest-to-acquire to most learnable:
Audio pipeline tooling: ffmpeg, Silero VAD, faster-whisper, neural audio codecs (Encodec, SNAC, SoundStream).
TTS frontend work: G2P (phonemizer, g2p-en), text normalization (NeMo TN or equivalent), prosody and phoneme alignment.
Annotation platforms: Label Studio, Argilla, or equivalent — particularly customizing or replacing them.
Direct experience with our stack: GCP (Cloud Run, Cloud Batch, GCS, Pub/Sub), Supabase / Postgres. AWS or Azure experience maps fine.
Why Join Rime
Build the data infrastructure behind a category-defining voice AI company.
The pipelines you build determine what models we can train.
Meaningful equity upside.
High ownership, high standards, low bureaucracy.
What We Offer
Competitive base + meaningful early-stage equity
Remote-friendly
Visa sponsorship available
Access to a proprietary, full-duplex, studio-quality conversational speech corpus
Compute and tooling to do the work
Direct influence on the future of voice AI
At Rime, we...
Are outliers
Cut through the hype to focus on the craft
Move fast with agency and freedom
Maintain a growth mindset, finding joy in the struggle
Do the right things, knowing that it'll lead to making money
If that sounds like you too, you'll be a great fit for Rime!
- Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust to Payments, and from... ..., and tooling). Your work will define how we measure quality, how we turn feedback into learning signals, and how we continuously...QualityWork experience placementCasual workLive inWork at officeRemote work
$180.6k - $315k
...Scale has been the leading AI data foundry, helping fuel the most... ...flywheel that makes the whole machine move. This includes research around... ...with constructing high quality data to use to improve an LLM/... ...coverage, retirement benefits, a learning and development stipend, and...QualityFull time- ..., London and Amsterdam. The Data Foundation and AI team within... ...and maintains the shared machine learning and AI infrastructure that powers... ...infrastructure, feature engineering, and monitoring. In addition... ...across diverse tasks, ensuring quality beyond single-metric...Quality
$170k - $216k
...builds the system which learns the spatial-temporal representation... ...of miles of driving data from a diverse set of sensors, enabling engineers like you to (1) develop... ...years of experience in Machine Learning, with a focus... ...scale data curation and quality assurance processes for...QualityRemote work- ...About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing... ...iteration cycles, improve reproducibility, and raise the quality bar for production systems. You will define clear interfaces...QualityPermanent employmentFull time
- A leading technology company in San Francisco is seeking a Machine Learning Engineer to ensure the quality and coverage of data across diverse languages. You will design large-scale datasets, evaluate models, and implement quality control systems. The ideal candidate has...QualityFull timeWork at office
$222.72k - $389.75k
...verification to ensure correctness and quality. Automate repeatable... .... Coach and mentor engineers while collaborating with product... ...degree in computer science, machine learning, statistics, a related field... ..., experiment analysis, SQL/data exploration, and engineering...QualityWork at officeLocal areaRemote workRelocationRelocation package$225k - $325k
...a hands-on, high-ownership role for ML engineers who want to build production models that... ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across... ...structured human feedback, benchmark subjective quality, and inform model iterations. Level...QualityH1bWork at office- ...An innovative networking company in San Francisco is seeking a Data Operations Engineer to develop systems that convert network engineering expertise into high-quality training data. This role calls for creativity in prototyping and a passion for user experience, where...Quality
$189.31k - $389.75k
...and drive advertiser growth. As a Staff Machine Learning Engineer focused on Agentic AI &... ...performance, market, workflow, and interaction data into reusable signals for agentic experiences... ...loops that measure recommendation quality, user trust, action rates, business impact...QualityWork at officeLocal areaRemote workRelocationRelocation package1 day per week$115k - $250k
...building Sentry for AI Agents. Engineering teams at companies use... ...where Raindrop comes in. It learns the unique shape of each AI agent... ...across 100% of their production data. They can see frequency over... ...iteration without compromising on quality. Deeply understand the...QualityTemporary work- ...Be a Machine Learning Ads Mastermind (US Remote) Northwest Talent Solutions... ...for a Machine Learning Engineer to join a dynamic U.S. company... ...rapid growth with speed and quality. You’re a Perfect Fit If You... ...Experience with Machine Learning, data mining, data analysis, and...QualityRemote work
- ...AI teams ingest real world enterprise data with state of the art accuracy. The vast... ...First Round Capital, and are hiring a Machine Learning Engineer to help us train and deploy the models... ...You: Hold yourself to a high bar for quality and precision. Enjoy solving complex...QualityWork at officeLocal area
- ...surging demand for housing, data centers, manufacturing hubs,... ...construction veterans and world-class engineers to solve physical-world... ...us. We’re looking for a Machine Learning Engineer with a focus on... ...management pipelines to ensure high-quality training datasets Build...QualityWork at officeFlexible hours
- ...work alongside senior engineers on real-world ML problems... ...tested Python code for data preprocessing, feature... ...surface insights and data quality issues. Run A/B and... ...Requirements Key Focus: Learn, execute, and grow under... ...placing elite AI and Machine Learning engineering talent...QualityPermanent employmentFull timePart timeFor contractorsInternshipImmediate startFlexible hours
- ...this role We’re hiring an ML Engineer, Assessments to help build... ...Design Lead (Content/Learning Design) , Machine Learning, Product, and Engineering... ..., deployment, and ongoing quality of our assessment... ...over weeks/months Support data and labeling strategy Help...QualityImmediate start
- ...operational stability. About the Role As a Machine Learning Engineer in OpenAI's Applied Group, you will... ...and Scale: Implement scalable data pipelines, optimize models for performance... ...and lead by example to maintain high-quality engineering practices. Make a Difference...Quality
$200k
...Founding ML Engineer San Francisco, on-site, full-time... ...dashboard. Yesterday's mouse data has already been... ...corporation--you'll often learn a new role every few... ...modified peptide library into machine-readable... ...decisive, and keeping quality systems lightweight but...QualityFull timeNight shiftDay shiftAfternoon shift$148.7k - $199.4k
...Senior Machine Learning Engineer Location : Glendale, CA / San Francisco, CA / New York, NY Job Summary ESPN is investing in large‑scale data infrastructure and real‑time processing platforms that... ...latency, reliability, and data quality requirements for downstream ML...QualityLocal area$244k - $320k
...our AI-powered personalization engine delivers bespoke experiences... ...Index ! About the Role Our Machine Learning Engineering team powers personalized... ...ML projects end-to-end, from data exploration and modeling to... ...safeguard model and system quality through testing, monitoring,...QualityFull time$200k - $400k
...is building the next-generation data platform to train AI video... ...seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role... ...the‑loop systems to curate high‑quality labeled datasets for video models...QualityWork experience placement$160k - $180k
...Department Department Technology Engineering Compensation $160K – $180K •... .... We are looking for a Machine Learning Engineer to join the growing... ...with partners. Analyzing the Data : Work closely with product... ...maintenance. Keep a high bar for quality in everything you do. Being...QualityFull timeWork at officeWorldwideFlexible hours3 days per week$180k - $270k
...human will have an AI persona. Senior Machine Learning Engineer to join our Avatar Technology team, focused... ...in practice. You will work across data, modeling, and runtime systems to... ...animation performs reliably and at high quality in production. What You’ll Be Doing: Integrate...QualityFull timeWork experience placementWork at office- ...AI teams ingest real world enterprise data with state of the art accuracy. The vast... ...First Round Capital, and are hiring a Machine Learning Engineer to help us train and deploy the models... ...worst critic. You have a high bar for quality and don’t rest until the job is done...QualityWork at officeLocal area
- ...every night of sleep into a personalized, data‑driven recovery experience. We are... ...isn’t it. The role We’re looking for a Machine Learning Engineer to build and ship consumer‑facing AI... ...and partner with Product to run high‑quality online experiments. Productionize models...QualityFull timeImmediate startWorldwideNight shift
- ...infrastructure behind the AI boom — data centers, semiconductor fabs,... ..., safety, speed, and quality no longer trade off against... ...Check out our About page to learn more. The Mission: Fully Autonomous... ...will be Shepherd’s first Machine Learning Engineer, embedded in the Fully...QualityWork at office
$155.52k - $194.4k
...Twilions! Join the team as Twilio’s next Machine Learning Engineer. This position is needed to drive... ..., and operators within Twilio’s Data & Observability Substrate organization... ...frameworks to continuously improve data quality, model performance, latency, and cost...QualityLocal areaRemote workWorldwide$227.33k - $312.58k
...We're looking for a Staff ML Data Engineer to join Procore's AI & Frontier Models organization... ...data systems that power frontierscale machine learning research and applied AI products, with... ...leadership in data architecture, quality, and operational excellence. This is an...QualityWork at officeLocal areaImmediate start3 days per week- Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires... ...significantly impact ... operations to ensure quality and efficiency in AI applications. Candidates...QualityRemote job
- ...Generative Ai Engineer We are looking for a generative AI engineer... ...image-based training data into usable formats, training... ...Challenges Generate high quality renovation and home design... ...with a focus or specialty in Machine Learning ~2+ years of experience training...Quality
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Data Quality. Be the first to apply!
- computer vision machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- staff data engineer San Francisco, CA



