Machine Learning Engineer, Data Quality
Rime Labs
Machine Learning Engineer, Data Quality Rime builds voice AI for enterprises running customer experiences at scale. Our text-to-speech models are purpose-built for high-volume conversational deployments, engineered for the pronunciation accuracy, latency, and deployment flexibility that production environments actually demand. We started from a different premise than the rest of the field: voice AI isn't bottlenecked by model architecture. It's bottlenecked by data. So before we trained a single model, we built our own corpus: full-duplex, studio-quality conversational speech, recorded and annotated by PhD linguists. That's our moat. It's also why enterprises pick Rime when pilots need to convert into production. We're backed by top-tier investors including Unusual Ventures, and we've built a team at the intersection of product, research, and craft. Building voice models is an art. We intend to master it. The path is the craft itself: the loop between theory and practice - the shared mental model of how things should behave, met by the reality that doesn't quite conform, sharpened by the meeting. Role Overview We're hiring a Machine Learning Engineer, Data Quality to own the operational data pipeline that produces our training corpus end-to-end - and to bring a vision for where it should go next. We take that seriously: if you can plan an overhaul, justify it, and orchestrate the human and machine migration work, we'll do it together. This is a sociotechnical role. You'll be in the loop on everything and talking to everyone that touches the data across 42+ languages: 50+ annotators, 32+ external vendors and an in-house recording studio, and the systems behind them - ingestion, quality assurance, pre-processing, cataloging, export to training. At any given moment, dozens of deliverables are in flight, each on its own clock. The people who thrive here want to listen to the audio clips and design the system that scales their judgment to the next million. You don't need deep expertise across the whole stack on day one - you need the judgment to know what good looks like at each stage, and the engineering depth to build (or learn to build) the parts that need building. What You'll Own
- Linguist- and annotation-team-facing tooling : annotation UI, PM workflow for project management, QC dashboards. This is the surface the frontline uses every day.
- Vendor data QA workflows : A large share of incoming data arrives from vendors in various states and needs to pass QA before it can be trusted. The tooling, routing, and tracking for that work is yours.
- Quality systems across the network : The signals, dashboards, and review loops that surface when a corner of the network is drifting - a vendor's transcripts getting sloppy, an annotator's IAA slipping, a language's gold set going stale - before it lands in the training pool.
- End-to-end audio annotation pipeline : Currently some stages exist as prototypes; productionizing and rebuilding them is work that's currently in flight.
- Dataset versioning and experimenter tooling : the model team will want to subset the vetted pool ("speakers X/Y/Z, duration 3-12s, quality > 0.8") into reproducible training manifests. The query interface, manifest format, and lineage tracking are all yours.
- Pipelines for full- and half-duplex training data
- Instinct for data quality. You can tell good data from bad. You know what "bad" looks like in this specific domain - not just generic "anomalies," but the particular ways audio and transcripts go wrong.
- Willing to look at the data. Open the file. Listen to the clip. Read the transcript. You don't outsource the first-pass checks to a script.
- Opinionated, and curious when challenged. You arrive with a perspective informed by what you've seen work and what you've seen fail - and you're equally interested in pressure-testing it. A "what about..." question isn't a threat; it's where the work happens.
- Project sense. You can hold a lot of moving parts in your head - what's in flight, what's blocked, what's about to slip - and keep the picture clear enough that others can step into it.
- Designs, doesn't just execute. You want to take on more design responsibility over time, not less. You're looking for a role where you (co-)own things end-to-end, not one where someone hands you tasks to implement.
- Comfort being out of your depth at the boundary. You'll sometimes debug code you didn't write in tools you don't use daily. You should find this energizing, not threatening.
- Solid software and data engineering fundamentals. Python, schemas you can reason about, production data pipelines you've built and operated on cloud-native infrastructure.
- Audio pipeline tooling : ffmpeg, Silero VAD, faster-whisper, neural audio codecs (Encodec, SNAC, SoundStream).
- TTS frontend work : G2P (phonemizer, g2p-en), text normalization (NeMo TN or equivalent), prosody and phoneme alignment.
- Annotation platforms : Label Studio, Argilla, or equivalent - particularly customizing or replacing them.
- Direct experience with our stack : GCP (Cloud Run, Cloud Batch, GCS, Pub/Sub), Supabase / Postgres. AWS or Azure experience maps fine.
- Build the data infrastructure behind a category-defining voice AI company.
- The pipelines you build determine what models we can train.
- Meaningful equity upside.
- High ownership, high standards, low bureaucracy.
- Competitive base + meaningful early-stage equity
- Remote-friendly
- Visa sponsorship available
- Access to a proprietary, full-duplex, studio-quality conversational speech corpus
- Compute and tooling to do the work
- Direct influence on the future of voice AI
- Are outliers
- Cut through the hype to focus on the craft
- Move fast with agency and freedom
- Maintain a growth mindset, finding joy in the struggle
- Do the right things, knowing that it'll lead to making money
Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Data Quality in San Francisco, CA vacancy
- ...Machine Learning Engineer, Data & Training Infrastructure Rime builds voice AI for enterprises running customer experiences at scale. Our text... ...model, we built our own corpus: full-duplex, studio-quality conversational speech, recorded and annotated by PhD linguists...QualityRemote workVisa sponsorship
- Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust to Payments, and from... ..., and tooling). Your work will define how we measure quality, how we turn feedback into learning signals, and how we continuously...QualityWork experience placementCasual workLive inWork at officeRemote work
$250k - $350k
...Scale has been the leading AI data foundry, helping fuel the most... ...flywheel that makes the whole machine move. This includes research around... ...with constructing high quality data to use to improve an LLM/... ...coverage, retirement benefits, a learning and development stipend, and...QualityFull time- ..., London and Amsterdam. The Data Foundation and AI team within... ...and maintains the shared machine learning and AI infrastructure that powers... ...infrastructure, feature engineering, and monitoring. In addition... ...across diverse tasks, ensuring quality beyond single-metric...Quality
$192k - $264k
...Faire, we're using the power of tech, data, and machine learning to connect this thriving community of... ...Data Science team working on Listing Quality, you will be responsible for... ...Quality pod, including product, design, engineering, analytics, and operations, to solve...QualityFull timeWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours2 days per week$185k - $235k
...product is a scalable risk engine. We stay when... ...proxies, backward-looking data, and manual processes,... ...in-the-loop management, quality, and unit economic optimization... ...ownership to learn our processes (running... ...compounding data science and machine learning systems:...QualityFull timeTemporary workH1bWork at officeRemote workVisa sponsorshipWork visaFlexible hours$164.2k - $205.2k
...significant strides in LLM quality for these products. These products... ...are seeking multiple GenAI Engineers from junior levels to more... ...BI Genie). Develop novel data collection, fine-tuning, and... ...For ~2-8 years of machine learning engineering experience in high...QualityWork at officeLocal areaWorldwide$225k - $325k
...a hands-on, high-ownership role for ML engineers who want to build production models that... ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across... ...structured human feedback, benchmark subjective quality, and inform model iterations. Level...QualityH1bWork at office- ...About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital... ...iteration cycles, improve reproducibility, and raise the quality bar for production systems. You will define clear interfaces...QualityPermanent employmentFull time
- Nerdleveltech is seeking an L4 Machine Learning Engineer to join our Trust Intelligence Platform team in San Francisco... .... Responsibilities include architecting data pipelines, building ML workflows, and ensuring data quality. Ideal candidates have 4-8 years of experience...QualityRemote work
$227.33k - $312.58k
...We're looking for a Staff ML Data Engineer to join Procore's AI & Frontier Models organization... ...data systems that power frontierscale machine learning research and applied AI products, with... ...leadership in data architecture, quality, and operational excellence. This is an...QualityWork at officeLocal areaImmediate start3 days per week- Slope is seeking a Machine Learning Engineer in San Francisco to own tooling for data annotation for computer vision. You will improve automation and reduce costs while ensuring quality in our data pipeline. The ideal candidate has robust experience in ML and computer...QualityFlexible hours
- A leading technology company in San Francisco is seeking a Machine Learning Engineer to ensure the quality and coverage of data across diverse languages. You will design large-scale datasets, evaluate models, and implement quality control systems. The ideal candidate has...QualityFull timeWork at office
- Jaide Health is seeking a Machine Learning Engineer specializing in pretraining data to enhance AI performance. This role involves creating data pipelines and conducting experiments on data quality. Candidates should have strong Python skills and experience with data processing...QualityRemote job
$185k - $235k
Stand Insurance is seeking an ML Engineer to enhance its data annotation pipeline using computer vision and machine learning systems. The role involves significant ownership of... ...responsibilities include managing pipeline operations, quality instrumentation, and vendor performance...Quality- ...Generative Ai Engineer We are looking for a generative AI engineer... ...image-based training data into usable formats, training... ...Challenges Generate high quality renovation and home design... ...with a focus or specialty in Machine Learning ~2+ years of experience training...Quality
$200k - $265k
...the future! About the Role As a Senior Machine Learning Engineer on the AI Image Generation (Imagine)... ...is constantly striving to improve the quality, character consistency, responsiveness... ..., and competitive compensation market data. Benefits We Offer Competitive salary...QualityWork at office$130k - $200k
...Machine Learning Engineer Location: San Francisco, CA Salary Range: $130,000 - $200,000 About Us: Join our innovative AI... ...Work with extensive datasets, crafting large-scale, high-quality data solutions to power our models. Requirements: Experience...QualityWork at office$150k - $265k
...company based in the San Francisco Bay Area, is seeking a Machine Learning Engineer (Search Quality) to help build the next generation of AI-powered... ...adaptation strategies to tailor language models to specialized data environments. Explore innovative approaches for...Quality- ...Machine Learning Engineer Title of Role: Machine Learning Engineer Location: San Francisco, CA, onsite Company Stage of Funding:... ...experiments, analyze results, and iterate quickly to enhance model quality and overall product performance. Collaborate cross-...QualityInternshipWork at officeVisa sponsorship
$147.6k - $274k
...Machine Learning Engineer - Infra San Francisco, CA The Opportunity We are revolutionizing drug... ...Collaborate closely with ML engineers and data scientists to understand and address... ...to completion and deliver high-quality, scalable, and reliable software. Excellent...QualityRelocation package$160k - $220k
...About the Role Together AI is looking for an ML Engineer who will develop systems and APIs that enable our customers to perform inference... ...experience writing high-performance, well-tested, production quality code ~ Bachelor's degree in computer science or equivalent...QualityFull time$204k - $259k
...Senior Machine Learning Engineer, Simulation Waymo is an autonomous driving technology company with... ...operate large-scale machine learning and data systems, simulation workflows, and... ...~ A track record in improving model quality We prefer: ~4+ years of experience...QualityWork experience placement- ...implementation of projects we provide the highest quality IT Services. We don't just help you... ...by state-of-the-art AI and Deep Learning techniques. Work with an international top-notch engineering team with full commitment on Machine Learning development. Required...QualityH1bImmediate start
$200k - $250k
...Founding Machine Learning Engineer - On-site - San Francisco, CA Location: San Francisco, CA... ...analysis pipelines. Create synthetic data systems and tooling to improve model performance... ...and the ability to write production-quality code. Experience with multimodal...QualityWork at officeImmediate start$180k - $300k
...eternal tension between latency and output quality Architects deep product integrations that... ...particularly finetuning, optimizing, and serving deep learning models in production environments A proven track record as an ML engineer who's shipped models that real systems...QualityRemote workWorldwide2 days per week- ...London offices. About the Role As a Machine Learning Engineer on the Marketplace team, you will... ...and the need to optimize across speed, quality, and conversion simultaneously. What... ...working across the full applied ML stack: data, features, training, inference, and...QualityWork at officeRelocation package
- ...Machine Learning Engineer We are looking for a Machine Learning Engineer to join the growing AI and... ...collaboration with partners. Analyzing the Data: Work closely with product managers,... ...and maintenance. Keep a high bar for quality in everything you do. Being...QualityWork at officeWorldwideFlexible hours3 days per week
- ...surging demand for housing, data centers, manufacturing hubs,... ...construction veterans and world-class engineers to solve physical-world... ...us. We're looking for a Machine Learning Engineer with a focus on... ...management pipelines to ensure high-quality training datasets Build...QualityWork at officeFlexible hours
- ...reinvent the way people learn, starting with language... ...We're hiring an ML Engineer, Assessments to help build... .../Learning Design) , Machine Learning, Product, and... ...deployment, and ongoing quality of our assessment algorithms... .../months Support data and labeling strategy...QualityLive inImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Data Quality. Be the first to apply!
Related searches
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA


