Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco

$180k - $270k

Plaud

About Plaud Inc. Plaud is building the world’s most trusted AI work companion for professionals to elevate productivity and performance through note‑taking solutions, loved by over 1,500,000 users worldwide since 2023. With a mission to amplify human intelligence, Plaud is building the next‑generation intelligence infrastructure and interfaces to capture, extract, and utilize what you say, hear, see, and think. Plaud Inc. is a Delaware‑incorporated, San Francisco‑based company pushing the boundary of human‑AI intelligence through a hardware‑software combination. With SOC 2, HIPAA, GDPR, ISO27001, ISO27701, and EN18031 compliance, Plaud is committed to the highest standards of data security and privacy protection. To learn more about Plaud, please visit and follow along on Instagram, X, Facebook, LinkedIn, and YouTube. Why You Should Join Us Plaud is building the next generation intelligence infrastructure and interfaces to capture, extract, and utilize intelligence from what people say, hear, see, and think. Plaud is a bootstrapped, skyrocketing, profitable company with a $250M revenue run rate achieved in just three years. Define the next‑gen paradigm for human‑AI interaction. Gain exposure to cutting‑edge AI for Pro tools and play a direct role in our global expansion. Work with passionate teammates who value innovation, collaboration, and customer success. Grow your career in a culture that champions continuous learning and fast career development. Market‑competitive compensation, global exposure, and a vibrant, creativity‑fueled work atmosphere. You may be a good fit if you: Have a passion for turning ambiguous, subjective concepts like a voice's naturalness, expressiveness, or conversational cadence into clear, defensible, and automated metrics that researchers and leadership can rely on. Possess strong software engineering skills (especially in Python) and have experience building reliable distributed systems, data pipelines, or evaluation harnesses that can run at scale against live model checkpoints. Can deeply partner with ML researchers to define exactly what "good" looks like for a Speech LLM, translating capabilities (like ASR robustness in noisy environments or TTS emotional steerability) into measurable benchmarks. Are comfortable building and owning dashboards that track model health during training, improving signal‑to‑noise ratios, reducing evaluation latency, and making performance regressions impossible to miss. Rapidly debug anomalous mid‑training results to determine if a drop in performance stems from the model architecture, corrupted data, or infrastructure. Communicate complex statistical results and model behaviors clearly to both technical and non‑technical stakeholders. Strong candidates may also have experience with: Speech Metrics: Deep familiarity with both traditional (WER, CER, PESQ, etc) and modern audio evaluation frameworks (automated MOS scoring). LLM‑as‑a‑Judge: Using frontier models or finetune multi‑modal LLMs to evaluate the conversational logic, transcription accuracy, audio quality, and reasoning of audio models. Human Evaluation: Managing large‑scale crowdsourcing operations or preference data collection to support RLHF/DPO efforts. Observability: A strong background in statistics and experimental design, paired with experience building trusted tracking dashboards (e.g., Weights & Biases, MLflow). Adversarial Datasets: Curating complex datasets to test edge cases, such as heavy accents, overlapping speech, or highly noisy acoustic environments. What We Offer Founding Team Initiative: Opportunity to be an early, foundational member of our core SpeechLLM lab, with meaningful ownership and impact on a fast‑growing startup. Competitive Compensation: $180K - $270K base salary + performance bonus + Equity. Comprehensive Benefits: Top‑tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy. Retirement Planning: 401(k) plan for full‑time employees with company matching. Paid Time Off: Unlimited PTO, plus 13 paid holidays. New Parent Leave: 12 weeks of paid time off to spend time with your new family, regardless of gender. Hybrid Office: Minimum of 3x in‑office per week to foster highly collaborative, fast‑paced research. Gear & Perks: Choice of top‑of‑the‑line laptops/workstations, annual offsites, and a fully stocked office. Plaud is and will continue to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristics. #J-18808-Ljbffr Plaud

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco in San Francisco, CA vacancy

Machine Learning Engineer, Inference & Serving (Speech LLM) - San Francisco
$200k
...deploying high‑throughput, ultra‑low‑latency inference engines for large language models or foundational speech models. Understand the intricate trade‑offs between... ...: Deep, under‑the‑hood familiarity with modern LLM serving frameworks like vLLM, TensorRT‑LLM, SGLang,...
Suggested
Full time
Work at office
Plaud
San Francisco, CA
4 days ago
Machine Learning Engineer, Speech LLM Training - San Francisco
$180k - $270k
...culture that champions continuous learning and fast career development.... ...large‑scale audio or speech models from the ground up, whether... ...intersection of research and engineering, eager to design novel sequence... ...(e.g., vLLM, TensorRT‑LLM, SGLang) to minimize latency...
Suggested
Full time
Work at office
Plaud
San Francisco, CA
2 days ago
Speech LLM Model Evaluations Engineer - Hybrid
$180k - $270k
Plaud in San Francisco is seeking skilled professionals to join their fast-growing team dedicated to... ...productivity. The role involves collaborating with machine learning researchers and engineering teams to define metrics, improve model capabilities, and ensure effective...
Suggested
Plaud
San Francisco, CA
1 day ago
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI San Francisco,[...]
$180.6k - $315k
...foundation healthtech search models. If you are excited... ...systems to directly learn from both process +... ...’d have: 5+ years of LLM training in a production... ...in the locations of San Francisco, New York, Seattle is:... ...ensure a fair and thorough evaluation of all applicants....
Suggested
Full time
Scale AI, Inc.
San Francisco, CA
5 days ago
Machine Learning Engineer San Francisco, CA
...We are a team of engineers and researchers with an... ...open source developers, machine learning researchers, and... ...to hire in-person in San Francisco. We believe that having... ...Design, train, and ship models that guide reviewers... ...generate concise summaries Evaluate retrieval and RAG for...
Suggested
Relocation package
Assert
San Francisco, CA
3 days ago
Product Manager, Model Behavior at OpenAI San Francisco, CA
$245k - $310k
Product Manager, Model Behavior San Francisco, CA. About the Team The Model Behavior... ...closely with research, engineering, product design, and... ...tools, and processes for evaluating, tuning, and iterating on... ...‑planning support Annual learning & development stipend ($1,...
Work at office
Relocation package
kozmetickesluzby.vecnakraska.sk - Jobboard
San Francisco, CA
4 days ago
ML Evaluation Engineer: Benchmark & Model Quality
A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You will...
Reducto
San Francisco, CA
2 days ago
ML Evaluation Engineer: Benchmark & Model Quality
A cutting-edge AI company located in San Francisco is seeking an ML Eval Engineer to enhance model evaluations and ensure quality metrics. This role involves designing benchmarks, collaborating with teams to identify model weaknesses, and developing automated processes....
Reducto, Inc.
San Francisco, CA
3 days ago
Product Manager, Claude Code Model Performance San Francisco, CA | New York City, NY
$305k
...group of committed researchers, engineers, policy experts, and... ...Product Manager on Claude Code's model performance team, you will... ...behavior, prompt engineering, and evaluation methodology Are a systems... ...and love solving puzzles San Francisco and Seattle only The annual...
Visa sponsorship
Anthropic
San Francisco, CA
3 days ago
Technical Business Development (Model Labs) San Francisco
$220k - $270k
...partner companies, particularly Model Labs, focused on driving... ...about trends in AI, machine learning, and generative media to proactively... ...diverse teams, including engineering, product, and customer... ...currently hiring in downtown San Francisco. We offer visa sponsorship...
Temporary work
Currently hiring
Relocation
Visa sponsorship
Fal
San Francisco, CA
1 day ago
ML Engineer — LLM Evaluation
...frontier research for their next generation of LLM products. Join us if you: Wish to work... ...advancement. Responsibilities Own LLM evaluation processes and methods with a focus on... ...abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art...
Local area
Shift work
Capitolis
San Francisco, CA
2 days ago
Applied ML Engineer, Speech
...reinvent the way people learn, starting with... ...based throughout San Francisco, Seoul, Tokyo,... ...for an experienced Machine Learning Engineer to join our team and... ...develop cutting-edge speech recognition models that help teach... ...such as training/evaluation datasets and labeling...
Live in
Work at office
Worldwide
Dormont Manufacturing Co
San Francisco, CA
1 day ago
Machine Learning Engineer: Agentic AI & LLM Systems
...Digital Space LLC is seeking a Machine Learning Expert to enhance customer... ..., collaborating with engineering and product teams, and significantly... ...a PhD with experience in LLM and has a passion for... ...This position is based in the San Francisco Bay Area. The company is...
United States Digital Space LLC
San Francisco, CA
3 days ago
Research Lead, Model Evaluation & Training Insights
...Research Lead for the Training Insights team to shape the evaluation of model capabilities. This hands-on leadership role involves developing... ...and a passion for AI safety. This role is based primarily in San Francisco with remote-friendly options. #J-18808-Ljbffr Anthropic
Remote work
Anthropic
San Francisco, CA
1 day ago
Senior or Staff ML Systems Engineer, LLMs - San Francisco Only
$200k - $240k
...fraud and financial crime. Our AI engineering team focuses on next‑... ...applications, especially large language models and agentic systems, building... ...Staff ML Systems Engineer - LLM Build reusable CI/CD workflows for model training, evaluation, and deployment using Langfuse...
Dormont Manufacturing Co
San Francisco, CA
3 days ago
Remote Language Model Evaluator - Fact-Check & Analysis
Mercor is seeking an AI model evaluator based in San Francisco, California. The role involves fact-checking using trusted sources and generating high-quality data to assess response quality. Candidates should have excellent writing skills and a strong attention to detail...
Remote job
Mercor
San Francisco, CA
5 days ago
Senior Machine Learning Engineer
...Senior Machine Learning Engineer Location: San Francisco About Hum.ai Hum.ai is building... ...generative transformer diffusion models, designing next-gen... ...models (beyond just LLM fine-tuning). This role... ...design and model evaluation frameworks Building agentic...
Work experience placement
Remote work
Humai
San Francisco, CA
4 days ago
Machine Learning Engineer
$250k - $300k
...involved. Headquartered in San Francisco, we have secured $100M in... .... The Role As a Machine Learning Engineer at Ambience, you will help... ...research opportunities. Scale Model Evaluation: Collaborate with clinical... ...particularly in LLMs, NLP, and speech recognition—and champion...
Work at office
Dormont Manufacturing Company
San Francisco, CA
1 day ago
Founding Machine Learning Engineer
...a new foundation model for investing in U... ...Location & Workstyle San Francisco Bay Area (near... ...hiring our Founding ML Engineer, the first full-time machine learning hire who will turn... ...backtesting and evaluation frameworks with... .... Experience with LLM/RAG workflows for...
Full time
Immediate start
Relocation
Visa sponsorship
Relocation package
Poesis LLC
San Francisco, CA
3 days ago
Benchmarking Research Engineer: Frontier Model Evaluations
Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology... ...build benchmarks that labs use for evaluating coding abilities and computer-use... ...require expertise in reinforcement learning and supervised fine-tuning, as well as...
Full time
Refresh AI
San Francisco, CA
2 days ago
ML Engineer
...ML Engineer San Francisco, California, United States... ...businesses learn from and optimize... ...Applied AI, Machine Learning, and... ...intelligence from model optimization... ...enabling natural speech interaction... ...innovation in LLM and audio ML applications... ...training and evaluation....
Full time
Catalyst Labs, LLC
San Francisco, CA
4 days ago
Founding Machine Learning Engineer
$150k - $300k
...Founding ML Engineer Location: San Francisco, CA Company Stage: Early-... ...boundaries of applied machine learning to power the next... ...—from research and modeling to production deployment... ...Continuously evaluate and improve model performance... ...Experience scaling LLM inference pipelines...
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Animation Integration
$180k - $270k
...Location Genies LA; Genies San Francisco Employment Type Full... ...AI persona. Senior Machine Learning Engineer to join our Avatar... ...work across data, modeling, and runtime systems... ..., fine-tuning, and evaluation. Build data... ...Collaborate with Behavior and LLM teams to integrate...
Full time
Work experience placement
Work at office
Cerebras
San Francisco, CA
3 days ago
Spanish Voice Actors for AI Training (in studio) - San Francisco
...with voice acting experience to create content for generative AI models to ensure accuracy and relevance across a wide array of topics... ...freelancers to record in person in a studio that we select in San Francisco. Travel expenses will generally not be reimbursed, so please...
Freelance
Upwork
San Francisco, CA
3 days ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance... ...skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive...
Remote job
Jaide Health
San Francisco, CA
2 days ago
Staff ML Engineer: End-to-End Model Training & Deployment
Parallel Bio in San Francisco is looking for a candidate to own the training pipeline behind models essential for both their search stack... ...fine-tuning and evaluating models for safe deployment... ...deep understanding of modern machine learning models and care about practical...
Parallel Bio
San Francisco, CA
2 days ago
AI & ML Engineer: Scale NLP & LLM Solutions
$180k - $260k
Slope is looking for an AI and ML Engineer to design and build large-scale NLP and LLM systems. You'll have the chance... ...creating prompt libraries and evaluating content performance, all while collaborating... ...and fast-paced execution in San Francisco or NYC, with a competitive...
Slope
San Francisco, CA
4 days ago
Product Designer / Demo Storyteller - San Francisco, USA
...company currently operating out of San Francisco, building AI‑enabled systems... ...devices, user experience, machine data, and real‑world... ...czak, founder of Nethermind. To learn more about the company, vision... ...work directly with founders and engineers. Ability to move quickly from...
Immediate start
Nethermind
San Francisco, CA
14 hours ago
AI Data Quality & Model Evaluation Associate
Welocalize is seeking a Data Quality Associate to evaluate AI model outputs and provide structured feedback. This is a full-time, onsite role located in San Francisco. The ideal candidate possesses a Bachelor's degree and has 1-2 years of professional writing experience...
Full time
Welocalize
San Francisco, CA
2 days ago
Senior Machine Learning Engineer, AI Agent
$199k - $298.4k
...Location Genies San Francisco; Genies LA Employment... ...Hybrid Department Engineering Compensation Machine Learning Zone A $199K – $298... ...solutions using foundation models, prompt engineering... ...benchmarks to evaluate and continuously... ...Fine‑tune and train LLM/ML models when off-...
Full time
Work experience placement
Work at office
Local area
Flexible hours
Voiceflow
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco. Be the first to apply!