Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco

$180k - $270k

Plaud

About Plaud Inc. Plaud is building the world’s most trusted AI work companion for professionals to elevate productivity and performance through note‑taking solutions, loved by over 1,500,000 users worldwide since 2023. With a mission to amplify human intelligence, Plaud is building the next‑generation intelligence infrastructure and interfaces to capture, extract, and utilize what you say, hear, see, and think. Plaud Inc. is a Delaware‑incorporated, San Francisco‑based company pushing the boundary of human‑AI intelligence through a hardware‑software combination. With SOC 2, HIPAA, GDPR, ISO27001, ISO27701, and EN18031 compliance, Plaud is committed to the highest standards of data security and privacy protection. To learn more about Plaud, please visit and follow along on Instagram, X, Facebook, LinkedIn, and YouTube. Why You Should Join Us Plaud is building the next generation intelligence infrastructure and interfaces to capture, extract, and utilize intelligence from what people say, hear, see, and think. Plaud is a bootstrapped, skyrocketing, profitable company with a $250M revenue run rate achieved in just three years. Define the next‑gen paradigm for human‑AI interaction. Gain exposure to cutting‑edge AI for Pro tools and play a direct role in our global expansion. Work with passionate teammates who value innovation, collaboration, and customer success. Grow your career in a culture that champions continuous learning and fast career development. Market‑competitive compensation, global exposure, and a vibrant, creativity‑fueled work atmosphere. You may be a good fit if you: Have a passion for turning ambiguous, subjective concepts like a voice's naturalness, expressiveness, or conversational cadence into clear, defensible, and automated metrics that researchers and leadership can rely on. Possess strong software engineering skills (especially in Python) and have experience building reliable distributed systems, data pipelines, or evaluation harnesses that can run at scale against live model checkpoints. Can deeply partner with ML researchers to define exactly what "good" looks like for a Speech LLM, translating capabilities (like ASR robustness in noisy environments or TTS emotional steerability) into measurable benchmarks. Are comfortable building and owning dashboards that track model health during training, improving signal‑to‑noise ratios, reducing evaluation latency, and making performance regressions impossible to miss. Rapidly debug anomalous mid‑training results to determine if a drop in performance stems from the model architecture, corrupted data, or infrastructure. Communicate complex statistical results and model behaviors clearly to both technical and non‑technical stakeholders. Strong candidates may also have experience with: Speech Metrics: Deep familiarity with both traditional (WER, CER, PESQ, etc) and modern audio evaluation frameworks (automated MOS scoring). LLM‑as‑a‑Judge: Using frontier models or finetune multi‑modal LLMs to evaluate the conversational logic, transcription accuracy, audio quality, and reasoning of audio models. Human Evaluation: Managing large‑scale crowdsourcing operations or preference data collection to support RLHF/DPO efforts. Observability: A strong background in statistics and experimental design, paired with experience building trusted tracking dashboards (e.g., Weights & Biases, MLflow). Adversarial Datasets: Curating complex datasets to test edge cases, such as heavy accents, overlapping speech, or highly noisy acoustic environments. What We Offer Founding Team Initiative: Opportunity to be an early, foundational member of our core SpeechLLM lab, with meaningful ownership and impact on a fast‑growing startup. Competitive Compensation: $180K - $270K base salary + performance bonus + Equity. Comprehensive Benefits: Top‑tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy. Retirement Planning: 401(k) plan for full‑time employees with company matching. Paid Time Off: Unlimited PTO, plus 13 paid holidays. New Parent Leave: 12 weeks of paid time off to spend time with your new family, regardless of gender. Hybrid Office: Minimum of 3x in‑office per week to foster highly collaborative, fast‑paced research. Gear & Perks: Choice of top‑of‑the‑line laptops/workstations, annual offsites, and a fully stocked office. Plaud is and will continue to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristics. #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco in San Francisco, CA vacancy
  • $180k - $270k

     ...Delaware-incorporated, San Francisco-based company pushing the...  ...privacy protection. To learn more about Plaud,...  ...-low-latency inference engines for large language models or foundational speech models. Understand the...  ...familiarity with modern LLM serving frameworks like... 
    Suggested
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    3 days ago
  • $200k

     ...Delaware-incorporated, San Francisco-based company pushing...  ...privacy protection. To learn more about Plaud,...  ...large-scale audio or speech models from the ground up, whether...  ...of research and engineering, eager to design novel...  ...(e.g., vLLM, TensorRT-LLM, SGLang) to minimize latency... 
    Suggested
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    3 days ago
  • $240.45k - $300.3k

     ...Senior Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys...  ...robustness, and safety metrics, including LLM-judge-based evaluations. Design...  ...time position in the locations of San Francisco, New York, Seattle is: $240,450—$3... 
    Suggested
    Full time

    Scale AI

    San Francisco, CA
    9 days ago
  • $250k - $300k

     ...Machine Learning Engineer - Speech Model Training $250,000 - $300,000 San Francisco, CA Hybrid, 3x per week in office Full time / Permanent In this role...  ...inference performance with vLLM, TensorRT-LLM, or SGLang Apply RL alignment techniques... 
    Suggested
    Permanent employment
    Full time
    Work at office
    Immediate start
    Worldwide

    DeepRec.ai

    San Francisco, CA
    3 days ago
  • $180k - $270k

     ...Plaud in San Francisco is seeking skilled professionals to join their fast-growing team dedicated to...  ...productivity. The role involves collaborating with machine learning researchers and engineering teams to define metrics, improve model capabilities, and ensure effective... 
    Suggested

    Plaud

    San Francisco, CA
    3 days ago
  •  ...We are a team of engineers and researchers with an...  ...open source developers, machine learning researchers, and...  ...to hire in-person in San Francisco. We believe that having...  ...Design, train, and ship models that guide reviewers...  ...generate concise summaries Evaluate retrieval and RAG for... 
    Relocation package

    Assert

    San Francisco, CA
    3 days ago
  • $200k - $400k

     ...Machine Learning Engineer, Life Sciences About Goodfire Behind our name...  ...corporation headquartered in San Francisco with a team of the world’s...  ...platform for training, evaluating, and deploying interpretable...  ...and biological foundation models(e.g., genomic foundation... 

    Goodfire

    San Francisco, CA
    2 days ago
  • $320k

     ...committed researchers, engineers, policy experts, and...  ...to build the evaluations that tell us — and the...  ...leadership use to monitor model health during training...  ...Problems in AI Safety, and Learning from Human...  ...corporation headquartered in San Francisco. We offer competitive... 
    Remote job
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    a month ago
  • $200k - $250k

     ...Job Description Founding Machine Learning Engineer - On-site - San Francisco, CA Location: San...  ...reinforcement learning models that operate in high-stakes...  ...reinforcement learning, evaluation systems, and the...  ..., perception models, or LLM-powered products. Ability... 
    Work at office
    Immediate start

    Connect Staffing Professional

    San Francisco, CA
    28 days ago
  •  ...A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You... 

    Reducto

    San Francisco, CA
    2 days ago
  • $204k - $259k

     ...A leading autonomous driving technology company in San Francisco is seeking an experienced engineer to develop evaluation techniques for machine learning models. The role involves metrics development, simulation strategies, and collaboration with top-tier engineering teams... 

    Waymo

    San Francisco, CA
    2 days ago
  • A cutting-edge AI company located in San Francisco is seeking an ML Eval Engineer to enhance model evaluations and ensure quality metrics. This role involves designing benchmarks, collaborating with teams to identify model weaknesses, and developing automated processes.... 

    Reducto, Inc.

    San Francisco, CA
    2 days ago
  • $147.6k - $274k

     ...Machine Learning Engineer - Infra San Francisco, CA The Opportunity We are revolutionizing drug discovery with cutting-edge machine learning techniques...  ...Preferred Extensive experience with large‑scale ML model platforms and tools. Deep understanding and... 
    Relocation package

    ESR Healthcare

    San Francisco, CA
    2 days ago
  •  ...intersection of software engineering and applied AI,...  ...in language models and agent frameworks...  ...leveraging modern LLM's (strong plus for...  ...Familiarity with AI evaluation techniques and...  ...notified about new Machine Learning Engineer jobs in San Francisco, CA . Oakland,... 
    Full time
    Immediate start

    Greylock Partners

    San Francisco, CA
    2 days ago
  •  ...Job Description: Machine Learning Engineer (Operations) Location: South San Francisco CA (Hybrid, 3 days/week) (Not remote...  ..., managing, and deploying ML models using core AWS services such as...  ...use cases. Ability to monitor LLM performance, fine-tune... 
    3 days per week

    ESR Healthcare

    San Bruno, CA
    1 day ago
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust... 
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    1 day ago
  •  ...the person who takes the newest open-source models (image, video, 3D, audio, multimodal…) and...  ...to run natively in the ComfyUI core engine Design and build the native nodes that expose...  ...next Bonus: If you've worked with diffusion/LLM models before or built custom nodes for... 

    Comfy

    San Francisco, CA
    2 days ago
  •  ...do cutting‑edge deep learning research—conducting experiments...  ...more human‑like machine intelligence. Note:...  ...requires being onsite in San Francisco. Example projects:...  ...to be a world‑class ML engineer. A fan of paired programming...  ...its own foundation models optimized for... 

    Stars Arena

    San Francisco, CA
    2 days ago
  • $220k - $270k

     ...infrastructure, tools, and model access that teams need to move...  ...about trends in AI, machine learning, and generative media to proactively...  ...diverse teams, including engineering, product, and customer...  ...currently hiring in downtown San Francisco. We offer visa sponsorship... 
    Contract work
    Temporary work
    Currently hiring
    Relocation
    Visa sponsorship

    fal

    San Francisco, CA
    5 days ago
  • $140k - $160k

     ...Location: San Francisco, CA Reports To: VP of Engineering FLSA Status: Exempt Employment...  ...powers analytics, machine learning, and AI‑driven...  ...data platforms and LLM‑enabled...  ...enterprise‑grade data models, architect RAG systems...  ..., monitoring, and evaluation frameworks to mitigate... 
    Full time
    Local area
    Flexible hours

    Argo AI

    San Francisco, CA
    3 days ago
  • $320k - $405k

     ...committed researchers, engineers, policy experts, and...  ...an experienced Machine Learning Systems Engineer to join...  ...directly impacts how our models learn from and...  ...analytical skills and can evaluate the impact of engineering...  ...headquartered in San Francisco. We offer competitive... 
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    more than 2 months ago
  •  ...committed researchers, engineers, policy experts, and...  ...that train AI models like Claude. You're excited...  ...work at the frontier of machine learning, implementing and...  ...systems Large scale LLM training Python...  ...corporation headquartered in San Francisco. We offer competitive... 
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    more than 2 months ago
  • $154k - $188k

     ...Machine Learning Application Engineer II At Maze Therapeutics, we believe precision medicine...  ...deploy machine learning models to support workflows in...  ...data analysis). Lead the evaluation and integration of Large...  ...employees located in the San Francisco Bay Area is $154,000 -$18... 

    Initial Therapeutics, Inc.

    South San Francisco, CA
    2 days ago
  • $350k

     ...committed researchers, engineers, policy experts, and...  ...'s production models undergo sophisticated...  ...model fine-tuning and evaluation Develop tools to measure...  ...in Python, deep learning frameworks, and distributed...  ...headquartered in San Francisco. We offer competitive... 
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    more than 2 months ago
  • $200k - $260k

     ...Senior Machine Learning Engineer, Voice AI San Francisco About the Role Together AI...  ...applications — serving speech-to-text and text-to-speech models with best-in-class...  ...inference engines like TRT-LLM and SGLang to...  ...infrastructure. Build quality evaluation frameworks that... 
    Full time

    Together AI

    San Francisco, CA
    4 days ago
  • $148.5k - $266.2k

     ...Machine Learning Engineering Manager, Model Delivery page is loaded## Machine Learning Engineering Manager, Model Deliverylocations: San Francisco, CA, USA: California, USA - Remotetime type: Full timeposted...  ...deployment, monitoring, evaluation, reliability, and operational... 
    Remote work

    Autodesk

    San Francisco, CA
    2 days ago
  •  ...frontier research for their next generation of LLM products. Join us if you: Wish to work...  ...advancement. Responsibilities Own LLM evaluation processes and methods with a focus on...  ...abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art... 
    Local area
    Shift work

    Capitolis

    San Francisco, CA
    1 day ago
  •  ...reinvent the way people learn, starting with...  ...based throughout San Francisco, Seoul, Tokyo,...  ...for an experienced Machine Learning Engineer to join our team and...  ...develop cutting-edge speech recognition models that help teach...  ...such as training/evaluation datasets and labeling... 
    Live in
    Work at office
    Worldwide

    Speak LLC

    San Francisco, CA
    3 days ago
  • $10k

     ...Machine Learning Engineer Full Time (San Francisco) We’re looking for someone to continue leveraging our vast trove of medical imaging data in order to train and deploy deep neural network models. These models enable our surgical robot to understand and reason about... 
    Full time

    ESR Healthcare

    San Francisco, CA
    15 days ago
  • $155k - $235k

     ...Senior AI Data Engineer### Job summarySan Francisco### Work modelHybrid...  ...and evolve data modeling and metadata patterns...  ...or LLM-powered applications...  ...trained, aligned and evaluated (RLHF, fine-tuning...  ...location. San Francisco is our...  ...employee equity* Learning and development... 
    Local area
    Home office
    Flexible hours

    byebyeoffice

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco. Be the first to apply!