Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, LLM Evals & Observability

$200k - $300k

Glean.info

About Glean:

Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the infrastructure to govern, scale, and customize AI across their entire business - without vendor lock-in or costly implementation cycles.

At its core, Glean is redefining how enterprises find, use, and act on knowledge. Its Enterprise Graph and Personal Knowledge Graph map the relationships between people, content, and activity, delivering deeply personalized, context-aware responses for every employee. This foundation powers Glean's agentic capabilities - AI agents that automate real work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured, historical and real-time. The result: measurable business impact through faster onboarding, hours of productivity gained each week, and smarter, safer decisions at every level.

Recognized by Fast Company as one of the World's Most Innovative Companies (Top 10, 2025), by CNBC's Disruptor 50, Bloomberg's AI Startups to Watch (2026), Forbes AI 50, and Gartner's Tech Innovators in Agentic AI, Glean continues to accelerate its global impact. With customers across 50+ industries and 1,000+ employees in more than 25 countries, we're helping the world's largest organizations make every employee AI-fluent, and turning the superintelligent enterprise from concept into reality.

If you're excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow, Zendesk, GitHub, and many more - deeply embedded where people get things done. You'll ship agentic capabilities on an open, extensible stack, with the craft and care required for enterprise trust, as we bring Work AI to every employee, in every company.

About the Role:

Building a great AI assistant is only half the battle - knowing whether it's actually great is the other half. Our team owns the measurement and quality layer that make Glean's Assistant and Agents reliably better over time: evaluation pipelines, quality eval-sets, LLM-powered judges, agent observability, and the tooling engineers use to understand what changed and why. It's a rare combination of infrastructure engineering, applied ML, and direct product impact. If you care deeply about quality and want to build the systems that make it measurable, this role is for you.

You will:
  • Design and curate evaluation datasets - sampling strategies, query diversity, and golden sets that give reliable, representative coverage of real assistant behavior.
  • Build and maintain large-scale evaluation pipelines that measure assistant quality across thousands of real user queries.
  • Build LLM-powered judges that score metrics like correctness, completeness, and response quality, and align them against human judgment.
  • Evaluate new models and product changes before they ship - providing the quality signal that gates launches and prevents regressions.
  • Build observability infrastructure for AI agents: trace enrichment, data pipelines, and dashboards that make assistant behavior inspectable.
  • Close the loop between quality measurement and improvement using eval results, customer feedback, and techniques like automated prompt iteration to help drive concrete gains in assistant behavior.
  • Collaborate with engineers across the company to make evals a first-class part of how we ship.
About you:
  • 2+ years of software engineering experience with strong coding skills.
  • Strong backend fundamentals in Go and Python; comfortable with distributed data pipelines.
  • Experience working with LLM evaluation, reinforcement learning from human feedback, natural language processing, or other large systems involving machine learning.
  • Analytically rigorous - you think carefully about what offline metrics actually predict about real user experience.
  • Thrive in a customer-focused, tight-knit and cross-functional environment - being a team player and willing to take on whatever is most impactful for the company
  • You care about quality - not just in the systems you build, but in the product you're helping measure and improve.
Location:
  • This role is hybrid (3-4 days a week in one of our SF Bay Area offices)

Compensation & Benefits:

The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused.

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

#LI-HYBRID

AI-First Mindset at Glean:

At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required.

Global Data Privacy Notice for Job Candidates and Applicants:

Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement .

By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, LLM Evals & Observability in Mountain View, CA vacancy
  • $200k - $300k

     ...connectors flexible LLM choice and robust APIs...  ...judges agent observability and the tooling engineers use to understand what...  ...the company to make evals a first-class part of...  ...evaluation reinforcement learning from human feedback...  ...systems involving machine learning. ~ Analytically... 
    Suggested
    Full time
    Home office
    Flexible hours
    3 days per week

    Glean

    Mountain View, CA
    5 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Perception LLM/VLM Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    5 days ago
  •  ...running. Our Team's Vision: Our Engineering team is shaping the future of...  ...and prompt engineering. ~ Agentic observability using Langfuse, Evals frameworks for Testing/Resilience...  ...scale. AI Ops: Experience with LLM deployment optimization (e.g., vLLM,... 
    Suggested
    Immediate start

    Illumio

    Sunnyvale, CA
    1 day ago
  • $213k - $263k

     ...Senior Machine Learning Engineer, Multimodal Perception (LLM/VLM) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver... 
    Suggested
    Full time
    Temporary work
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  •  ...complex workflows, and continuously learn and adapt. Moveworks is...  ...with Moveworks’ Reasoning Engine and natural language capabilities...  ...Role We are looking for a Machine Learning Engineer to help build...  ...for building and serving LLM’s at Moveworks. This role will... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Servicenow

    Mountain View, CA
    3 hours ago
  • $140k - $220k

     ...ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities...  ..., automated training pipelines, and observability solutions to maintain model health...  ...Use of artificial intelligence or a LLM such as ChatGPT during the interview... 
    Work at office
    Flexible hours

    RADAR

    Sunnyvale, CA
    6 days ago
  • $181.1k - $272.1k

     ...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish... 
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • $196k - $221k

     ...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong...  ...large-scale SID / ASR / NLP / LLM systems that power mission-critical...  ...in model performance, robustness, observability, and operational excellence using real... 
    Permanent employment

    Otter.ai

    Mountain View, CA
    1 day ago
  •  ...act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll...  ...real-time ML serving systems Develop LLM-Powered Applications Build and optimize...  ...Python code Ensure reliability, observability, and performance across distributed systems... 

    Barker Staffing Solutions LLC

    Mountain View, CA
    5 days ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer - Agentic AI The VCV organization has pioneered human-centric, real-time...  ..., and cost, and build monitoring and observability systems to understand behavior in...  ...environments. Strong proficiency with LLM-assisted coding, including using AI... 
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago
  • $213k - $263k

     ...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company with the mission to be the world's...  ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $148.91k - $252k

     ...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical... 
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $232k - $310k

     ...collaboration, and high standards. Our engineers, product leaders, and go-to-market teams...  ...on pushing the boundaries of applied machine learning. We work with massive datasets, tackle...  ...inference optimization (vLLM, TensorRT-LLM). Desired Skills & Experience: Research... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    3 days per week

    Eightfold LLC

    Santa Clara, CA
    1 day ago
  • $214k - $289.5k

     ...Come join Intuit as a Senior Staff Machine Learning Engineer (MLE). Senior Staff MLEs deliver end...  ...systems are built for adaptability, observability, and secure customer data usage. Build...  ...-augmented generation (RAG), and LLM fine-tuning pipelines to accelerate product... 
    Local area

    Intuit

    Mountain View, CA
    5 days ago
  • $162.51k - $342.75k

     ...Staff Machine Learning Engineer We are Omnissa. The world is evolving quickly, and organizations...  ...testing as needed. Implement model observability, monitoring, and drift detection to ensure...  ...in machine learning, AI, and LLM technologies, and apply them to product... 
    Work experience placement
    Local area
    Flexible hours

    Omnissa

    Mountain View, CA
    3 days ago
  • $60 - $70 per hour

     ...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that... 
    Contract work
    Temporary work
    Remote work
    3 days per week

    TEKsystems

    Cupertino, CA
    6 days ago
  •  ...interacting with content, we're engineering the next generation of...  ...versatile, self-driven Staff Machine Learning Engineer to raise the bar...  ...compatibility in mind, and drive observability across system health and...  ...powering asset generation, LLM inference, and the ML-... 
    Full time
    Temporary work
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    3 hours ago
  •  ...and personalized agent experiences. Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language processing (NLP...  ...strategies (QLORA, DPO) and inference optimization (vLLM, TensorRT-LLM). Research experience in agentic AI or related fields.... 
    Full time
    Work experience placement

    Eightfold

    Santa Clara, CA
    3 hours ago
  • $150k

     ...Machine Learning Engineer About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and...  ...during large scale data processing ~ Hands-on experience with LLM algorithms, such as Supervised Fine-Tuning (SFT) and... 
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $200k - $300k

     ...platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the...  ...every company. About the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of Quality and traditional... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    3 days ago
  •  ...Jayaraman from Info Way Solutions, LLC We have job opening for Machine Learning Engineer and the detailed Job description is given below: Kindly...  ...data pipelines using python • Experience with GenAI/ LLM. Thanks & Regards, Jayaraman Email: ****@*****.***... 

    Info Way Solutions

    Sunnyvale, CA
    4 days ago
  •  ...Criteria (Must Haves) • Top 3 Required skills: Machine Learning, Gen AI, Python • Years of experience in...  ...Create, test, and refine prompts (prompt engineering) including system instructions, chains, and templates to improve LLM output quality and reliability.... 
    Hourly pay
    Permanent employment
    Work at office
    Remote work
    3 days per week

    eTeam

    Sunnyvale, CA
    5 days ago
  •  ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into scalable...  ...in inference time optimization, deep understanding of LLM serving optimizations for LLMs/VLMs. Hands on experience... 

    Nace AI

    Palo Alto, CA
    1 day ago
  • $213k - $263k

     ...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...efficient deep learning techniques for foundation models or LLM. Experience with GPU HW or TPU HW and related system... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    5 days ago
  •  ...Summary We are seeking a highly experienced Machine Learning Engineer to build, deploy, and optimize Large Language Model (LLM)-based applications, with a strong emphasis...  ...Qualifications Hands-on experience with observability and evaluation tools for LLMs. Solid... 

    Apple

    Cupertino, CA
    4 days ago
  • $108k - $170k

    About Us Observe.AI is the AI Agents platform for customer experience...  ...We’re looking for an AI Agent Engineer to lead the charge in building...  ...Write and refine prompts for LLM‑driven agents, monitor performance...  ...stipend to support wellness, learning and professional development,... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    AI Chopping Block, Inc.

    Redwood City, CA
    5 days ago
  •  ...Description We are seeking an experienced GenAI engineer to join our seasoned founding team to...  ...distributed infrastructure to support machine learning training, inference, and evaluation....  ...with real‑time responsiveness and edge‑llm deployment. Compensation Competitive compensation... 

    Spector.ai

    Mountain View, CA
    5 days ago
  •  ...video experiences at Apple. The Video Engineering group develops key image and video technologies...  ...many Apple products and services. As a Machine Learning Engineer, you'll be developing...  ...Ability to leverage AI coding assistants and LLM-based tools to accelerate development,... 

    Apple

    Sunnyvale, CA
    5 days ago
  • $213k - $263k

     ...Senior Machine Learning Engineer, Runtime and Serving Waymo is an autonomous driving technology company with the mission to be the world's most...  ...Runtime, OpenXLA/PjRT, TVM). Experience building or scaling LLM serving systems, including expertise in distributed... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, LLM Evals & Observability. Be the first to apply!