Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, LLM Evals & Observability

$200k - $300k

Glean.info

About Glean:

Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the infrastructure to govern, scale, and customize AI across their entire business - without vendor lock-in or costly implementation cycles.

At its core, Glean is redefining how enterprises find, use, and act on knowledge. Its Enterprise Graph and Personal Knowledge Graph map the relationships between people, content, and activity, delivering deeply personalized, context-aware responses for every employee. This foundation powers Glean's agentic capabilities - AI agents that automate real work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured, historical and real-time. The result: measurable business impact through faster onboarding, hours of productivity gained each week, and smarter, safer decisions at every level.

Recognized by Fast Company as one of the World's Most Innovative Companies (Top 10, 2025), by CNBC's Disruptor 50, Bloomberg's AI Startups to Watch (2026), Forbes AI 50, and Gartner's Tech Innovators in Agentic AI, Glean continues to accelerate its global impact. With customers across 50+ industries and 1,000+ employees in more than 25 countries, we're helping the world's largest organizations make every employee AI-fluent, and turning the superintelligent enterprise from concept into reality.

If you're excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow, Zendesk, GitHub, and many more - deeply embedded where people get things done. You'll ship agentic capabilities on an open, extensible stack, with the craft and care required for enterprise trust, as we bring Work AI to every employee, in every company.

About the Role:

Building a great AI assistant is only half the battle - knowing whether it's actually great is the other half. Our team owns the measurement and quality layer that make Glean's Assistant and Agents reliably better over time: evaluation pipelines, quality eval-sets, LLM-powered judges, agent observability, and the tooling engineers use to understand what changed and why. It's a rare combination of infrastructure engineering, applied ML, and direct product impact. If you care deeply about quality and want to build the systems that make it measurable, this role is for you.

You will:
  • Design and curate evaluation datasets - sampling strategies, query diversity, and golden sets that give reliable, representative coverage of real assistant behavior.
  • Build and maintain large-scale evaluation pipelines that measure assistant quality across thousands of real user queries.
  • Build LLM-powered judges that score metrics like correctness, completeness, and response quality, and align them against human judgment.
  • Evaluate new models and product changes before they ship - providing the quality signal that gates launches and prevents regressions.
  • Build observability infrastructure for AI agents: trace enrichment, data pipelines, and dashboards that make assistant behavior inspectable.
  • Close the loop between quality measurement and improvement using eval results, customer feedback, and techniques like automated prompt iteration to help drive concrete gains in assistant behavior.
  • Collaborate with engineers across the company to make evals a first-class part of how we ship.
About you:
  • 2+ years of software engineering experience with strong coding skills.
  • Strong backend fundamentals in Go and Python; comfortable with distributed data pipelines.
  • Experience working with LLM evaluation, reinforcement learning from human feedback, natural language processing, or other large systems involving machine learning.
  • Analytically rigorous - you think carefully about what offline metrics actually predict about real user experience.
  • Thrive in a customer-focused, tight-knit and cross-functional environment - being a team player and willing to take on whatever is most impactful for the company
  • You care about quality - not just in the systems you build, but in the product you're helping measure and improve.
Location:
  • This role is hybrid (3-4 days a week in one of our SF Bay Area offices)

Compensation & Benefits:

The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused.

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

#LI-HYBRID

AI-First Mindset at Glean:

At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required.

Global Data Privacy Notice for Job Candidates and Applicants:

Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement .

By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
Vacancy posted 4 hours ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, LLM Evals & Observability in Mountain View, CA vacancy
  • $200k - $300k

     ...Assistant Quality Engineer Glean is the Work...  ..., flexible LLM choice, and robust...  ...powered judges, agent observability, and the tooling engineers...  ...company to make evals a first-class part...  ..., reinforcement learning from human...  ...systems involving machine learning. ~ Analytically... 
    Suggested
    Home office
    Flexible hours
    3 days per week

    Colorwave Inc

    Mountain View, CA
    12 hours ago
  •  ...world running. Our Team's Vision: Our Engineering team is shaping the future of...  ...models and prompt engineering. Agentic observability using Langfuse, Evals frameworks for Testing/Resilience. Bonus...  ...at scale. AI Ops: Experience with LLM deployment optimization (e.g., vLLM,... 
    Suggested
    Immediate start

    Illumio

    Sunnyvale, CA
    12 hours ago
  • $170k - $216k

     ...Machine Learning Engineer Perception LLM/VLM (PhD, New Grad) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $150k - $230k

     ...About the Role We are looking for a hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis...  ...into working, production‑ready code. Requirements Hands‑on LLM post‑training experience. You have personally run CPT, SFT,... 
    Suggested
    Full time

    GoToMeeting

    Mountain View, CA
    1 day ago
  • $210k - $350k

     ...Rewards, and Great Careers. Distinguished Engineer GEICO is seeking a Distinguished...  ...Generative AI applications, including LLM orchestration(e.g., LangChain, LlamaIndex...  ...workflows, RAG systems, and evaluation/observability tooling, while partnering on underling AI... 
    Suggested
    Hourly pay
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    1 day ago
  •  ...act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll...  ...real-time ML serving systems Develop LLM-Powered Applications Build and optimize...  ...Python code Ensure reliability, observability, and performance across distributed systems... 

    Barker Staffing Solutions LLC

    Mountain View, CA
    4 days ago
  • $145k - $165k

     ...optimization. Our mission is to apply machine learning to enhance user experiences,...  ...roles: Machine Learning Engineers (this role) who focus on...  ...automated evaluation Exposure to observability and monitoring for ML systems Exposure to LLM-related use cases or applied... 
    Work experience placement
    Casual work
    Work at office

    Match Group

    Palo Alto, CA
    12 hours ago
  • $210k - $350k

     ...Distinguished Engineer (AI Platforms) to join our AI organization. This individual will serve...  ...for Generative AI applications, including LLM orchestration (LangChain, LlamaIndex), agentic workflows, RAG systems, and observability tooling. Ensure architectural consistency... 
    Local area

    Government Employees Insurance Company

    Palo Alto, CA
    4 days ago
  • $196k - $221k

     ...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong...  ...large-scale SID / ASR / NLP / LLM systems that power mission-critical...  ...in model performance, robustness, observability, and operational excellence using real... 
    Permanent employment

    Otter.ai

    Mountain View, CA
    3 days ago
  • $188.5k - $282.7k

     ...Rubrik's Semantic AI Governance Engine, which is the first system...  .... At its core, SAGE is "LLM-as-judge" applied to AI...  ...path with the right latency, observability, rollback, and tenancy guarantees...  ...) in Computer Science, Machine Learning, Computer Engineering, Statistics... 
    Permanent employment
    Local area

    Rubrik

    Palo Alto, CA
    1 day ago
  • $213k - $263k

     ...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company with the mission to be the world's...  ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $148.91k - $252k

     ...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical... 
    Full time

    XPENG

    Santa Clara, CA
    2 days ago
  • $214k - $289.5k

     ...Senior Staff Machine Learning Engineer Category: Software Engineering Location: Mountain View, California...  ...systems are built for adaptability, observability, and secure customer data usage....  ...retrieval‑augmented generation (RAG), and LLM fine‑tuning pipelines to accelerate... 
    Worldwide

    ATX Venture Partners

    Mountain View, CA
    4 days ago
  •  ...personalized agent experiences. Qualifications Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language processing (NLP...  ...(QLORA, DPO) and inference optimization (vLLM, TensorRT‑LLM). Research experience in agentic AI or related fields.... 
    Work experience placement

    Nutanix

    Santa Clara, CA
    4 days ago
  • $238k - $302k

     ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards...  ...schedule and you will report to a Senior Staff Software Engineer. You will: * Work with a creative team of people who help... 
    Full time
    Temporary work
    Remote work

    Waymo

    Mountain View, CA
    15 days ago
  • $162.51k - $342.75k

     ...Staff Machine Learning Engineer We are Omnissa. The world is evolving quickly, and organizations...  ...testing as needed. Implement model observability, monitoring, and drift detection to ensure...  ...in machine learning, AI, and LLM technologies, and apply them to product... 
    Work experience placement
    Local area
    Flexible hours

    Omnissa

    Mountain View, CA
    4 days ago
  • $200k - $275k

     ...What You Will Do We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s  at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra... 
    Full time

    Moveworks

    Mountain View, CA
    more than 2 months ago
  •  ...Description We are seeking an experienced GenAI engineer to join our seasoned founding team to...  ...distributed infrastructure to support machine learning training, inference, and evaluation....  ...with real‑time responsiveness and edge‑llm deployment. Compensation Competitive compensation... 

    Spector.ai

    Mountain View, CA
    12 hours ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most...  ...performance computing in deep learning, driving impactful discoveries...  ...pioneers. The Role As a Machine Learning Engineer at the Institute...  ...~ Hands‑on experience with LLM algorithms, such as Supervised... 
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  •  ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into scalable...  ...in inference time optimization, deep understanding of LLM serving optimizations for LLMs/VLMs. Hands on experience... 

    Nace AI

    Palo Alto, CA
    3 days ago
  • $175.5k - $219.5k

     ...Machine Learning Engineer III, Search Relevance Box (NYSE:BOX) is the leader in Intelligent Content...  ...embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and...  ...and ****@*****.***. Develop reliable, observable microservices and near real-time... 
    Live in
    Work at office
    Shift work
    3 days per week

    Box

    Redwood City, CA
    4 days ago
  •  ...collaboration, and high standards. Our engineers, product leaders, and go-to-...  ...and deploy cutting-edge deep learning models across all Eightfold...  ...Knowledge and passion in machine learning algorithms, Gen AI,...  ...optimization (vLLM, TensorRT-LLM). Desired Skills &... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    3 days per week

    Eightfold LLC

    Santa Clara, CA
    2 days ago
  • $213k - $263k

     ...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...efficient deep learning techniques for foundation models or LLM. Experience with GPU HW or TPU HW and related system... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $200k - $300k

     ...platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the...  ...every company. About the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of Quality and traditional... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    4 days ago
  •  ...building the world's most reliable on-demand logistics engine for delivery! We are bringing on a talented Machine Learning Engineer to help us develop and improve the ETA...  ...Learning, Operations Research/Optimization, and LLM. Experience shipping production‑grade ML models... 

    Fairygodboss

    Sunnyvale, CA
    12 hours ago
  •  ...Jayaraman from Info Way Solutions, LLC We have job opening for Machine Learning Engineer and the detailed Job description is given below: Kindly...  ...data pipelines using python • Experience with GenAI/ LLM. Thanks & Regards, Jayaraman Email: ****@*****.***... 

    Info Way Solutions

    Sunnyvale, CA
    4 hours ago
  • $140k - $220k

     ...feedback and needs. ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities at RADAR....  ...model monitoring, automated training pipelines, and observability solutions to maintain model health and performance Accelerate... 
    Work at office
    Flexible hours

    Radar

    Sunnyvale, CA
    1 day ago
  • $115k - $230k

     ...Overview We are seeking an accomplished Senior ML Engineer to serve as an individual contributor to the generative AI space at GEICO. In...  ...solutions such as Azure and AWS. 2 years with tools that power LLM‑based AI agents: evaluation frameworks, agent tooling, RAG pipelines... 
    Local area

    Government Employees Insurance Company

    Palo Alto, CA
    12 hours ago
  • $195k - $230k

     ...! For more information, visit About the Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation systems and apply AI / LLM technologies to real-world production problems. You will work on core feed, retrieval... 
    Full time
    Local area
    Work from home

    NewsBreak

    Mountain View, CA
    1 day ago
  • $120k - $215k

     ...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP IQ is HP's new AI innovation lab. Combining...  ...in applied machine learning, including at least 3 years in LLM fine-tuning. ~ Proficiency in Python and ML frameworks ecosystem... 
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    Palo Alto, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, LLM Evals & Observability. Be the first to apply!