Machine Learning Engineer, LLM Evals & Observability

$200k - $300k

Glean.info

About Glean:

Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the infrastructure to govern, scale, and customize AI across their entire business - without vendor lock-in or costly implementation cycles.

At its core, Glean is redefining how enterprises find, use, and act on knowledge. Its Enterprise Graph and Personal Knowledge Graph map the relationships between people, content, and activity, delivering deeply personalized, context-aware responses for every employee. This foundation powers Glean's agentic capabilities - AI agents that automate real work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured, historical and real-time. The result: measurable business impact through faster onboarding, hours of productivity gained each week, and smarter, safer decisions at every level.

Recognized by Fast Company as one of the World's Most Innovative Companies (Top 10, 2025), by CNBC's Disruptor 50, Bloomberg's AI Startups to Watch (2026), Forbes AI 50, and Gartner's Tech Innovators in Agentic AI, Glean continues to accelerate its global impact. With customers across 50+ industries and 1,000+ employees in more than 25 countries, we're helping the world's largest organizations make every employee AI-fluent, and turning the superintelligent enterprise from concept into reality.

If you're excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow, Zendesk, GitHub, and many more - deeply embedded where people get things done. You'll ship agentic capabilities on an open, extensible stack, with the craft and care required for enterprise trust, as we bring Work AI to every employee, in every company.

About the Role:

Building a great AI assistant is only half the battle - knowing whether it's actually great is the other half. Our team owns the measurement and quality layer that make Glean's Assistant and Agents reliably better over time: evaluation pipelines, quality eval-sets, LLM-powered judges, agent observability, and the tooling engineers use to understand what changed and why. It's a rare combination of infrastructure engineering, applied ML, and direct product impact. If you care deeply about quality and want to build the systems that make it measurable, this role is for you.

You will:

Design and curate evaluation datasets - sampling strategies, query diversity, and golden sets that give reliable, representative coverage of real assistant behavior.
Build and maintain large-scale evaluation pipelines that measure assistant quality across thousands of real user queries.
Build LLM-powered judges that score metrics like correctness, completeness, and response quality, and align them against human judgment.
Evaluate new models and product changes before they ship - providing the quality signal that gates launches and prevents regressions.
Build observability infrastructure for AI agents: trace enrichment, data pipelines, and dashboards that make assistant behavior inspectable.
Close the loop between quality measurement and improvement using eval results, customer feedback, and techniques like automated prompt iteration to help drive concrete gains in assistant behavior.
Collaborate with engineers across the company to make evals a first-class part of how we ship.

About you:

2+ years of software engineering experience with strong coding skills.
Strong backend fundamentals in Go and Python; comfortable with distributed data pipelines.
Experience working with LLM evaluation, reinforcement learning from human feedback, natural language processing, or other large systems involving machine learning.
Analytically rigorous - you think carefully about what offline metrics actually predict about real user experience.
Thrive in a customer-focused, tight-knit and cross-functional environment - being a team player and willing to take on whatever is most impactful for the company
You care about quality - not just in the systems you build, but in the product you're helping measure and improve.

Location:

This role is hybrid (3-4 days a week in one of our SF Bay Area offices)

Compensation & Benefits:

The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused.

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

#LI-HYBRID

AI-First Mindset at Glean:

At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required.

Global Data Privacy Notice for Job Candidates and Applicants:

Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement .

By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.

Apply

Vacancy posted 4 hours ago

Similar jobs that could be interesting for youBased on the Machine Learning Engineer, LLM Evals & Observability in Mountain View, CA vacancy

Machine Learning Engineer, LLM Evals & Observability
$200k - $300k
...Assistant Quality Engineer Glean is the Work... ..., flexible LLM choice, and robust... ...powered judges, agent observability, and the tooling engineers... ...company to make evals a first-class part... ..., reinforcement learning from human... ...systems involving machine learning. ~ Analytically...
Suggested
Home office
Flexible hours
3 days per week
Colorwave Inc
Mountain View, CA
12 hours ago
Sr. Machine Learning Engineer
...world running. Our Team's Vision: Our Engineering team is shaping the future of... ...models and prompt engineering. Agentic observability using Langfuse, Evals frameworks for Testing/Resilience. Bonus... ...at scale. AI Ops: Experience with LLM deployment optimization (e.g., vLLM,...
Suggested
Immediate start
Illumio
Sunnyvale, CA
12 hours ago
Machine Learning Engineer Perception LLM/VLM (PhD, New Grad)
$170k - $216k
...Machine Learning Engineer Perception LLM/VLM (PhD, New Grad) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the...
Suggested
Full time
Remote work
Waymo
Mountain View, CA
4 days ago
Machine Learning Engineer, LLM Post-Training
$150k - $230k
...About the Role We are looking for a hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis... ...into working, production‑ready code. Requirements Hands‑on LLM post‑training experience. You have personally run CPT, SFT,...
Suggested
Full time
GoToMeeting
Mountain View, CA
1 day ago
Distinguished Machine Learning Engineer
$210k - $350k
...Rewards, and Great Careers. Distinguished Engineer GEICO is seeking a Distinguished... ...Generative AI applications, including LLM orchestration(e.g., LangChain, LlamaIndex... ...workflows, RAG systems, and evaluation/observability tooling, while partnering on underling AI...
Suggested
Hourly pay
Work experience placement
Local area
GEICO
Palo Alto, CA
1 day ago
Machine Learning Engineer
...act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll... ...real-time ML serving systems Develop LLM-Powered Applications Build and optimize... ...Python code Ensure reliability, observability, and performance across distributed systems...
Barker Staffing Solutions LLC
Mountain View, CA
4 days ago
Machine Learning Engineer II
$145k - $165k
...optimization. Our mission is to apply machine learning to enhance user experiences,... ...roles: Machine Learning Engineers (this role) who focus on... ...automated evaluation Exposure to observability and monitoring for ML systems Exposure to LLM-related use cases or applied...
Work experience placement
Casual work
Work at office
Match Group
Palo Alto, CA
12 hours ago
Distinguished Machine Learning Engineer
$210k - $350k
...Distinguished Engineer (AI Platforms) to join our AI organization. This individual will serve... ...for Generative AI applications, including LLM orchestration (LangChain, LlamaIndex), agentic workflows, RAG systems, and observability tooling. Ensure architectural consistency...
Local area
Government Employees Insurance Company
Palo Alto, CA
4 days ago
Machine Learning Engineer
$196k - $221k
...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong... ...large-scale SID / ASR / NLP / LLM systems that power mission-critical... ...in model performance, robustness, observability, and operational excellence using real...
Permanent employment
Otter.ai
Mountain View, CA
3 days ago
Senior Machine Learning Engineer
$188.5k - $282.7k
...Rubrik's Semantic AI Governance Engine, which is the first system... .... At its core, SAGE is "LLM-as-judge" applied to AI... ...path with the right latency, observability, rollback, and tenancy guarantees... ...) in Computer Science, Machine Learning, Computer Engineering, Statistics...
Permanent employment
Local area
Rubrik
Palo Alto, CA
1 day ago
Senior ML Engineer, LLM / VLM Distillation
$213k - $263k
...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company with the mission to be the world's... ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards...
Full time
Remote work
Waymo
Mountain View, CA
4 days ago
Machine Learning Engineer - LLM, AI & Robotics
$148.91k - $252k
...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical...
Full time
XPENG
Santa Clara, CA
2 days ago
Senior Staff Machine Learning Engineer
$214k - $289.5k
...Senior Staff Machine Learning Engineer Category: Software Engineering Location: Mountain View, California... ...systems are built for adaptability, observability, and secure customer data usage.... ...retrieval‑augmented generation (RAG), and LLM fine‑tuning pipelines to accelerate...
Worldwide
ATX Venture Partners
Mountain View, CA
4 days ago
Staff Machine Learning Engineer - Agentic Models, LLM, RAG, GenAI
...personalized agent experiences. Qualifications Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language processing (NLP... ...(QLORA, DPO) and inference optimization (vLLM, TensorRT‑LLM). Research experience in agentic AI or related fields....
Work experience placement
Nutanix
Santa Clara, CA
4 days ago
Staff Machine Learning Engineer - VLM/LLM Evaluation
$238k - $302k
...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards... ...schedule and you will report to a Senior Staff Software Engineer. You will: * Work with a creative team of people who help...
Full time
Temporary work
Remote work
Waymo
Mountain View, CA
15 days ago
Staff Machine Learning Engineer
$162.51k - $342.75k
...Staff Machine Learning Engineer We are Omnissa. The world is evolving quickly, and organizations... ...testing as needed. Implement model observability, monitoring, and drift detection to ensure... ...in machine learning, AI, and LLM technologies, and apply them to product...
Work experience placement
Local area
Flexible hours
Omnissa
Mountain View, CA
4 days ago
Senior Machine Learning Engineer II - LLM
$200k - $275k
...What You Will Do We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra...
Full time
Moveworks
Mountain View, CA
more than 2 months ago
Machine Learning Engineer (GenAI)
...Description We are seeking an experienced GenAI engineer to join our seasoned founding team to... ...distributed infrastructure to support machine learning training, inference, and evaluation.... ...with real‑time responsiveness and edge‑llm deployment. Compensation Competitive compensation...
Spector.ai
Mountain View, CA
12 hours ago
Machine Learning Engineer
$150k
...researchers, data scientists, and engineers, tackling the most... ...performance computing in deep learning, driving impactful discoveries... ...pioneers. The Role As a Machine Learning Engineer at the Institute... ...~ Hands‑on experience with LLM algorithms, such as Supervised...
Worldwide
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
4 days ago
Machine Learning Engineer
...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into scalable... ...in inference time optimization, deep understanding of LLM serving optimizations for LLMs/VLMs. Hands on experience...
Nace AI
Palo Alto, CA
3 days ago
Machine Learning Engineer III, Search Relevance
$175.5k - $219.5k
...Machine Learning Engineer III, Search Relevance Box (NYSE:BOX) is the leader in Intelligent Content... ...embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and... ...and ****@*****.***. Develop reliable, observable microservices and near real-time...
Live in
Work at office
Shift work
3 days per week
Box
Redwood City, CA
4 days ago
Lead Machine Learning Engineer - Agentic Models, LLM, RAG, GenAI
...collaboration, and high standards. Our engineers, product leaders, and go-to-... ...and deploy cutting-edge deep learning models across all Eightfold... ...Knowledge and passion in machine learning algorithms, Gen AI,... ...optimization (vLLM, TensorRT-LLM). Desired Skills &...
Work experience placement
Work at office
Remote work
Flexible hours
3 days per week
Eightfold LLC
Santa Clara, CA
2 days ago
Machine Learning Engineer, Runtime & Optimization
$213k - $263k
...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...efficient deep learning techniques for foundation models or LLM. Experience with GPU HW or TPU HW and related system...
Full time
Remote work
Waymo
Mountain View, CA
1 day ago
Machine Learning Engineer, Enterprise Brain
$200k - $300k
...platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the... ...every company. About the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of Quality and traditional...
Work at office
Home office
Flexible hours
Glean.info
Mountain View, CA
4 days ago
Machine Learning Engineer - ETA Team
...building the world's most reliable on-demand logistics engine for delivery! We are bringing on a talented Machine Learning Engineer to help us develop and improve the ETA... ...Learning, Operations Research/Optimization, and LLM. Experience shipping production‑grade ML models...
Fairygodboss
Sunnyvale, CA
12 hours ago
Machine Learning Engineer
...Jayaraman from Info Way Solutions, LLC We have job opening for Machine Learning Engineer and the detailed Job description is given below: Kindly... ...data pipelines using python • Experience with GenAI/ LLM. Thanks & Regards, Jayaraman Email: ****@*****.***...
Info Way Solutions
Sunnyvale, CA
4 hours ago
Machine Learning Engineer
$140k - $220k
...feedback and needs. ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities at RADAR.... ...model monitoring, automated training pipelines, and observability solutions to maintain model health and performance Accelerate...
Work at office
Flexible hours
Radar
Sunnyvale, CA
1 day ago
Senior Machine Learning Engineer
$115k - $230k
...Overview We are seeking an accomplished Senior ML Engineer to serve as an individual contributor to the generative AI space at GEICO. In... ...solutions such as Azure and AWS. 2 years with tools that power LLM‑based AI agents: evaluation frameworks, agent tooling, RAG pipelines...
Local area
Government Employees Insurance Company
Palo Alto, CA
12 hours ago
Senior Machine Learning Engineer, Recommendation & AI Applications
$195k - $230k
...! For more information, visit About the Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation systems and apply AI / LLM technologies to real-world production problems. You will work on core feed, retrieval...
Full time
Local area
Work from home
NewsBreak
Mountain View, CA
1 day ago
Senior Machine Learning Engineer - Fine-Tuning and On-device AI
$120k - $215k
...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP IQ is HP's new AI innovation lab. Combining... ...in applied machine learning, including at least 3 years in LLM fine-tuning. ~ Proficiency in Python and ML frameworks ecosystem...
Full time
Temporary work
Local area
Flexible hours
HP IQ
Palo Alto, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, LLM Evals & Observability. Be the first to apply!