Machine Learning Engineer, LLM Evals & Observability
$200k - $300kGlean.info
About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the infrastructure to govern, scale, and customize AI across their entire business - without vendor lock-in or costly implementation cycles. At its core, Glean is redefining how enterprises find, use, and act on knowledge. Its Enterprise Graph and Personal Knowledge Graph map the relationships between people, content, and activity, delivering deeply personalized, context-aware responses for every employee. This foundation powers Glean's agentic capabilities - AI agents that automate real work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured, historical and real-time. The result: measurable business impact through faster onboarding, hours of productivity gained each week, and smarter, safer decisions at every level. Recognized by Fast Company as one of the World's Most Innovative Companies (Top 10, 2025), by CNBC's Disruptor 50, Bloomberg's AI Startups to Watch (2026), Forbes AI 50, and Gartner's Tech Innovators in Agentic AI, Glean continues to accelerate its global impact. With customers across 50+ industries and 1,000+ employees in more than 25 countries, we're helping the world's largest organizations make every employee AI-fluent, and turning the superintelligent enterprise from concept into reality. If you're excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow, Zendesk, GitHub, and many more - deeply embedded where people get things done. You'll ship agentic capabilities on an open, extensible stack, with the craft and care required for enterprise trust, as we bring Work AI to every employee, in every company. About the Role: Building a great AI assistant is only half the battle - knowing whether it's actually great is the other half. Our team owns the measurement and quality layer that make Glean's Assistant and Agents reliably better over time: evaluation pipelines, quality eval-sets, LLM-powered judges, agent observability, and the tooling engineers use to understand what changed and why. It's a rare combination of infrastructure engineering, applied ML, and direct product impact. If you care deeply about quality and want to build the systems that make it measurable, this role is for you. You will:
Compensation & Benefits: The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. #LI-HYBRID AI-First Mindset at Glean: At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required. Global Data Privacy Notice for Job Candidates and Applicants: Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement . By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
- Design and curate evaluation datasets - sampling strategies, query diversity, and golden sets that give reliable, representative coverage of real assistant behavior.
- Build and maintain large-scale evaluation pipelines that measure assistant quality across thousands of real user queries.
- Build LLM-powered judges that score metrics like correctness, completeness, and response quality, and align them against human judgment.
- Evaluate new models and product changes before they ship - providing the quality signal that gates launches and prevents regressions.
- Build observability infrastructure for AI agents: trace enrichment, data pipelines, and dashboards that make assistant behavior inspectable.
- Close the loop between quality measurement and improvement using eval results, customer feedback, and techniques like automated prompt iteration to help drive concrete gains in assistant behavior.
- Collaborate with engineers across the company to make evals a first-class part of how we ship.
- 2+ years of software engineering experience with strong coding skills.
- Strong backend fundamentals in Go and Python; comfortable with distributed data pipelines.
- Experience working with LLM evaluation, reinforcement learning from human feedback, natural language processing, or other large systems involving machine learning.
- Analytically rigorous - you think carefully about what offline metrics actually predict about real user experience.
- Thrive in a customer-focused, tight-knit and cross-functional environment - being a team player and willing to take on whatever is most impactful for the company
- You care about quality - not just in the systems you build, but in the product you're helping measure and improve.
- This role is hybrid (3-4 days a week in one of our SF Bay Area offices)
Compensation & Benefits: The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. #LI-HYBRID AI-First Mindset at Glean: At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required. Global Data Privacy Notice for Job Candidates and Applicants: Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement . By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
Vacancy posted 4 hours ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, LLM Evals & Observability in Mountain View, CA vacancy
$200k - $300k
...Assistant Quality Engineer Glean is the Work... ..., flexible LLM choice, and robust... ...powered judges, agent observability, and the tooling engineers... ...company to make evals a first-class part... ..., reinforcement learning from human... ...systems involving machine learning. ~ Analytically...SuggestedHome officeFlexible hours3 days per week- ...world running. Our Team's Vision: Our Engineering team is shaping the future of... ...models and prompt engineering. Agentic observability using Langfuse, Evals frameworks for Testing/Resilience. Bonus... ...at scale. AI Ops: Experience with LLM deployment optimization (e.g., vLLM,...SuggestedImmediate start
$170k - $216k
...Machine Learning Engineer Perception LLM/VLM (PhD, New Grad) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the...SuggestedFull timeRemote work$150k - $230k
...About the Role We are looking for a hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis... ...into working, production‑ready code. Requirements Hands‑on LLM post‑training experience. You have personally run CPT, SFT,...SuggestedFull time$210k - $350k
...Rewards, and Great Careers. Distinguished Engineer GEICO is seeking a Distinguished... ...Generative AI applications, including LLM orchestration(e.g., LangChain, LlamaIndex... ...workflows, RAG systems, and evaluation/observability tooling, while partnering on underling AI...SuggestedHourly payWork experience placementLocal area- ...act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll... ...real-time ML serving systems Develop LLM-Powered Applications Build and optimize... ...Python code Ensure reliability, observability, and performance across distributed systems...
$145k - $165k
...optimization. Our mission is to apply machine learning to enhance user experiences,... ...roles: Machine Learning Engineers (this role) who focus on... ...automated evaluation Exposure to observability and monitoring for ML systems Exposure to LLM-related use cases or applied...Work experience placementCasual workWork at office$210k - $350k
...Distinguished Engineer (AI Platforms) to join our AI organization. This individual will serve... ...for Generative AI applications, including LLM orchestration (LangChain, LlamaIndex), agentic workflows, RAG systems, and observability tooling. Ensure architectural consistency...Local area$196k - $221k
...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong... ...large-scale SID / ASR / NLP / LLM systems that power mission-critical... ...in model performance, robustness, observability, and operational excellence using real...Permanent employment$188.5k - $282.7k
...Rubrik's Semantic AI Governance Engine, which is the first system... .... At its core, SAGE is "LLM-as-judge" applied to AI... ...path with the right latency, observability, rollback, and tenancy guarantees... ...) in Computer Science, Machine Learning, Computer Engineering, Statistics...Permanent employmentLocal area$213k - $263k
...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company with the mission to be the world's... ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards...Full timeRemote work$148.91k - $252k
...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical...Full time$214k - $289.5k
...Senior Staff Machine Learning Engineer Category: Software Engineering Location: Mountain View, California... ...systems are built for adaptability, observability, and secure customer data usage.... ...retrieval‑augmented generation (RAG), and LLM fine‑tuning pipelines to accelerate...Worldwide- ...personalized agent experiences. Qualifications Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language processing (NLP... ...(QLORA, DPO) and inference optimization (vLLM, TensorRT‑LLM). Research experience in agentic AI or related fields....Work experience placement
$238k - $302k
...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards... ...schedule and you will report to a Senior Staff Software Engineer. You will: * Work with a creative team of people who help...Full timeTemporary workRemote work$162.51k - $342.75k
...Staff Machine Learning Engineer We are Omnissa. The world is evolving quickly, and organizations... ...testing as needed. Implement model observability, monitoring, and drift detection to ensure... ...in machine learning, AI, and LLM technologies, and apply them to product...Work experience placementLocal areaFlexible hours$200k - $275k
...What You Will Do We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra...Full time- ...Description We are seeking an experienced GenAI engineer to join our seasoned founding team to... ...distributed infrastructure to support machine learning training, inference, and evaluation.... ...with real‑time responsiveness and edge‑llm deployment. Compensation Competitive compensation...
$150k
...researchers, data scientists, and engineers, tackling the most... ...performance computing in deep learning, driving impactful discoveries... ...pioneers. The Role As a Machine Learning Engineer at the Institute... ...~ Hands‑on experience with LLM algorithms, such as Supervised...WorldwideVisa sponsorship- ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into scalable... ...in inference time optimization, deep understanding of LLM serving optimizations for LLMs/VLMs. Hands on experience...
$175.5k - $219.5k
...Machine Learning Engineer III, Search Relevance Box (NYSE:BOX) is the leader in Intelligent Content... ...embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and... ...and ****@*****.***. Develop reliable, observable microservices and near real-time...Live inWork at officeShift work3 days per week- ...collaboration, and high standards. Our engineers, product leaders, and go-to-... ...and deploy cutting-edge deep learning models across all Eightfold... ...Knowledge and passion in machine learning algorithms, Gen AI,... ...optimization (vLLM, TensorRT-LLM). Desired Skills &...Work experience placementWork at officeRemote workFlexible hours3 days per week
$213k - $263k
...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...efficient deep learning techniques for foundation models or LLM. Experience with GPU HW or TPU HW and related system...Full timeRemote work$200k - $300k
...platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the... ...every company. About the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of Quality and traditional...Work at officeHome officeFlexible hours- ...building the world's most reliable on-demand logistics engine for delivery! We are bringing on a talented Machine Learning Engineer to help us develop and improve the ETA... ...Learning, Operations Research/Optimization, and LLM. Experience shipping production‑grade ML models...
- ...Jayaraman from Info Way Solutions, LLC We have job opening for Machine Learning Engineer and the detailed Job description is given below: Kindly... ...data pipelines using python • Experience with GenAI/ LLM. Thanks & Regards, Jayaraman Email: ****@*****.***...
$140k - $220k
...feedback and needs. ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities at RADAR.... ...model monitoring, automated training pipelines, and observability solutions to maintain model health and performance Accelerate...Work at officeFlexible hours$115k - $230k
...Overview We are seeking an accomplished Senior ML Engineer to serve as an individual contributor to the generative AI space at GEICO. In... ...solutions such as Azure and AWS. 2 years with tools that power LLM‑based AI agents: evaluation frameworks, agent tooling, RAG pipelines...Local area$195k - $230k
...! For more information, visit About the Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation systems and apply AI / LLM technologies to real-world production problems. You will work on core feed, retrieval...Full timeLocal areaWork from home$120k - $215k
...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP IQ is HP's new AI innovation lab. Combining... ...in applied machine learning, including at least 3 years in LLM fine-tuning. ~ Proficiency in Python and ML frameworks ecosystem...Full timeTemporary workLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, LLM Evals & Observability. Be the first to apply!
Related searches
- machine learning engineer Mountain View, CA
- senior ml engineer Mountain View, CA
- computer vision machine learning engineer Mountain View, CA
- ai ml engineer Mountain View, CA
- machine learning software engineer Mountain View, CA
- machine learning ai engineer Mountain View, CA
- machine learning scientist Mountain View, CA
- machine learning remote Mountain View, CA
- machine learning Mountain View, CA
- machine learning researcher Mountain View, CA

