Machine Learning Engineer, LLM Evals & Observability
$200k - $300kGlean.info
About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the infrastructure to govern, scale, and customize AI across their entire business - without vendor lock-in or costly implementation cycles. At its core, Glean is redefining how enterprises find, use, and act on knowledge. Its Enterprise Graph and Personal Knowledge Graph map the relationships between people, content, and activity, delivering deeply personalized, context-aware responses for every employee. This foundation powers Glean's agentic capabilities - AI agents that automate real work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured, historical and real-time. The result: measurable business impact through faster onboarding, hours of productivity gained each week, and smarter, safer decisions at every level. Recognized by Fast Company as one of the World's Most Innovative Companies (Top 10, 2025), by CNBC's Disruptor 50, Bloomberg's AI Startups to Watch (2026), Forbes AI 50, and Gartner's Tech Innovators in Agentic AI, Glean continues to accelerate its global impact. With customers across 50+ industries and 1,000+ employees in more than 25 countries, we're helping the world's largest organizations make every employee AI-fluent, and turning the superintelligent enterprise from concept into reality. If you're excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow, Zendesk, GitHub, and many more - deeply embedded where people get things done. You'll ship agentic capabilities on an open, extensible stack, with the craft and care required for enterprise trust, as we bring Work AI to every employee, in every company. About the Role: Building a great AI assistant is only half the battle - knowing whether it's actually great is the other half. Our team owns the measurement and quality layer that make Glean's Assistant and Agents reliably better over time: evaluation pipelines, quality eval-sets, LLM-powered judges, agent observability, and the tooling engineers use to understand what changed and why. It's a rare combination of infrastructure engineering, applied ML, and direct product impact. If you care deeply about quality and want to build the systems that make it measurable, this role is for you. You will:
Compensation & Benefits: The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. #LI-HYBRID AI-First Mindset at Glean: At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required. Global Data Privacy Notice for Job Candidates and Applicants: Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement . By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
- Design and curate evaluation datasets - sampling strategies, query diversity, and golden sets that give reliable, representative coverage of real assistant behavior.
- Build and maintain large-scale evaluation pipelines that measure assistant quality across thousands of real user queries.
- Build LLM-powered judges that score metrics like correctness, completeness, and response quality, and align them against human judgment.
- Evaluate new models and product changes before they ship - providing the quality signal that gates launches and prevents regressions.
- Build observability infrastructure for AI agents: trace enrichment, data pipelines, and dashboards that make assistant behavior inspectable.
- Close the loop between quality measurement and improvement using eval results, customer feedback, and techniques like automated prompt iteration to help drive concrete gains in assistant behavior.
- Collaborate with engineers across the company to make evals a first-class part of how we ship.
- 2+ years of software engineering experience with strong coding skills.
- Strong backend fundamentals in Go and Python; comfortable with distributed data pipelines.
- Experience working with LLM evaluation, reinforcement learning from human feedback, natural language processing, or other large systems involving machine learning.
- Analytically rigorous - you think carefully about what offline metrics actually predict about real user experience.
- Thrive in a customer-focused, tight-knit and cross-functional environment - being a team player and willing to take on whatever is most impactful for the company
- You care about quality - not just in the systems you build, but in the product you're helping measure and improve.
- This role is hybrid (3-4 days a week in one of our SF Bay Area offices)
Compensation & Benefits: The standard base salary range for this position is $200,000 - $300,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. #LI-HYBRID AI-First Mindset at Glean: At Glean, AI fluency is core to how we work and we're committed to ensuring every new hire feels confident integrating AI into their everyday work. As part of the interview process, you'll complete a brief AI-focused exercise or discussion so we can understand how you think about, design, and use AI to drive impact in your role. Feel free to reference any tools, platforms, or workflows you use today - prior Glean experience isn't required. Global Data Privacy Notice for Job Candidates and Applicants: Depending on your location, the General Data Protection Regulation (GDPR), California Consumer Privacy Act (CCPA), or other privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available in our Privacy Policy . By submitting your application, you are agreeing to our use and processing of your data as required. US applicants and their applications are subject to arbitration of disputes as outlined in our Applicant Arbitration Agreement . By clicking "Submit Application," I confirm that I have read the Global Data Privacy Notice and the Applicant Arbitration Agreement , and I agree to the terms.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, LLM Evals & Observability in Mountain View, CA vacancy
$200k - $300k
...connectors flexible LLM choice and robust APIs... ...judges agent observability and the tooling engineers use to understand what... ...the company to make evals a first-class part of... ...evaluation reinforcement learning from human feedback... ...systems involving machine learning. ~ Analytically...SuggestedFull timeHome officeFlexible hours3 days per week$204k - $259k
...Senior Machine Learning Engineer, Perception LLM/VLM Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver...SuggestedFull timeRemote work- ...running. Our Team's Vision: Our Engineering team is shaping the future of... ...and prompt engineering. ~ Agentic observability using Langfuse, Evals frameworks for Testing/Resilience... ...scale. AI Ops: Experience with LLM deployment optimization (e.g., vLLM,...SuggestedImmediate start
$213k - $263k
...Senior Machine Learning Engineer, Multimodal Perception (LLM/VLM) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building...SuggestedFull timeRemote work$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver...SuggestedFull timeTemporary workRemote work- ...complex workflows, and continuously learn and adapt. Moveworks is... ...with Moveworks’ Reasoning Engine and natural language capabilities... ...Role We are looking for a Machine Learning Engineer to help build... ...for building and serving LLM’s at Moveworks. This role will...Full timeWork at officeRemote workFlexible hours
$140k - $220k
...ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities... ..., automated training pipelines, and observability solutions to maintain model health... ...Use of artificial intelligence or a LLM such as ChatGPT during the interview...Work at officeFlexible hours$181.1k - $272.1k
...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish...Relocation$196k - $221k
...alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong... ...large-scale SID / ASR / NLP / LLM systems that power mission-critical... ...in model performance, robustness, observability, and operational excellence using real...Permanent employment- ...act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll... ...real-time ML serving systems Develop LLM-Powered Applications Build and optimize... ...Python code Ensure reliability, observability, and performance across distributed systems...
$147.4k - $272.1k
...Machine Learning Engineer - Agentic AI The VCV organization has pioneered human-centric, real-time... ..., and cost, and build monitoring and observability systems to understand behavior in... ...environments. Strong proficiency with LLM-assisted coding, including using AI...Relocation$213k - $263k
...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company with the mission to be the world's... ...The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards...Full timeRemote work$148.91k - $252k
...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical...Full time$232k - $310k
...collaboration, and high standards. Our engineers, product leaders, and go-to-market teams... ...on pushing the boundaries of applied machine learning. We work with massive datasets, tackle... ...inference optimization (vLLM, TensorRT-LLM). Desired Skills & Experience: Research...Work experience placementWork at officeRemote workFlexible hours3 days per week$214k - $289.5k
...Come join Intuit as a Senior Staff Machine Learning Engineer (MLE). Senior Staff MLEs deliver end... ...systems are built for adaptability, observability, and secure customer data usage. Build... ...-augmented generation (RAG), and LLM fine-tuning pipelines to accelerate product...Local area$162.51k - $342.75k
...Staff Machine Learning Engineer We are Omnissa. The world is evolving quickly, and organizations... ...testing as needed. Implement model observability, monitoring, and drift detection to ensure... ...in machine learning, AI, and LLM technologies, and apply them to product...Work experience placementLocal areaFlexible hours$60 - $70 per hour
...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that...Contract workTemporary workRemote work3 days per week- ...interacting with content, we're engineering the next generation of... ...versatile, self-driven Staff Machine Learning Engineer to raise the bar... ...compatibility in mind, and drive observability across system health and... ...powering asset generation, LLM inference, and the ML-...Full timeTemporary workWork at officeWorldwideRelocation package
- ...and personalized agent experiences. Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language processing (NLP... ...strategies (QLORA, DPO) and inference optimization (vLLM, TensorRT-LLM). Research experience in agentic AI or related fields....Full timeWork experience placement
$150k
...Machine Learning Engineer About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and... ...during large scale data processing ~ Hands-on experience with LLM algorithms, such as Supervised Fine-Tuning (SFT) and...WorldwideVisa sponsorship$200k - $300k
...platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the... ...every company. About the Role: Glean is seeking a few Machine Learning engineers who want to focus on a combination of Quality and traditional...Work at officeHome officeFlexible hours- ...Jayaraman from Info Way Solutions, LLC We have job opening for Machine Learning Engineer and the detailed Job description is given below: Kindly... ...data pipelines using python • Experience with GenAI/ LLM. Thanks & Regards, Jayaraman Email: ****@*****.***...
- ...Criteria (Must Haves) • Top 3 Required skills: Machine Learning, Gen AI, Python • Years of experience in... ...Create, test, and refine prompts (prompt engineering) including system instructions, chains, and templates to improve LLM output quality and reliability....Hourly payPermanent employmentWork at officeRemote work3 days per week
- ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into scalable... ...in inference time optimization, deep understanding of LLM serving optimizations for LLMs/VLMs. Hands on experience...
$213k - $263k
...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...efficient deep learning techniques for foundation models or LLM. Experience with GPU HW or TPU HW and related system...Full timeRemote work- ...Summary We are seeking a highly experienced Machine Learning Engineer to build, deploy, and optimize Large Language Model (LLM)-based applications, with a strong emphasis... ...Qualifications Hands-on experience with observability and evaluation tools for LLMs. Solid...
$108k - $170k
About Us Observe.AI is the AI Agents platform for customer experience... ...We’re looking for an AI Agent Engineer to lead the charge in building... ...Write and refine prompts for LLM‑driven agents, monitor performance... ...stipend to support wellness, learning and professional development,...Full timeWork at officeLocal areaRemote workFlexible hours- ...Description We are seeking an experienced GenAI engineer to join our seasoned founding team to... ...distributed infrastructure to support machine learning training, inference, and evaluation.... ...with real‑time responsiveness and edge‑llm deployment. Compensation Competitive compensation...
- ...video experiences at Apple. The Video Engineering group develops key image and video technologies... ...many Apple products and services. As a Machine Learning Engineer, you'll be developing... ...Ability to leverage AI coding assistants and LLM-based tools to accelerate development,...
$213k - $263k
...Senior Machine Learning Engineer, Runtime and Serving Waymo is an autonomous driving technology company with the mission to be the world's most... ...Runtime, OpenXLA/PjRT, TVM). Experience building or scaling LLM serving systems, including expertise in distributed...Full timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, LLM Evals & Observability. Be the first to apply!
Related searches
- machine learning ai engineer Mountain View, CA
- machine learning engineer Mountain View, CA
- machine learning software engineer Mountain View, CA
- ai ml engineer Mountain View, CA
- senior ml engineer Mountain View, CA
- computer vision machine learning engineer Mountain View, CA
- data scientist machine learning engineer Mountain View, CA
- machine learning research scientist Mountain View, CA
- artificial intelligence - machine learning intern Mountain View, CA
- machine learning Mountain View, CA



