Engineering Manager, AI Observability
Netflix, Inc.
The AI Observability team at Netflix makes AI, ML, and Agentic systems transparent, reliable, and production‑ready at scale. We build end‑to‑end observability for ML and GenAI workloads, capturing model inputs, features, predictions, outcomes, and behavior across online and batch systems. Responsibilities Partner with ML researchers, engineers, and platform teams to embed “observability‑by‑default” into new AI services, ensuring telemetry, monitoring, and evaluation are built into systems from day one. Lead the end‑to‑end observability strategy for AI workloads, including LLMs, generative AI systems, and classical ML models; drive build‑vs‑buy decisions and scale solutions across model training, online inference, and agent orchestration. Drive the evolution of LLM evaluation frameworks, covering prompt instrumentation, response quality measurement, grounding correctness, hallucination rates, and human/LLM‑as‑a‑judge scoring. Define and execute a platform roadmap focused on incremental delivery, with clear success metrics, migration goals, and strong adoption across teams. Communicate progress to stakeholders, customers, and senior leadership. Hire, grow, and mentor a high‑performing engineering team while fostering an inclusive and collaborative culture. Qualifications 10+ years of software engineering experience and 3+ years of management experience. Experience leading teams responsible for building high‑traffic distributed systems and ML infrastructure. Deep familiarity with AI and ML operations, including model evaluation, drift detection, and continuous monitoring at scale. Experience with AI observability and monitoring tools (Arize AI, Fiddler AI, Weights & Biases, Vertex AI Model Monitoring, SageMaker Model Monitor). Exposure to LLM or generative AI systems, including prompt/result logging, evaluation metrics, LLM‑as‑a‑judge frameworks, and human‑in‑the‑loop review. Strong technical acumen and ability to act as a credible technical advisor, set and enforce a high‑quality bar for code and system design, and mentor the team. Strong communication and collaboration skills, and the ability to build strong relationships with internal customers and external partners. A demonstrated ability to develop, drive, and execute a technical vision and roadmap. Experience managing a hybrid team with partners and team members distributed across (US) geographies & time zones. Compensation Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. The range for this role is $523,000.00 – $920,000.00. Benefits We provide comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family‑forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full‑time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full‑time salaried employees are immediately entitled to flexible time off. Equal‑Opportunity Employer We are an equal‑opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service. #J-18808-Ljbffr Netflix, Inc.
- The AI Observability team at Netflix, Inc. seeks an experienced leader to drive observability for AI and ML workloads. Responsibilities... ...LLMs. The ideal candidate will have over 10 years of engineering experience, management capability, and proficiency in AI operations. We offer...Suggested
- Netflix, Inc. is seeking an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team. In this role, you will ensure every client release, server canary, and A/B test is safely delivered while building a high-performing team of engineers. Responsibilities...SuggestedFlexible hours
$224k - $356.5k
...tapping into the unlimited potential of AI to define the next era of computing... ..., and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo... ...including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter,...SuggestedLocal areaWorldwide$146.3k - $289.9k
...impressive content effortlessly. The AI Foundations team builds the flexible,... ...personalization. We are looking for an Engineering Manager to lead and grow a team of engineers... ...and meet engineering quality standards: observability, fault tolerance, latency guarantees,...SuggestedTemporary workLocal areaImmediate startWorldwideFlexible hours- Overview We are looking for an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team, a newly formed group that owns the release delivery automation platform and the real‑time observability stack, ensuring every client release, server canary...SuggestedFlexible hours
- At Coram AI, we’re reimagining video security for the modern world. Our cloud-native... ...We are looking for a technically deep Engineering Manager to lead the AI team at Coram. This... ...standards around reliability, observability, and model evaluation What We’re Looking...Shift work
$224k - $356.5k
...into the unlimited potential of AI to define the next era of... ...leadership with expertise in systems engineering, inference infrastructure,... ...forward. As Technical Lead Manager, you will lead the... ...operators, Helm charts, and GPU observability tooling (DCGM, dcgm‑exporter,...Local areaWorldwide$206k - $303k
...Principal Engineer - Observability CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$272k - $431.25k
...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking... ...Proficiency in Out-of-Band and In-Band management architectures, device management protocols... ...degree in Computer Science, Electrical Engineering or related field (or equivalent experience...Shift work$197k - $291k
A leading technology company seeks a Software Engineering Manager II for YouTube to lead engineering teams in optimizing ML infrastructure and building recommendation systems. The ideal candidate has extensive software development experience, strong technical leadership...Full time$224k - $356.5k
...We are looking for a highly motivated Engineering Manager, Hardware Infrastructure Build Systems... ...performance, reliability, reproducibility, and observability. Partnering with hardware, software... ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes....Remote work$207k - $304k
X Development, LLC is seeking an Engineering Manager in Mountain View, CA to lead a world-class engineering team dedicated to evolving the electric grid. This role requires a background in technical leadership to ensure that systems support global deployments effectively...$141.8k - $258.6k
Engineering Project Manager - AI Features Internationalization, L&RE Cupertino, California, United States Software and Services Apple's Software Engineering Operations (SWE Ops) organization is seeking a highly technical Engineering Project Manager (EPM) to drive the...WorldwideRelocation$207k - $304k
...r i n g ( T a p e s t r y ) Software Engineering Mountain View, CA (HQ) About Tapestry... ...frontier where energy's complexity meets AI's potential. We were born at X, the innovation... ...and Leader to serve as the Engineering Manager for our Infrastructure and Developer...Full timeFlexible hours$207k - $300k
Google Inc. is seeking a Software Engineering Manager for Merchant Shopping in Mountain View, CA. In this role, you will lead a team responsible for developing AI-driven solutions and managing large-scale projects. The ideal candidate will have at least 8 years of experience...$222k - $312.6k
A leading data and AI company is seeking a Senior Engineering Manager for Customer Experience Intelligence. This role involves shaping intelligent customer experiences and managing support workflows. Candidates should have over 10 years of experience in building customer...$160.2k - $320k
...Adobe, we are seeking an experienced manager to join our world-class engineering team in Product Engagement Systems... ..., scalability, reliability, and observability are built into product design and... ...turn ideas into impact, powered by AI and driven by human ingenuity....Temporary workLocal areaWorldwide$272k - $431.25k
...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking... ...Proficiency in Out‑of‑Band and In‑Band management architectures, device management protocols... ...degree in Computer Science, Electrical Engineering or related field (or equivalent experience...Shift work- ...Senior Engineering Manager At F5, we strive to bring a better digital world to life. Our teams... ...engineering, Quality Engineering, and AI Transformation across internal... ...regression automation, release readiness, and observability. Improve reliability, change...
$200k - $250k
...Type Full time Department Engineering Compensation Estimated Base... ...industry. Our growing suite of AI solutions spans ambient AI... ...coding, revenue cycle management and more — all designed for... ...Define standards for frontend observability, performance metrics, and user...Full timeWork at officeLocal areaRemote work$190k - $290k
Engineering Manager — Foundational Data Systems for AI Location: Downtown Mountain View, CA (office-based, 5 days/week) Team: Foundational Data Systems About... ...Distributed compute orchestration Reliability, observability, and operational tooling These systems operate on...Work at officeFlexible hours$206.4k - $384.68k
...venture at Adobe - an enterprise managed-service offering for custom multimedia generative AI. The offering includes deep-... ...We are hiring a Director, ML Engineering to own the engineering... ...services. Own analytics and observability across every model pipeline -...Temporary workLocal areaWorldwide$291.5k - $369.1k
...Modelsteam at Splunk, where we advance the state of AI for highvolume, realtime, multimodal... ...operational excellence of Splunk and Cisco's global engineering capabilities. Our work spans networking, security, observability, and customer experience - designing and deploying...Full timeTemporary workLocal areaFlexible hours- ...Splunk AI Models Team Splunk, a Cisco company, is building a safer, more resilient digital world... ...excellence of Splunk and Cisco's global engineering capabilities. Our work spans networking, security, observability, and customer experience — designing and deploying...Flexible hours
$156k - $229k
A leading technology company in Sunnyvale seeks a Senior Silicon Validation Engineer to drive cutting-edge TPU technology in AI/ML applications. You'll own silicon validation across the life-cycle, ensuring robust performance and optimization. The ideal candidate has a...- A leading technology company based in Sunnyvale is seeking a Silicon Validation Engineering Manager to lead their validation efforts for custom TPU silicon. This role involves strategizing and executing test plans, managing resources, and ensuring the silicon meets all...
- Business Area Engineering Seniority Level Mid-Senior level Job Description At Cloudera, we... ...actionable insights. With as much data under management as the hyperscalers, we’re the preferred... ..., streaming, operational databases, and AI. Cloudera is looking for a Senior...Work from homeWorldwideFlexible hours
$141.8k - $258.6k
...leading technology company in Cupertino is seeking a Software Engineering Project Manager to join their team. This role involves driving cross-... ...in technical project management and a solid background in AI/ML projects. The role offers a competitive salary range between...$160k - $240k
...Sr. Engineering Manager – Product Engineering Calling all innovators - find your future at Fiserv. We're Fiserv, a global leader in Fintech... ..., and success metrics beyond just technical requirements. AI-Powered Engineering: Promote the effective use of AI software...Temporary workWork at officeWorldwideMonday to Friday$188k - $275k
CoreWeave, the AI Hyperscaler™, acquired Weights & Biases to create the most powerful... ...training experiment. Mentor and Grow Engineers: Manage and coach a high-caliber team of... ...exceeding millions of events per second. Deep Observability Expertise: Hands-on experience with...Permanent employmentTemporary workCasual workWork at officeImmediate startRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Engineering Manager, AI Observability. Be the first to apply!

