Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Engineering Manager, AI Observability

Netflix, Inc.

The AI Observability team at Netflix makes AI, ML, and Agentic systems transparent, reliable, and production‑ready at scale. We build end‑to‑end observability for ML and GenAI workloads, capturing model inputs, features, predictions, outcomes, and behavior across online and batch systems. Responsibilities Partner with ML researchers, engineers, and platform teams to embed “observability‑by‑default” into new AI services, ensuring telemetry, monitoring, and evaluation are built into systems from day one. Lead the end‑to‑end observability strategy for AI workloads, including LLMs, generative AI systems, and classical ML models; drive build‑vs‑buy decisions and scale solutions across model training, online inference, and agent orchestration. Drive the evolution of LLM evaluation frameworks, covering prompt instrumentation, response quality measurement, grounding correctness, hallucination rates, and human/LLM‑as‑a‑judge scoring. Define and execute a platform roadmap focused on incremental delivery, with clear success metrics, migration goals, and strong adoption across teams. Communicate progress to stakeholders, customers, and senior leadership. Hire, grow, and mentor a high‑performing engineering team while fostering an inclusive and collaborative culture. Qualifications 10+ years of software engineering experience and 3+ years of management experience. Experience leading teams responsible for building high‑traffic distributed systems and ML infrastructure. Deep familiarity with AI and ML operations, including model evaluation, drift detection, and continuous monitoring at scale. Experience with AI observability and monitoring tools (Arize AI, Fiddler AI, Weights & Biases, Vertex AI Model Monitoring, SageMaker Model Monitor). Exposure to LLM or generative AI systems, including prompt/result logging, evaluation metrics, LLM‑as‑a‑judge frameworks, and human‑in‑the‑loop review. Strong technical acumen and ability to act as a credible technical advisor, set and enforce a high‑quality bar for code and system design, and mentor the team. Strong communication and collaboration skills, and the ability to build strong relationships with internal customers and external partners. A demonstrated ability to develop, drive, and execute a technical vision and roadmap. Experience managing a hybrid team with partners and team members distributed across (US) geographies & time zones. Compensation Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. The range for this role is $523,000.00 – $920,000.00. Benefits We provide comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family‑forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full‑time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full‑time salaried employees are immediately entitled to flexible time off. Equal‑Opportunity Employer We are an equal‑opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service. #J-18808-Ljbffr Netflix, Inc.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Engineering Manager, AI Observability in Los Gatos, CA vacancy
  • The AI Observability team at Netflix, Inc. seeks an experienced leader to drive observability for AI and ML workloads. Responsibilities...  ...LLMs. The ideal candidate will have over 10 years of engineering experience, management capability, and proficiency in AI operations. We offer... 
    Suggested

    Netflix, Inc.

    Los Gatos, CA
    3 days ago
  • Netflix, Inc. is seeking an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team. In this role, you will ensure every client release, server canary, and A/B test is safely delivered while building a high-performing team of engineers. Responsibilities... 
    Suggested
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    3 days ago
  • $224k - $356.5k

     ...tapping into the unlimited potential of AI to define the next era of computing...  ..., and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo...  ...including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter,... 
    Suggested
    Local area
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $146.3k - $289.9k

     ...impressive content effortlessly. The AI Foundations team builds the flexible,...  ...personalization. We are looking for an Engineering Manager to lead and grow a team of engineers...  ...and meet engineering quality standards: observability, fault tolerance, latency guarantees,... 
    Suggested
    Temporary work
    Local area
    Immediate start
    Worldwide
    Flexible hours

    Adobe

    San Jose, CA
    4 days ago
  • Overview We are looking for an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team, a newly formed group that owns the release delivery automation platform and the real‑time observability stack, ensuring every client release, server canary... 
    Suggested
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    15 hours ago
  • At Coram AI, we’re reimagining video security for the modern world. Our cloud-native...  ...We are looking for a technically deep Engineering Manager to lead the AI team at Coram. This...  ...standards around reliability, observability, and model evaluation What We’re Looking... 
    Shift work

    Coram AI

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

     ...into the unlimited potential of AI to define the next era of...  ...leadership with expertise in systems engineering, inference infrastructure,...  ...forward. As Technical Lead Manager, you will lead the...  ...operators, Helm charts, and GPU observability tooling (DCGM, dcgm‑exporter,... 
    Local area
    Worldwide

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $206k - $303k

     ...Principal Engineer - Observability CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    5 days ago
  • $272k - $431.25k

     ...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking...  ...Proficiency in Out-of-Band and In-Band management architectures, device management protocols...  ...degree in Computer Science, Electrical Engineering or related field (or equivalent experience... 
    Shift work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $197k - $291k

    A leading technology company seeks a Software Engineering Manager II for YouTube to lead engineering teams in optimizing ML infrastructure and building recommendation systems. The ideal candidate has extensive software development experience, strong technical leadership... 
    Full time

    Jobleads-US

    Mountain View, CA
    15 hours ago
  • $224k - $356.5k

     ...We are looking for a highly motivated Engineering Manager, Hardware Infrastructure Build Systems...  ...performance, reliability, reproducibility, and observability. Partnering with hardware, software...  ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes.... 
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $207k - $304k

    X Development, LLC is seeking an Engineering Manager in Mountain View, CA to lead a world-class engineering team dedicated to evolving the electric grid. This role requires a background in technical leadership to ensure that systems support global deployments effectively... 

    X Development, LLC

    Mountain View, CA
    1 day ago
  • $141.8k - $258.6k

    Engineering Project Manager - AI Features Internationalization, L&RE Cupertino, California, United States Software and Services Apple's Software Engineering Operations (SWE Ops) organization is seeking a highly technical Engineering Project Manager (EPM) to drive the... 
    Worldwide
    Relocation

    Apple Inc.

    Cupertino, CA
    5 hours ago
  • $207k - $304k

     ...r i n g ( T a p e s t r y ) Software Engineering Mountain View, CA (HQ) About Tapestry...  ...frontier where energy's complexity meets AI's potential. We were born at X, the innovation...  ...and Leader to serve as the Engineering Manager for our Infrastructure and Developer... 
    Full time
    Flexible hours

    X: The Moonshot Factory

    Mountain View, CA
    4 days ago
  • $207k - $300k

    Google Inc. is seeking a Software Engineering Manager for Merchant Shopping in Mountain View, CA. In this role, you will lead a team responsible for developing AI-driven solutions and managing large-scale projects. The ideal candidate will have at least 8 years of experience... 

    Google Inc.

    Mountain View, CA
    4 days ago
  • $222k - $312.6k

    A leading data and AI company is seeking a Senior Engineering Manager for Customer Experience Intelligence. This role involves shaping intelligent customer experiences and managing support workflows. Candidates should have over 10 years of experience in building customer... 

    Databricks

    Mountain View, CA
    4 days ago
  • $160.2k - $320k

     ...Adobe, we are seeking an experienced manager to join our world-class engineering team in Product Engagement Systems...  ..., scalability, reliability, and observability are built into product design and...  ...turn ideas into impact, powered by AI and driven by human ingenuity.... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $272k - $431.25k

     ...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking...  ...Proficiency in Out‑of‑Band and In‑Band management architectures, device management protocols...  ...degree in Computer Science, Electrical Engineering or related field (or equivalent experience... 
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Senior Engineering Manager At F5, we strive to bring a better digital world to life. Our teams...  ...engineering, Quality Engineering, and AI Transformation across internal...  ...regression automation, release readiness, and observability. Improve reliability, change... 

    F5

    San Jose, CA
    4 days ago
  • $200k - $250k

     ...Type Full time Department Engineering Compensation Estimated Base...  ...industry. Our growing suite of AI solutions spans ambient AI...  ...coding, revenue cycle management and more — all designed for...  ...Define standards for frontend observability, performance metrics, and user... 
    Full time
    Work at office
    Local area
    Remote work

    Monograph

    Mountain View, CA
    1 day ago
  • $190k - $290k

    Engineering Manager — Foundational Data Systems for AI Location: Downtown Mountain View, CA (office-based, 5 days/week) Team: Foundational Data Systems About...  ...Distributed compute orchestration Reliability, observability, and operational tooling These systems operate on... 
    Work at office
    Flexible hours

    Dormont Manufacturing Co

    Mountain View, CA
    1 day ago
  • $206.4k - $384.68k

     ...venture at Adobe - an enterprise managed-service offering for custom multimedia generative AI. The offering includes deep-...  ...We are hiring a Director, ML Engineering to own the engineering...  ...services. Own analytics and observability across every model pipeline -... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $291.5k - $369.1k

     ...Modelsteam at Splunk, where we advance the state of AI for highvolume, realtime, multimodal...  ...operational excellence of Splunk and Cisco's global engineering capabilities. Our work spans networking, security, observability, and customer experience - designing and deploying... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Jose, CA
    4 days ago
  •  ...Splunk AI Models Team Splunk, a Cisco company, is building a safer, more resilient digital world...  ...excellence of Splunk and Cisco's global engineering capabilities. Our work spans networking, security, observability, and customer experience — designing and deploying... 
    Flexible hours

    Webex Events (formerly Socio)

    Milpitas, CA
    3 days ago
  • $156k - $229k

    A leading technology company in Sunnyvale seeks a Senior Silicon Validation Engineer to drive cutting-edge TPU technology in AI/ML applications. You'll own silicon validation across the life-cycle, ensuring robust performance and optimization. The ideal candidate has a... 

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • A leading technology company based in Sunnyvale is seeking a Silicon Validation Engineering Manager to lead their validation efforts for custom TPU silicon. This role involves strategizing and executing test plans, managing resources, and ensuring the silicon meets all... 

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • Business Area Engineering Seniority Level Mid-Senior level Job Description At Cloudera, we...  ...actionable insights. With as much data under management as the hyperscalers, we’re the preferred...  ..., streaming, operational databases, and AI. Cloudera is looking for a Senior... 
    Work from home
    Worldwide
    Flexible hours

    Nerdleveltech

    Santa Clara, CA
    4 days ago
  • $141.8k - $258.6k

     ...leading technology company in Cupertino is seeking a Software Engineering Project Manager to join their team. This role involves driving cross-...  ...in technical project management and a solid background in AI/ML projects. The role offers a competitive salary range between... 

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $160k - $240k

     ...Sr. Engineering Manager – Product Engineering Calling all innovators - find your future at Fiserv. We're Fiserv, a global leader in Fintech...  ..., and success metrics beyond just technical requirements. AI-Powered Engineering: Promote the effective use of AI software... 
    Temporary work
    Work at office
    Worldwide
    Monday to Friday

    BentoBox

    Sunnyvale, CA
    5 days ago
  • $188k - $275k

    CoreWeave, the AI Hyperscaler™, acquired Weights & Biases to create the most powerful...  ...training experiment. Mentor and Grow Engineers: Manage and coach a high-caliber team of...  ...exceeding millions of events per second. Deep Observability Expertise: Hands-on experience with... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Weights & Biases

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Engineering Manager, AI Observability. Be the first to apply!