Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Ops Engineering

$175.1k - $334.75k

CVS Health

We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.

Executive Director, AI Platform SRE

About the Role

CVS Health is seeking an Executive Director, AI Ops Engineering to build and lead a team of professionals responsible for the continuous operation, monitoring, and optimization of CVS's Enterprise AI environment. This is first and foremost an engineering leadership role - your core accountability is ensuring the platform is always on, always performing, and always improving.

CVS Health's AI platform is a critical enterprise asset powering clinical, operational, and consumer capabilities at scale across one of the nation's largest healthcare organizations. Keeping it reliable, observable, and continuously improving is the mission. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, you will establish and maintain operational baselines across the full infrastructure stack, ensure all changes are continuously monitored, observed, and adjusted, and drive the highest levels of availability, reliability, and scalability across every layer of the environment.

This is a greenfield organizational build - the person in this role will define the operating model, shape the team culture, and establish the engineering standards that will govern CVS's AI infrastructure for years ahead. If you thrive on building from the ground up, this role was designed for you.

Teams You Will Lead

You will build and lead a multi-disciplinary SRE organization structured across nine functional areas spanning core platform operations and innovation. The team is organized to ensure full-spectrum coverage of the AI environment - from hardware and network through platform reliability, security, observability, and 24/7 operations - while continuously developing advanced automation and self-healing capabilities.

Core operational teams cover the following domains:

  • Platform Reliability - SLO/SLI/error budget management, availability baseline enforcement, cluster administration, GPU quota governance, and infrastructure-as-code

  • Infrastructure - Compute, storage, and hardware lifecycle management, including compliance controls and data isolation

  • Network - High-performance GPU networking, fabric management, security segmentation, and continuous network baseline enforcement

  • Observability - End-to-end monitoring strategy, alerting pipelines, SLI/SLO dashboards, and the feedback loops that connect operational data to improvement

  • Security SRE - Security posture, access controls, audit logging, vulnerability management, and regulatory compliance (HIPAA, NIST AI RMF)

  • 24/7 Operations Center - Round-the-clock incident response, on-call protocols, escalation management, and shift-level change execution, structured for sustainable coverage with no mandatory overtime

  • Change & Release Management - Change lifecycle governance, ITIL process management, compliance frameworks, ModelOps boundary definition, and platform knowledge base

  • FinOps - GPU cost governance, utilization optimization, tenant quota enforcement, and chargeback models in partnership with Finance

In addition to core operations, you will oversee three Innovation PODs - focused on AI-driven automation, infrastructure-as-code and self-service capabilities, and chaos engineering and resilience testing - with the goal of continuously reducing manual toil and building a self-healing, self-optimizing platform over time.

What You'll Do

Leadership

  • Own the SRE vision, strategy, and long-range roadmap with availability (>99.99%), reliability, and scalability as the primary measures of success

  • Lead, develop, and integrate all functional teams into a cohesive, always-on operations organization - setting clear ownership, accountability, and performance expectations for each team and each engineer

  • Establish and enforce operational baselines across all platform components; ensure deviations are detected, escalated, and resolved within defined SLAs

  • Drive end-to-end observability with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles

  • Oversee change management ensuring every modification is risk-assessed, monitored during rollout, and baseline-validated post-deployment

  • Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time

  • Build and sustain a high-performing 24/7 operations model - zero mandatory overtime, zero burnout attrition, and measurable team health and retention

  • Empower the Security SRE Lead to implement and maintain a world-class security posture, minimizing risk and ensuring robust compliance with frameworks like HIPAA and NIST AI RMF

  • Direct Innovation POD strategy to develop self-healing and autonomous capabilities that proactively prevent degradation before it impacts availability

  • Lead GPU FinOps governance - utilization optimization, tenant quota enforcement, and cost reduction - in partnership with the Finance organization

  • Manage vendor relationships and performance accountability

Program Governance

  • Lead the structured transition of operational ownership from the incumbent managed services provider to CVS's internal SRE organization, governing phased handoffs, competency validation, and milestone sign-offs, ensuring a seamless transition with minimal disruption to platform availability and business operations

  • Establish and lead the long-term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self-sustaining at program close

What You'll Bring

  • 10+ years in SRE, platform operations, or DevOps engineering leadership with a demonstrated focus on availability and reliability outcomes

  • 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations - with measurable team health, retention, and performance outcomes

  • Proven success establishing and enforcing operational baselines, SLO/SLI/error budget frameworks, and observability-driven continuous improvement in complex environments

  • Deep expertise in Kubernetes/OpenShift, IaC, GPU computing, and AI/ML infrastructure

  • Experience managing large-scale MSP transitions or platform operational handoffs while ensuring business continuity and minimizing disruption.

  • Demonstrated FinOps and GPU cost optimization experience in cloud or on-premises environments

  • Security framework implementation and compliance program management in regulated industries (HIPAA, NIST AI RMF)

  • Track record building sustainable 24/7 operations models with measurable retention and no burnout-related attrition

  • Executive stakeholder communication, vendor negotiation, and budget ownership

  • Background in innovation programs, POD structures, or centers of excellence

  • Willingness to travel and work off hours as required. Our 24/7 model is designed for sustainable, predictable coverage that eliminates mandatory overtime. As a leader, you will be an escalation point for critical incidents, but our goal is a resilient system and culture that protects our team's time

Preferred Qualifications

  • NVIDIA AI Enterprise, Run:AI, or GPU orchestration platform experience

  • Healthcare or regulated industry background

  • Certifications: ITIL Expert, PMP, AWS/Azure/GCP, CISSP

  • Familiarity with Cisco UCS, VAST storage, EVPN-VXLAN, and RDMA/RoCE protocols

  • Chaos engineering and AI-driven operations experience

  • Thought leadership: published work or speaking at industry conferences

Education

Required: Bachelor's in Computer Science, Engineering, or related field | Preferred: Master's degree

Pay Range

The typical pay range for this role is:

$175,100.00 - $334,750.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program.

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.

This full-time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well-being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.

Additional details about available benefits are provided during the application process and on Benefits Moments ( .

We anticipate the application window for this opening will close on: 05/31/2026

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

CVS Health is an equal opportunity/affirmative action employer, including Disability/Protected Veteran - committed to diversity in the workplace.

Vacancy posted 20 hours ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Ops Engineering in Michigan vacancy
  •  ...Job Description The Role The AI Engineering and Productivity team in the Global Planning, Design, and Product IT org is responsible...  ...integration, AI/LLM service integration, monitoring, and ML Ops concepts). ~ Knowledge of relational and dimensional data... 
    Suggested
    Local area
    Work from home
    Relocation package

    General Motors

    Warren, MI
    5 days ago
  • $113.63k - $174.23k

     ...Dematic is standing up a Finance AI enablement team to drive adoption, build, and roll...  ...As the Applied AI / Machine Learning Engineer, you will play a hands-on role crafting, developing...  ...history for audit and risk review. ML Ops & Deployment Implement MLOps... 
    Suggested
    Local area

    Kion Group AG

    Grand Rapids, MI
    4 days ago
  • $170.6k - $261.3k

     ...understand the world! The Data Labeling Engineering team designs, builds, and operates hybrid...  ...engineering, data engineering, and AI/ML, defining the strategies, tooling, and...  ...the AV stack Work with partner teams (ML, Ops, Product, Data Science, other platform teams... 
    Suggested
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Lansing, MI
    23 days ago
  • $60 per hour

     ...A leading AI development firm is seeking proficient programmers to contribute to cutting-edge AI systems. This role allows for flexible remote work, where you can choose the projects you partake in and set your own schedule. Responsibilities include designing coding problems... 
    Suggested
    Remote work
    Flexible hours

    DataAnnotation

    Lansing, MI
    4 days ago
  • $60 per hour

     ...A tech-focused organization is seeking proficient programmers to contribute to developing cutting-edge AI systems. The position offers fully remote work options and allows for flexible schedules. Responsibilities include designing coding problems for AI training, writing... 
    Suggested
    Remote work
    Flexible hours

    DataAnnotation

    Lansing, MI
    2 days ago
  •  ...AI Engineer Responsibilities: Demonstrate strong hands-on development skills Build AI-powered proof-of-concepts and explore cutting-edge AI capabilities Work in a fast-paced, innovative, research-driven environment Be a problem solver using critical thinking... 
    Contract work

    Right Hire IT

    Lansing, MI
    1 day ago
  •  ...AI Engineer / Applied AI Developers The ideal candidate should have strong hands-on development skills, with a passion for building AI-powered proof-of-concepts and exploring cutting-edge AI capabilities. The candidate must be comfortable working in a fast-paced,... 

    RICEFW Technologies

    Farmington Hills, MI
    1 day ago
  •  ...AI Agentic Engineer Location: Detroit, MI (Onsite) Duration: Full-time only JOB DESCRIPTION : Must Have Technical / Functional...  ...Experience with multi-agent systems, planning, and autonomous execution is critical. • RAG (Retrieval-Augmented Generation): Deep,... 
    Full time
    Immediate start
    Relocation

    JConnect Infotech

    Detroit, MI
    20 hours ago
  •  ...Worldwide is a Certified™ Great Place To Work®. The AI Engineer will be contributing to designing, implementing, governing, and...  ...maintainable, and appropriately governed. Supports ownership and execution of AI Council priorities, including intake, evaluation, and... 
    Work at office
    Remote work
    Worldwide
    Flexible hours

    Wolverine Worldwide

    Rockford, MI
    1 day ago
  •  ...AI Engineer / Applied AI Developers. The ideal candidate should have strong hands-on development skills, with a passion for building AI-powered proof-of-concepts and exploring cutting-edge AI capabilities. The candidate must be comfortable working in... 

    RICEFW Technologies

    Farmington, MI
    4 days ago
  • A technology company seeks a talented Google Dialogflow developer to design and develop intelligent chatbots in a remote setting. The position requires a minimum of four years IT experience and GCP Certification. Responsibilities include implementing engaging conversational...
    Contract work
    Remote work
    Flexible hours

    Avantdigitalnow

    Lansing, MI
    4 days ago
  •  ...systems, implements MLOps practices for reliable model monitoring and retraining, and integrates AI solutions with existing controls and process infrastructure. The AI Engineer collaborates cross-functionally to turn business problems into scalable, explainable AI... 
    Local area

    Howmet Aerospace

    Whitehall, MI
    1 day ago
  •  ...Generative AI Engineer Location: Warren, MI Duration: Full Time Job Description: Programming: Expert-level proficiency in Python. ML Frameworks: Extensive experience with PyTorch (strongly preferred) and/or TensorFlow. LLMs & NLP: Hands-on experience... 
    Full time

    JConnect Infotech

    Warren, MI
    1 day ago
  •  ...Job Title Senior AI-Enabled Digital Engineer Overview / Summary We are seeking a Senior Digital Engineer with expertise in AI-enabled software engineering to accelerate digital product and platform development within a retail customer’s digital organization. This role... 
    Remote work

    HTC Global Services

    Grand Rapids, MI
    1 day ago
  •  ...Description Senior AI Engineer (REACT/Front End) Location : Hybrid, USA (Birmingham, MI, Chicago, IL, New York, NY, Fort Lauderdale...  ...As a Senior AI Engineer you will be a critical member in executing our AI development initiatives within the OneStream platform.... 
    Full time
    Temporary work
    Work experience placement
    Remote work

    OneStream Software

    Rochester, MI
    18 days ago
  •  ...DTS is looking for AI Engineer / Applied AI Developer for our client Position in Farmington Hills / Okemos, MI Job Description: Delta Dental is seeking an AI Engineer / Applied AI Developer to design, prototype, and deliver AI-powered applications using large language... 

    Digital Technology Solutions Llc

    Detroit, MI
    1 day ago
  •  ...with regular training, opportunities for advancement, and fun events to bring everyone together. We are presently seeking an AI Engineer to Design and deliver production-grade AI solutions - including intelligent agents, copilots, predictive models, and automation workflows... 
    Temporary work
    Work at office
    Remote work
    Flexible hours

    RHP Properties

    Farmington Hills, MI
    3 days ago
  • $20 per hour

    A leading AI development company is seeking analytical and detail-oriented freelancers to teach AI chatbots. The role allows for remote work with a flexible schedule and involves writing prompts, testing AI models, and evaluating outputs. Ideal candidates will have strong... 
    Hourly pay
    Freelance
    Remote work
    Flexible hours

    DataAnnotation

    Lansing, MI
    4 days ago
  •  ...Job Title Bachelor’s degree in Computer Science, AI/ML, or related field 5+ years of relevant experience Strong hands-on experience with Kubernetes and container technologies in production Proficiency in Python, Golang, or similar programming languages Solid... 

    Saxon Global

    Ann Arbor, MI
    20 hours ago
  • $67 - $76 per hour

     ...AI Software Engineer III Location: Farmington Hills, MI + Detroit, MI (Hybrid) Pay Range: $67-$76 Length: 6 months Key Responsibilities Design and develop backend services and APIs that manage and enforce system behavior across multiple services Define... 

    Apex Systems

    Detroit, MI
    4 days ago
  • $150k - $185k

    A leading IT solutions provider is seeking a Sr. Managed Services Engineer - AI & CoPilot to design and manage AI solutions for clients. This remote position requires a blend of technical skills, including experience with Microsoft Copilot and Azure OpenAI, along with a... 
    Remote work

    SHI GmbH

    Lansing, MI
    2 days ago
  •  ...AI Native Software Engineer Seeking a hands-on AI Native Software Engineer to design, build, and deploy production-grade AI-driven systems within...  ...workflows Testing & Performance Define and execute test strategies for AI systems Measure system performance... 

    Merican

    Detroit, MI
    3 days ago
  • $100k

     ...for recent grads in Mathematics, Statistics, Computer Science or Engineering or candidates with gaps in their career or people wanting to...  ...Kubernates and REST API's experience For data Science/Data Analyst/AI/Machine learning Positions Preferred SKILLS Associate or... 
    H1b

    SynergisticIT

    Lansing, MI
    2 days ago
  •  ...AI Software Engineer III Primary Location : Farmington, Hills MI V-Soft Consulting is currently hiring for an AI Software Engineer III for our premier client Farmington Hills, MI. Education and Experience " 5-8 years of software engineering experience... 
    Currently hiring
    Local area

    V-Soft Consulting Group

    Farmington Hills, MI
    1 day ago
  •  ...strong hands-on development skills, with apassion for building AI-powered proof-of-concepts and exploringcutting-edgeAI capabilities...  ...and critical thinking skills •Experience with prompt engineering and function calling in LLMs •Experience integrating APIs and... 
    Summer work
    Work at office
    3 days per week

    Apex Systems

    Farmington Hills, MI
    1 day ago
  •  ...AI Senior Engineer, Payments Apply Online We are seeking a highly motivated and hands-on AI Senior Engineer to drive the integration and effective use of AI tools across our Payments Engineering team. This individual will play a critical role in accelerating engineering... 
    Local area

    Tyler Technologies

    Troy, MI
    1 day ago
  • $67 - $76 per hour

     ...Job Title: AI Software Engineer III Location: Farmington Hills, MI + Detroit, MI (Hybrid) Pay Range: $67-$76 Length: 6 months JOB DESCRIPTION We are looking for a Lead Software Engineer to join our AI agentic engineering team. You will design and deliver... 
    Contract work

    Apex Systems

    Farmington Hills, MI
    4 days ago
  •  ...AI Software Engineer Responsibilities: Design, develop, and deploy production-grade AI software systems Collaborate with enterprise customers to understand requirements and deliver solutions that scale to millions of users Work alongside academic and industry... 
    Contract work

    Right Hire IT

    Detroit, MI
    1 day ago
  • $233.3k - $385k

     ...We are seeking a distinguished and customer-facing Fellow AI Engineer to define and lead UKG’s enterprise-wide Agentic AI strategy across...  ..., enterprise architecture thinking, commercial awareness, and executive-level influence and communication. You will operate at the... 
    Local area

    UKG

    Lansing, MI
    1 day ago
  •  ...Farmington Hills / Metro Detroit, Michigan area. No remote candidates. JOB DESCRIPTION We are looking for a Lead Software Engineer to join our AI agentic engineering team. You will design and deliver guardrail components across services, define where and how... 
    Contract work
    Remote work

    Insight Global

    Farmington, MI
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Ops Engineering. Be the first to apply!