Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Ops Engineering

$175.1k - $334.75k

CVS

We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.

Executive Director, AI Platform SRE

About the Role

CVS Health is seeking an Executive Director, AI Ops Engineering to build and lead a team of professionals responsible for the continuous operation, monitoring, and optimization of CVS's Enterprise AI environment. This is first and foremost an engineering leadership role - your core accountability is ensuring the platform is always on, always performing, and always improving.

CVS Health's AI platform is a critical enterprise asset powering clinical, operational, and consumer capabilities at scale across one of the nation's largest healthcare organizations. Keeping it reliable, observable, and continuously improving is the mission. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, you will establish and maintain operational baselines across the full infrastructure stack, ensure all changes are continuously monitored, observed, and adjusted, and drive the highest levels of availability, reliability, and scalability across every layer of the environment.

This is a greenfield organizational build - the person in this role will define the operating model, shape the team culture, and establish the engineering standards that will govern CVS's AI infrastructure for years ahead. If you thrive on building from the ground up, this role was designed for you.

Teams You Will Lead

You will build and lead a multi-disciplinary SRE organization structured across nine functional areas spanning core platform operations and innovation. The team is organized to ensure full-spectrum coverage of the AI environment - from hardware and network through platform reliability, security, observability, and 24/7 operations - while continuously developing advanced automation and self-healing capabilities.

Core operational teams cover the following domains:

  • Platform Reliability - SLO/SLI/error budget management, availability baseline enforcement, cluster administration, GPU quota governance, and infrastructure-as-code

  • Infrastructure - Compute, storage, and hardware lifecycle management, including compliance controls and data isolation

  • Network - High-performance GPU networking, fabric management, security segmentation, and continuous network baseline enforcement

  • Observability - End-to-end monitoring strategy, alerting pipelines, SLI/SLO dashboards, and the feedback loops that connect operational data to improvement

  • Security SRE - Security posture, access controls, audit logging, vulnerability management, and regulatory compliance (HIPAA, NIST AI RMF)

  • 24/7 Operations Center - Round-the-clock incident response, on-call protocols, escalation management, and shift-level change execution, structured for sustainable coverage with no mandatory overtime

  • Change & Release Management - Change lifecycle governance, ITIL process management, compliance frameworks, ModelOps boundary definition, and platform knowledge base

  • FinOps - GPU cost governance, utilization optimization, tenant quota enforcement, and chargeback models in partnership with Finance

In addition to core operations, you will oversee three Innovation PODs - focused on AI-driven automation, infrastructure-as-code and self-service capabilities, and chaos engineering and resilience testing - with the goal of continuously reducing manual toil and building a self-healing, self-optimizing platform over time.

What You'll Do

Leadership

  • Own the SRE vision, strategy, and long-range roadmap with availability (>99.99%), reliability, and scalability as the primary measures of success

  • Lead, develop, and integrate all functional teams into a cohesive, always-on operations organization - setting clear ownership, accountability, and performance expectations for each team and each engineer

  • Establish and enforce operational baselines across all platform components; ensure deviations are detected, escalated, and resolved within defined SLAs

  • Drive end-to-end observability with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles

  • Oversee change management ensuring every modification is risk-assessed, monitored during rollout, and baseline-validated post-deployment

  • Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time

  • Build and sustain a high-performing 24/7 operations model - zero mandatory overtime, zero burnout attrition, and measurable team health and retention

  • Empower the Security SRE Lead to implement and maintain a world-class security posture, minimizing risk and ensuring robust compliance with frameworks like HIPAA and NIST AI RMF

  • Direct Innovation POD strategy to develop self-healing and autonomous capabilities that proactively prevent degradation before it impacts availability

  • Lead GPU FinOps governance - utilization optimization, tenant quota enforcement, and cost reduction - in partnership with the Finance organization

  • Manage vendor relationships and performance accountability

Program Governance

  • Lead the structured transition of operational ownership from the incumbent managed services provider to CVS's internal SRE organization, governing phased handoffs, competency validation, and milestone sign-offs, ensuring a seamless transition with minimal disruption to platform availability and business operations

  • Establish and lead the long-term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self-sustaining at program close

What You'll Bring

  • 10+ years in SRE, platform operations, or DevOps engineering leadership with a demonstrated focus on availability and reliability outcomes

  • 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations - with measurable team health, retention, and performance outcomes

  • Proven success establishing and enforcing operational baselines, SLO/SLI/error budget frameworks, and observability-driven continuous improvement in complex environments

  • Deep expertise in Kubernetes/OpenShift, IaC, GPU computing, and AI/ML infrastructure

  • Experience managing large-scale MSP transitions or platform operational handoffs while ensuring business continuity and minimizing disruption.

  • Demonstrated FinOps and GPU cost optimization experience in cloud or on-premises environments

  • Security framework implementation and compliance program management in regulated industries (HIPAA, NIST AI RMF)

  • Track record building sustainable 24/7 operations models with measurable retention and no burnout-related attrition

  • Executive stakeholder communication, vendor negotiation, and budget ownership

  • Background in innovation programs, POD structures, or centers of excellence

  • Willingness to travel and work off hours as required. Our 24/7 model is designed for sustainable, predictable coverage that eliminates mandatory overtime. As a leader, you will be an escalation point for critical incidents, but our goal is a resilient system and culture that protects our team's time

Preferred Qualifications

  • NVIDIA AI Enterprise, Run:AI, or GPU orchestration platform experience

  • Healthcare or regulated industry background

  • Certifications: ITIL Expert, PMP, AWS/Azure/GCP, CISSP

  • Familiarity with Cisco UCS, VAST storage, EVPN-VXLAN, and RDMA/RoCE protocols

  • Chaos engineering and AI-driven operations experience

  • Thought leadership: published work or speaking at industry conferences

Education

Required: Bachelor's in Computer Science, Engineering, or related field | Preferred: Master's degree

Pay Range

The typical pay range for this role is:

$175,100.00 - $334,750.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program.

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.

This full-time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well-being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.

Additional details about available benefits are provided during the application process and on Benefits Moments ( .

We anticipate the application window for this opening will close on: 05/31/2026

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

CVS Health is an equal opportunity/affirmative action employer, including Disability/Protected Veteran - committed to diversity in the workplace.

Vacancy posted 20 hours ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Ops Engineering in Nashville, TN vacancy
  • $60 per hour

    A cutting-edge AI development company is seeking proficient programmers to contribute to the advancement of AI systems. This fully remote role allows you to set your own schedule and choose projects that fit your skills, with competitive pay reaching up to $60/hour. Ideal... 
    Suggested
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $20 per hour

    A technology company specializing in AI is seeking professionals for a remote position to train AI chatbots. Ideal candidates will create complex prompts, write quality responses, and evaluate AI models. This flexible job allows you to work from anywhere with an ability... 
    Suggested
    Hourly pay
    For contractors
    Freelance
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    4 days ago
  • A leading AI development company is seeking proficient programmers to contribute to cutting-edge AI systems with remote work flexibility. Key responsibilities include designing coding problems for training AI and coding tasks in various programming languages. Qualifications... 
    Suggested
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  •  ...brick by brick. Role Overview Kanbrick is building an AI and technology enablement capability across the firm and our partner...  ...data models, partnering with internal stakeholders and outside engineering resources where needed, and shepherding the system from... 
    Suggested
    Internship

    Kanbrick

    Nashville, TN
    4 days ago
  • $60 per hour

    A leading AI development company is seeking proficient programmers to develop cutting-edge AI systems with fully remote work flexibility. Responsibilities include designing and solving coding problems for AI training, writing clear code, and evaluating AI-generated code... 
    Suggested
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $20 per hour

    A company focused on AI development is looking for independent contractors to teach AI chatbots. This remote role allows you to manage your own schedule and choose projects, with payment starting at $20+ per hour. Responsibilities include developing prompts, writing responses... 
    Hourly pay
    For contractors
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    4 days ago
  •  ...Gen AI Engineer/Architect Direct Hire Opportunity Location: Nashville, TN or within driving distance Primarily Remote with 10% travel expectation As a GenAI Engineer , you will actively engage in your engineering craft, taking a hands-... 
    Local area
    Remote work
    Visa sponsorship
    Flexible hours

    Medasource

    Nashville, TN
    2 days ago
  • $20 per hour

    A leading AI training company in the United States is seeking analytical individuals to join their remote team. In this role, you will help train AI chatbots by developing prompts and writing high-quality responses. Ideal candidates are fluent in English, possess strong... 
    Hourly pay
    For contractors
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    4 days ago
  • AI Systems Development: Architect, fine-tune, and deploy AI agents purpose-built for utility use cases, including predictive operations...  .... • Cross-Functional Collaboration: Partner with software engineers, data specialists, and security teams to integrate AI... 

    Insight Global

    Nashville, TN
    1 day ago
  • $60 per hour

    A leading AI development company is seeking proficient programmers to contribute to cutting-edge AI systems. This fully remote role offers flexibility to choose your own projects and schedule, with competitive hourly pay reaching up to $60 USD. Ideal candidates are fluent... 
    Hourly pay
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $65 - $80 per hour

     ...is partnering with a Healthcare organization focused on transforming value-based primary care who is looking for a Snowflake AI Data Engineer to join their team. This is a 6-month contract. To be successful in this role, this Data Engineer needs experience with Snowflake... 
    Hourly pay
    Contract work
    Remote work

    Insight Global

    Nashville, TN
    1 day ago
  •  ...-level software programmers, Java Full stack developers, Python/Java developers, Data analysts/ Data Scientists, Machine Learning engineers. Who Should Apply Recent Computer science/Engineering /Mathematics/Statistics or Science Graduates looking to make their careers... 
    H1b

    SynergisticIT

    Nashville, TN
    20 hours ago
  • $150k - $185k

    A managed services provider is looking for a Sr. Managed Services Engineer specializing in AI & CoPilot solutions. This remote position involves designing, building, and managing platform solutions while ensuring optimal performance and collaborating closely with customers... 
    Remote work

    SHI GmbH

    Nashville, TN
    7 days ago
  • A technology company is seeking proficient programmers to join their remote team. You’ll work flexibly on AI-related coding tasks, such as solving coding problems and enhancing AI models, directly impacting AI development. Candidates should have strong programming skills... 
    Hourly pay
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $123.4k - $176.3k

     ...within the United States. Candidates must be able to work within EST business hours (9a-5p EST). Job Summary The Senior Data/AI Engineer is a senior individual contributor on the Data & AI Engineering team responsible for leading the design and delivery of data and... 
    Temporary work
    Local area
    Immediate start
    Remote work
    Flexible hours

    Cardinal Health

    Nashville, TN
    1 day ago
  • $95k - $159k

     ...Job Description Summary The Senior Data Engineer designs and builds the AWS-native data foundation behind our enterprise AI applications - knowledge graphs, semantic layers...  ...and value clearly to product, business, and executive stakeholders Required Qualifications: ~... 
    Permanent employment
    Contract work
    Remote work
    Visa sponsorship
    Work visa
    Relocation package

    GE Aerospace

    Nashville, TN
    2 days ago
  • $60 per hour

    A technology company specializing in AI seeks proficient programmers to join their remote coding team. As a programmer, you will engage in designing and solving coding challenges, evaluating AI-generated code, and providing essential feedback. The company offers a flexible... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    2 days ago
  • $60 per hour

     ...A technology company specializing in AI development is seeking proficient programmers to join their remote team. You will solve challenging coding problems, develop engaging applications, and refine intelligent systems. Ideal candidates possess fluency in English and... 
    Hourly pay
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $244.7k - $279.2k

     ...Capital One is looking for a Distinguished AI Engineer to lead the development of AI-powered products in Nashville, TN. This remote position involves partnering with cross-functional teams to design and implement scalable AI solutions, leveraging cutting-edge technologies... 
    Remote work

    Capital One

    Nashville, TN
    1 day ago
  • $60 per hour

     ...A remote programming services company seeks proficient programmers to contribute to cutting-edge AI development. You will work on coding tasks, including creating code samples and providing feedback on AI-generated code. The position offers fully remote work with a flexible... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    2 days ago
  •  ...A leading AI-driven cybersecurity company is seeking experienced cybersecurity professionals for a remote role focused on evaluating AI-generated security content and solving technical problems. Candidates should have over two years of hands-on experience in various cybersecurity... 
    Hourly pay
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  • $79.2k - $178.1k

     ...the future of healthcare - cloud-native Healthcare Solutions with AI at their core, designed to operate at nation-scale. Our mission...  ...administrative burden. We're looking for highly skilled AI engineers to design and build high-scale, cloud-based data processing pipelines... 
    Temporary work
    Flexible hours

    Oracle

    Nashville, TN
    20 hours ago
  • $40 per hour

     ...A technology company seeks experienced cybersecurity professionals for a unique role assessing AI-generated security content and solving cybersecurity challenges. The position is available remotely, offering flexibility in project selection and work schedule. Candidates... 
    Hourly pay
    Remote work

    DataAnnotation

    Nashville, TN
    4 days ago
  •  ...Mandatory Skills: AI/ML Platforms, Python Job Description: The role is for an AI Support Engineer responsible for operational support of enterprise AI platforms. The focus is on ensuring stability, performance, availability, and reliability of AI/ML workloads by working... 

    Omni Inclusive

    Nashville, TN
    5 days ago
  •  ...AI Platform Operations Specialist Provide operational support for enterprise AI platforms to ensure stability, performance, availability, and reliability of AI and machine learning workloads. Collaborate with architecture, middleware, and development teams to support... 

    Cynet Systems

    Nashville, TN
    16 days ago
  •  ...5 Months Position Summary The Sr Staff Agentic AI Engineer will be part of the team in Nashville, TN As a Sr Staff Agentic...  ...autonomous systems capable of complex reasoning and task execution. Champion and integrate the Model Context Protocol to standardize... 
    Contract work

    CereCore (HCA)

    Nashville, TN
    20 hours ago
  •  ...Workplaces in Financial Services & Insurance Applied & Agentic AI Engineer Job Responsibilities Architect and deploy LLM-powered...  ...semi-autonomous agents capable of reasoning, planning, and executing multi-step claims processes. Develop stateful workflow orchestration... 

    Sedgwick

    Nashville, TN
    3 days ago
  • A leading technology services provider is seeking an AI‐First Context Engineer in Atlanta, GA. This role requires improving AI decision outcomes by designing context modeling frameworks. Candidates must have 5+ years of experience in systems analysis and strong expertise... 

    Capgemini

    Nashville, TN
    3 days ago
  •  ...through a hybrid approach. Teradata delivers real business value with AI. What you will do In this role you will build the agentic...  ...Who you will work with You will join a focused team of AI engineers within Teradata's AI Apps, Analytics, and UX Engineering... 
    Permanent employment
    Flexible hours

    Teradata

    Nashville, TN
    3 days ago
  • $233.3k - $385k

     ...Role: We are seeking a distinguished and customer-facing Fellow AI Engineer to define and lead UKG’s enterprise-wide Agentic AI strategy...  ..., enterprise architecture thinking, commercial awareness, and executive-level influence and communication. You will operate at the intersection... 
    Local area

    UKG (Ultimate Kronos Group)

    Nashville, TN
    20 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Ops Engineering. Be the first to apply!