Executive Director, AI Ops Engineering
$175.1k - $334.75kCVS Health
We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.
Executive Director, AI Platform SRE
About the Role
CVS Health is seeking an Executive Director, AI Ops Engineering to build and lead a team of professionals responsible for the continuous operation, monitoring, and optimization of CVS's Enterprise AI environment. This is first and foremost an engineering leadership role - your core accountability is ensuring the platform is always on, always performing, and always improving.
CVS Health's AI platform is a critical enterprise asset powering clinical, operational, and consumer capabilities at scale across one of the nation's largest healthcare organizations. Keeping it reliable, observable, and continuously improving is the mission. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, you will establish and maintain operational baselines across the full infrastructure stack, ensure all changes are continuously monitored, observed, and adjusted, and drive the highest levels of availability, reliability, and scalability across every layer of the environment.
This is a greenfield organizational build - the person in this role will define the operating model, shape the team culture, and establish the engineering standards that will govern CVS's AI infrastructure for years ahead. If you thrive on building from the ground up, this role was designed for you.
Teams You Will Lead
You will build and lead a multi-disciplinary SRE organization structured across nine functional areas spanning core platform operations and innovation. The team is organized to ensure full-spectrum coverage of the AI environment - from hardware and network through platform reliability, security, observability, and 24/7 operations - while continuously developing advanced automation and self-healing capabilities.
Core operational teams cover the following domains:
Platform Reliability - SLO/SLI/error budget management, availability baseline enforcement, cluster administration, GPU quota governance, and infrastructure-as-code
Infrastructure - Compute, storage, and hardware lifecycle management, including compliance controls and data isolation
Network - High-performance GPU networking, fabric management, security segmentation, and continuous network baseline enforcement
Observability - End-to-end monitoring strategy, alerting pipelines, SLI/SLO dashboards, and the feedback loops that connect operational data to improvement
Security SRE - Security posture, access controls, audit logging, vulnerability management, and regulatory compliance (HIPAA, NIST AI RMF)
24/7 Operations Center - Round-the-clock incident response, on-call protocols, escalation management, and shift-level change execution, structured for sustainable coverage with no mandatory overtime
Change & Release Management - Change lifecycle governance, ITIL process management, compliance frameworks, ModelOps boundary definition, and platform knowledge base
FinOps - GPU cost governance, utilization optimization, tenant quota enforcement, and chargeback models in partnership with Finance
In addition to core operations, you will oversee three Innovation PODs - focused on AI-driven automation, infrastructure-as-code and self-service capabilities, and chaos engineering and resilience testing - with the goal of continuously reducing manual toil and building a self-healing, self-optimizing platform over time.
What You'll Do
Leadership
Own the SRE vision, strategy, and long-range roadmap with availability (>99.99%), reliability, and scalability as the primary measures of success
Lead, develop, and integrate all functional teams into a cohesive, always-on operations organization - setting clear ownership, accountability, and performance expectations for each team and each engineer
Establish and enforce operational baselines across all platform components; ensure deviations are detected, escalated, and resolved within defined SLAs
Drive end-to-end observability with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles
Oversee change management ensuring every modification is risk-assessed, monitored during rollout, and baseline-validated post-deployment
Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time
Build and sustain a high-performing 24/7 operations model - zero mandatory overtime, zero burnout attrition, and measurable team health and retention
Empower the Security SRE Lead to implement and maintain a world-class security posture, minimizing risk and ensuring robust compliance with frameworks like HIPAA and NIST AI RMF
Direct Innovation POD strategy to develop self-healing and autonomous capabilities that proactively prevent degradation before it impacts availability
Lead GPU FinOps governance - utilization optimization, tenant quota enforcement, and cost reduction - in partnership with the Finance organization
Manage vendor relationships and performance accountability
Program Governance
Lead the structured transition of operational ownership from the incumbent managed services provider to CVS's internal SRE organization, governing phased handoffs, competency validation, and milestone sign-offs, ensuring a seamless transition with minimal disruption to platform availability and business operations
Establish and lead the long-term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self-sustaining at program close
What You'll Bring
10+ years in SRE, platform operations, or DevOps engineering leadership with a demonstrated focus on availability and reliability outcomes
5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations - with measurable team health, retention, and performance outcomes
Proven success establishing and enforcing operational baselines, SLO/SLI/error budget frameworks, and observability-driven continuous improvement in complex environments
Deep expertise in Kubernetes/OpenShift, IaC, GPU computing, and AI/ML infrastructure
Experience managing large-scale MSP transitions or platform operational handoffs while ensuring business continuity and minimizing disruption.
Demonstrated FinOps and GPU cost optimization experience in cloud or on-premises environments
Security framework implementation and compliance program management in regulated industries (HIPAA, NIST AI RMF)
Track record building sustainable 24/7 operations models with measurable retention and no burnout-related attrition
Executive stakeholder communication, vendor negotiation, and budget ownership
Background in innovation programs, POD structures, or centers of excellence
Willingness to travel and work off hours as required. Our 24/7 model is designed for sustainable, predictable coverage that eliminates mandatory overtime. As a leader, you will be an escalation point for critical incidents, but our goal is a resilient system and culture that protects our team's time
Preferred Qualifications
NVIDIA AI Enterprise, Run:AI, or GPU orchestration platform experience
Healthcare or regulated industry background
Certifications: ITIL Expert, PMP, AWS/Azure/GCP, CISSP
Familiarity with Cisco UCS, VAST storage, EVPN-VXLAN, and RDMA/RoCE protocols
Chaos engineering and AI-driven operations experience
Thought leadership: published work or speaking at industry conferences
Education
Required: Bachelor's in Computer Science, Engineering, or related field | Preferred: Master's degree
Pay Range
The typical pay range for this role is:
$175,100.00 - $334,750.00
This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program.
Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.
Great benefits for great people
We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.
This full-time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well-being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.
Additional details about available benefits are provided during the application process and on Benefits Moments ( .
We anticipate the application window for this opening will close on: 05/31/2026
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.
CVS Health is an equal opportunity/affirmative action employer, including Disability/Protected Veteran - committed to diversity in the workplace.
- A leading IT solutions provider is seeking a Sr. Managed Services Engineer - AI & CoPilot to manage, design, and support AI-driven solutions. The role requires a strong focus on performance optimization, technical leadership, and collaboration with customers and teams....SuggestedRemote work
$123.4k - $176.3k
...within the United States. Candidates must be able to work within EST business hours (9a-5p EST). Job Summary The Senior Data/AI Engineer is a senior individual contributor on the Data & AI Engineering team responsible for leading the design and delivery of data and...SuggestedTemporary workLocal areaImmediate startRemote workFlexible hours- ...approach. Teradata delivers real business value with AI. What You'll Do We are seeking Director of AI Engineering to lead teams building Agent Platform, Agents... ..., and predictive analytics solutions. Drive execution for quarterly/annual AI engineering deliverables...SuggestedPermanent employmentFlexible hours
- ...Workplaces in Financial Services & Insurance Applied & Agentic AI Engineer Job Responsibilities Architect and deploy LLM-powered... ...semi-autonomous agents capable of reasoning, planning, and executing multi-step claims processes. Develop stateful workflow orchestration...Suggested
$100k
...developers, Data analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients. Who Should Apply... ...software development life cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data visualization tools Excellent...SuggestedFull timeH1b- ...Senior AI/ML Engineer Anywhere Type: Contract-to-Hire Category: Development Industry: Government Workplace Type: Remote Reference ID: JN -052026-107129 Date Posted: 05/26/2026 Shortcut: Description Recommended Jobs Description:...Hourly payPermanent employmentContract workLocal areaRemote work
$99.6k - $223.4k
...future of healthcare - cloud-native Healthcare Solutions with AI at their core, designed to operate at nation-scale. Our mission... ...administrative burden. We’re looking for highly skilled AI engineers to design and build high-scale, cloud-based data processing...Temporary workFlexible hours- ...a hybrid approach. Teradata delivers real business value with AI. What You'll Do At Teradata, we're not just managing data... ...across large-scale enterprise environments. As a member of our AI engineering team, you'll play a critical role in designing and deploying...Hourly payPermanent employmentInternshipSummer internshipFlexible hours
$200k
...currently seeking an exciting opportunity for a Senior Director, AI Systems Engineering to join the Maximus AI Accelerator supporting the enterprise... ...end-to-end process improvement strategy and manage execution of strategy and operational direction. Responsible for...Immediate startRemote workFlexible hours$165k - $190k
...eye health, and we believe we are well positioned to continue leading the advancement of eye health in the future. The AI Agent & ML Engineer will design, build, and optimize intelligent agents powered by advanced machine learning models, enabling process automation...Temporary workWork visa$79.2k - $178.1k
...Job Description As a Senior AI Site Reliability Engineer, you will play a pivotal role in building and operating the next-generation, AI... ...self-service and autonomous operations Data Pipeline Execution ~ Build and optimize scalable data pipelines using...Temporary workFlexible hours$155.66k - $225.16k
...with one place to chat, explore and build with a wide variety of AI language models (bots), including o3, o4-mini, Claude 3.7 Sonnet... ...the Team and Role: We’re hiring our first AI Automation Engineer to lead how we apply AI internally across the company. This is...Remote jobFull timeShift work- ...CMS guidance, and evolving requirements. Leverage approved AI/LLM tools for coding, testing, documentation, and quality assurance... ...5) years of experience in FHIR, HL7, API development, software engineering, or healthcare integration. Strong knowledge of FHIR R4+, HL...Temporary workRemote workFlexible hours
$120.1k - $251.6k
...Description As the C&E Industry Executive Director, you will have:... ...understanding of Construction & Engineering industry, enterprise in data... ...tech/applications (including AI / Agentic AI). Proven ability... ..., including GenAI, LLM Ops, and vector search, is highly...Temporary workFlexible hours$89k - $143.75k
...Product Development Job Sub Function: R&D Software/Systems Engineering Job Category: Scientific/Technology All Job Posting... ...or similar field. Familiarity with CI/CD tools and Dev-Sec-Ops tools and processes. Experience working with Agile methodology...Full timeTemporary workWork at officeLocal areaRemote workNight shift$126.07k - $196.98k
...clean energy, advanced electronics, high-performance computing and AI, climate friendly cooling, and high-quality paints and coatings... ...we may consider remote. Experienced full stack software engineer who has a track record of designing & launching apps (for internal...Work at officeLocal areaRemote work- ...Home Based US As Executive Director, Project Management, Patient Safety you will oversee the management and leadership of Parexel's Patient Safety pillar, team or functions within Patient Safety organization. In this case, the Patient Safety Project Leadership Organization...Work at officeRemote workWork from homeWorldwideFlexible hours
- A leading global biopharmaceutical service provider is seeking an Executive Director of Project Management to oversee their Patient Safety pillar. This pivotal role involves strategic leadership, operational planning, and driving process improvements. The ideal candidate...Remote work
- ...Responsibilities: - Designs, develops, trains, evaluates, and integrates AI/ML models and algorithms supporting Government operational... ...and structures data for machine learning pipelines, feature engineering, and model lifecycle management - Implements model monitoring...Minimum wageFull timeContract workTemporary workFor contractorsWork experience placementRemote work
$94.1k - $150k
...The Platform Engineer (Ops Technology Lead) is responsible for designing, implementing, and maintaining IT infrastructure platforms within the CASTLE-NET program, ensuring reliability, scalability, and security. This role supports application deployment and management,...Contract workWork at office$99.6k - $223.4k
...Oracle Health is seeking a Lead Semantic Layer Engineer to build the governed semantic and knowledge foundation that enables AI agents and end users to work from consistent... ...reporting, dashboard generation, and executive narrative creation. The ideal candidate combines...Temporary workFlexible hours$200k - $250k
...medical records to powering the AI revolution in healthcare,... ...hands-on, deeply experienced engineering leader who can operate across... ...define and drive the architecture, execution strategy, and operational excellence... ...stakeholders across Product, Ops, and external partners...$99.6k - $192.9k
...We are looking for a skilled GCP Data Engineer to join our EPEO - Data and AI Ops team. In this role, you will play a critical part in designing, developing, and maintaining our Security Data Lake and associated data products. The core requirement for this role is...Immediate startRemote workFlexible hours- ...organized, and synced at the end of each workday when possible Report daily findings and progress updates to the Project Field Director Maintain field equipment, ensure vehicle readiness, and confirm the crew is prepared and positioned to complete daily survey tasks...
- ...expanding rapidly as a platform for Digital Operations Management using AI/ML and Automation and growing our adoption by Development, IT,... ...across the organization. PagerDuty is seeking a Territory Executive (TE) to join our dynamic, customer-focused team! As a Territory...Local areaFlexible hours
$163.9k - $235.55k
...your work matters—and so do you. Director, Go-To-Market Product Engineering – Salesforce (M5 Level) The... ...experience. • Innovate by incorporating AI-driven solutions to modernize GTM... ...across quoting, ordering, and deal execution Partner with AI, data, and platform...Local area$86.4k
...a highly skilled and versatile Software Engineer to join our Agile team. This role involves... ..., GUIs, RESTful APIs, microservices). AI & Automation: Championing and implementing... ...sprints in the development through final execution of software applications/programs....Temporary workFor contractorsWork at officeLocal area$80k - $100k
...Amentum is a global leader in advanced engineering and innovative technology solutions, trusted by the United States and its allies to address their most significant and complex challenges in science, security and sustainability. Our people apply undaunted curiosity, relentless...Hourly payContract workLocal areaRemote work$100.32k
...Maximus is currently seeking a Software Engineer. In this role, you will provide expertise in the areas of managed file transfer and EDI X12 translations. In addition, they must configure, support and maintain environments and procedures for all supported applications...Remote work$109k
...departments that we refer to as Directorates within the Lab, focused on a... ...established software engineering principals and best practices... ...collaboratively within a team to execute on the full system development... ...GitLab CI/CD and actively uses AI-assisted development tools -...For contractorsWork at officeLocal areaRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Executive Director, AI Ops Engineering. Be the first to apply!



