Executive Director, AI Ops Engineering
$175.1k - $334.75kCVS Health
We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.
Executive Director, AI Platform SRE
About the Role
CVS Health is seeking an Executive Director, AI Ops Engineering to build and lead a team of professionals responsible for the continuous operation, monitoring, and optimization of CVS's Enterprise AI environment. This is first and foremost an engineering leadership role - your core accountability is ensuring the platform is always on, always performing, and always improving.
CVS Health's AI platform is a critical enterprise asset powering clinical, operational, and consumer capabilities at scale across one of the nation's largest healthcare organizations. Keeping it reliable, observable, and continuously improving is the mission. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, you will establish and maintain operational baselines across the full infrastructure stack, ensure all changes are continuously monitored, observed, and adjusted, and drive the highest levels of availability, reliability, and scalability across every layer of the environment.
This is a greenfield organizational build - the person in this role will define the operating model, shape the team culture, and establish the engineering standards that will govern CVS's AI infrastructure for years ahead. If you thrive on building from the ground up, this role was designed for you.
Teams You Will Lead
You will build and lead a multi-disciplinary SRE organization structured across nine functional areas spanning core platform operations and innovation. The team is organized to ensure full-spectrum coverage of the AI environment - from hardware and network through platform reliability, security, observability, and 24/7 operations - while continuously developing advanced automation and self-healing capabilities.
Core operational teams cover the following domains:
Platform Reliability - SLO/SLI/error budget management, availability baseline enforcement, cluster administration, GPU quota governance, and infrastructure-as-code
Infrastructure - Compute, storage, and hardware lifecycle management, including compliance controls and data isolation
Network - High-performance GPU networking, fabric management, security segmentation, and continuous network baseline enforcement
Observability - End-to-end monitoring strategy, alerting pipelines, SLI/SLO dashboards, and the feedback loops that connect operational data to improvement
Security SRE - Security posture, access controls, audit logging, vulnerability management, and regulatory compliance (HIPAA, NIST AI RMF)
24/7 Operations Center - Round-the-clock incident response, on-call protocols, escalation management, and shift-level change execution, structured for sustainable coverage with no mandatory overtime
Change & Release Management - Change lifecycle governance, ITIL process management, compliance frameworks, ModelOps boundary definition, and platform knowledge base
FinOps - GPU cost governance, utilization optimization, tenant quota enforcement, and chargeback models in partnership with Finance
In addition to core operations, you will oversee three Innovation PODs - focused on AI-driven automation, infrastructure-as-code and self-service capabilities, and chaos engineering and resilience testing - with the goal of continuously reducing manual toil and building a self-healing, self-optimizing platform over time.
What You'll Do
Leadership
Own the SRE vision, strategy, and long-range roadmap with availability (>99.99%), reliability, and scalability as the primary measures of success
Lead, develop, and integrate all functional teams into a cohesive, always-on operations organization - setting clear ownership, accountability, and performance expectations for each team and each engineer
Establish and enforce operational baselines across all platform components; ensure deviations are detected, escalated, and resolved within defined SLAs
Drive end-to-end observability with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles
Oversee change management ensuring every modification is risk-assessed, monitored during rollout, and baseline-validated post-deployment
Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time
Build and sustain a high-performing 24/7 operations model - zero mandatory overtime, zero burnout attrition, and measurable team health and retention
Empower the Security SRE Lead to implement and maintain a world-class security posture, minimizing risk and ensuring robust compliance with frameworks like HIPAA and NIST AI RMF
Direct Innovation POD strategy to develop self-healing and autonomous capabilities that proactively prevent degradation before it impacts availability
Lead GPU FinOps governance - utilization optimization, tenant quota enforcement, and cost reduction - in partnership with the Finance organization
Manage vendor relationships and performance accountability
Program Governance
Lead the structured transition of operational ownership from the incumbent managed services provider to CVS's internal SRE organization, governing phased handoffs, competency validation, and milestone sign-offs, ensuring a seamless transition with minimal disruption to platform availability and business operations
Establish and lead the long-term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self-sustaining at program close
What You'll Bring
10+ years in SRE, platform operations, or DevOps engineering leadership with a demonstrated focus on availability and reliability outcomes
5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations - with measurable team health, retention, and performance outcomes
Proven success establishing and enforcing operational baselines, SLO/SLI/error budget frameworks, and observability-driven continuous improvement in complex environments
Deep expertise in Kubernetes/OpenShift, IaC, GPU computing, and AI/ML infrastructure
Experience managing large-scale MSP transitions or platform operational handoffs while ensuring business continuity and minimizing disruption.
Demonstrated FinOps and GPU cost optimization experience in cloud or on-premises environments
Security framework implementation and compliance program management in regulated industries (HIPAA, NIST AI RMF)
Track record building sustainable 24/7 operations models with measurable retention and no burnout-related attrition
Executive stakeholder communication, vendor negotiation, and budget ownership
Background in innovation programs, POD structures, or centers of excellence
Willingness to travel and work off hours as required. Our 24/7 model is designed for sustainable, predictable coverage that eliminates mandatory overtime. As a leader, you will be an escalation point for critical incidents, but our goal is a resilient system and culture that protects our team's time
Preferred Qualifications
NVIDIA AI Enterprise, Run:AI, or GPU orchestration platform experience
Healthcare or regulated industry background
Certifications: ITIL Expert, PMP, AWS/Azure/GCP, CISSP
Familiarity with Cisco UCS, VAST storage, EVPN-VXLAN, and RDMA/RoCE protocols
Chaos engineering and AI-driven operations experience
Thought leadership: published work or speaking at industry conferences
Education
Required: Bachelor's in Computer Science, Engineering, or related field | Preferred: Master's degree
Pay Range
The typical pay range for this role is:
$175,100.00 - $334,750.00
This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program.
Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.
Great benefits for great people
We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.
This full-time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well-being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.
Additional details about available benefits are provided during the application process and on Benefits Moments ( .
We anticipate the application window for this opening will close on: 05/31/2026
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.
CVS Health is an equal opportunity/affirmative action employer, including Disability/Protected Veteran - committed to diversity in the workplace.
$60 per hour
A technology company focused on AI development is seeking proficient programmers to join their remote team. This position allows for a flexible schedule where you can select projects that fit your availability. The role involves designing coding problems, writing quality...SuggestedHourly payRemote workFlexible hours$60 per hour
...A leading AI development company is seeking proficient programmers to join its remote team. You will design coding solutions for AI systems, particularly in Android development. This role offers flexibility, allowing you to choose projects and set your own schedule. Ideal...SuggestedRemote work$90k - $120k
...Job Description - AI Engineer Location: Salt Lake City, UT - 4 days onsite role Job Type: Fulltime Salary: $90k-120k/year(Bit flexible) Role Overview An AI Engineer is responsible for designing, building, deploying, and optimizing AI, Machine Learning...SuggestedFull timeFlexible hours- ...Job Title: Junior AI Engineer Location: Salt Lake City, Utah, United States Contract to hire role Description: Junior AI Engineer Position Description Ready to take your career to the next level? CGI is seeking a Junior AI Engineer to design, build, and...SuggestedFull timeContract workLocal area
- ...Job Title Artificial Intelligence (AI) / Machine Learning (ML) Engineers Working Title AI Engineer... ...with responsibilities beyond routine execution. Evidence of applying advanced technical... ...harassment, you may contact the Director/Title IX Coordinator in the Office...SuggestedFull timePart timeWork experience placementWork at officeMonday to FridayShift work
- ...An innovative tech company is seeking proficient programmers to join their remote team in the development of cutting-edge AI systems. This flexible position allows you to work from anywhere and choose projects that align with your skills and schedule. Responsibilities...Hourly payRemote workFlexible hours
$60 per hour
A cutting-edge AI development company is seeking proficient programmers to contribute remotely. You'll tackle diverse coding problems, focusing on Android development, and have the flexibility to choose projects and your working hours. The ideal candidate is fluent in...Hourly payRemote workFlexible hours- ...Junior AI Engineer We are seeking a Junior AI Engineer to design, build, and scale AI-powered product features from concept to production. In this role, you will work as a full-time consultant supporting enterprise clients, solving complex business and technical...Full time
- ...to win. We set the highest standards and execute beyond them. And if you're like us, we can... ...more human interactions - at scale. Our engineering teams are at the center of that mission,... ...engineer who is genuinely excited about AI - not because it's a trend, but because they...Worldwide
$120k - $130k
...AI Engineer Location: Salt Lake City, UT or New York City, NY – 4 days Onsite Role Fulltime role with Mphasis only, no C2C Salary: $120-130K Plus, only visa independent candidate, no OPT EAD Required Technical Skills: Programming & Frameworks Strong...Full time$102.6k - $120.7k
...listen to your ideas. This is an early-career engineering role focused on building, operating, and... ...or cloud operations role (internships/co-ops count). Comfort with one or more of the... ..., etc) Demonstrated experience with AI/ML/GenAI enablement (model lifecycle, AI...Full timeContract workInternshipLocal areaFlexible hours$40 per hour
A dynamic cybersecurity firm is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. In this remote role, candidates will work on their schedule, with payment starting at $40+ per hour. Ideal candidates have over 2 years...Hourly payRemote workFlexible hours$40 per hour
A cybersecurity company seeks experienced professionals to evaluate AI-generated content and solve technical security problems. This fully remote role allows flexibility in project choice and scheduling, with hourly pay starting at $40. Candidates should have 2+ years...Hourly payRemote work$40 per hour
A leading cybersecurity firm is seeking experienced professionals to join their team in evaluating AI-generated security content. The role involves solving technical problems and providing feedback to improve AI models. Candidates should have over two years of hands-on...Hourly payRemote work$40 per hour
A cybersecurity company is seeking experienced professionals to join their remote team. You will evaluate AI-generated security content, solve technical cybersecurity issues, and provide feedback to strengthen AI models. Ideal candidates should have over 2 years of cybersecurity...Hourly payRemote workFlexible hours- A cybersecurity firm is seeking experienced cybersecurity professionals to evaluate AI-generated content and solve technical cybersecurity problems. You will provide feedback to enhance AI models and contribute to building reliable tools for the cybersecurity industry....Remote workFlexible hours
$40 per hour
...experienced cybersecurity professionals to join our team to help train AI models. In this role, you will evaluate AI-generated security... ...penetration testing, red teaming, incident response, detection engineering, DFIR, malware analysis, threat intelligence, or similar) Some...Hourly payFull timePart timeRemote work- ...Principal Cybersecurity Engineer Zions Bancorporation is transforming what it means to work for a financial... ...strategies for securing the Bank against AI threats. This position will report directly to the Director of Cybersecurity Operations and will focus on mitigation...Work experience placementWork at officeWork from homeFlexible hours3 days per week
$40 per hour
...A cybersecurity consulting firm is seeking experienced cybersecurity professionals to work remotely. Candidates will evaluate AI-generated security content and solve technical problems related to cybersecurity. Qualifications include 2+ years of hands-on experience in...Hourly payRemote workFlexible hours$146k - $241k
...Position Overview The Principal Data/AI Engineer helps drive the technical strategy and architecture of enterprise-scale data and AI platforms that power mission-critical data products, analytics, and AI-driven solutions. In this role, you will operate as a technical...Remote workWork from home- Detail Page LLC, located in Utah, is seeking a Technical Account Manager (TAM) to turn client goals into optimized product listings. This role involves data wrangling, workflow automation, and content refinement. The ideal candidate will have strong analytical and spreadsheet...
$60 per hour
...A leading AI development company is seeking proficient programmers to contribute to cutting-edge AI systems. This fully remote role allows you to set your own schedule and choose projects that align with your availability. Responsibilities include solving coding problems...Remote work$99.6k - $223.4k
...the future of healthcare - cloud-native Healthcare Solutions with AI at their core, designed to operate at nation-scale. Our mission... ...administrative burden. We're looking for highly skilled AI engineers to design and build high-scale, cloud-based data processing pipelines...Temporary workFlexible hours- ...Job Title: AI/ML Engineer - MCP - Agentic AI Location : Salt Lake City, Utah Mandatory Skills: TEAM OVERVIEW The Machine Learning... ...combine retrieval, structured reasoning, and secure action execution (function calling, change orchestration, policy enforcement)...
$40 per hour
A leading tech company is seeking experienced cybersecurity professionals to join their remote team. In this role, you'll evaluate AI-generated security content and solve real cybersecurity challenges. Candidates should have over 2 years of hands-on experience in various...Hourly payRemote work- ...Scientists. We welcome candidates with all visas and citizens to apply. Who Should Apply : Recent Computer science/Engineering /Mathematics/Statistics or Science Graduates looking to make their careers in IT Industry Candidates who are serious...
- ...Senior AI/ML Engineer Anywhere Type: Contract-to-Hire Category: Development Industry: Government Workplace Type: Remote Reference ID: JN -052026-107129 Date Posted: 05/26/2026 Shortcut: Description Recommended Jobs Description:...Hourly payPermanent employmentContract workLocal areaRemote work
- ...A healthcare technology company is seeking an Audit Defense Specialist to join its team in training AI models. The successful candidate will have expertise in healthcare, handling diverse problems related to AI chatbots' logic and performance evaluations. This position...Hourly payRemote workFlexible hours
$101.9k - $163k
...programs, benefits, and initiatives that are integrated into the fabric of how we work every day. To learn more, please see The AI/ML Engineer - Higher Education builds AI capabilities for Cengage's higher education products to improve student engagement, learning...Live inLocal areaWorldwide$40 per hour
A cybersecurity technology firm is seeking experienced professionals to evaluate AI-generated security content and improve AI systems’ reasoning about threats. Candidates should have 2+ years of cybersecurity experience, including areas like penetration testing and incident...Hourly payRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Executive Director, AI Ops Engineering. Be the first to apply!
- chief industries Salt Lake City, UT
- executive support Salt Lake City, UT
- chief content officer Salt Lake City, UT
- chief of police Salt Lake City, UT
- chief diversity officer Salt Lake City, UT
- executive Salt Lake City, UT
- executive director Salt Lake City, UT
- chief Salt Lake City, UT
- board member Salt Lake City, UT
- store executive Salt Lake City, UT

