Executive Director, AI Ops Engineering
$175.1k - $334.75kHispanic Alliance for Career Enhancement
We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time. Executive Director, AI Platform SRE About the Role CVS Health is seeking an Executive Director, AI Ops Engineering to build and lead a team of professionals responsible for the continuous operation, monitoring, and optimization of CVS's Enterprise AI environment. This is first and foremost an engineering leadership role – your core accountability is ensuring the platform is always on, always performing, and always improving. CVS Health's AI platform is a critical enterprise asset powering clinical, operational, and consumer capabilities at scale across one of the nation's largest healthcare organizations. Keeping it reliable, observable, and continuously improving is the mission. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, you will establish and maintain operational baselines across the full infrastructure stack, ensure all changes are continuously monitored, observed, and adjusted, and drive the highest levels of availability, reliability, and scalability across every layer of the environment. This is a greenfield organizational build – the person in this role will define the operating model, shape the team culture, and establish the engineering standards that will govern CVS's AI infrastructure for years ahead. If you thrive on building from the ground up, this role was designed for you. Teams You Will Lead You will build and lead a multi-disciplinary SRE organization structured across nine functional areas spanning core platform operations and innovation. The team is organized to ensure full-spectrum coverage of the AI environment – from hardware and network through platform reliability, security, observability, and 24/7 operations – while continuously developing advanced automation and self-healing capabilities. Platform Reliability – SLO/SLI/error budget management, availability baseline enforcement, cluster administration, GPU quota governance, and infrastructure-as-code Infrastructure – Compute, storage, and hardware lifecycle management, including compliance controls and data isolation Network – High-performance GPU networking, fabric management, security segmentation, and continuous network baseline enforcement Observability – End-to-end monitoring strategy, alerting pipelines, SLI/SLO dashboards, and the feedback loops that connect operational data to improvement Security SRE – Security posture, access controls, audit logging, vulnerability management, and regulatory compliance (HIPAA, NIST AI RMF) 24/7 Operations Center – Round-the-clock incident response, on-call protocols, escalation management, and shift-level change execution, structured for sustainable coverage with no mandatory overtime Change & Release Management – Change lifecycle governance, ITIL process management, compliance frameworks, ModelOps boundary definition, and platform knowledge base FinOps – GPU cost governance, utilization optimization, tenant quota enforcement, and chargeback models in partnership with Finance In addition to core operations, you will oversee three Innovation PODs – focused on AI-driven automation, infrastructure-as-code and self-service capabilities, and chaos engineering and resilience testing – with the goal of continuously reducing manual toil and building a self-healing, self-optimizing platform over time. What You'll Do Leadership Own the SRE vision, strategy, and long-range roadmap with availability (>99.99%), reliability, and scalability as the primary measures of success Lead, develop, and integrate all functional teams into a cohesive, always-on operations organization – setting clear ownership, accountability, and performance expectations for each team and each engineer Establish and enforce operational baselines across all platform components; ensure deviations are detected, escalated, and resolved within defined SLAs Drive end-to-end observability with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles Oversee change management ensuring every modification is risk-assessed, monitored during rollout, and baseline-validated post-deployment Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time Build and sustain a high-performing 24/7 operations model – zero mandatory overtime, zero burnout attrition, and measurable team health and retention Empower the Security SRE Lead to implement and maintain a world-class security posture, minimizing risk and ensuring robust compliance with frameworks like HIPAA and NIST AI RMF Direct Innovation POD strategy to develop self-healing and autonomous capabilities that proactively prevent degradation before it impacts availability Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization Manage vendor relationships and performance accountability Program Governance Lead the structured transition of operational ownership from the incumbent managed services provider to CVS's internal SRE organization, governing phased handoffs, competency validation, and milestone sign-offs, ensuring a seamless transition with minimal disruption to platform availability and business operations Establish and lead the long-term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self-sustaining at program close What You'll Bring 10+ years in SRE, platform operations, or DevOps engineering leadership with a demonstrated focus on availability and reliability outcomes 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations – with measurable team health, retention, and performance outcomes Proven success establishing and enforcing operational baselines, SLO/SLI/error budget frameworks, and observability-driven continuous improvement in complex environments Deep expertise in Kubernetes/OpenShift, IaC, GPU computing, and AI/ML infrastructure Experience managing large-scale MSP transitions or platform operational handoffs while ensuring business continuity and minimizing disruption. Demonstrated FinOps and GPU cost optimization experience in cloud or on-premises environments Security framework implementation and compliance program management in regulated industries (HIPAA, NIST AI RMF) Track record building sustainable 24/7 operations models with measurable retention and no burnout-related attrition Executive stakeholder communication, vendor negotiation, and budget ownership Background in innovation programs, POD structures, or centers of excellence Willingness to travel and work off hours as required. Our 24/7 model is designed for sustainable, predictable coverage that eliminates mandatory overtime. As a leader, you will be an escalation point for critical incidents, but our goal is a resilient system and culture that protects our team's time Preferred Qualifications NVIDIA AI Enterprise, Run:AI, or GPU orchestration platform experience Healthcare or regulated industry background Certifications: ITIL Expert, PMP, AWS/Azure/GCP, CISSP Familiarity with Cisco UCS, VAST storage, EVPN-VXLAN, and RDMA/RoCE protocols Chaos engineering and AI-driven operations experience Thought leadership: published work or speaking at industry conferences Education Required: Bachelor's in Computer Science, Engineering, or related field | Preferred: Master's degree Pay Range The typical pay range for this role is: $175,100.00 - $334,750.00. This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program. Great benefits for great people We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong. Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr
$160k - $300k
...health technology startup in New York seeks an ML operations engineer to manage the lifecycle of AI systems, from experimentation to production. The role... ...AI performance on biopharma data, implementing ML Ops best practices, and cross-functional collaboration in a...Suggested$174.99k - $209.98k
...Staff AI Engineer - Grafana Ops, AI/ML | USA | Remote United States (Remote) Grafana Labs is a remote-first, open-source powerhouse. There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate...SuggestedLocal areaRemote work- ...Calix is seeking a highly skilled Staff AI Ops Engineer to join their AI/ML team, supporting machine learning and generative AI applications. This remote role requires expertise in GCP, container orchestration, and CI/CD pipelines. The ideal candidate will have at least...SuggestedRemote work
- CVS Health seeks an Executive Director, AI Ops Engineering to lead a team ensuring optimal operation of the AI platform. You'll oversee SRE practices, building a multi-disciplinary team, managing vendor relationships, and driving success with top-notch availability and...Suggested
- ...Sevaro is seeking an AI Operations Engineer to help optimize workflows and drive AI initiatives across the organization. This role involves collaborating with clinical and administrative teams to identify operational efficiencies, while ensuring compliance with healthcare...Suggested
$200k - $240k
...Traba is the AI operating layer for the industrial supply chain. We started in workforce... ...the workers already on our platform to execute against them, we are building applied AI... ...an entrepreneurial Senior Applied Agent Engineer to join as a founding member of the Agents...Temporary workLocal areaFlexible hoursShift workDay shift- PointFive is looking for a Business AI Engineer in New York to enhance internal operations by building AI-powered workflows for various teams. This unique role requires 5+ years in data engineering and familiarity with automation tools like n8n and Zapier. Candidates should...
- ...Bristol Myers Squibb is seeking an Associate Director to act as a technology leader in an AI-first product team focused on Global Development Operations... ...should have at least 15 years of experience in software engineering, strong leadership capabilities, and a proven track...
$140k - $203k
...MUFG Bank, Ltd. is seeking a highly motivated Security Engineer in Hoboken, New Jersey, to design and deploy AI agents that enhance cyber security decision-making. The successful candidate will develop autonomous workflows, integrate AI models, and optimize data pipelines...- ...We are actively seeking a highly skilled and experienced Senior AI/ML Engineer with a focus on MLOps to join our innovative team. If you have 6 to 10 years of hands-on experience in the AI/ML space and a passion for driving technological advancements, this role is for...Full timeRemote work
- Arlo is seeking AI Forward Deployed Engineers to revolutionize health insurance workflows. You will engage deeply with underwriting, claims, and operations to implement AI solutions that automate tasks and elevate team capacities. Your contributions will directly enhance...
$195k - $275k
...partner with the Advanced Analytics, Machine learning and Gen AI Platform team(s), across multiple project areas, and work in collaboration... ...person would also be part of the overall cloud adoption and engineering roadmap and ensure scalable, agile and robust architecture and...Temporary workWorldwide- ...A technology consulting firm in Jersey City seeks an AI Operations Platform Consultant with extensive experience in deploying and managing large-scale GPU-accelerated AI platforms. Responsibilities include overseeing the deployment of LLM inference systems using Kubernetes...
$150k - $250k
David Protein in New York, NY is seeking a Software Engineer to build internal tools and AI-powered workflows. The role requires strong proficiency in Python and TypeScript, with a minimum of 2 years in software engineering. Responsibilities include designing solutions...Full time- ...A technology solutions company in Jersey City seeks an AI Operations Platform Consultant to oversee the deployment and management of GPU-accelerated AI platforms and LLM inference systems. Candidates should have strong expertise in Kubernetes, TensorRT-LLM, and Triton...
- ...Principal Software Engineer If you are looking for a game-changing career, working for one of the world's leading financial institutions... ...support one or more of the firm's portfolios. As an Agentic AI Principal Software Engineer at JPMorganChase within the Asset &...
$175.1k - $334.75k
...CVS Health is seeking an Executive Director of AI Platform SRE based in Kentucky. This pivotal role entails leading a multi-disciplinary SRE organization focused on ensuring platform reliability and continuous improvement of AI-driven services. Candidates must possess...$45k - $50k
...Okcnp is seeking an Executive Director to oversee daily operations, program development, fundraising, and community engagement within Canadian County. The ideal candidate will have a minimum of three years of nonprofit management experience and strong leadership skills...$191.85k - $287.78k
...sales channels. Job Summary FreeWheel, a Comcast company, is seeking a seasoned and strategic finance leader to serve as Executive Director, Global Financial Operations & Compliance. Reporting directly to the CFO, this role sits at the center of our global growth...Work experience placementWork at officeLocal area$150k - $300k
...A tech company is seeking a Full-Stack AI Engineer in New York. The ideal candidate will design and build core product surfaces powered by AI, taking ownership of complex systems from interfaces to backend services. Responsibilities include end-to-end development, scalable...Work at office- ...Join to apply for the Marketing Ops AI Manager role at Cyera Come join the company reinventing data security, empowering businesses to... ...optimizing AI‑driven systems and workflows that power the marketing engine. This role sits at the intersection of marketing, data, and...Full timeTemporary workWork at officeRemote work
- ...Framework Ventures is seeking an Ecosystem Operator to lead OP Mainnet, the flagship L2 in the Web3 space. You'll define its long... ...through strategic partnerships, and collaborate with product and engineering teams to ensure its success. This fully remote role offers...Remote work
$100k
...volume and growing fast. We're building the AI-powered revenue platform for modern... ...product we're building. We're looking for engineers who are ambitious and take ownership end-to... ...billing with AI: guest piece on AI in finance ops. Sequence blog: Introducing Sequence 2....Live inWork at officeVisa sponsorshipFlexible hours$175k - $250k
...Swayable is a fast-growing AI and automated data science platform... ..., it is led by the former Executive Director for Digital Strategy at the... ...Swayable is seeking a Senior Engineer blending Python software development... ...toolset for ML and AI Ops. * You are knowledgeable about...$140k - $180k
...raise the bar on what agentic AI, CTV, eCommerce, social, and mobile... ...Mission As the AI Engineer at Kargo, you will architect,... ...leadership has a clear, prioritized AI Ops roadmap that you own and drive... ...defining the problem and executing without a fully specified...Work experience placementLocal area$110k - $120k
...client we serve. We’re seeking a Azure Data & AI Strategy Expert who can guide enterprise... ...and establishing enterprise‑scale ML ops foundations. Support the implementation of... ...metadata, and operational AI enablement. Executive presence with ability to translate complex...Worldwide- ...Socure builds, deploys, and scales AI-driven identity solutions... ...Reporting to the Head of New Product Engineering, you'll join a new Internal AI... ...sales, marketing, revenue ops, finance, talent, and legal.... ...what to build - not just execute what's been scoped. You'll be...Remote work
$100k - $120k
...Job Description Junior AI Engineer Location: New York, US Type: Full-time Department... ...agentic AI workflows that can plan, execute, and validate steps across operational processes... ...). Collaborate with domain SMEs (ops, compliance, risk) to translate...Full time- ...Founding AI Engineer (AI + Production) New York City (5 days on-site) · Top of market + equity + benefits TL;DR: Build AI that accelerates... ...design, guardrails, physics-AI integration, and the entire ML ops stack Mentor and lead: as the team scales, you'll hire and...Immediate startWeekend work
$187k - $240k
..., and more. We're building a new wave of AI-powered capabilities that help customers... ...paths This is a highly product-minded engineering role: you'll work from problem discovery... ...Python; strong API/service design; production ops (monitoring, alerting, on-call rotation)...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Executive Director, AI Ops Engineering. Be the first to apply!
- executive associate New York, NY
- chief communications officer New York, NY
- managing director sales New York, NY
- college president New York, NY
- chief intellectual property counsel New York, NY
- executive search consultant New York, NY
- credit union executive New York, NY
- chief dental officer New York, NY
- executive program manager New York, NY
- chief growth officer New York, NY

