Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Infrastructure & Platform Engineering

$175.1k - $334.75k

Hispanic Alliance for Career Enhancement

Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute platform. This position owns the physical and platform layers of CVS's Enterprise AI Factory – a frontier‑class GPU compute environment running NVIDIA Blackwell systems across a high‑throughput RoCE v2 fabric, hosted in co‑located data center facilities, with multi‑site expansion underway. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, this leader will establish operational baselines across the full infrastructure stack – hardware, network fabric, GPU clusters, storage, and the operating systems and orchestration layers above – and build the Site Reliability Engineering practice that delivers the availability, reliability, and performance that frontier AI workloads demand. This is a greenfield organizational build. The Executive Director will define the operating model, set the engineering standards, hire and develop the team, and establish the long‑term operations capability that will govern CVS’s AI infrastructure for years ahead. Key Responsibilities Strategy and Leadership: Define and execute the long‑range vision and strategy for AI infrastructure and platform engineering, with availability (>99.99%), reliability, and platform performance as the primary measures of success. Recruit, hire, develop, and retain a high‑performing engineering organization spanning infrastructure, network, platform reliability, observability, security, 24/7 operations, change and release management, and FinOps. Establish clear ownership, accountability, and performance expectations across all functional teams; foster a culture of operational excellence, engineering rigor, and continuous improvement. Provide executive‑level communication to senior leadership on platform status, milestones, risk posture, and strategic initiatives. Infrastructure and Platform Engineering: Own the physical layer of the AI compute environment – GPU compute, storage, network fabric, capacity planning, and hardware lifecycle accountability. Direct bare‑metal Kubernetes and OpenShift operations, including cluster administration, GPU quota governance, infrastructure‑as‑code adoption, and availability baseline enforcement. Govern high‑performance network fabric operations – RoCE v2, spine‑leaf topology, lossless Ethernet tuning, congestion management, and segmentation. Establish and enforce operational baselines across every layer of the stack – hardware, fabric, platform, and workload – with deviations detected, escalated, and resolved within defined SLAs. Direct Innovation POD strategy to develop self‑healing and autonomous capabilities that proactively prevent service degradation before it impacts availability. Operations and Reliability: Build and sustain a high‑performing 24/7 operations model – designed for sustainable, predictable coverage with no mandatory overtime and measurable team health and retention. Drive end‑to‑end observability across the physical and platform layers, with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles. Oversee change management so every modification is risk‑assessed, monitored during rollout, and baseline‑validated post‑deployment. Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time. Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization. Security and Compliance: Empower the Security SRE Lead to maintain a world‑class security posture across the infrastructure and platform layers, with robust compliance to frameworks including HIPAA and NIST AI RMF. Govern access controls, audit logging, vulnerability management, and network segmentation across the AI compute environment. Program Transition and Operating Model: Lead the operational transition from program‑launch staffing to permanent CVS‑owned operations – governing phased handoffs, competency validation, and milestone sign‑offs to ensure minimal disruption to platform availability and business operations. Establish and lead the long‑term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self‑sustaining at program close. Vendor and Stakeholder Management: Own vendor relationships, contract performance, and accountability across the hardware, networking, platform, and managed‑services stack. Manage budget ownership for the AI infrastructure and platform engineering organization, including capital planning and operational expense governance. Required Qualifications 10+ years of engineering leadership experience, with substantial time directly owning physical infrastructure at data center scale – including hardware lifecycle, capacity planning, and facility coordination (power, cooling, rack‑and‑stack execution). Hands‑on production ownership of bare‑metal Kubernetes or OpenShift. Managed cloud services (EKS, GKE, AKS) alone do not substitute for the practitioner expertise this role requires. Fluency with high‑speed cluster fabrics – RoCE v2, InfiniBand, EVPN‑VXLAN, or carrier‑grade equivalent – and the operational discipline these fabrics require (PFC, ECN, lossless tuning, congestion management). 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations, with measurable team health, retention, and performance outcomes. Proven success establishing and enforcing operational baselines, SLO / SLI / error‑budget frameworks, and observability‑driven continuous improvement in physical‑infrastructure‑anchored environments. Hardware lifecycle, vendor accountability, and facility coordination experience – including capacity planning, RMA management, and multi‑vendor escalation. Experience leading operational transitions or organizational build‑outs at scale, with business continuity and minimal disruption as non‑negotiables. Executive‑level stakeholder communication, vendor negotiation, and budget ownership. Preferred Qualifications Hands‑on experience with Cisco UCS, NVIDIA HGX / DGX / Blackwell systems, and VAST or comparable distributed NVMe storage. Direct experience operating GPU clusters of 32 or more GPUs in production environments – including HPC, AI training, research computing, or comparable workloads. NVIDIA AI Enterprise, NVIDIA Run:AI, NVIDIA Base Command Manager, or comparable GPU orchestration platform experience. Healthcare or other regulated‑industry background (HIPAA, NIST AI RMF, SOX, FedRAMP, ITAR). Chaos engineering and AI‑driven operations experience – predictive alerting and automated remediation patterns. Background in innovation programs, POD structures, or centers of excellence. Education Required: Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field. Pay Range Typical pay range: $175,100.00 – $334,750.00 This pay range represents the base hourly rate or base annual full‑time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short‑term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. Equal Opportunity Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr Hispanic Alliance for Career Enhancement

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Infrastructure & Platform Engineering in Kansas City, MO vacancy
  • Empower Retirement, LLC is seeking a Platform Engineer in Overland Park, Kansas to design and implement innovative platform solutions for the Innovation Lab. You will work with cross-functional teams to enhance the architecture of projects, focusing on scalability and seamless... 
    Suggested

    Empower Retirement, LLC

    Overland Park, KS
    1 day ago
  • $160k - $185k

    LEVI, RAY & SHOUP, INC is seeking a Software Engineer in Kansas City, MO, responsible for building platform infrastructure and developer tools. The role involves designing core components, ensuring compliance, and mentoring teams. The ideal candidate has 8+ years of experience... 
    Suggested

    LEVI, RAY & SHOUP, INC

    Kansas City, MO
    6 days ago
  • The Hispanic Alliance for Career Enhancement is seeking an Executive Director for AI Infrastructure & Platform Engineering. This senior engineering leadership role is vital for establishing CVS Health's AI compute platform, focusing on operational excellence, reliability... 
    Suggested

    Hispanic Alliance for Career Enhancement

    Kansas City, MO
    3 days ago
  • $125.4k - $181.88k

    Job Overview The Platform Engineer will design and implement platform solutions...  ...in areas such as generative AI, advanced analytics, digital...  ...in the development and execution of a technology strategy for...  ...performance issues in the platform infrastructure. Contribute to the... 
    Suggested
    16 hours
    Temporary work
    Casual work
    Work at office
    Local area
    Remote work

    Empower Retirement, LLC

    Overland Park, KS
    1 day ago
  • $142.6k - $261.5k

     ...across mobile, web, and tablet platforms. Engage in coding,...  ...workstreams from planning through execution and closure, maintaining regular...  ...roadmap contributions. Advise on AI‑enabled ITOM use cases,...  ...Information Systems Management, Engineering, or a related discipline. 4-... 
    Suggested
    Flexible hours

    EY

    Kansas City, MO
    19 hours ago
  • $142.6k - $261.5k

     ...interfaces (UI) across various platforms including mobile, web, and...  ...workstreams from planning through execution and closure. Travel may be...  ...process or evaluating how AI can streamline delivery. Wherever...  ...Systems Management, Engineering or similar discipline Typically... 
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Kansas City, MO
    2 days ago
  • $60 per hour

     ...A progressive AI development firm is seeking proficient programmers to advance AI systems while enjoying fully remote flexibility. Candidates will undertake challenging coding tasks, including developing applications and evaluating AI-generated code. This role offers... 
    Hourly pay
    Remote work

    DataAnnotation

    Kansas City, MO
    1 day ago
  • Platform Engineering & Operations Manager Role Description As the Platform...  ...will own the Azure-hosted infrastructure, reliability, security, and...  ....NET Core Server Apps, and AI services. The role delivers...  ...the matured Azure estate. Execute the day‑to‑day operations of... 
    Work at office
    Local area
    Flexible hours
    2 days per week

    BusinessOptix

    Kansas City, MO
    2 days ago
  • RadNet, Inc. seeks an AI Platform Administrator to oversee the administration, security, and operational support of AI platforms including Microsoft Copilot and Claude. Responsibilities include ensuring compliance with policies, managing access controls, and supporting... 

    RadNet, Inc.

    Kansas City, MO
    3 days ago
  • $101.9k - $175k

     ...here: In your role as Technology Lead, Platforms & Ecommerce Engineering within our Digital organization, you...  .... You will also help advance how AI is applied within engineering workflows...  ...and DevOps to align on priorities and execution. You will collaborate with other technology... 
    Work experience placement
    Live in
    Local area

    Cengage Group

    Kansas City, MO
    2 days ago
  • $142.6k - $261.5k

     ...programming languages like Java and C#. The successful candidate will manage external client engagements and guide teams through project executions, ensuring high quality and risk management standards. This position offers a vibrant work atmosphere focusing on professional... 

    Ernst & Young Oman

    Kansas City, MO
    2 days ago
  • $60 per hour

    A tech company specializing in AI is seeking proficient programmers to work remotely from the United States. The role involves solving coding problems and developing AI systems, particularly for Android. Candidates should be fluent in English and have proficiency in Kotlin... 
    Remote job
    Hourly pay

    DataAnnotation

    Kansas City, MO
    1 day ago
  • $60 per hour

    A technology company is seeking proficient programmers for remote work on cutting-edge AI systems. Responsibilities include designing coding problems, evaluating AI-generated code, and writing clear code snippets. Ideal candidates should be fluent in English and proficient... 
    Remote job

    DataAnnotation

    Kansas City, MO
    3 days ago
  • $89.6k - $167.6k

     ...From strategy to execution, the Government & Public Sector...  ...for leading the delivery of platform and infrastructure capabilities from initial design...  ...a bridge between product, engineering, and platform teams to advance...  ...AZ-400, AZ-500, AZ-700, AI-102 Certified Kubernetes Administrator... 
    Full time
    Summer holiday
    Local area
    Flexible hours
    Shift work

    EY

    Kansas City, MO
    1 day ago
  • $101.9k - $132.45k

     ...’ll do here: As the Software Engineering Manager, you will lead a team...  ...content for our Cengage Learning Platforms (CLP). Working closely with...  ...and implement an innovative, AI‑first roadmap ensuring...  ...tools and report progress to executive teams. Balance technical debt... 
    Work experience placement
    Local area
    Remote work

    Cengage

    Kansas City, MO
    4 days ago
  • Netsmart is seeking a Technical Enablement Lead in Overland Park, KS. This role requires strong collaboration with engineering teams to align learning priorities with evolving needs, ensuring that content is reflective of real-world application. The successful candidate... 

    Netsmart

    Overland Park, KS
    1 day ago
  •  ...Tech Lead - Sr. Software Engineer In this role, you'll mentor the...  ...accelerate delivery through AI- and agentic-first engineering...  ...Why Join Us Impact: Build platforms used across the business; modernize...  .../EKS, ECS, or Lambda Infrastructure-as-Code experience, using tools... 
    Contract work

    BOK Financial

    Overland Park, KS
    2 days ago
  •  ...-Sharing technology platform that drives action for...  ...of Product and Sr. Director of Engineering author), capturing...  ...documentation, clarity, and execution discipline come from...  ...engineers, QA, and infrastructure. Experience...  ...infrastructure, platform, AI, compliance, data)... 
    Full time
    Local area
    Shift work

    GO Project

    Kansas City, MO
    8 hours ago
  • $165k - $195k

    AI Solutions Executive The AI Solutions Executive is a high-impact, quota-carrying...  ...AI solutions, use cases, platforms and technologies with...  ...targeting both AI infrastructure/product deals and consulting...  ..., GenAI, agentic AI, data engineering, AI security, and industry... 
    Shift work

    World Wide Technology

    Kansas City, MO
    2 days ago
  • $138.2k - $180k

     ...Director of Software Engineering – Cengage We believe in the power and joy of learning...  ...products and shape AI-driven engineering teams....  ...Docker, microservices), and Infrastructure as Code (Terraform, AWS CDK...  ...systems with cloud‑native platforms. ~ Proven ability to compose... 
    For contractors
    Work experience placement
    Local area

    Cengage Group

    Kansas City, MO
    1 day ago
  • SRE DevOps Engineer Location: Overland Park, KS / Atlanta, GA / Frisco...  ...CI/CD pipelines, GitOps, and infrastructure as code. Solid problem‑...  ...API Proxy, WAF, DBs, and infra platforms. Design and improve runbooks,...  ...ChatOps) Expectation: Integrate AI/ML‑powered tools for anomaly... 

    Highbrow LLC

    Overland Park, KS
    3 days ago
  • $142.6k - $261.5k

     ...opportunity As a Deployment Manager, you will primarily focus on executing large SAP implementation procedures. You will collaborate with...  ...planet, while building trust in capital markets. Enabled by data, AI and advanced technology, EY teams help clients shape the future... 
    Summer holiday
    Flexible hours

    EY

    Kansas City, MO
    1 day ago
  • $85k - $115k

     ...Improvement: Recommend enhancements to the lab infrastructure, integrations, and processes to optimize...  ..., and technical experimentation platforms. Strong understanding of identity and access...  ...IT for resolution. Familiarity with AI technologies (e.g., generative AI,... 
    Full time
    Work at office
    Remote work

    Mariner Holdings

    Overland Park, KS
    2 days ago
  •  ...The Technical Enablement Lead partners with Cloud, DevOps, and AI leaders to align learning priorities with evolving platform and business needs. This role ensures learning experiences reflect how engineering teams actually work, while collaborating with the Associate... 
    For contractors
    Work at office

    Netsmart

    Overland Park, KS
    1 day ago
  • $92.5k - $166.8k

    Job Overview The Software Engineer is essential for designing, implementing, and deploying scalable software solutions that meet customer...  ...GitLab CI, Docker, Kubernetes, AWS Strong knowledge of Agentic AI Strong knowledge of using AI tools in analysis and design (Required... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Local area
    Flexible hours

    T-Mobile

    Overland Park, KS
    1 day ago
  •  ...from inception to delivery. The scope of this job includes executing WellSky's solution strategy in order to deliver best-in-class...  ...developed in order to meet business objectives. Leverage AI tools and platforms as an integral part of developing detailed product requirements... 
    Full time
    Work experience placement

    WellSky

    Overland Park, KS
    19 hours ago
  • $240k - $334k

     ...Data Center Network Infrastructure Construction...  ...recommendations to executives as you are discussing...  ...development with engineers. To lead the...  ...stakeholders. The AI and...  ...providing the essential platforms that enable developers...  ...Alphabet Inc. board of directors or its delegate,... 
    For contractors
    Work at office
    Worldwide
    Flexible hours

    Google Inc.

    Kansas City, MO
    2 days ago
  • Cox Enterprises is seeking a Sr. Software Engineer for the Dealer.com team in Overland Park, Kansas. You will design...  ...automotive industry's leading digital marketing platform, collaborating with talented engineers and AI agents. Your responsibilities include architecting... 

    Cox Enterprises

    Overland Park, KS
    2 days ago
  •  ...will provide technical leadership — setting standards, mentoring engineers, and ensuring UI excellence across the team. Key...  ...serving as a technical anchor on UI projects Basic proficiency with AI coding tools (e.g., GitHub Copilot, Cursor, or ChatGPT) to assist... 
    Full time

    BuzzClan

    Overland Park, KS
    19 hours ago
  • Colgate in Overland Park, Kansas is seeking a strategic leader in data analytics with over 15 years of experience. The role involves executing multi-year strategies and leading a high-performing team in a data-driven environment. Candidates should have a strong data... 
    Relocation package

    Colgate

    Overland Park, KS
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Infrastructure & Platform Engineering. Be the first to apply!