Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Infrastructure & Platform Engineering

$175.1k - $334.75k

Hispanic Alliance for Career Enhancement

Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute platform. This position owns the physical and platform layers of CVS's Enterprise AI Factory – a frontier‑class GPU compute environment running NVIDIA Blackwell systems across a high‑throughput RoCE v2 fabric, hosted in co‑located data center facilities, with multi‑site expansion underway. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, this leader will establish operational baselines across the full infrastructure stack – hardware, network fabric, GPU clusters, storage, and the operating systems and orchestration layers above – and build the Site Reliability Engineering practice that delivers the availability, reliability, and performance that frontier AI workloads demand. This is a greenfield organizational build. The Executive Director will define the operating model, set the engineering standards, hire and develop the team, and establish the long‑term operations capability that will govern CVS’s AI infrastructure for years ahead. Key Responsibilities Strategy and Leadership: Define and execute the long‑range vision and strategy for AI infrastructure and platform engineering, with availability (>99.99%), reliability, and platform performance as the primary measures of success. Recruit, hire, develop, and retain a high‑performing engineering organization spanning infrastructure, network, platform reliability, observability, security, 24/7 operations, change and release management, and FinOps. Establish clear ownership, accountability, and performance expectations across all functional teams; foster a culture of operational excellence, engineering rigor, and continuous improvement. Provide executive‑level communication to senior leadership on platform status, milestones, risk posture, and strategic initiatives. Infrastructure and Platform Engineering: Own the physical layer of the AI compute environment – GPU compute, storage, network fabric, capacity planning, and hardware lifecycle accountability. Direct bare‑metal Kubernetes and OpenShift operations, including cluster administration, GPU quota governance, infrastructure‑as‑code adoption, and availability baseline enforcement. Govern high‑performance network fabric operations – RoCE v2, spine‑leaf topology, lossless Ethernet tuning, congestion management, and segmentation. Establish and enforce operational baselines across every layer of the stack – hardware, fabric, platform, and workload – with deviations detected, escalated, and resolved within defined SLAs. Direct Innovation POD strategy to develop self‑healing and autonomous capabilities that proactively prevent service degradation before it impacts availability. Operations and Reliability: Build and sustain a high‑performing 24/7 operations model – designed for sustainable, predictable coverage with no mandatory overtime and measurable team health and retention. Drive end‑to‑end observability across the physical and platform layers, with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles. Oversee change management so every modification is risk‑assessed, monitored during rollout, and baseline‑validated post‑deployment. Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time. Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization. Security and Compliance: Empower the Security SRE Lead to maintain a world‑class security posture across the infrastructure and platform layers, with robust compliance to frameworks including HIPAA and NIST AI RMF. Govern access controls, audit logging, vulnerability management, and network segmentation across the AI compute environment. Program Transition and Operating Model: Lead the operational transition from program‑launch staffing to permanent CVS‑owned operations – governing phased handoffs, competency validation, and milestone sign‑offs to ensure minimal disruption to platform availability and business operations. Establish and lead the long‑term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self‑sustaining at program close. Vendor and Stakeholder Management: Own vendor relationships, contract performance, and accountability across the hardware, networking, platform, and managed‑services stack. Manage budget ownership for the AI infrastructure and platform engineering organization, including capital planning and operational expense governance. Required Qualifications 10+ years of engineering leadership experience, with substantial time directly owning physical infrastructure at data center scale – including hardware lifecycle, capacity planning, and facility coordination (power, cooling, rack‑and‑stack execution). Hands‑on production ownership of bare‑metal Kubernetes or OpenShift. Managed cloud services (EKS, GKE, AKS) alone do not substitute for the practitioner expertise this role requires. Fluency with high‑speed cluster fabrics – RoCE v2, InfiniBand, EVPN‑VXLAN, or carrier‑grade equivalent – and the operational discipline these fabrics require (PFC, ECN, lossless tuning, congestion management). 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations, with measurable team health, retention, and performance outcomes. Proven success establishing and enforcing operational baselines, SLO / SLI / error‑budget frameworks, and observability‑driven continuous improvement in physical‑infrastructure‑anchored environments. Hardware lifecycle, vendor accountability, and facility coordination experience – including capacity planning, RMA management, and multi‑vendor escalation. Experience leading operational transitions or organizational build‑outs at scale, with business continuity and minimal disruption as non‑negotiables. Executive‑level stakeholder communication, vendor negotiation, and budget ownership. Preferred Qualifications Hands‑on experience with Cisco UCS, NVIDIA HGX / DGX / Blackwell systems, and VAST or comparable distributed NVMe storage. Direct experience operating GPU clusters of 32 or more GPUs in production environments – including HPC, AI training, research computing, or comparable workloads. NVIDIA AI Enterprise, NVIDIA Run:AI, NVIDIA Base Command Manager, or comparable GPU orchestration platform experience. Healthcare or other regulated‑industry background (HIPAA, NIST AI RMF, SOX, FedRAMP, ITAR). Chaos engineering and AI‑driven operations experience – predictive alerting and automated remediation patterns. Background in innovation programs, POD structures, or centers of excellence. Education Required: Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field. Pay Range Typical pay range: $175,100.00 – $334,750.00 This pay range represents the base hourly rate or base annual full‑time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short‑term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. Equal Opportunity Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr Hispanic Alliance for Career Enhancement

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Infrastructure & Platform Engineering in Seattle, WA vacancy
  •  ...Managing Director, Platform Engineering About the Company Rapidly scaling technology...  ...focused on cloud infrastructure and platform engineering....  ...role that demands a blend of executive leadership and deep technical...  ...also have experience with AI developer tooling,... 
    Suggested

    Confidential

    Seattle, WA
    3 days ago
  • $2,000 per month

    Elastic, the Search AI Company, enables everyone...  ...The Elastic Search AI Platform, used by more than 50%...  ...team, with a focus on Infrastructure Observability. With...  ...Orgs, collaborating with Engineering, UX, Design and other...  ...ensure personal technical execution consistently aligns... 
    Suggested
    Local area
    Remote work
    Flexible hours

    Elasticsearch B.V.

    Seattle, WA
    10 hours ago
  • $205k - $235k

     ...world value. The Growth Platforms team focuses on...  ...durable technology and AI capabilities that power...  ...growth. The Software Engineering Director for AI Tooling plays...  ...translating strategy into execution—designing and scaling...  ...containerization and infrastructure-as-code. Build AI-... 
    Suggested
    Summer holiday
    Work at office
    Flexible hours

    Ernst & Young Oman

    Seattle, WA
    4 days ago
  • $215.2k - $269k

     ...the next generation of infrastructure for the future 100...  ...define the products that AI-native companies will...  ...development to cloud execution, from short-running tasks...  .... Partner with engineering on architecture and tradeoffs...  ...or developer platforms: Experience building cloud... 
    Suggested
    Full time
    Local area
    Worldwide
    Flexible hours

    DigitalOcean

    Seattle, WA
    2 days ago
  • $144k - $180k

     ...Corporate Campus Armada is a full‑stack edge infrastructure company delivering compute, connectivity, and sovereign AI/ML to some of the world’s most remote places...  ...for an experienced, detail‑oriented Senior Platform Engineer to join our growing Edge team. You will be... 
    Suggested
    Work at office
    Local area
    Remote work
    Flexible hours

    Garuda Ventures

    Bellevue, WA
    2 days ago
  • FHLB Des Moines is seeking an experienced AI DevOps Engineer to support AI initiatives across its campuses. This critical role involves engineering...  .... A strong foundation in Azure, CI/CD pipelines, and infrastructure-as-code tools is essential. #J-18808-Ljbffr FHLB Des... 
    Remote job

    FHLB Des Moines

    Seattle, WA
    1 day ago
  • $157.4k - $236k

     ...organization, the Enterprise Data Solutions (EDS) team leads work across Knowledge Management, AI, Data Engineering, Business Intelligence, Data Infrastructure, and Health Data Platforms. The team's vision is to empower the Foundation with innovative, data-driven services and... 
    H1b

    Disability Solutions

    Seattle, WA
    4 days ago
  • $120k - $140k

     ...Sr. Platform Engineer — Identity & Modern WorkplaceJob Type: Full-Time (On...  ...you'll lead the design and execution of Radiant's move to a modern...  ...vendor's roadmap.Radiant is an AI-driven organization. We...  ...identity-security boundary, with Infrastructure/SRE on cloud and server... 
    Full time
    Work experience placement
    Work at office

    Radiant Logistics

    Renton, WA
    5 days ago
  • $296.3k - $423.9k

     ...Trajectory Generation team within the Embodied AI organization, combining deep technical...  ...You will lead a high‑performing team of engineers building ML‑driven trajectory generation...  .... Experience with large‑scale training infrastructure and distributed ML pipelines. Experience... 
    Local area
    Remote work
    Flexible hours

    Israelvcforum

    Seattle, WA
    4 days ago
  •  ...designs and effective use of data plane platforms. Provides strategic oversight for fault...  ...Only Oracle brings together the data, infrastructure, applications, and expertise to power everything...  ...to life-saving care. And with AI embedded across our products and services... 
    Full time
    Flexible hours

    Oracle

    Seattle, WA
    3 hours ago
  •  ...IT support with operational needs while driving adoption of automation and AI to increase efficiency. Provide strategic leadership for helpdesk, infrastructure, and service management platforms. Responsibilities Hire, mentor, and lead a large team of IT professionals,... 

    brobstongroup.com - Jobboard

    Seattle, WA
    3 days ago
  • PortX is a leading AI-powered data and integration company for...  ...to life through our unified platform for modern integration, governed...  ...for Senior Platform Engineer for our team, working in a high...  ...applications and tools to manage infrastructure and continuous deployment systems... 

    NeuroNav

    Seattle, WA
    10 hours ago
  •  ...insights that drive business outcomes. Our AI-native platform integrates advanced data models with...  .... The Role As a Senior Platform Engineer at Summation, you will be a key contributor...  ...technology that is transforming how businesses plan and execute #J-18808-Ljbffr Summation Inc
    Flexible hours

    Summation Inc

    Bellevue, WA
    2 days ago
  • $120k - $140k

    Role Overview The Sr. Platform Engineer — Identity & Modern Workplace is the...  ...the technical design and execution of Radiant's move from legacy...  ...right fit. Automation, IaC & AI‑Enabled Engineering Drive...  ...and REST APIs broadly, and Infrastructure-as-Code (Terraform or equivalent... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Flexible hours

    Radiant Logistics

    Renton, WA
    4 days ago
  • $157.4k - $236k

     ...Overview The Gates Foundation is seeking a Senior Platform Engineer to design and scale a modern data and AI platform that accelerates impact across global...  ...engineering with a focus on advanced analytics, AI/ML infrastructure, data security and automation. Manage and drive... 

    Bill & Melinda Gates Foundation

    Seattle, WA
    2 days ago
  • IBM Computing is seeking a Senior Software Engineer to design and operate internal platform services in a remote-friendly role. The successful candidate will...  ...involves automating performance testing, integrating AI-assisted workflows, and contributing to team culture through... 
    Remote work

    IBM Computing

    Bellevue, WA
    3 days ago
  • $78k - $185k

     ...remainder remote. About the Role The Senior Platform Engineer will partner with cloud‑native...  ...related platform support (Cyber Security, Infrastructure as Code, Observability). Ability to dive...  ...with SDLC toolchain. Experience using Gen AI tooling for software development and... 
    Work at office
    Remote work

    Morgan Stanley

    Seattle, WA
    10 hours ago
  •  ...Oefentherapie seeks a Senior IC5 Software Engineer in Seattle to lead the Oracle...  ...Kubernetes skills and cloud infrastructure experience. You will focus on...  ...security within a highly available platform that supports various workloads including AI and GPU. #J-18808-Ljbffr Ll... 

    Ll Oefentherapie

    Seattle, WA
    2 days ago
  • IBM Computing seeks a Senior Software Engineer to enhance developer productivity through secure, scalable platform services in Seattle, WA. You'll design and operate internal...  ..., improve testing capabilities, and explore AI-assisted workflows. Ideal candidates will have... 

    IBM Computing

    Seattle, WA
    2 days ago
  • A leading AI research company in Seattle is seeking a Software Engineer to enhance production workflows and enable testing on new platforms. Responsibilities include creating test harnesses, validating workloads, and ensuring performance across various systems. The ideal... 

    OpenAI

    Seattle, WA
    3 days ago
  • $120k - $140k

    Radiant Global Logistics Inc is seeking a Senior Platform Engineer for its Renton, WA location. This role involves architecting and operating...  ...productivity platforms, with a focus on secure collaboration and modern AI tooling. Applicants should possess at least 7 years in... 
    Full time

    Radiant Global Logistics

    Renton, WA
    3 days ago
  • $198.36k - $416.1k

     ...United States. The Data Platform Team builds a scalable...  ..., cost-efficient infrastructure that ensures data integrity...  ...empowerment" through AI-driven innovation. Our...  ...autonomous diagnostic engines, ensuring they are...  ...innovation and high-velocity execution. Minimum... 
    Temporary work
    Local area
    Shift work

    TikTok USDS Joint Venture

    Seattle, WA
    3 days ago
  • $162k - $259.2k

     ...metadata, access control, and the platform layer that connects Evidence....  ...a well-governed API and AI agent integration layer. This...  ...for the next decade, require executive alignment across product, security...  ...to AI agents, working with engineering on schema design and with... 
    Work experience placement
    Work at office
    Remote work

    Out in Science, Technology, Engineering, and Mathematics

    Seattle, WA
    1 day ago
  • $157.5k - $254.35k

    A leading e-signature solutions provider is seeking a Manager of Software Engineering in San Francisco. The role involves leading a platform engineering team, ensuring robust and secure integrations. Candidates should have over 5 years of software development experience... 

    DocuSign, Inc.

    Seattle, WA
    10 hours ago
  • The Hispanic Alliance for Career Enhancement is seeking an Executive Director for AI Infrastructure & Platform Engineering. This senior leadership role involves overseeing the operation and continuous improvement of CVS Health's AI compute platform, ensuring high availability... 

    Hispanic Alliance for Career Enhancement

    Seattle, WA
    2 days ago
  •  ...and implement software solutions for its portfolio of manufacturing companies. The role involves working with senior engineers on the Strata47 IoT platform and developing customer-facing applications using TypeScript and React. The ideal candidate has over 5 years of... 

    SupportFinity

    Seattle, WA
    4 days ago
  • Azure Platform Engineer Work Site: Seattle, WA (Onsite) Duration: 12+ Months Job Description We...  ...Terraform. Design, deploy, and manage infrastructure solutions using Terraform, ensuring...  ...scalable and reliable infrastructure for AI/ML platforms. Implement containerization... 

    US staffing Inc

    Seattle, WA
    3 days ago
  • $123k - $165k

     ...Cruise Line - The Walt Disney Company in Seattle is seeking an AI Engineer to help build and support enterprise Generative AI...  ...involves Python development and contributes to the conversational AI platform. The ideal candidate should have at least 3 years of experience... 

    Disney Cruise Line

    Seattle, WA
    2 days ago
  • B Capital is seeking a Technical Support Engineer to provide excellent customer experiences through effective problem-solving and support for Salesforce technology. This role requires U.S. Citizenship and 2+ years of technical support experience. The ideal candidate will... 

    B Capital

    Seattle, WA
    4 days ago
  • US staffing Inc is seeking an Azure Platform Engineer to work onsite in Seattle, WA. The role involves designing and managing infrastructure solutions using Azure, AKS, and Terraform, ensuring...  ...implement CI/CD pipelines, and work on AI/ML platforms. Ideal candidates should... 

    US staffing Inc

    Seattle, WA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Infrastructure & Platform Engineering. Be the first to apply!