Executive Director, AI Infrastructure & Platform Engineering
$175.1k - $334.75kHispanic Alliance for Career Enhancement
Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute platform. This position owns the physical and platform layers of CVS's Enterprise AI Factory – a frontier‑class GPU compute environment running NVIDIA Blackwell systems across a high‑throughput RoCE v2 fabric, hosted in co‑located data center facilities, with multi‑site expansion underway. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, this leader will establish operational baselines across the full infrastructure stack – hardware, network fabric, GPU clusters, storage, and the operating systems and orchestration layers above – and build the Site Reliability Engineering practice that delivers the availability, reliability, and performance that frontier AI workloads demand. This is a greenfield organizational build. The Executive Director will define the operating model, set the engineering standards, hire and develop the team, and establish the long‑term operations capability that will govern CVS’s AI infrastructure for years ahead. Key Responsibilities Strategy and Leadership: Define and execute the long‑range vision and strategy for AI infrastructure and platform engineering, with availability (>99.99%), reliability, and platform performance as the primary measures of success. Recruit, hire, develop, and retain a high‑performing engineering organization spanning infrastructure, network, platform reliability, observability, security, 24/7 operations, change and release management, and FinOps. Establish clear ownership, accountability, and performance expectations across all functional teams; foster a culture of operational excellence, engineering rigor, and continuous improvement. Provide executive‑level communication to senior leadership on platform status, milestones, risk posture, and strategic initiatives. Infrastructure and Platform Engineering: Own the physical layer of the AI compute environment – GPU compute, storage, network fabric, capacity planning, and hardware lifecycle accountability. Direct bare‑metal Kubernetes and OpenShift operations, including cluster administration, GPU quota governance, infrastructure‑as‑code adoption, and availability baseline enforcement. Govern high‑performance network fabric operations – RoCE v2, spine‑leaf topology, lossless Ethernet tuning, congestion management, and segmentation. Establish and enforce operational baselines across every layer of the stack – hardware, fabric, platform, and workload – with deviations detected, escalated, and resolved within defined SLAs. Direct Innovation POD strategy to develop self‑healing and autonomous capabilities that proactively prevent service degradation before it impacts availability. Operations and Reliability: Build and sustain a high‑performing 24/7 operations model – designed for sustainable, predictable coverage with no mandatory overtime and measurable team health and retention. Drive end‑to‑end observability across the physical and platform layers, with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles. Oversee change management so every modification is risk‑assessed, monitored during rollout, and baseline‑validated post‑deployment. Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time. Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization. Security and Compliance: Empower the Security SRE Lead to maintain a world‑class security posture across the infrastructure and platform layers, with robust compliance to frameworks including HIPAA and NIST AI RMF. Govern access controls, audit logging, vulnerability management, and network segmentation across the AI compute environment. Program Transition and Operating Model: Lead the operational transition from program‑launch staffing to permanent CVS‑owned operations – governing phased handoffs, competency validation, and milestone sign‑offs to ensure minimal disruption to platform availability and business operations. Establish and lead the long‑term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self‑sustaining at program close. Vendor and Stakeholder Management: Own vendor relationships, contract performance, and accountability across the hardware, networking, platform, and managed‑services stack. Manage budget ownership for the AI infrastructure and platform engineering organization, including capital planning and operational expense governance. Required Qualifications 10+ years of engineering leadership experience, with substantial time directly owning physical infrastructure at data center scale – including hardware lifecycle, capacity planning, and facility coordination (power, cooling, rack‑and‑stack execution). Hands‑on production ownership of bare‑metal Kubernetes or OpenShift. Managed cloud services (EKS, GKE, AKS) alone do not substitute for the practitioner expertise this role requires. Fluency with high‑speed cluster fabrics – RoCE v2, InfiniBand, EVPN‑VXLAN, or carrier‑grade equivalent – and the operational discipline these fabrics require (PFC, ECN, lossless tuning, congestion management). 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations, with measurable team health, retention, and performance outcomes. Proven success establishing and enforcing operational baselines, SLO / SLI / error‑budget frameworks, and observability‑driven continuous improvement in physical‑infrastructure‑anchored environments. Hardware lifecycle, vendor accountability, and facility coordination experience – including capacity planning, RMA management, and multi‑vendor escalation. Experience leading operational transitions or organizational build‑outs at scale, with business continuity and minimal disruption as non‑negotiables. Executive‑level stakeholder communication, vendor negotiation, and budget ownership. Preferred Qualifications Hands‑on experience with Cisco UCS, NVIDIA HGX / DGX / Blackwell systems, and VAST or comparable distributed NVMe storage. Direct experience operating GPU clusters of 32 or more GPUs in production environments – including HPC, AI training, research computing, or comparable workloads. NVIDIA AI Enterprise, NVIDIA Run:AI, NVIDIA Base Command Manager, or comparable GPU orchestration platform experience. Healthcare or other regulated‑industry background (HIPAA, NIST AI RMF, SOX, FedRAMP, ITAR). Chaos engineering and AI‑driven operations experience – predictive alerting and automated remediation patterns. Background in innovation programs, POD structures, or centers of excellence. Education Required: Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field. Pay Range Typical pay range: $175,100.00 – $334,750.00 This pay range represents the base hourly rate or base annual full‑time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short‑term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. Equal Opportunity Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr Hispanic Alliance for Career Enhancement
$175.1k - $334.75k
Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute...SuggestedHourly payPermanent employmentFull timeContract workTemporary workLocal area$175.1k - $334.75k
Position Summary The Executive Director of Platform Engineering is the senior-most leader responsible for the strategy... ...Enablement & Acceleration, Infrastructure Enablement, and Observability & Reliability... ...areas, including automation and AI‑enabled delivery, in a HIPAA‑...SuggestedHourly payFull timeContract workTemporary workLocal area- ...leadership career. As a Senior Director of Software Engineering at JPMorganChase within the Employee Platforms, Workforce Experience... ...of processes and procedures. Executes at code level accountability... ...Kafka, Python/Scala, Glue/EMR, AI Developer Automation Tools (Unit...SuggestedWork at office
- Lead Cloud DevOps Platform Engineer Marlborough, MA, United States Santa... ...developers to provision compliant infrastructure (databases, clusters,... ...architectural trade-offs to both executive leadership and junior... ...applicants to refrain from using AI tools, such as generative AI...SuggestedRemote work
- Agent Platform Engineer Product Solutions Vice President - Chief Data Analytics... ...of the spear for agentic AI adoption across the firm. We... ...-calling patterns, parallel execution loops, and write-back... ...Exposure to cloud‑native AI infrastructure — managed model endpoints, model...SuggestedFull timeWork at office
- ...Managing Director, Platform Engineering About the Company Rapidly scaling technology... ...focused on cloud infrastructure and platform engineering.... ...role that demands a blend of executive leadership and deep technical... ...also have experience with AI developer tooling,...
- ...Signify Health LLC is seeking an Executive Director of Platform Engineering to lead strategy and execution within... ..., development enablement, and infrastructure, ensuring efficient, secure multi-... ...technical excellence. A strong focus on AI-assisted tooling and operational...
$126.6k - $180k
...We are seeking a high-caliber DevOps/Platform Engineer to join our Global Platform team. This role... ...to the development of our global infrastructure standards and acting as the primary platform... ..., and the integration of modern AI-driven operational workflows. Key Responsibilities...Summer workRemote workFlexible hours$205k - $235k
EY-Parthenon - Strategy and Execution - Growth Platforms - Software Engineering - Director Location: New York Other locations... ..., paired with innovative AI‑powered technology and an investor... ...GCP, using containerization and infrastructure as code. Build AI‑ready application...Summer holidayWork at officeFlexible hours$287.8k
About the Role A healthier future. It’s what drives us to innovate, to continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. The new Computational Sciences Center of Excellence (CoE) is a strategic, unified...Local areaRelocation package- F. Hoffmann-La Roche AG in New York is looking for a leader in machine learning research to shape research vision and mentor emerging scientists in a collaborative environment. The chosen candidate will be responsible for driving global scientific engagement and outreach...Relocation package
- Renaissance Learning Inc. is seeking a Dev Ops Engineer II to help drive the transition to a platform-centric model. This role involves building and scaling... ...globally. The ideal candidate will have AWS cloud infrastructure expertise and experience with infrastructure-as-...
$126.6k - $180k
PVcase in New York is looking for a skilled DevOps/Platform Engineer to join our Global Platform team. The selected candidate will oversee the AWS infrastructure strategy, focusing on application performance and security, while contributing to our global infrastructure...Remote jobFlexible hours- JPMorgan Chase & Co. is seeking an Agent Platform Engineer to spearhead innovation in AI product solutions. The role requires strong software engineering skills, particularly in building complex multi-agent systems, and engaging directly with clients to understand their...
- ...Full-time About the Role We’re seeking a Senior Platform Engineer to help build and scale Daytona’s core... ...architect, implement, and optimize mission-critical infrastructure that powers thousands of developers and millions of AI agents. If building reliable, elegant, and...Full time
- ...We're looking for a Platform Engineer to own the foundational primitives and surfaces that power... ...product engineering velocity, service infrastructure, and the connectivity layer between our... ...engineering, preferably at a high‑growth or AI‑native company. Strong fundamentals in...Work at officeFlexible hoursNight shift
- ...State and Local governments. Our platform is a configurable approach to... .... As a Concourse Platform Engineer, you're not building a... .... You're contributing to the infrastructure that scales to our hundreds of... ...the data ingestion pipelines, AI workflow orchestration engine...Local areaFlexible hours
$131.3k - $149.8k
...Senior Platform Engineer Do you love building and pioneering in the technology space? Do you enjoy... ...Facilities, we streamline operations and execute on high level strategic initiatives that... ...Team is building the backbone of our AI-driven Work OS. We focus on scaling state...Full timePart timeH1bLocal area- ...industries are falling behind on AI adoption. Their workflows... ...the agent harness - the infrastructure layer that wraps around... ...Palantir's first AI platform and built the analytics engine behind $100M+ contracts.... ...workloads from day one. Durable execution, blue‑green deploys,...Work at office
$170k - $190k
...outcomes company. Our measurement platform connects convergent TV... ...with decision science, vertical AI, and investment-grade data to... ...for a Senior Platform Engineer to help accelerate our shift... ...This is a senior IC role on the Infrastructure team focused on paved paths,...Full timeWork experience placementWork at officeImmediate startRemote workFlexible hoursShift work$150k - $200k
...Role We’re looking for an experienced Platform Engineer to own and scale the systems that power... ...team. You’ll build and maintain the core infrastructure that powers Partiful's products and... ...shipping Empower others (and yourself) to use AI tools effectively and safely across the...$140k - $180k
...About Farsight AI Farsight is the agentic AI platform for financial services, currently helping investment... ...Ventures, supercharged by scalable engineering and AI skills from companies including... ...). Job Description As a Cloud Infrastructure Engineer at Farsight AI, you’ll lead...Remote work$200k - $325k
...Platform Engineer Build the future of investment management with us The infrastructure managing $300 trillion in assets was built in the 90s. Now, all of it is up for grabs. The winner of the AI era of investment management will be a $100B+ company. We intend to be that...Work at office$150k - $250k
...'re a small team (around ten engineers and designers) who wear a lot... ...zero — not just the ones with a platform team to build them. Open‑... ...product runs on — the services, infrastructure, and internal tooling behind... ..., two‑way Git sync, and AI/MCP chat, owned from backend...Permanent employmentFull timeFixed term contractRemote workFlexible hours$112k - $179k
...Peraton Labs is seeking a Senior DevSecOps / Platform Engineer to own the Agentic AI platform end-to-end across delivery, infrastructure, runtime operations, and platform health.... ...ideal candidate combines deep technical execution with strong operational judgment and a builder...Contract workShift work- ...Senior Platform Engineer Why this Role Matters: At Greenbox Capital, we help small businesses thrive... ...Platform Modernization & Architecture Execution – Deliver scalable services and... ...frameworks, Streamlit, Terraform/Bicep, AI-enabled application patterns, and familiarity...Remote workFlexible hours
- ...rely on multiple outdated platforms and error‑prone manual... ...Senior Software Engineer (Platform) to build and scale the infrastructure, backend systems, and internal... ...patterns, AI‑powered developer workflows... ...technologies and independently execute ambiguous, high‑impact...Flexible hours
$142.32k - $213.48k
Citibank (Switzerland) AG is seeking a Senior GenAI Platform Engineer (Python, VP) located in New York. The role involves architecting and delivering scalable AI solutions, primarily using Python, as well as Node.js and TypeScript. Ideal candidates will have over 7 years...- Citigroup Inc. is seeking a Senior GenAI Platform Engineer to lead the design and development of innovative Generative AI platforms. This role focuses on building scalable AI solutions predominantly using Python, while also utilizing Node.js and TypeScript. The ideal candidate...
$154.7k - $260k
Manager, Software Engineering - Mobile Platform Remote - US Working at Samsara means you’ll help define... ...reduce tech fragmentation Introduce AI‑assisted SDLC tooling where impactful... ...engineers to improve system design and execution skills 4. Operational & Execution...Full timeTemporary workRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Executive Director, AI Infrastructure & Platform Engineering. Be the first to apply!
- chief officer New York, NY
- executive director marketing New York, NY
- executive program director New York, NY
- chief content officer New York, NY
- executive director foundation New York, NY
- assisted living executive director New York, NY
- chief intellectual property counsel New York, NY
- chief of psychiatry New York, NY
- associate executive director New York, NY
- chairman New York, NY

