Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Infrastructure & Platform Engineering

$175.1k - $334.75k

Hispanic Alliance for Career Enhancement

Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute platform. This position owns the physical and platform layers of CVS's Enterprise AI Factory – a frontier‑class GPU compute environment running NVIDIA Blackwell systems across a high‑throughput RoCE v2 fabric, hosted in co‑located data center facilities, with multi‑site expansion underway. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, this leader will establish operational baselines across the full infrastructure stack – hardware, network fabric, GPU clusters, storage, and the operating systems and orchestration layers above – and build the Site Reliability Engineering practice that delivers the availability, reliability, and performance that frontier AI workloads demand. This is a greenfield organizational build. The Executive Director will define the operating model, set the engineering standards, hire and develop the team, and establish the long‑term operations capability that will govern CVS’s AI infrastructure for years ahead. Key Responsibilities Strategy and Leadership: Define and execute the long‑range vision and strategy for AI infrastructure and platform engineering, with availability (>99.99%), reliability, and platform performance as the primary measures of success. Recruit, hire, develop, and retain a high‑performing engineering organization spanning infrastructure, network, platform reliability, observability, security, 24/7 operations, change and release management, and FinOps. Establish clear ownership, accountability, and performance expectations across all functional teams; foster a culture of operational excellence, engineering rigor, and continuous improvement. Provide executive‑level communication to senior leadership on platform status, milestones, risk posture, and strategic initiatives. Infrastructure and Platform Engineering: Own the physical layer of the AI compute environment – GPU compute, storage, network fabric, capacity planning, and hardware lifecycle accountability. Direct bare‑metal Kubernetes and OpenShift operations, including cluster administration, GPU quota governance, infrastructure‑as‑code adoption, and availability baseline enforcement. Govern high‑performance network fabric operations – RoCE v2, spine‑leaf topology, lossless Ethernet tuning, congestion management, and segmentation. Establish and enforce operational baselines across every layer of the stack – hardware, fabric, platform, and workload – with deviations detected, escalated, and resolved within defined SLAs. Direct Innovation POD strategy to develop self‑healing and autonomous capabilities that proactively prevent service degradation before it impacts availability. Operations and Reliability: Build and sustain a high‑performing 24/7 operations model – designed for sustainable, predictable coverage with no mandatory overtime and measurable team health and retention. Drive end‑to‑end observability across the physical and platform layers, with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles. Oversee change management so every modification is risk‑assessed, monitored during rollout, and baseline‑validated post‑deployment. Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time. Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization. Security and Compliance: Empower the Security SRE Lead to maintain a world‑class security posture across the infrastructure and platform layers, with robust compliance to frameworks including HIPAA and NIST AI RMF. Govern access controls, audit logging, vulnerability management, and network segmentation across the AI compute environment. Program Transition and Operating Model: Lead the operational transition from program‑launch staffing to permanent CVS‑owned operations – governing phased handoffs, competency validation, and milestone sign‑offs to ensure minimal disruption to platform availability and business operations. Establish and lead the long‑term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self‑sustaining at program close. Vendor and Stakeholder Management: Own vendor relationships, contract performance, and accountability across the hardware, networking, platform, and managed‑services stack. Manage budget ownership for the AI infrastructure and platform engineering organization, including capital planning and operational expense governance. Required Qualifications 10+ years of engineering leadership experience, with substantial time directly owning physical infrastructure at data center scale – including hardware lifecycle, capacity planning, and facility coordination (power, cooling, rack‑and‑stack execution). Hands‑on production ownership of bare‑metal Kubernetes or OpenShift. Managed cloud services (EKS, GKE, AKS) alone do not substitute for the practitioner expertise this role requires. Fluency with high‑speed cluster fabrics – RoCE v2, InfiniBand, EVPN‑VXLAN, or carrier‑grade equivalent – and the operational discipline these fabrics require (PFC, ECN, lossless tuning, congestion management). 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations, with measurable team health, retention, and performance outcomes. Proven success establishing and enforcing operational baselines, SLO / SLI / error‑budget frameworks, and observability‑driven continuous improvement in physical‑infrastructure‑anchored environments. Hardware lifecycle, vendor accountability, and facility coordination experience – including capacity planning, RMA management, and multi‑vendor escalation. Experience leading operational transitions or organizational build‑outs at scale, with business continuity and minimal disruption as non‑negotiables. Executive‑level stakeholder communication, vendor negotiation, and budget ownership. Preferred Qualifications Hands‑on experience with Cisco UCS, NVIDIA HGX / DGX / Blackwell systems, and VAST or comparable distributed NVMe storage. Direct experience operating GPU clusters of 32 or more GPUs in production environments – including HPC, AI training, research computing, or comparable workloads. NVIDIA AI Enterprise, NVIDIA Run:AI, NVIDIA Base Command Manager, or comparable GPU orchestration platform experience. Healthcare or other regulated‑industry background (HIPAA, NIST AI RMF, SOX, FedRAMP, ITAR). Chaos engineering and AI‑driven operations experience – predictive alerting and automated remediation patterns. Background in innovation programs, POD structures, or centers of excellence. Education Required: Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field. Pay Range Typical pay range: $175,100.00 – $334,750.00 This pay range represents the base hourly rate or base annual full‑time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short‑term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. Equal Opportunity Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr Hispanic Alliance for Career Enhancement

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Infrastructure & Platform Engineering in Florida, NY vacancy
  • $60.5k - $86.1k

     ...is expanding and we're looking for a **Platform Software Engineer** to join our team! Our office is...  ...drive them to completion.* Understand and execute test strategies to deliver quality software...  ..., land, sea, space, and cyber. From AI-powered drones and loitering munitions... 
    Suggested
    Permanent employment
    Contract work
    Work experience placement
    Work at office

    AeroVironment, Inc.

    Florida, NY
    3 days ago
  • The Hispanic Alliance for Career Enhancement is looking for an Executive Director for AI Infrastructure & Platform Engineering. This role focuses on leading the AI compute platform, establishing operational standards, and recruiting a high-performing team. Your success... 
    Suggested

    Hispanic Alliance for Career Enhancement

    Florida, NY
    4 days ago
  • Mercor is seeking a Software Engineer to build an AI-native platform that replaces traditional operations with real-time dashboards. The ideal candidate will have strong practical software engineering skills, experience with SaaS products, and comfort with integrations.... 
    Suggested
    Remote job

    Mercor

    Florida, NY
    3 days ago
  • $65k - $150k

    Ascensus is seeking candidates for multiple positions on their AI Technology team. The roles involve troubleshooting AI systems, enhancing performance, and training users on technology. Ideal applicants should possess a bachelor’s degree in Computer Science or a related... 
    Suggested
    Remote job

    Ascensus

    Florida, NY
    3 days ago
  •  ...Principal Data & Smart Grid Platform Engineer Date: May 14, 2026 Company:...  ...Grid and Advanced Metering Infrastructure (AMI) systems. This role is...  ...excellence. Key Responsibilities AI first engineer and...  ...technology and patterns to execute on work as well as product solutions... 
    Suggested
    Full time
    For contractors
    Local area
    Relocation

    NextEra Energy Resources

    Florida, NY
    2 days ago
  •  ...Success Managers, Technical Engineering teams, and vendor partners to...  ...teams, leveraging automation/AI tools, and building a culture...  ...technical teams, and CIO‑level executives. Demonstrated ability to lead...  .... Proven success using ITSM platforms (ConnectWise preferred; others... 
    For contractors
    Currently hiring
    Work at office
    Remote work
    Work visa
    All shifts
    Shift work
    Night shift
    Weekend work

    SUCRE

    Florida, NY
    1 day ago
  •  ...handle time, and abandoned rate. Leverage AI tools to automate service desk tasks and...  ...effective resolution of IT issues. Maintain and execute performance management standards in...  ...and maintain security and social engineering protocols to protect the organization. Develop... 
    Work at office
    Local area

    LAM Lennar Associates Mgmt LLC

    Florida, NY
    3 days ago
  • $50 - $100 per hour

    DataAnnotation is seeking a DevOps Engineer to advance AI development in New York. You will evaluate AI outputs and give coding challenges to chatbots while working from home on your schedule. This independent contractor position offers hourly payment between $50-$100+... 
    Remote job
    Hourly pay
    For contractors
    Work from home
    Flexible hours

    DataAnnotation

    Florida, NY
    4 days ago
  •  ...'s investible assets. Every day, our teams harness cutting‑edge AI and breakthrough technologies to collaborate with clients, driving...  ...based function into a scalable, automated, product centric platform that enables technology and product owners to onboard, self‑manage... 
    Work experience placement
    Worldwide
    Flexible hours

    BNY Mellon

    Florida, NY
    2 days ago
  •  ...seeking a Software Asset Management Lead to enhance IT Asset Management through AI and automation. The role involves transforming the Software Asset Management function into a scalable platform, improving data quality, and facilitating product management. Candidates... 

    BNY Mellon

    Florida, NY
    3 days ago
  •  ...sites or provide remote access to support in‑person visits. Teach hands‑on applications training for technologists. Manage use of Subtle AI across fixed locations and partner with Subtle applications team to ensure higher productivity. Work with local site administrators... 
    Work at office
    Local area
    Remote work

    Akumin

    Florida, NY
    4 days ago
  • $118.8k - $178.2k

     ...for an experienced DevOps Engineering Manager to lead the design,...  ...maintenance of Green Dot’s infrastructure. The role is structured hybrid...  ...cloud infrastructure, and AI‑driven tools while...  ...reliable deployments. Plan and execute platform migrations and reliability... 
    Remote work
    3 days per week

    Green Dot Corporation

    Florida, NY
    4 days ago
  •  ...AssistRx , we combine human insight and AI-powered technology to simplify patient access...  ...healthcare outcomes. As a Salesforce Engineering Manager, you’ll help shape the future of...  ...engagement, and business operations platforms. Key Responsibilities: Salesforce Engineering... 
    Temporary work
    Local area
    Immediate start

    AssistRx

    Florida, NY
    10 hours ago
  • $129.5k - $186.1k

     ...Management, Coaching and Development, Execution Excellence responsibilities...  ..., code reviews, and helping engineers optimize their code....  ...optimization of our modern data platform. Provide technical leadership...  .... Lead by example in AI augmented development: use AI... 

    UKG

    Florida, NY
    2 days ago
  • $163.4k - $219.1k

     ...warehouse management and execution systems that move all...  ...the intersection of engineering leadership, operational...  ...and execution platforms connected to them. Lead...  ...workstreams. Champion AI tool adoption across the...  ...and risks clearly to Director‑ and VP‑level stakeholders... 
    Contract work
    For contractors
    Work experience placement
    Worldwide

    Disney Cruise Line

    Florida, NY
    10 hours ago
  • The Johnson is seeking a Head of Organic Growth (SEO / CRO / AI Search) to lead strategic initiatives and drive organic performance...  ...track record in using tools like Google Analytics 4 and A/B testing platforms. Key responsibilities include overseeing the organic growth... 
    Work from home

    The Johnson

    Florida, NY
    3 days ago
  •  ...SEO, CRO, and emerging AI/Generative search (GEO)...  ...AI Search). This is a director‑level role that owns the...  ...enterprise A/B testing platform (VWO, Optimizely, or Convert...  ...Development Team on execution, and personally own the...  ...generative answer engines (e.g., Profound, Peec,... 
    Permanent employment
    Full time
    Summer work
    Work at office
    Local area
    Work from home
    Shift work

    The Johnson

    Florida, NY
    3 days ago
  •  ...compliance, Agile delivery, and enterprise platform transformation. Strong background in stakeholder management, risk mitigation, executive reporting, roadmap execution, and delivery...  ...SharePoint Database Management Generative AI Agentic AI and workflows leverage AI/ML... 

    Virtusa

    Florida, NY
    10 hours ago
  •  ...Director of Software Engineering & Architecture Position Summary :...  ...rapidly growing data platform to improve patient outcomes...  ..., and increase AI adoption, they are...  ...comfortably between executive discussions and hands...  ...applications, infrastructure, and cloud environments... 

    Ranger Technical Resources

    Florida, NY
    2 days ago
  • Genpact LLC is seeking a Business Development Executive located in New York to grow the business by pursuing new clients and managing the...  ...analytics, and client relations. Join Genpact to be part of a team driving AI-powered transformation. #J-18808-Ljbffr Genpact LLC
    Contract work

    Genpact LLC

    Florida, NY
    4 days ago
  • $86.8k - $198k

     ...with us to transform the future of digital platforms.Join us. The world can’t wait.**You Have...  ...* Bachelor's degree in CS, Computer Engineering, Mathematics, Statistics, or Engineering...  ...identity and prevent fraud.**Candidate AI Usage Policy**AI is a part of our daily... 
    Full time
    Contract work
    Part time
    Work at office
    Remote work

    Booz Allen Hamilton

    Florida, NY
    10 hours ago
  • $118.8k - $178.2k

    Green Dot Corporation is seeking an experienced DevOps Engineering Manager to lead the design and maintenance of their infrastructure. This structured hybrid role requires a minimum of three days onsite in Los Angeles, CA. The ideal candidate will have a Bachelor's degree... 

    Green Dot Corporation

    Florida, NY
    4 days ago
  •  ...Inc is seeking a President for FounderHut to oversee day-to-day operations and manage its P&L, KPIs, and team dynamics as it scales AI-powered fundraising. The ideal candidate will be a senior operator with experience in high-growth startups, capable of leading teams... 

    Cyber Technologies, Inc

    Florida, NY
    3 days ago
  •  ...pricing models, negotiating agreements, and providing insights into purchasing behaviors. This position is vital for shaping the commercial strategy of a leading AI research lab in New York, offering opportunities to influence key market dynamics. #J-18808-Ljbffr Mercor

    Mercor

    Florida, NY
    1 day ago
  •  ...ensuring work entering execution is well‑defined and...  ...resolution paths for engineering and product leadership...  ...variability over time. Leverage AI tooling to accelerate...  ...QE, architecture, and platform teams to manage cross‑...  ...pipelines, deployment infrastructure, or cloud services... 
    Remote job
    Currently hiring
    Local area
    Work from home
    Shift work

    Prog Leasing LLC

    Florida, NY
    2 days ago
  • 3Core Systems Inc. is seeking a hands-on Scrum Master to lead Agile delivery for an AI Transformation Program. Located in Lake Mary, FL, this role requires strong communication and organizational skills to manage risks and facilitate Scrum ceremonies. The ideal candidate... 

    3Core Systems Inc.

    Florida, NY
    2 days ago
  • $152k - $222k

     ...Experience with cloud engineering, on-premise engineering...  ..., or containerization platforms. Experience engaging...  ...technical stakeholders or executive leaders. Experience...  ...of the following: Infrastructure Modernization, Application...  ...Data Analytics, Cloud AI, Networking,... 
    Remote work

    Google

    Florida, NY
    10 hours ago
  •  ...Hands‑on Scrum Master role Not a Project Manager position Must personally lead Scrum ceremonies AI‑focused program with active AI initiatives Drives Agile execution and removes delivery roadblocks POSITION OVERVIEW Seeking a hands‑on Scrum Master to support an AI... 
    Long term contract
    Contract work
    Local area

    3Core Systems Inc.

    Florida, NY
    2 days ago
  • $130k - $160k

     ...Disney Company is inviting applications for a Senior Software Engineer to join their Payments and Accounting Technology team. This role...  ...solutions integrated with core financial operations across platforms. The ideal candidate will possess strong software engineering... 

    Disney Cruise Line - The Walt Disney Company

    Florida, NY
    3 days ago
  • Greenberg Traurig, LLP is seeking an Innovation Manager, Applied AI to advance the firm's use of generative AI in legal practice. This role involves collaborating with attorneys to enhance workflows and improve engagement with AI-enabled tools. The ideal candidate will... 

    Greenberg Traurig, LLP

    Florida, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Infrastructure & Platform Engineering. Be the first to apply!