Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Executive Director, AI Infrastructure & Platform Engineering

$175.1k - $334.75k

Hispanic Alliance for Career Enhancement

Executive Director, AI Infrastructure & Platform Engineering The Executive Director, AI Infrastructure & Platform Engineering is a senior engineering leadership role responsible for standing up, operating, and continuously improving CVS Health's on‑premises AI compute platform. This position owns the physical and platform layers of CVS's Enterprise AI Factory – a frontier‑class GPU compute environment running NVIDIA Blackwell systems across a high‑throughput RoCE v2 fabric, hosted in co‑located data center facilities, with multi‑site expansion underway. Reporting to the Global Head of Infrastructure/AI Operations and Service Delivery, this leader will establish operational baselines across the full infrastructure stack – hardware, network fabric, GPU clusters, storage, and the operating systems and orchestration layers above – and build the Site Reliability Engineering practice that delivers the availability, reliability, and performance that frontier AI workloads demand. This is a greenfield organizational build. The Executive Director will define the operating model, set the engineering standards, hire and develop the team, and establish the long‑term operations capability that will govern CVS’s AI infrastructure for years ahead. Key Responsibilities Strategy and Leadership: Define and execute the long‑range vision and strategy for AI infrastructure and platform engineering, with availability (>99.99%), reliability, and platform performance as the primary measures of success. Recruit, hire, develop, and retain a high‑performing engineering organization spanning infrastructure, network, platform reliability, observability, security, 24/7 operations, change and release management, and FinOps. Establish clear ownership, accountability, and performance expectations across all functional teams; foster a culture of operational excellence, engineering rigor, and continuous improvement. Provide executive‑level communication to senior leadership on platform status, milestones, risk posture, and strategic initiatives. Infrastructure and Platform Engineering: Own the physical layer of the AI compute environment – GPU compute, storage, network fabric, capacity planning, and hardware lifecycle accountability. Direct bare‑metal Kubernetes and OpenShift operations, including cluster administration, GPU quota governance, infrastructure‑as‑code adoption, and availability baseline enforcement. Govern high‑performance network fabric operations – RoCE v2, spine‑leaf topology, lossless Ethernet tuning, congestion management, and segmentation. Establish and enforce operational baselines across every layer of the stack – hardware, fabric, platform, and workload – with deviations detected, escalated, and resolved within defined SLAs. Direct Innovation POD strategy to develop self‑healing and autonomous capabilities that proactively prevent service degradation before it impacts availability. Operations and Reliability: Build and sustain a high‑performing 24/7 operations model – designed for sustainable, predictable coverage with no mandatory overtime and measurable team health and retention. Drive end‑to‑end observability across the physical and platform layers, with continuous feedback loops connecting monitoring data to incident response, change decisions, and improvement cycles. Oversee change management so every modification is risk‑assessed, monitored during rollout, and baseline‑validated post‑deployment. Ensure configuration consistency and drift detection across all platform components to prevent baseline degradation over time. Lead GPU FinOps governance – utilization optimization, tenant quota enforcement, and cost reduction – in partnership with the Finance organization. Security and Compliance: Empower the Security SRE Lead to maintain a world‑class security posture across the infrastructure and platform layers, with robust compliance to frameworks including HIPAA and NIST AI RMF. Govern access controls, audit logging, vulnerability management, and network segmentation across the AI compute environment. Program Transition and Operating Model: Lead the operational transition from program‑launch staffing to permanent CVS‑owned operations – governing phased handoffs, competency validation, and milestone sign‑offs to ensure minimal disruption to platform availability and business operations. Establish and lead the long‑term operating model by institutionalizing key technical, architectural, and delivery leadership capabilities into permanent CVS roles, ensuring the organization is fully self‑sustaining at program close. Vendor and Stakeholder Management: Own vendor relationships, contract performance, and accountability across the hardware, networking, platform, and managed‑services stack. Manage budget ownership for the AI infrastructure and platform engineering organization, including capital planning and operational expense governance. Required Qualifications 10+ years of engineering leadership experience, with substantial time directly owning physical infrastructure at data center scale – including hardware lifecycle, capacity planning, and facility coordination (power, cooling, rack‑and‑stack execution). Hands‑on production ownership of bare‑metal Kubernetes or OpenShift. Managed cloud services (EKS, GKE, AKS) alone do not substitute for the practitioner expertise this role requires. Fluency with high‑speed cluster fabrics – RoCE v2, InfiniBand, EVPN‑VXLAN, or carrier‑grade equivalent – and the operational discipline these fabrics require (PFC, ECN, lossless tuning, congestion management). 5+ years leading multiple technical teams simultaneously, including 24/7 operations organizations, with measurable team health, retention, and performance outcomes. Proven success establishing and enforcing operational baselines, SLO / SLI / error‑budget frameworks, and observability‑driven continuous improvement in physical‑infrastructure‑anchored environments. Hardware lifecycle, vendor accountability, and facility coordination experience – including capacity planning, RMA management, and multi‑vendor escalation. Experience leading operational transitions or organizational build‑outs at scale, with business continuity and minimal disruption as non‑negotiables. Executive‑level stakeholder communication, vendor negotiation, and budget ownership. Preferred Qualifications Hands‑on experience with Cisco UCS, NVIDIA HGX / DGX / Blackwell systems, and VAST or comparable distributed NVMe storage. Direct experience operating GPU clusters of 32 or more GPUs in production environments – including HPC, AI training, research computing, or comparable workloads. NVIDIA AI Enterprise, NVIDIA Run:AI, NVIDIA Base Command Manager, or comparable GPU orchestration platform experience. Healthcare or other regulated‑industry background (HIPAA, NIST AI RMF, SOX, FedRAMP, ITAR). Chaos engineering and AI‑driven operations experience – predictive alerting and automated remediation patterns. Background in innovation programs, POD structures, or centers of excellence. Education Required: Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field. Pay Range Typical pay range: $175,100.00 – $334,750.00 This pay range represents the base hourly rate or base annual full‑time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short‑term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. Equal Opportunity Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws. #J-18808-Ljbffr Hispanic Alliance for Career Enhancement

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Executive Director, AI Infrastructure & Platform Engineering in Annapolis, MD vacancy
  • $295k - $345k

     ...network for trusted trade. Our AI-powered product network...  ...at Altana As the Senior Director of Platform Engineering, you will own the foundation...  ...builds on: the cloud infrastructure that runs our products and...  ...Responsibilities Technical Strategy & Execution Define and own a clear,... 
    Suggested
    Full time
    Temporary work
    Work experience placement
    Local area
    Immediate start
    Remote work
    Flexible hours

    AlleyCorp

    Annapolis, MD
    4 days ago
  • A leading data analytics company is seeking a Director of AI Engineering to spearhead AI initiatives and develop AI agents to enhance enterprise product offerings. The role involves establishing best practices, driving architecture standards, and leading a team to innovate... 
    Suggested

    Teradata Corporation (SE)

    Annapolis, MD
    3 days ago
  • $60 per hour

    A leading technology company in the US seeks proficient programmers to contribute to cutting-edge AI development. This remote role offers a flexible schedule, allowing you to choose projects and work hours. Ideal candidates should have fluency in English and preferred... 
    Suggested
    Remote job
    Flexible hours

    DataAnnotation

    Annapolis, MD
    17 hours ago
  • $142.6k - $261.5k

     ...opportunity As a Deployment Manager, you will primarily focus on executing large SAP implementation procedures. You will collaborate with...  ...planet, while building trust in capital markets. Enabled by data, AI and advanced technology, EY teams help clients shape the future... 
    Suggested
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Annapolis, MD
    2 days ago
  • $145k - $205k

     ...Innovation & Technology (AI&T) teams harness the...  ...Plan, lead, and execute high-impact offensive...  ...Information Security, Engineering, or IT required while...  ...domains (e.g. enterprise infrastructure and services, cloud, identity...  ..., serverless platforms, and CI/CD pipelines... 
    Suggested
    Work experience placement
    Shift work

    Edwards Lifesciences Belgium

    Annapolis, MD
    2 days ago
  •  ...software reliability and performance. The ideal candidate will utilize AI technologies to proactively mitigate operational risks and ensure system health. This position involves collaboration with engineers and compliance teams, offering a dynamic environment for... 

    Teradata Corporation (SE)

    Annapolis, MD
    1 day ago
  • Missionforce is seeking a Vice President of Product Management to lead the product vision and roadmap for Agentforce applications in the public sector. This role requires extensive experience in government markets and the ability to drive innovation across defense and local...
    Local area

    B Capital

    Annapolis, MD
    1 day ago
  • An environmental research organization located near Annapolis, MD, is seeking an experienced Executive Director to provide leadership and support to the team. The candidate should have a graduate degree, at least 10 years of relevant experience, and substantial knowledge... 
    Part time

    Cedarfield - Pinnacle Living

    Annapolis, MD
    4 days ago
  • Karsun Solutions, LLC is seeking a highly skilled Sr. AI Data Engineer based in Maryland to build scalable data platforms and integrate Generative AI into workflows. The role requires expertise in Databricks, where you will operationalize MLOps and enhance data quality... 

    Karsun Solutions, LLC

    Annapolis, MD
    2 days ago
  • $92k - $123k

     ...Function/Branch: Engineering / Digital Enterprise Directorate: Engineering Tech...  ...ground test data infrastructure, facility operations...  ...team projects are executed safely, on schedule...  ..., data engineers, AI engineers, project...  ...architectures, and data platforms. Prior... 
    Full time
    For contractors
    Work experience placement

    Bnh Jv

    Arnold, MD
    4 days ago
  • Operations DevOps / Platform Engineer This position is onsite at Joint Base Andrews in Maryland and requires an active Secret clearance to start. As the Operations DevOps / Platform Engineer, you will work alongside a dedicated group of professionals to design, implement... 
    Work experience placement

    Leidos

    Annapolis, MD
    2 days ago
  • Position Description for Executive Director of the Chesapeake Research Consortium Although this position will remain open until filled,please...  ...for emerging environmental professionals, (3) building platforms for sharing knowledge and developing solutions, and (4) supporting... 
    Full time
    Part time
    Interim role
    Internship
    Work at office
    Local area
    Immediate start
    Remote work

    Cedarfield - Pinnacle Living

    Annapolis, MD
    4 days ago
  • $118.45k - $236.9k

     ...operational teams, client success executives, and client support...  ...role will report to our Lead Director of Day of Service (DOS) Execution...  ...Responsibilities Architect Signify’s AI Outreach and Rescheduling...  ...complex logic for AI-driven engines. Has e a track record of... 
    Hourly pay
    Full time
    Temporary work
    Local area
    Flexible hours

    CVSHealth

    Annapolis, MD
    4 days ago
  • $83.43k - $222.48k

     ...environment, focusing on the platform capabilities that make data discoverable...  ...teams including data engineering, software engineering,...  ...Proven experience defining and executing a data governance strategy, from...  ...transformation tooling. AI/ML experience, experience leveraging... 
    Hourly pay
    Full time
    Temporary work
    Local area

    Hispanic Alliance for Career Enhancement

    Annapolis, MD
    1 day ago
  •  ...looking for a DevSecOps Compliance Engineer. This role involves implementing automated compliance platforms in software development...  ...experience with CI/CD platforms, Infrastructure as Code tools, and knowledge...  ...with various security tools. #J-18808-Ljbffr Bigbear.ai

    Bigbear.ai

    Annapolis, MD
    2 days ago
  •  ...Description & Requirements Maximus is currently seeking a Cloud Platform Engineer. This is a remote position. Maximus is a trusted...  ...enterprise or federal settings. - Proven experience with Infrastructure as Code (e.g., ARM templates, Bicep, Terraform) for... 
    Minimum wage
    Full time
    Contract work
    Temporary work
    Work experience placement
    Remote work

    Maximus

    Annapolis, MD
    3 days ago
  •  ...and Responsibilities: - Provide Tier‑3 engineering support for Microsoft 365 GCC, Exchange...  ...SharePoint Online environments, ensuring platform availability, performance, and security....  ...messaging services. - Plan, test, execute, and support upgrades, patches, and migrations... 
    Minimum wage
    Full time
    Contract work
    Temporary work
    Work experience placement

    Maximus

    Annapolis, MD
    4 days ago
  • $86.9k - $198k

    Booz Allen Hamilton is looking for a skilled Platform Engineer to join their team in Annapolis, Maryland. In this role, you will create innovative solutions to complex problems, leveraging your expertise in technologies like Kubernetes and Docker. Candidates must have a... 
    Remote job

    Booz Allen Hamilton

    Annapolis, MD
    4 days ago
  •  ...Autonomous Knowledge Platform activates enterprise intelligence...  ...business value with AI. What You’ll Do As...  ...-to-end operational engine that powers Teradata’s...  ...into repeatable execution, navigate the technical...  ...marketplace and co-sell infrastructure — including SKU/listing... 
    Permanent employment
    Contract work
    Flexible hours

    Teradata Corporation (SE)

    Annapolis, MD
    1 day ago
  • Government Employees Insurance Company is looking for a Staff Engineer specializing in Platform Security Engineering focused on Encryption and Tokenization. This role is essential for implementing a secure data protection platform, ensuring sensitive data safety throughout... 

    Government Employees Insurance Company

    Annapolis, MD
    3 days ago
  • $115k - $133k

    TekSynap is seeking a Linux OS Engineer to join our team at Annapolis Junction, MD. The ideal candidate will engineer and administer RHEL and Debian systems, ensuring maximum reliability and security compliance. You will collaborate across teams to implement automation... 

    TekSynap

    Annapolis, MD
    17 hours ago
  •  ...experience in software development/engineering, including requirements...  ...debugging Proficiency in Infrastructure as Code (IaC) tools like Salt...  ..., integration/leveraging of platforms and prototypes, architecting...  ...Familiarity with using AI to aid in debugging and development... 

    Peraton

    Annapolis, MD
    17 hours ago
  • Peraton is looking for a Cloud Software Engineer to join our Cyber Intelligence team in Maryland. This role involves developing cutting-edge...  ...cloud solutions supporting national security, utilizing AWS and AI/ML technologies. Ideal candidates should possess strong Java... 

    Peraton

    Annapolis, MD
    4 days ago
  •  ...HS+16 years of experience Expert knowledge of big data platforms Experience with cloud infrastructure (AWS, Azure) Understanding of distributed systems...  ...Department of Defense Cyber Defense Command (DCDC) with DCO AI Cyber Security Support. Location: Fort Meade, MD. In... 

    Peraton

    Annapolis, MD
    17 hours ago
  • $165.56k - $188.09k

    Under Armour, Inc. is seeking a Sr. Data Analytics Engineer to provide quality data modeling and visualization solutions across business...  ...00% remote work, and includes involvement in a modernized data platform aimed at enhancing business decision-making. A competitive... 
    Remote job

    Under Armour, Inc.

    Annapolis, MD
    2 days ago
  • $72 - $80 per hour

     ...commercial enterprises, helping them to feel confident in their IT infrastructure. With DCCA, these organizations can be confident in the...  ...solve today’s IT problems. Equal Opportunity Employer including Disability/Vets #J-18808-Ljbffr International Executive Service Corps
    Part time
    Flexible hours

    International Executive Service Corps

    Annapolis, MD
    3 days ago
  • $50k - $120k

    (Hiring) Information System Security Officer $50,000-$120,000 + Benefits We are seeking an Information System Security Officer to join our team! You will implement security measures for the protection of computer networks and information. Responsibilities Implement and...

    Viper Staffing Services L.L.C.

    Annapolis, MD
    17 hours ago
  • Cybersecurity Information System Security Officer (ISSO) job at Link Solutions, Inc. Aberdeen Proving Ground, MD. Link Solutions is seeking a Cybersecurity Information System Security Officer (ISSO) to join a team of dedicated professionals at an industry-leading organization...
    Temporary work
    Work at office
    Relocation package

    Payfuture Technologies

    Annapolis, MD
    2 days ago
  •  ...certification you need to propel you to the next level. Work/Life Balance: A healthy work/life balance is essential for building and executing your work effectively at ProSync, but it’s also necessary to allow you the room to pursue everything else you want to develop in... 
    Flexible hours

    Prosync

    Annapolis, MD
    2 days ago
  • $200k - $280k

    GliaCell Technologies is seeking a Senior DevOps Engineer / Team Lead to support a U.S. Government customer in Annapolis Junction, MD. This full-time position involves architecting AWS infrastructure, developing infrastructure as code, and managing CI/CD pipelines while... 
    Full time

    GliaCell Technologies

    Annapolis, MD
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Executive Director, AI Infrastructure & Platform Engineering. Be the first to apply!