Director - Hyperscale, HPC & Sovereign AI Deployment and Fleet Operations

Advanced Micro Devices

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

THE ROLE:

As the Director of Cloud, HPC & Sovereign AI Customer Engineering within the Compute & Enterprise AI Solutions Customer Engineering organization, you will lead the team responsible for enabling successful deployment, adoption, and lifecycle support of AMD compute and AI solutions across Cloud, HPC, and Sovereign AI customers. This highly visible leadership role oversees a team of Customer Program Managers (CPMs) responsible for guiding customers through new product introductions, deployment readiness, production ramp, and long‑term fleet sustainment. Working closely with customers, Customer Platform Engineering, Product Management, Engineering and Architecture you will drive successful customer outcomes across the full lifecycle of AMD compute and AI platforms.

THE PERSON:

The ideal candidate is a strong technical and organizational leader with experience supporting large‑scale cloud, AI, HPC, or datacenter deployments. You have a proven track record of leading customer‑facing technical teams, managing complex customer engagements, and driving successful deployment and sustainment of infrastructure at scale. You are equally comfortable engaging with customer executives, architects, operations teams, and engineering organizations while driving alignment across AMD. You possess strong technical credibility, customer advocacy skills, and the ability to lead through influence in complex environments.

KEY RESPONSIBILITIES:

Lead and scale the Cloud, HPC & Sovereign AI Customer Engineering organization supporting strategic customer deployments and lifecycle management. Lead a team of Customer Program Managers responsible for customer engagement, deployment readiness, new product introduction (NPI) execution, production ramp, and fleet sustainment activities. Drive successful deployment, adoption, and operational readiness of AMD compute and AI solutions across Cloud, HPC, and Sovereign AI customers. Serve as the executive escalation point for strategic customer issues and drive resolution of complex deployment, platform, performance, and operational challenges. Partner closely with Customer Platform Engineering teams, including PAE, BAE, Security Engineering, and Debug Engineering, to ensure successful customer outcomes. Develop deployment methodologies, operational best practices, and customer engagement frameworks that accelerate customer time‑to‑production. Drive fleet sustainment strategies including observability, telemetry, remote diagnostics, lifecycle management, and operational readiness. Partner with customers, and AMD engineering teams to support successful platform deployment and long‑term fleet success. Act as the voice of the customer, ensuring customer deployment experiences and operational insights influence future products, platforms, and solutions. Mentor and develop CPM leaders while building a high‑performance, customer‑focused culture.

PREFERRED EXPERIENCE:

Experience leading customer‑facing engineering, technical program management, cloud infrastructure, AI infrastructure, HPC, datacenter operations, or related technical organizations. Experience leading technical teams responsible for customer deployments, operational readiness, and lifecycle support of complex infrastructure platforms. Experience supporting large‑scale cloud, AI, HPC, or enterprise infrastructure deployments from new product introduction through fleet sustainment. Strong understanding of datacenter infrastructure, compute platforms, AI systems, observability, telemetry, and operational readiness. Experience managing customer escalations and driving resolution of complex technical and operational issues. Experience working with hyperscalers, cloud providers, HPC customers, sovereign AI customers. Proven ability to drive alignment across engineering, product, operations, and customer‑facing organizations. Strong communication and executive engagement skills.

ACADEMIC CREDENTIALS:

Bachelor's or Master's degree in Engineering, Computer Science, or a related technical field. This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr Advanced Micro Devices

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Director - Hyperscale, HPC & Sovereign AI Deployment and Fleet Operations in Santa Clara, CA vacancy

Lead Technical Product Manager
$171k - $232k
...WeRide.ai is looking for a lead Technical Product Manager (PMT... ...and its global commercial deployment. This role reports directly to... ...customer desires, business goals, operational constraints, regulatory... ...strategies for new ODD expansion and fleet scale-up, balance user...
Operations
Fleet
Odd job
Temporary work
WeRide.ai
San Jose, CA
1 day ago
Program Manager III, NPI Technical Operations, Cloud Supply Chain
$159k - $231k
...Experience in the data center operations or technology industry.... ...on progress and deadlines. Fleet Transition Management (FTM) team... ...You will focus on optimizing deployment and orderability infrastructure... ...to team workflows, advocating AI adoption, and maintaining operational...
Operations
Fleet
Full time
PMs for Hire
Sunnyvale, CA
3 days ago
Staff Hyperscale Product Manager - 2248
$144.33k - $240.55k
...with sales, engineering, operations, and management to drive product... ...deep engagement with hyperscale customers (cloud, AI, and large-scale data center... ..., qualification, and deployment at scale Define and execute... ..., deployment models, and fleet-level optimization (performance...
Operations
Fleet
Local area
Worldwide
Flexible hours
Kioxia
San Jose, CA
27 days ago
Senior Technical Program Manager
$168k - $258.75k
...Manager with a strong focus on operational rigor and cross-functional... ..., you will lead the DGX Cloud Fleet Configuration initiative. You... ...impact on the world’s leading AI infrastructure! What You'll Be... ...compliance in large-scale cloud or HPC environments. Hands-on...
Operations
Fleet
Full time
NVIDIA
Santa Clara, CA
13 hours ago
Senior Technical Program Manager
$180k - $220k
...Our technology enables a single operator to supervise and control... ...announced its merger with Havoc AI, a fast-growing defense technology... ...developing coordinated fleets of autonomous maritime vessels... ...are ready for production and deployment. Partner with engineering managers...
Operations
Fleet
Full time
Teleo
Palo Alto, CA
3 days ago
Senior Staff Technical Program Manager, NPI
$216.15k - $262k
...the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from... ...the Role NVIDIA Vera Rubin deployments begin in early 2027. We are... ...firmware versioning matters for fleet reliability at scale. Networking...
Operations
Fleet
Temporary work
Crusoe
Sunnyvale, CA
15 days ago
Senior Technical Product Manager DGX Enterprise Infrastructure and Cloud-Native Operations
$208k - $327.75k
...Product Manager For Enterprise Ai NVIDIA is seeking a world-... ...Manager to architect for the operational future of Enterprise AI. While... ...most sophisticated companies deploy, manage, and scale their Enterprise... ...health checks that keep the fleet at peak performance without...
Operations
Fleet
Night shift
NVIDIA
Santa Clara, CA
1 day ago
Senior Technical Program Manager
...Technical Program Manager with a passion for data‑driven operations, you will lead the DGX Cloud Fleet Health reporting program — delivering real‑time,... ...making a significant impact on the world’s most powerful AI infrastructure. What You’ll Be Doing: Define and own the...
Operations
Fleet
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Staff Software Technical Program Manager, Platform
$165k - $215k
...overhead sensors, software, and AI-powered analytics to locate... ..., down to the fixture. We are deployed across 1,400+ stores with retailers... ..., noisy environments - at fleet scale. RADAR is one of the... ...development, delivery, and ongoing operations of fleetwide RFID solutions to...
Operations
Fleet
Worldwide
Flexible hours
RADAR
San Jose, CA
11 days ago
Sr Technical Program Manager, Hyperscale Operations
$182k - $273k
THE ROLE As a Technical Program Manager for Hyperscale Operations, you will be the strategic glue between hardware engineering and global manufacturing... ...and improve the predictability of large-scale hardware deployments. WHAT YOU BRING Technical Program Leadership: Proven...
Operations
Contract work
Work at office
Flexible hours
Pure Storage
Santa Clara, CA
1 day ago
Sr. Technical Program Manager, Networking Capacity Delivery
$157k - $210k
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform... ...Architecture, Supply Chain, and Data Center Operations to solve deployment and topology challenges at hyperscale. Beyond delivering individual programs, you will...
Operations
Permanent employment
Full time
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
4 days ago
Manager, Technical Support Engineer
$198k - $264k
CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...performant. You'll lead daily support operations, triage incidents, drive... ...to quickly learn and adapt to HPC environments. Proven track... ...hardware lifecycle management: deployment, maintenance, thermal/power concerns...
Operations
Permanent employment
Full time
Temporary work
Casual work
Work at office
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
3 days ago
Technical Operations Manager
...Wayve is the leading developer of Embodied AI technology. Our advanced AI software and... .... The role We’re looking for a Technical Operations Manager to join Wayve’s Technical... ...actionable on‑road plans in partnership with Fleet Operations leadership, ensuring alignment...
Operations
Fleet
Full time
Work at office
Remote work
Work from home
Icehouseventures
Sunnyvale, CA
1 day ago
Staff Product Manager
$180k - $240k
...developed an artificial intelligence (AI) powered technology stack... ...software to develop, test and deploy autonomous capabilities for... ...functional leaders align to, and operating as a trusted thought partner... ...product decisions to fleet economics, operational outcomes...
Operations
Fleet
Contract work
Temporary work
Work at office
Visa sponsorship
Flexible hours
Omaze
Mountain View, CA
15 hours ago
Staff Product Manager - Data Center High-Speed Connectivity & AI Infrastructure
$205k - $307.6k
...connectivity solutions powering hyperscale AI infrastructure. As a... ...silicon and AI infrastructure deployments. Customer Engagement... ...architecture, engineering, operations, sales, and marketing. Track... ...infrastructure, hyperscale data centers, HPC systems, or cloud...
Operations
Work experience placement
Work at office
Work from home
Qualcomm
Santa Clara, CA
20 hours ago
Technical Program Manager
$190k - $220k
...are shaping the future of network reliability, security, and AI‑ready operations. Technical Program Manager Core accountability: Drive... ...SDLC coordination from product definition through production deployment, dependency mapping, release cadence, risk identification,...
Operations
Forward
Santa Clara, CA
1 day ago
Senior Technical Program Manager AI Infrastructure, Site Operations
...builds the world's largest AI chip, 56 times larger... ...faster than GPU-based hyperscale cloud inference... ...partnership with Cerebras, to deploy 750 megawatts of scale,... ...owns site and data center operations programs supporting... ...Preferred Experience AI/ML, HPC, or accelerator-based...
Operations
Cerebras Systems
Sunnyvale, CA
4 days ago
Technical Program Manager
$100 - $111 per hour
...Strategic Sourcing & Procurement, Cost Operations, Hyperscaler Operations, Supply Chain, Finance... ...sessions Manage cutover planning and deployment readiness activities Own project risks,... ...—from Software and Aerospace to AI, Clean Tech, Medical Devices, and Connected...
Operations
Hourly pay
Contract work
Protingent
Santa Clara, CA
15 hours ago
Platform Technical Program Manager
$2,000 per month
...Etched Etched is building the world’s first AI inference system purpose-built for... ...are aligned with system integration and deployment. Provide clear, regular program updates... ...involving EE, PD, SI, test, supply chain, and operations. Comfort operating in fast‑paced,...
Operations
Contract work
Work at office
Relocation package
ETCHED LLC
San Jose, CA
4 days ago
Technical Program Manager
$108.36k - $154.8k
...working in close partnership with the Forward Deployed Engineer. This individual owns the data... ...PM is both a technical thinker and an operational coordinator — responsible for ensuring that... ...technical specifications for the Astreya AI operations hub, data pipelines,...
Operations
Temporary work
Flexible hours
Astreya Partners
Santa Clara, CA
3 days ago
Director, SEC Reporting and Technical Accounting
$175k - $220k
...Director, Sec Reporting & Technical Accounting PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory... ...in Silicon Valley with operations in the United States and Europe... ...with Plus to accelerate the deployment of next-generation autonomous...
Operations
Immediate start
PlusAI
Santa Clara, CA
20 hours ago
Senior AI Inference Capacity TPM: Fleet & Forecasting
...role, you will oversee capacity planning and fleet strategy for the AI Inference Service organization,... ...with teams across Engineering, Product, and Operations. Your responsibilities include managing daily deployment tracking, leading strategic initiatives related...
Operations
Fleet
Cerebras
Sunnyvale, CA
1 day ago
Senior NPI Tech Ops Program Manager, Cloud Supply Chain
...and Process Program Manager to lead the daily operations cadence, optimize deployment and orderability infrastructure, and ensure fleet transitions land at scale with minimal... ...will bring TPM best practices and advocate AI adoption across cross-functional teams. The...
Operations
Fleet
PMs for Hire
Sunnyvale, CA
3 days ago
Defense Technical Program Manager
$180k - $220k
...developed an artificial intelligence (AI) powered technology stack... ...software to develop, test and deploy autonomous capabilities for... ...the Marine Corps) to autonomous fleet deployment programs for other... ...hardware, and different operational demands. This is a cross-functional...
Operations
Fleet
Temporary work
For contractors
Work at office
Visa sponsorship
Flexible hours
Kodiak
Mountain View, CA
2 days ago
Technical Program Manager
...Compute platform that complements GPU acceleration in large AI clusters We build and operate high-throughput, low-latency infrastructure so customers... ...-functional teams to improve development processes, deployment pipelines, and system reliability Experience in incident...
Operations
Flexible hours
CoreWeave
Sunnyvale, CA
3 days ago
Manager, Technical Program Management, Workspace GenAI Foundations
$240k - $334k
...direct reports. Experience in building AI features or AI Platforms. Preferred Qualifications... ...of experience in product, engineering, operations, or technical. 10 years of experience... ...organization wide. e.g., effective (re)deployment of machine and people resources leading...
Operations
Full time
Google
Sunnyvale, CA
20 hours ago
Staff Technical Program Manager
$150k - $188.1k
...designing, manufacturing, and operating an all-electric aircraft that... ...execution of the ArcherOS / AI roadmap—a top priority for our... ...Infrastructure & Network Deployment Infrastructure Ownership: Own... ...integration of the incoming DS/BI Director into DTO PMO structures and...
Operations
For contractors
Work at office
Local area
Dormont Manufacturing Company
San Jose, CA
15 hours ago
Staff Technical Program Manager
$193k - $234k
...the only vertically integrated AI infrastructure company built... ...the ground up, we own and operate each layer of the stack — from... ...developments; NPI and large data center deployments; and highly cross‑functional... ...reports, as the Senior Director of TPM manages the broader...
Operations
Temporary work
Crusoe Energy Systems
Sunnyvale, CA
15 hours ago
Technical Program Manager, IaaS
$109k - $160k
CoreWeave is The Essential Cloud for AI. Built for pioneers by pioneers, CoreWeave delivers... ...in large AI clusters. We build and operate high‑throughput, low‑latency infrastructure... ...teams to improve development processes, deployment pipelines, and system reliability....
Operations
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
Coreweave
Sunnyvale, CA
2 days ago
Technical Program Manager (Performance & Benchmarking)
Responsibilities The AI/ML TPM team owns delivery and execution across CoreWeave’s AI... ...across models, hardware generations, and deployment contexts Drive end-to-end program... ...infrastructure initiatives Establish dashboards, operating cadences, and success metrics to improve...
Operations
CoreWeave
Sunnyvale, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Director - Hyperscale, HPC & Sovereign AI Deployment and Fleet Operations. Be the first to apply!