Technical Program Manager- AI Cluster Validation
Advanced Micro Devices
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Technical Program Manager – AI Cluster Validation The Role We are seeking a Technical Program Manager to lead execution of AI cluster engineering programs with deep focus on GPU platforms, rack‑level solutions, and AI Cluster validation. This role is responsible for driving end‑to‑end delivery from GPU + server integration through rack bring‑up, scale testing, failure analysis, and system debug closure, ensuring platform readiness for hyperscale and enterprise AI deployments. This role operates at the intersection of hardware, firmware, networking, and scale‑test execution, and requires strong technical depth combined with disciplined program execution. The Person You are a hands‑on TPM who thrives in complex, fast‑moving ecosystems, and can connect deep technical details to crisp program plans, executive reporting, and customer outcomes. You are comfortable driving execution in bring‑up and EVT/DVT/PVT working closely with engineers to root‑cause issues, unblock debug, and make data‑driven tradeoffs to keep programs moving. You bring urgency, ownership, and clarity to ambiguous problem spaces and can communicate effectively from lab floor to executive review. Key Responsibilities Program Leadership & Execution Define, plan, and drive program plans for AI infrastructure systems validation and readiness, including server integration, rack bring‑up, and cluster‑scale deployment readiness. Create and maintain core PM artifacts: schedules, dependency maps, resource forecasts, risk/issue logs, and program dashboards/status reports. Identify and drive mitigation plans for issues/risks, including cross‑team escalations and corrective actions across multiple engineering areas. Drive regular execution reviews with engineering teams and provide concise, data‑driven updates to senior leadership. GPU & Platform Execution Own program execution for GPU‑based AI platforms, spanning system bring‑up, qualification, scale readiness, and deployment validation across server, rack, and cluster levels. Drive alignment across GPU, CPU, firmware, BIOS/BMC, and system teams to ensure readiness for scale testing and customer workloads. Track platform issues, and debug dependencies; ensure risks are clearly documented, owned, and mitigated. AI Rack / Cluster Validation Own program planning and execution for multi‑node and multi‑rack scale testing, including test strategy, scheduling, coverage tracking, and readiness gates. Lead end‑to‑end delivery of rack‑level AI solutions, including compute trays, switch trays, cabling, power, cooling, and management infrastructure. Ensure rack bring‑up plans are executable, resourced, and gated with clear entry/exit criteria across EVT, DVT, and scale phases. Drive coordination across lab operations, infrastructure, and engineering teams to unblock rack access, power, networking, and test readiness. Partner with scale, performance, and automation teams to ensure workloads, stress tests, and regressions plans are ready before hardware arrives. Debug, Failure Analysis & Risk Management Act as the execution lead for platform debug, coordinating across engineering teams to ensure fast triage, root‑cause analysis, and resolution of system‑level issues. Track high‑impact failures (GPU, HSIO, FW, rack, network) through debug forums ensuring clear ownership and closure plans. Balance debug depth vs. program timelines, escalating tradeoffs when needed and ensuring leadership has a clear view of risk and impact. Required Qualifications Experience leading complex hardware or AI infrastructure programs with ownership across bring‑up, validation, and deployment phases. Strong technical understanding of GPU‑based AI systems, rack architectures, and datacenter infrastructure. Proven ability to manage ambiguity, drive debug execution, and lead cross‑functional teams without direct authority. Strong written and verbal communication skills, including executive‑level status reporting. Proficiency with program management and execution tools (Jira, Confluence, dashboards, Excel/PowerPoint). Preferred Qualifications Hands‑on experience with GPU cluster scale testing, system stress, or performance validation. Familiarity with rack‑level bring‑up, power/cooling constraints, networking, and failure modes at scale. Experience working through hardware/firmware debug cycles in pre‑production or customer‑facing environments. Academic Credentials Bachelor’s or master’s degree in systems, EE, CS, or related engineering discipline. PMP, Scrum Master, or equivalent program management training. Location Austin, TX This role is not eligible for visa sponsorship. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr Advanced Micro DevicesVacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Technical Program Manager- AI Cluster Validation in Austin, TX vacancy
$200k - $275k
...Technical Program Manager For Deployments As a Technical Program Manager for Deployments, you will... ...bring data center infrastructure and AI clusters online. You will operate in a highly... ...with SMEs across multiple domains to validate plans and resolve gaps ~ Comfortable...SuggestedContract workFor contractorsLocal area- ...next‑generation computing experiences—from AI and data centers, to PCs, gaming and... ...your career. The Role We are seeking a Program Manager with strong analytical, problem‑solving,... ...with guidance from management and senior technical stakeholders Apply project management principles...SuggestedWork at office
$160k - $200k
...demand for new Cloud and AI infrastructure. Fleet is led... ..., within scope, and to technical standards while managing risk, dependencies, and scale... .... Job Responsibilities: Program & Deployment Management Own... ...cooling if applicable) Validate deployment meets data center...Suggested- ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded... ...of AI, solid system integration and validation is paramount. Our AI rack‑scale... ...this role, you will serve as a critical technical program manager in a dynamic, fast‑paced environment....Suggested
$151k - $297k
...embrace innovation, and unleash AI. Our industry-leading developer data... ...together to help our users manage MongoDB at global scale. We are responsible... ..., globally distributed MongoDB clusters in just minutes. As a Staff Technical Program Manager, you will own the...SuggestedLocal areaWorldwideFlexible hours$100k
...leading the industry on cutting-edge AI technology, revolutionizing performance... ...software. We are seeking an experienced Technical Program Manager to lead cross-functional product... ...preferred). Experience in product design, validation, or engineering, combined with 3+...Permanent employment$148.7k - $201.2k
...(GSCTP) organization is looking for a Technical Program Manager to lead vendor onboarding automation and... ...partners, designing and deploying AI-powered tools, driving system integrations... ..., from risk assessment and compliance validation through remediation tracking,...Local areaWorldwideFlexible hoursShift workDay shift- About Autonomize AI Autonomize AI is revolutionizing healthcare by streamlining... ...Opportunity We’re looking for a Customer‑Facing Technical Program Manager to be the driving force behind our... ...customer and internal stakeholders to validate solution architecture and integration...
$148.7k - $201.2k
...Sr. Technical Program Manager, AI Studios Job ID: 10447876 | Amazon.com Services LLC Join us as a Sr. Technical Program Manager on the AI Studios team within Prime Video and Amazon MGM Studios, where you’ll drive the delivery of AI-powered production capabilities that...Local areaImmediate startFlexible hoursDay shift$116k - $145k
...Join to apply for the Senior Technical Program Manager role at DigitalOcean . We are looking for a Senior Technical Program Manager (TPM) who is... ...dynamic team dedicated to revolutionizing cloud computing and AI through operational excellence and seamless execution. You will...Local areaRemote workFlexible hours$164k - $205k
...development plan, a trained and coached manager, the most amazing team you’ve ever met (... ...as an Individual Contributor to join our Technical Program Management team in BetterUp’s Research &... .... Utilize industry‑standard tools and AI capabilities creatively to solve problems...Work experience placementSummer holidayLive outWork at officeLocal areaFlexible hours2 days per week$148.84k - $175.1k
...The Business Unit Delivery Team leverages technical programs, platforms, and engineering to help... ...collaborating across internal teams to manage prioritization, dependencies, scheduling... ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini...Local area- ...power the world’s most advanced cloud, AI, and telecom infrastructures. We... ...organization, the Staff Product Development Technical Program Manager will lead the management and execution... ...and architecture through design, validation, production release, ramp, and end‑of‑...Contract work
$167.28k - $196.8k
...goal is to identify, measure, manage, mitigate, and report risk associated... ...’s funds and data safe. As a Program Manager in the Security... ...Managers, Program Managers, and Technical Program Managers who work... ...security programs by leveraging AI tools, automation, and retrospectives...Temporary workLocal area$116k - $159.5k
...literally connect our world – like AI and IoT. If you want to push... ...We're committed to providing programs and support that encourage... ..., processes and resources. Manages project schedule and task details... ...complete projects. Provides technical input to team members to achieve...Full timeWorldwideRelocation- ...Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our... ...JOB SUMMARY We are looking for an experienced Senior Technical Program Manager to lead large cross-functional teams through a full product...Local areaShift work
$66k - $110.5k
...eBay's Global Platforms organization is looking for a Technical Program Manager to help drive planning, coordination, and execution across critical... .... We use cookies to enhance your experience and may use AI tools for administrative tasks in the hiring process. To learn...Immediate startRemote work- ...About the Company At Future Secure AI, we're building something genuinely new - and we're looking for people bold enough... ...you. About the Role We are looking for a Platform Technical Program Manager to lead the planning and delivery of large-scale platform engineering...Flexible hours
- ...that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ..., we advance your career. THE ROLE: The Senior Technical Program Manager - Strategic Initiatives role is a customer-focused, solutions...Afternoon shiftEarly shift
$167.28k - $196.8k
...is accessible to everyone. We are looking for a strong *Technical Program Manager *to join the Base team with emphasis in the Base Chain and Protocol... ...~ Demonstrates the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini, Glean) in daily...Local area- ...We are seeking a highly skilled Technical Program Manager (TPM) to join our Engineering organization. The TPM will play a critical role in ensuring... ...decision is always made by our team. You may opt out of AI screening without affecting your candidacy. For additional details...Contract workFor contractorsLocal areaImmediate startWorldwide
- ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded... ...ROLE: We are seeking an experienced Technical Program Manager - Server Customer Engineering (... ...across hardware, firmware, software, validation, and partners to deliver production‑ready...
- ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded... ...currently looking for a Manufacturing Technical Program Manager who will be part of a team driving end... ...operates at the intersection of ASIC validation, firmware/software enablement,...Contract workWork experience placementFlexible hours
$116k - $159.5k
...literally connect our world - like AI and IoT. If you want to push... ...We’re committed to providing programs and support that encourage... ...planning to implementation. Manages project schedule and task details... ...to complete projects. Provides technical input to team members to...Full timeWorldwideRelocation- Ampstek is seeking a seasoned technical program manager in Austin, Texas. The ideal candidate will have over 8 years of experience in technical program... ...ability to thrive in ambiguous situations. Experience with AI-assisted delivery models and exposure to financial services...
$96k
Role Overview As a Senior Technical Program Manager (TPM), you will be a key force‑multiplier within Confluent's engineering organization, leading... ...programs. Training and educational resources on our personalized, AI‑driven learning platform where IBMers can grow skills and...Temporary workShift work$106.61k - $284.28k
...CVS Health Digital is looking for a dynamic and driven Senior Technical Program Manager to lead cross-functional engineering teams tasked with... ...results in agreement or behavior change Advanced skills in Jira, AI tools, Office 360, Confluence, SharePoint or any other...Hourly payFull timeTemporary workWork at officeLocal area- ...client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of... ...changes the world. Your role and responsibilities As a Senior Technical Program Manager (TPM), you will be a key force‑multiplier within Confluent's...Shift work
- Apex Systems seeks an experienced Technical Program Manager (TPM) for a contract role in Austin, TX. You will drive infrastructure and AI-related initiatives, coordinating with cross-functional teams to ensure effective execution and successful delivery of projects. The...Contract work
- Apptronik Systems, Inc. is seeking a seasoned Staff Technical Program Manager to lead complex cross-functional programs. You will shape the strategic... ...major organizational priorities and impact the development of AI-powered robots. #J-18808-Ljbffr Apptronik Systems, Inc.
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Technical Program Manager- AI Cluster Validation. Be the first to apply!
Related searches
- technical superintendent Austin, TX
- senior technical manager Austin, TX
- technical business manager Austin, TX
- technical supervisor Austin, TX
- technical director engineering Austin, TX
- technical writing manager Austin, TX
- technical training manager Austin, TX
- technical product manager Austin, TX
- technical director Austin, TX
- senior technical product manager Austin, TX

