Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Technical Program Manager- AI Cluster Validation

Advanced Micro Devices

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Technical Program Manager – AI Cluster Validation The Role We are seeking a Technical Program Manager to lead execution of AI cluster engineering programs with deep focus on GPU platforms, rack‑level solutions, and AI Cluster validation. This role is responsible for driving end‑to‑end delivery from GPU + server integration through rack bring‑up, scale testing, failure analysis, and system debug closure, ensuring platform readiness for hyperscale and enterprise AI deployments. This role operates at the intersection of hardware, firmware, networking, and scale‑test execution, and requires strong technical depth combined with disciplined program execution. The Person You are a hands‑on TPM who thrives in complex, fast‑moving ecosystems, and can connect deep technical details to crisp program plans, executive reporting, and customer outcomes. You are comfortable driving execution in bring‑up and EVT/DVT/PVT working closely with engineers to root‑cause issues, unblock debug, and make data‑driven tradeoffs to keep programs moving. You bring urgency, ownership, and clarity to ambiguous problem spaces and can communicate effectively from lab floor to executive review. Key Responsibilities Program Leadership & Execution Define, plan, and drive program plans for AI infrastructure systems validation and readiness, including server integration, rack bring‑up, and cluster‑scale deployment readiness. Create and maintain core PM artifacts: schedules, dependency maps, resource forecasts, risk/issue logs, and program dashboards/status reports. Identify and drive mitigation plans for issues/risks, including cross‑team escalations and corrective actions across multiple engineering areas. Drive regular execution reviews with engineering teams and provide concise, data‑driven updates to senior leadership. GPU & Platform Execution Own program execution for GPU‑based AI platforms, spanning system bring‑up, qualification, scale readiness, and deployment validation across server, rack, and cluster levels. Drive alignment across GPU, CPU, firmware, BIOS/BMC, and system teams to ensure readiness for scale testing and customer workloads. Track platform issues, and debug dependencies; ensure risks are clearly documented, owned, and mitigated. AI Rack / Cluster Validation Own program planning and execution for multi‑node and multi‑rack scale testing, including test strategy, scheduling, coverage tracking, and readiness gates. Lead end‑to‑end delivery of rack‑level AI solutions, including compute trays, switch trays, cabling, power, cooling, and management infrastructure. Ensure rack bring‑up plans are executable, resourced, and gated with clear entry/exit criteria across EVT, DVT, and scale phases. Drive coordination across lab operations, infrastructure, and engineering teams to unblock rack access, power, networking, and test readiness. Partner with scale, performance, and automation teams to ensure workloads, stress tests, and regressions plans are ready before hardware arrives. Debug, Failure Analysis & Risk Management Act as the execution lead for platform debug, coordinating across engineering teams to ensure fast triage, root‑cause analysis, and resolution of system‑level issues. Track high‑impact failures (GPU, HSIO, FW, rack, network) through debug forums ensuring clear ownership and closure plans. Balance debug depth vs. program timelines, escalating tradeoffs when needed and ensuring leadership has a clear view of risk and impact. Required Qualifications Experience leading complex hardware or AI infrastructure programs with ownership across bring‑up, validation, and deployment phases. Strong technical understanding of GPU‑based AI systems, rack architectures, and datacenter infrastructure. Proven ability to manage ambiguity, drive debug execution, and lead cross‑functional teams without direct authority. Strong written and verbal communication skills, including executive‑level status reporting. Proficiency with program management and execution tools (Jira, Confluence, dashboards, Excel/PowerPoint). Preferred Qualifications Hands‑on experience with GPU cluster scale testing, system stress, or performance validation. Familiarity with rack‑level bring‑up, power/cooling constraints, networking, and failure modes at scale. Experience working through hardware/firmware debug cycles in pre‑production or customer‑facing environments. Academic Credentials Bachelor’s or master’s degree in systems, EE, CS, or related engineering discipline. PMP, Scrum Master, or equivalent program management training. Location Austin, TX This role is not eligible for visa sponsorship. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr Advanced Micro Devices

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Technical Program Manager- AI Cluster Validation in Austin, TX vacancy
  •  ...generation computing experiences—from AI and data centers, to PCs,...  ...We are seeking an experienced Technical Program Manager to drive end-to-end execution of AI cluster engineering programs spanning GPU...  ...to rack and cluster-level validation. You bring strong ownership, structured... 
    Suggested
    Work at office

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded...  ...Engineering organization, the Technical Program Manager (TPM) will lead the management and execution...  ...life. THE PERSON In this role, the Validation Technical Program Manager will serve... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    4 days ago
  •  ...next‑generation computing experiences—from AI and data centers, to PCs, gaming and...  ...your career. The Role We are seeking a Program Manager with strong analytical, problem‑solving,...  ...with guidance from management and senior technical stakeholders Apply project management principles... 
    Suggested
    Work at office

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • $200k - $275k

    Join to apply for the Technical Program Manager, Deployments role at Fluidstack Base pay range $200,0...  ...abundant intelligence. We partner with top AI labs, governments, and enterprises -...  ...and rigorous testing of the cluster with the SRE and Infrastructure teams.... 
    Suggested
    Local area

    Fluidstack

    Austin, TX
    2 days ago
  • $131.6k - $210.3k

     ...Services team at Visa is hiring experienced Technical Program Managers (TPMs) to build and deliver advanced...  ...architecture and proposed designs to validate that solutions meet business needs,...  ...shift-left” goals such as automation, AI adoption, security, and quality. Collaborate... 
    Suggested
    Work experience placement
    Work at office
    Local area
    Relocation package
    Shift work

    Tink

    Austin, TX
    2 days ago
  • $160k - $200k

     ...demand for new Cloud and AI infrastructure. Fleet is led...  ..., within scope, and to technical standards while managing risk, dependencies, and scale...  .... Job Responsibilities: Program & Deployment Management Own...  ...cooling if applicable) Validate deployment meets data center... 

    Tract Capital Management, LP

    Austin, TX
    4 days ago
  •  ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded...  ...of AI, solid system integration and validation is paramount. Our AI rack‑scale...  ...this role, you will serve as a critical technical program manager in a dynamic, fast‑paced environment.... 

    Advanced Micro Devices

    Austin, TX
    20 hours ago
  • About Autonomize AI Autonomize AI is revolutionizing healthcare by streamlining...  ...Opportunity We’re looking for a Customer-Facing Technical Program Manager to be the driving force behind our...  ...customer and internal stakeholders to validate solution architecture and integration... 

    ATX Venture Partners

    Austin, TX
    4 days ago
  •  ...power the world’s most advanced cloud, AI, and telecom infrastructures. We...  ...organization, the Staff Product Development Technical Program Manager will lead the management and execution...  ...and architecture through design, validation, production release, ramp, and end‑of‑... 
    Contract work

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...goal is to identify, measure, manage, mitigate, and report risk associated...  ...’s funds and data safe. As a Program Manager in the Security...  ...Managers, Program Managers, and Technical Program Managers who work...  ...security programs by leveraging AI tools, automation, and retrospectives... 
    Temporary work
    Local area

    Coinbase

    Austin, TX
    4 days ago
  • $116k - $159.5k

    Technical Program Manager IV page is loaded## Technical Program Manager IVlocations: Austin,TX: Kalispell,MT: Santa Clara,CAtime type: Full timeposted...  ...technologies that literally connect our world - like AI and IoT. If you want to push the boundaries of materials science... 
    Full time
    Relocation

    Applied Materials, Inc.

    Austin, TX
    2 days ago
  • $192k - $278k

    Google is looking for a Technical Program Manager in Austin, Texas. In this role, you will lead complex engineering projects, managing project schedules and risks for various Engineering programs. With a focus on network technologies, you will drive transformative changes... 
    Full time

    Google

    Austin, TX
    3 days ago
  • $102.3k - $147.05k

     ...partners closely with Product Management, Engineering, Security,...  ...plans. Comprised of experienced program and delivery leaders, the team...  ...expectations. Role Overview - Senior Technical Program Manager: We are...  ...insights, and people‑first AI, our ability to reveal unseen... 
    Temporary work
    Work at office
    Local area

    UKG (Ultimate Kronos Group)

    Austin, TX
    2 days ago
  • We need a lead who can provide deep technical stewardship over Google Cloud Platform (GCP) environments...  ...Automation & Intake: Implement AI‑led interfaces to automate resource...  ...utilization audits. Security & Compliance: Manage vulnerability mitigation (aiming for 95%... 
    Hourly pay
    Local area

    Synergis

    Austin, TX
    1 day ago
  • $116k - $159.5k

     ...literally connect our world - like AI and IoT. If you want to push...  ...We’re committed to providing programs and support that encourage...  ...planning to implementation. Manages project schedule and task details...  ...to complete projects. Provides technical input to team members to... 
    Full time
    Worldwide
    Relocation

    Applied Materials

    Austin, TX
    1 day ago
  • Senior Technical Program Manager, Agentic AISkip to main contentLight & Wonder does not collect personally identifiable or confidential information...  ...of this Web Site.#Senior Technical Program Manager, Agentic AI page is loaded## Senior Technical Program Manager, Agentic... 
    Work experience placement

    Light & Wonder, Inc.

    Austin, TX
    3 days ago
  • $116k - $159.5k

     ...that literally connect our world - like AI and IoT. If you want to push the boundaries...  ...Ensures that specific project and program objectives are defined for areas of responsibility...  ...commodities, with time to time managing other semiconductor projects. The right candidate... 
    Full time
    Relocation

    Applied Materials

    Austin, TX
    3 days ago
  • Apptronik Systems, Inc. is seeking a seasoned Staff Technical Program Manager to lead complex cross-functional programs. You will shape the strategic...  ...major organizational priorities and impact the development of AI-powered robots. #J-18808-Ljbffr Apptronik Systems, Inc.

    Apptronik Systems, Inc.

    Austin, TX
    1 day ago
  • $148.84k - $175.1k

     ...The Business Unit Delivery Team leverages technical programs, platforms, and engineering to help...  ...collaborating across internal teams to manage prioritization, dependencies, scheduling...  ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini... 
    Local area

    Coinbase

    Austin, TX
    2 days ago
  • $163k - $237k

    Technical Program Manager III, Robotics and Automation, Cloud Supply Chain Google - Austin, TX, USA; Atlanta, GA, USA Qualifications Bachelor’s...  ...cross‑functional or cross‑team projects. Experience leveraging AI/ML technologies or advanced tooling for operational... 
    Full time

    Google Inc.

    Austin, TX
    1 day ago
  • $78.5k - $108k

     ...that literally connect our world - like AI and IoT. If you want to push the boundaries...  ...; analyzes possible solutions using technical experience and judgment and precedents....  ...participation in a bonus and a stock award program, as applicable. Applied Materials is an... 
    Full time
    Relocation

    Applied Materials

    Austin, TX
    3 days ago
  • $154.4k - $231.6k

    Company Qualcomm Technologies, Inc. Job Area Engineering Services Group > Program Management General Summary Technical Program Manager (AI/ML) Job Description Qualcomm is looking for a motivated, customer-oriented Program Manager/Agile Scrum Leader to join our team in... 
    Work experience placement

    Qualcomm

    Austin, TX
    20 hours ago
  •  ...semiconductor company in Austin, Texas is seeking a Senior Technical Program Manager to oversee high-impact product development initiatives. The...  ...experience managing complex technical programs. Join us in shaping the future of AI and technology advancements. #J-18808-Ljbffr AMD

    AMD

    Austin, TX
    4 days ago
  •  ...that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded...  ...we shape the future of AI and beyond. The Role The Senior Technical Program Manager - Strategic Initiatives role is a customer-focused,... 
    Afternoon shift
    Early shift

    AMD

    Austin, TX
    4 days ago
  • $192k - $278k

    Leadership Technical Program Manager, Google Enterprise Network Operations corporate_fare Google place Austin, TX, USA Apply Bachelor's degree in...  ...various Engineering-specific programs and teams. The AI and Infrastructure team is redefining what’s possible. We empower... 
    Full time
    Worldwide

    Google Inc.

    Austin, TX
    1 day ago
  • $163k - $237k

    Minimum Qualifications Bachelor's degree in a technical field, or equivalent practical experience. 5 years of experience in program management. Experience in sourcing, commodity...  ...managing external technical engagements. The AI and Infrastructure team is redefining what... 
    Full time
    Worldwide

    Google

    Austin, TX
    2 days ago
  • $200k - $275k

    A technology infrastructure company in Austin is seeking a Technical Program Manager for Deployments. This role involves managing the complete lifecycle of large-scale AI infrastructure projects, coordinating between various teams, and ensuring timely delivery of hardware... 

    Fluidstack

    Austin, TX
    2 days ago
  • Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our...  ...the better. JOB SUMMARY We are looking for a seasoned Staff Technical Program Manager to lead complex, cross-functional programs that span... 
    Local area
    Shift work

    Apptronik

    Austin, TX
    2 days ago
  • $155.4k - $210.2k

     ...year establishing its foundation. We are now looking for a Technical Program Manager to help take this team to the next level by strengthening the...  ...continued growth. You will also work closely with a dedicated AI engineer on the team to explore how automation, machine... 
    Flexible hours

    Amazon

    Austin, TX
    2 days ago
  • $160k - $200k

     ...are struggling to keep pace with the demand for new Cloud and AI infrastructure. Fleet is led by a team of industry veterans...  .... Position Overview The Data Center Campus Planning Senior Technical Program Manager (TPM) owns end-to-end program management of data center... 
    For contractors

    Tract Capital Management, LP

    Austin, TX
    20 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Technical Program Manager- AI Cluster Validation. Be the first to apply!