System Design & Debug Manager - AI Customer Engineering
Advanced Micro Devices , Inc.
What You Do At AMD Changes Everything
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
The Role
This role serves as the debug execution backbone of AMD's AI Customer Engineering organization, driving complex silicon, system, and fleet-level issues to resolution across all major customer segments. The System Design Manager plays a critical role in ensuring customer success, product quality, and large-scale deployment confidence through disciplined, end-to-end debug execution.
This is a high-visibility, high-impact position requiring deep technical expertise and strong cross-functional program leadership.
The Person
The ideal candidate is a highly experienced technical leader with deep expertise in pre- and post-silicon debug across CPU, GPU, and SoC platforms. They bring a strong program execution mindset, with the ability to translate complex, fragmented debug data into structured analysis, clear hypotheses, and actionable plans. This individual has a proven track record of leading critical customer escalations across hyperscale, OEM, and enterprise environments, while effectively influencing cross-functional teams without direct authority. They are an excellent communicator who can distill complex technical challenges into clear, concise, and decision-oriented messaging for executive leadership and customers.
Key Responsibilities
- Debug Program Leadership - Lead debug execution across hyperscale, OEM, HPC, and enterprise customer programs. Own high-impact, cross-customer and systemic issues and maintain visibility into top risks and trends.
- Customer Program Integration - Partner with Customer Program Managers to align debug execution with customer deliverables, platform readiness, and deployment schedules. Support escalations and executive-level customer engagements.
- Technical Debug Coordination - Drive cross-functional debug efforts across design, validation, product engineering, and failure analysis. Align pre- and post-silicon debug strategies and connect lab debug to real-world customer environments.
- Field Failure & Fleet Quality Management - Lead resolution of field failures, fleet anomalies, and data center reliability issues. Aggregate fleet, RMA, and production signals and feed learnings back into design, validation, and manufacturing.
- Governance & Process Improvement - Own debug tracking, prioritization, risk management, and executive reporting. Apply structured methodologies (8D, CAPA, FMEA) and drive continuous improvement in execution speed and consistency.
Preferred Experience
- Deep understanding of data center system architecture (CPU, GPU, FPGA, memory, connectivity, RAS, hotplug)
- Familiarity with hardware bring up, validation, manufacturing, and test flows
- Knowledge of reliability and quality metrics (yield, DPM, FIT)
- Proven years of experience in the semiconductor industry
- Deep hands-on experience with silicon debug (pre-silicon and post-silicon)
- Strong background in product development, debug tools, validation, failure analysis, or customer engineering
- Proven experience managing complex debug programs across multiple customer segments
- Strong functional team and project management skills with ability to drive execution across global, cross-functional teams
- Excellent written and verbal communication skills, including executive-level engagement
Education
- Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or related field required
- Master's degree preferred
This role is not eligible for visa sponsorship.
- ...Machine Learning Systems Engineer We are looking for Machine... ...in 3D generative AI, recognized as the No.... ...in industrial product design, in enablement of novel... ...full stack of AI, from debugging and monitoring the hardware... ...3D specific custom operators in Triton or...SuggestedPart timeRemote work
$152k - $208.5k
...leader in materials engineering solutions used to... ...in the world. We design, build and service... ...equipment that helps our customers manufacture... ...our world - like AI and IoT. If you want... ..., developing, and debugging software solutions... ...in intricate systems, deciphering code,...Suggested- ...computing experiences-from AI and data centers, to... ..., gaming and embedded systems. Grounded in a culture... ...THE ROLE: The AI Customer Engineering organization is... ...Principal AI Systems Design Engineer to help customers... ...role leading full-stack debug of AI infrastructure...Suggested
- ...in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting... ...strong skills in C/C++, Linux programming, and debugging complex issues. A hybrid work model is offered,...Suggested3 days per week
$152k - $241.5k
...built in the age of Generative AI? Join NVIDIA’s TensorRT... ...kind, AI-native initiative designed to make TensorRT the default... ...scale. If you are a systems-thinking C++ engineer who wants to help scale out... ...throughput gains for critical customer use cases. What we need...Suggested$200k - $322k
NVIDIA AI in Santa Clara is seeking a Senior System Debug Engineer to drive failure analysis during the New Product Introduction phase. You will collaborate with industry experts to ensure quality in GPU Server products while working in a diverse and supportive environment...$190.2k - $360.5k
...Creative Cloud Engineering organization is... ...generation of AI-powered engineering... ...a Senior AI Systems Engineer who operates... ...focuses on designing, orchestrating,... ...and autonomous debugging • Develop... ...and personalized customer experiences. Adobe... ...Experience Manager, and GenStudio...Temporary workLocal areaWorldwide- A leading technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role requires strong modern C++ skills, familiarity with deep learning frameworks, and...
- ...Principal AI/ML System Software Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of... ...optimize and trade-off various aspects of hardware-software co-design. You are able to build and scale software deliverables in...Work experience placement3 days per week
$170.5k - $240.71k
...The Role We are looking for a Senior AI Software Engineer — Agentic AI System to help build the infrastructure,... ...operating at scale. Key Responsibilities Design and implement deployment and... ...infrastructure-as-code or configuration management (Ansible, Terraform) Experience...Local areaImmediate startRemote workShift work- Apple Inc. in Sunnyvale is seeking an experienced engineer to design and implement cutting-edge agentic systems leveraging large language models (LLMs). This role focuses... ...of agentic systems, strong proficiency with AI coding, and the ability to design end-to-end solutions...
- ...Senior Staff AI/ML System Software Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation... ...optimize and trade off various aspects of hardware-software co-design. You are able to build and scale software deliverables in...3 days per week
$100k - $140k
Fortinet, Inc. is seeking an early-career engineer in Santa Clara, California to help design and implement resilient, scalable multi-agent systems for network security. This full-time... ...Bachelor’s degree in Computer Science, AI, or ML, along with strong Python skills...Full time- A leading AI technology company located in Santa Clara is seeking a Principal AI/ML System Software Engineer to develop and enhance next-generation AI deployment software. The ideal candidate will have over 12 years of experience in software development and strong skills...
- Applied Materials is hiring an Agentic AI Systems Engineer in Santa Clara, CA. This role involves designing the infrastructure for GenAI applications, bridging AI and software needs. Candidates must have 7+ years of experience with a strong proficiency in programming languages...
- ...Clinton). With a strong customer pipeline, Sage Care is... ...harnessing the latest AI innovations. Building... ...collaborations with health systems across the U.S., we... ...hiring a product-minded AI Engineer to help build and... ...helping ship features, debug issues from live calls,...
- Google Inc. is seeking a Software Engineer III for its Sunnyvale, CA location to work on AI/ML projects. You will be responsible for designing, developing, testing, and deploying large software systems, especially in the areas of user voice model serving and other ML-based...
$207k - $300k
Google Inc. is looking for a Staff Software Engineer for AI and Infrastructure to contribute to Google Cloud's mission.... ...8 years in related fields. Responsibilities include designing and implementing computer systems, collaborating on impactful projects, and providing...$134.7k - $207.6k
Design Manager, System Utilities This role is categorized as hybrid. This means the successful candidate is expected to report to Warren, MI... ...in the vehicle. You’ll work closely with software, engineering, and product partners to deliver intuitive, coherent, and...Flexible hours$152k - $241.5k
...unlimited potential of AI to define the next era... ...Deep Learning Compiler Engineer. NVIDIA is hiring software... ...AI, recommendation systems, image classification,... ...including analyzing and debugging performance bottlenecks... ...programming and software design skills, including debugging...$184k - $287.5k
...motivated, creative engineer with experience in software design who is passionate... ...in designing power management software architectures... ...and develop GPU system software components... ...driver programming and debugging, windows driver... ...vacancy. NVIDIA uses AI tools in its...Work experience placement$184k - $287.5k
...creative and highly motivated engineer with expertise in system s software to join the GPU Software team. You will design key aspects of our... ...with complex system-level debugging ~ Kernel experience with... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes...$184k - $287.5k
...We are now looking for a Systems Software Engineer. Do you like to think creatively... ...device testing, Silicon debug, and Silicon failure analysis... ...you'll be doing: Work with design team to understand Hardware... ...and yield learning Enable AI applications to optimize all...- ...responsible for designing, defining,... ...benchmark tests for AI infrastructure... ...box-level GPU systems, multi-GPU... ...Marketing, Product Management and System... ...business and customer use cases into... ...bottlenecks - Debugging knowledge on... ...Science or Computer Engineering with 5+ years...Full timeTemporary workWork at officeRemote workFlexible hoursShift work
$95k - $154k
...for entry-level software engineering and data roles, especially... ...engineering, data science, and ML/AI-full-time opportunities... ...Program (JOPP) is designed to solve: the gap between... ...master fundamentals-coding, debugging, data structures, system thinking-and then layer modern...Full timeH1b- A leading technology company is seeking a Senior Manager, System Co-Design Manager to oversee product co-design and verification in the Santa... ...to analyze and design innovative solutions while mentoring engineers. A competitive salary range and equity opportunities are included...
$147.4k - $220.9k
...Applied AI Software Engineer - Vision Products Group &... ...software, services, and design, we deliver end-... ...of AI, system software, and user... ...experience design, debugging complex cross-device... ..., memory management, and concurrency... ...ability to translate customer needs into technical...Relocation$170k - $277k
...LinkedIn's Machine Learning Engineers are both data/research... ...of LinkedIn's system Collaborate with 10+... ...newsfeed Build scalable AI innovations with foundation... ...experience in software design, development, and... ...diagnose technical problems, debug code, and automate...For contractorsWork at officeFlexible hours- ...Details Backend/Systems Experience 3+... ...systems (pre-AI experience required)... ...Kubernetes - can build, deploy, debug, and scale services themselves... ...) Hands-On Engineer Not just an... ...servers/integrations or custom tool-use systems for LLMs...
$150k - $160k
...Role: pplied AI Engineer Location: Sunnyvale, CA/Austin, TX - Onsite Job Job Type... ...Salary- $150-160K Production AI Systems : Has shipped AI/LLM features serving... ...and Kubernetes - can build, deploy, debug, and scale services themselves LLM...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to System Design & Debug Manager - AI Customer Engineering. Be the first to apply!
- interior design director Santa Clara, CA
- industrial design manager Santa Clara, CA
- architectural design manager Santa Clara, CA
- director of design Santa Clara, CA
- director of design and construction Santa Clara, CA
- interior design manager Santa Clara, CA
- ux design manager Santa Clara, CA
- director experience design Santa Clara, CA
- design manager Santa Clara, CA
- director ux design Santa Clara, CA

