Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Debug Engineer - Data Center GPU

Advanced Micro Devices

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

THE TEAM:

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and join our Data Center GPU organization where we are building amazing AI powered products with amazing people.

THE ROLE:

AMD is looking for a lead systems engineer to provide thought leadership and subject matter expertise to our growing team. As a key contributor, you will have a strong technical background to contribute to all aspects of the software development process. We have competitive benefit packages and an award‑winning culture. Join us! The Datacenter Graphics and Accelerated Computing (DCGPU) organization is looking for an experienced system level debug engineer. Individual will be part of a team that as to bring‑up, validate and ensure the platform being used is fully validated: including electrical, power, networking and SOC. Individual will be required to lead and document the plan for validating the system itself as well put in documentation for unique steps to enable it. Individual will need to be able to drive to root closure any issues encountered and communicate with the different Functional and IP layers for resolution.

THE PERSON:

You are highly motivated hands‑on leader with a strong development background, problem solving mentality, excellent communication skills, ability to prioritize tasks along with willingness to learn and adapt. Excellent teamwork skills and capable of leading a highly technical team. Experience in debugging of complex HW/FW issues is a must, understand the flow of a GPU through the different layers of a system and be able to validate the items connecting to the GPU SOC (PCIe, VRs, RMs, retimers, HBM, internal networking). Communication is essential in working with different owners of the functional code stack as well as the ability to drive issues via phone calls, chat messages, e-mails. Hands on experience with Hardware in a DataCenter environment will be required.

KEY RESPONSIBILITIES:

Debug / triage engineer and understanding of industry tools for root causing complex issues Understanding of GPU/System level HW and SW flow Ability to probe parts of a board; check electrical and power currents and validate a system Provide leadership for driving to root cause issues Communicate / Document flows and methods of bring‑up, boot‑up, system initialization and debug Lead technical presentations demonstrating a good understanding of application, data, infrastructure, architecture expertise and application systems design Collaborate with application and infrastructure architects and be responsible for the defining‑designing‑delivering of the technical architectures, patterns, technical quality, risks, fitness for purpose and operability of technical architecture solutions Be a leader and mentor to the operation team; be hands‑on and lead by example Be able to hand‑on troubleshoot and solve the technical issues; own the problem and drive for resolution Able to proactively support team culture that fosters knowledge sharing, excellence, and collaboration

PREFERRED EXPERIENCE:

Significant experience in SoC and/or System debug of complex issues Develop / Document debug capabilities on a given SOC and System Go‑to person for debugging of issues for the Production level Platform validation Collaborate with internal teams on root causing issues, finding optimum resolutions Hands‑on experience in using industry debug tools, scopes as well examine board level power Proven experience with C/C++ Demonstrable experience in facilitating Agile, Scrum or Kanban Skilled in scripting languages such as Perl, Ruby, and Shell script Proficient with revision control (GIT, SVN and CVS) Experience crafting and supporting cloud environments, including IaaS and PaaS Database development, PostgreSQL, Oracle, MS SQL Server Good balance of hardware, architecture, and software expertise Proven ability to drive resolution of critical problems within a lab, DataCenter Relationship with external customers/partners and able to help resolve problems in their Data Center Relationship with external customers/partners on ability to work manufacturing issues/failures Relationship with external customers/partners on ability to define rqmts for manufacturing validation

ACADEMIC CREDENTIALS:

Bachelor’s/Master’s degree in Computer Science or related field strongly preferred + minimum 8 yrs experience in System or SOC level debug and triage.

LOCATION:

Austin, TX Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr Advanced Micro Devices

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Systems Debug Engineer - Data Center GPU in Austin, TX vacancy
  • Advanced Micro Devices is seeking a hands-on lead systems engineer for its Data Center GPU organization in Austin, TX. You will guide a technical team...  ...of next-generation AI products. Strong background in debugging, hardware validation, and experience in Data Center environments... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...Senior Failure Analysis (FA) Engineer, you will play a critical...  ...complex failures across GPU-accelerated server...  ...server bring‑up, system‑level debug, and rack integration troubleshooting... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    2 days ago
  • A leading semiconductor company in Austin, TX, seeks a Systems Software Engineer to enhance product development and technical innovation. Responsibilities include debugging SOC programs and collaborating with cross-functional teams. Ideal candidates will have virtualization... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • A leading technology company in Austin, TX is seeking a Senior Signal Integrity Engineer to ensure high-speed interfaces deliver reliability and performance for datacenter GPU systems. The ideal candidate will lead design efforts, optimize performance for cost-efficiency... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    2 days ago
  • A leading semiconductor company in Austin, Texas, is seeking a System Application Engineer to support Data Center GPU customers. This role involves interacting with OEM partners and internal teams to facilitate the deployment of AMD’s Instinct™ Accelerators. Candidates... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    5 days ago
  • Hardware Systems Engineer - Data Center HWE At Apple, new ideas have a way of becoming products, services,...  ...configurations, and accelerator integration (GPU, SmartNIC, DPU). Optimize server...  ...problem-solving skills and the ability to debug complex issues across multiple layers... 

    Apple Inc.

    Austin, TX
    2 days ago
  •  ...computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...dynamic, energetic Systems Software Engineer to join our growing team. As a key...  ...cross-functional initiatives Debugging issues found during the SOC (System... 

    Advanced Micro Devices , Inc.

    Austin, TX
    1 day ago
  • Advanced Micro Devices is seeking a seasoned systems engineering professional fluent in Mandarin for the Data Center GPU team in Austin. This role involves collaborating on scalable AI systems and engaging with China-based customers. Key responsibilities include analyzing... 

    Advanced Micro Devices

    Austin, TX
    1 day ago
  •  ...seeks an analytical individual to develop low-level GPU exercisers for data center products. The successful candidate will have strong expertise in GPU programming and Linux systems. Responsibilities include debugging critical issues and contributing to application frameworks... 

    Advanced Micro Devices

    Austin, TX
    5 days ago
  •  ...now looking for a motivated Engineering Technician for one of our semiconductor...  ...a compute farm of systems which includes Builders,...  ...maintain and drive our world-class data centers and labs to produce timely,...  ...more) to craft, develop, debug, and release next-generation... 
    Contract work
    Worldwide

    Insight Global

    Austin, TX
    5 days ago
  •  ...accelerate next‑generation computing experiences—from AI and data centers to PCs, gaming, and embedded systems. We believe real progress comes from bold ideas,...  ...teams to deliver solutions. Mentor junior power engineers and verify their design and validation work.... 

    AMD

    Austin, TX
    2 days ago
  • Systems Signal Integrity Engineer - Apple Data center Apple is seeking an enthusiastic signal integrity engineer for the system technology team. The candidate will help with concept and feature development, rapid prototyping, The Datacenter organization is looking for an... 
    Work experience placement

    Apple Inc.

    Austin, TX
    4 days ago
  • A leading data center provider in Texas is looking for a Systems Engineer I to support the deployment and maintenance of Windows and Linux environments. Responsibilities include monitoring system performance, troubleshooting infrastructure issues, and providing Tier 2 support... 

    Switch

    Austin, TX
    3 days ago
  •  ...experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...opportunity due to growth for an Engineering Technician - AI Data...  ...to support the bring‑up, debug, and sustainment of AI data center servers and GPU‑based compute platforms.... 
    Work experience placement
    Work at office

    AMD

    Austin, TX
    5 days ago
  • $152k - $241.5k

     ...and Visualization. The GPU, our invention, serves...  ...our team of innovative engineers who develop and maintain...  ...support deployment and debug of our hardware and Infrastructure...  ...of operating systems, computer networks, and...  ...and debugging complex data center networks.* Experience developing... 

    NVIDIA Corporation

    Austin, TX
    1 day ago
  •  ...experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...data center CPUs/GPU products. In this role,...  ...languages. Familiarity with debugging techniques and methodologies...  ...Science, Computer Engineering or EECS is preferred.... 

    Advanced Micro Devices , Inc.

    Austin, TX
    1 day ago
  •  ...Systems Integration Engineer - Data Center HWE As a highly adaptable Systems Integration Engineer, your primary responsibility is to provide robust,...  ...production hardware and document results. Build, maintain, and debug both custom and OEM hardware to support multiple... 
    Remote work

    Apple

    Austin, TX
    5 days ago
  • A leading technology company in Austin, Texas is looking for an EngOps Engineer to maintain high-performance management solutions in datacenter environments. The position requires at least 5 years of experience in deploying clusters and managing infrastructure, along with... 

    NVIDIA Corporation

    Austin, TX
    1 day ago
  •  ...computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...responsible for System and Silicon debug of AMD EPYC Server & AMD Instinct...  ...system level failures working with engineering teams across AMD. Candidate will... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...seeking a curious and self-motivated UEFI BIOS Engineer to join our team and contribute to the...  ...and security of Apple datacenter systems. You will have the opportunity to make a...  ...implement custom features for the UEFI BIOS. Debug complex firmware issues. Collaborate with... 

    Apple Inc.

    Austin, TX
    3 days ago
  • $140k - $190k

     ...cloud, SaaS, identity, and data center networks in a single platform...  ...information, visit Our Engineering organization is a fast-growing...  ...and API design to backend systems and databases....  ...Swagger, etc.). ~ Skilled in debugging, performance profiling, automated... 
    Worldwide

    VECTRA

    Austin, TX
    1 day ago
  • Tract Capital, based in Austin, Texas, is seeking a Senior Mechanical Design Engineer to support the design and productization of mechanical data center infrastructure. This role involves developing 3D CAD models and detailed manufacturing documentation for cutting-edge... 

    Tract Capital

    Austin, TX
    3 days ago
  • Tract Capital Management, LP is seeking a Senior Mechanical Design Engineer in Austin, Texas. In this role, you will support the mechanical design of data center infrastructure products, develop 3D CAD models, and ensure manufacturability of designs. Candidates should... 

    Tract Capital Management, LP

    Austin, TX
    4 days ago
  • $127.36k - $191.04k

    Join to apply for the Systems Design Engineer role at AMD Join to apply for the...  ...building blocks for the data center, artificial intelligence, PCs...  ...performance architecture, IP (CPU, GPU, memory, data fabric), SoC,...  ...Test, Integration Test, debugging tolls Experience in SW... 
    Full time
    Internship

    AMD

    Austin, TX
    2 days ago
  •  ...experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...hiring Sr. Systems Design Engineer to research, design, develop...  ...system, or product (CPU, GPU, and/or SoC). Define...  ...data extraction; Hardware debug; ATE level tests; Creating... 

    Advanced Micro Devices, Inc.

    Austin, TX
    1 day ago
  • $200k - $250k

     ...our own, taking pride in the systems we build and the trust we earn...  ...Role Controls Commissioning Engineer will take the lead in commissioning...  ...in controls commissioning, data center MEP systems, or industrial...  ..., write, troubleshoot, and debug control logic. Experience with... 
    For contractors
    Local area

    Fluidstack

    Austin, TX
    4 days ago
  • $148.3k - $222.5k

    A leading semiconductor company is seeking a Data Center Post Silicon Power and Performance Engineer to optimize the performance and power consumption of SoCs...  ...characterizing workloads, defining analysis plans, debugging issues, and ensuring peak performance. Ideal candidates... 

    Qualcomm

    Austin, TX
    1 day ago
  •  ...looking for an experienced Technical Program Manager in Austin, Texas, to oversee AI cluster engineering programs. The role involves cross-functional collaboration to manage GPU platforms and datacenter AI infrastructure. Candidates should possess strong program... 

    Advanced Micro Devices

    Austin, TX
    1 day ago
  • A leading organization is seeking a motivated Engineering Technician to maintain their on-premise, private...  ...collaborating with engineering teams, managing the system's lifecycle, and ensuring optimal performance of data center resources. With a requirement of 5 to 12 years... 

    Insight Global

    Austin, TX
    1 day ago
  •  ...Principal Mechanical Design Engineer Job Family Mechanical Design Engineering Organization Data Center Infrastructure Team Location...  ...lead and oversee the mechanical systems essential for Data Center...  ...manufacturing. Perform tests, debug, and validation of mechanical... 
    Remote job
    Contract work

    Plasticos Castella SA

    Austin, TX
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Debug Engineer - Data Center GPU. Be the first to apply!