Senior Debug & Failure Analysis Engineer - Datacenter GPUs
NVIDIA
NVIDIA is seeking a Senior System Debug Engineer to join its datacenter product engineering team in Santa Clara, California. The role involves driving failure analysis and debugging efforts during the New Product Introduction phase while collaborating with industry experts. The ideal candidate will have over 12 years of experience and a degree in Electrical Engineering. This position offers a competitive salary, equity, and a comprehensive benefits package. The successful candidate will perform rigorous failure analysis and engage with internal teams to ensure product quality and timely delivery. This job presents a unique opportunity to work with innovative technology that shapes the future of datacenters. #J-18808-Ljbffr NVIDIA
$200k - $322k
...lasting impact on the world. Join NVIDIA's datacenter product engineering team in our Operations organization and be... ...forefront of technological advancement! As a Senior System Debug Engineer, you will drive failure analysis and debug efforts during our New Product...SeniorWork experience placementOverseas$200k - $322k
Join NVIDIA's datacenter product engineering team in our Operations organization and be at the forefront of technological advancement! As a Senior System Debug Engineer, you will drive failure analysis and debug efforts during our New Product Introduction (NPI) phase. You...SeniorWork experience placementOverseas$152k - $241.5k
...software team! This software engineering role involves developing datacenter scale performance... ...Python), analytical, and debugging Good understanding of Deep... ...Experience with NVIDIA GPUs, CUDA Programming, and... ...large AI job performance analysis for training/inference workload...Senior$140k - $224.25k
...automotive, vision, HPC, datacenters and networking in... ...’, and NVIDIA GPUs are the brains... ...support for root cause analysis on reliability and validation test failures to identify root... ...Build, develop/debug server and OS level... ...Science, Technology, Engineering, Math or Physics)...Senior$130k - $200k
...connections among thousands of GPUs and memory units. The... ...and test systems engineer to design, build, and... ...automation and analysis frameworks, and closing... ...limitations and failure modes Debug complex issues across... ...Familiarity with telecom, datacenter optics, or silicon...SeniorContract work$152k - $241.5k
...and seeking top-tier compiler engineers who want an exciting and... ...of programmable networks at datacenter scale deployments of NVIDIA... ...research experience in performance analysis, compiler optimizations,... ...software design skills, including debugging, performance analysis, and...Senior$152k - $241.5k
...AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...around the world are using GPUs to power a revolution in deep... ...optimizations and analysis, crafting and implementing compiler... ...software design skills, including debugging, performance analysis, and...Senior$200k - $400k
...Of Foundation Models Engineer The Institute of Foundation... ..., distributed debugging, and communication-runtime... ...across thousands of GPUs · Architect fault-... ...under real-world cluster failures Core Technical... ...concepts · Congestion analysis and routing optimization...SeniorVisa sponsorship$116k - $184k
NVIDIA Gruppe in Santa Clara is seeking a Product Quality Engineer to lead failure analysis for system products. The ideal candidate will have a strong analytical skillset and experience in problem-solving, focusing on quality improvement initiatives. Your responsibilities...Senior- ...connections among thousands of GPUs and memory units. The company... ...Packaging Design Engineer to own bump-bond, die-attach... ...budgets, CTE-mismatch stress analysis, outgassing constraints Fiber... ...reliability data to identify potential failure modes and work with design...SeniorContract work
- ...SK HMS Systems engineering / Quality assurance... ...focus on enterprise datacenter SSD development.... ...-dive root-cause analysis for complex,... ...intermittent system-level failures, driving cross-... ...LeCroy) for debugging PCIe/NVMe link-level... ...a contractor or senior consultant,...SeniorContract workFor contractors
- ...connections among thousands of GPUs and memory units. The... ...Overview We are looking for a senior individual contributor with... ...Applied Physics, Electrical Engineering, or related field ~7 - 15+... ...Hands-on experience with failure analysis and materials characterization...Senior
$200k - $351k
...receive an alert: Senior Principal, Design Engineering, Power Design Location... ..., design, and debug of complex power delivery systems for datacenter products. Skills &... ...relevant simulations, analysis, test vehicles, and... ...CPU, GPU components Failure mode analysis Good...SeniorLocal area$168k - $258.75k
The Senior Datacenter Systems Modeling Engineer leads cross-functional execution and modeling for data-center power-delivery programs. This role owns... ...and optimize the design; drive fast and effective failure analysis with solid root cause and correlation actions. Own technical...Senior$116k - $184k
...hear from you! We are looking for a Product Quality Engineer to join our team leading all aspects of failure analysis for NVIDIA’s system product segment throughout... ...on experience in PCB level hardware verification/debug and signal integrity measurement. Experience with...SeniorWork experience placement$152k - $241.5k
...company”. We are hiring software engineers for the CUDA Tile team. NVIDIA GPUs are at the center of the deep... ...compiler optimization, performance analysis and IR design. ~ Ability to work... ...software design skills, including debugging, performance analysis, and test design...Senior$168k - $264.5k
...are looking for a System Reliability Engineer to join NVIDIA's existing Reliability... ...cluster products. Lead reliability testing, failure analysis, and root cause investigations; drive... ...or hardware engineering from datacenter, systems, or computer industries. Hands...SeniorFull time$152k - $241.5k
...and experienced HPC Cluster Engineer to design, deploy, and operate... ...workloads including performance analysis and optimizations. Conduct... ...researchers' velocity, debugging and software performance at scale... ...: Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf...Senior- ...Senior Systems Engineer Graphcore is one of the world's leading innovators... .... Diagnose system-level failures involving thermal behavior,... ...teams to perform root cause analysis and propose corrective actions... ...and board-level debugging. Experience analyzing system...Senior
$184k - $287.5k
...NVIDIA is now looking for a Senior Memory System Engineer to join our ASIC Memory Subsystem... ...to the system workloads Debug and bring up memory evaluation / validation and failure issues on memory technology.... ...for memory failure analysis ~ Experience with Python,...Senior$191.4k - $281.4k
...are a close-knit team focused on hardware quality and failure analysis, within a larger hardware engineering organization which designs, implements, and delivers... .... Your Impact In this highly visible role as Senior Hardware Engineering Failure Analysis Technical...SeniorFull timeTemporary workLocal areaFlexible hours$75 - $85 per hour
...development and hardware engineering company, offering end-... ...com/careers Title: Senior Electrical Engineer... ...electrical measurements and analysis across power rails,... .... ~Execute lab debugging and troubleshooting using... .... ~ Conduct failure analysis of electrical...SeniorFlexible hours$152k - $241.5k
We are hiring software engineers for the CUDA Tile team, a new tile‑... ...programming model for NVIDIA GPUs. What you’ll be doing: Work on... ...optimization, performance analysis and IR design. Ability to work... ...software design skills, including debugging, performance analysis, and...Senior$168k - $264.5k
...for hardworking systems engineers who will craft FPGA... ...for our next generation GPUs, SOCs, NICs, and Switches... ...are now looking for a Senior Systems Prototyping and... ...platforms and drive complex debug and problem-solving... ...validation, and performance analysis using FPGA prototypes...Senior$184k - $287.5k
...As NVIDIA makes inroads into the Datacenter business, our team plays a central role in getting... .... The role of a Deep Learning Systems Engineer would be to analyze the performance and... ...). Work with experts to help develop analysis and profiling tools in Python, bash and...Senior- ...About the Role Eurofins EAG Laboratories is seeking a Senior Failure Analysis Engineer to lead hands-on investigations of electronic components... ...is best suited for someone who enjoys deep technical debugging operates lab tools independently and can translate complex...SeniorPermanent employmentFull timeCasual workLocal areaRemote work
- ...Senior Distributed Storage System Engineer This role has been designed as 'Onsite' with an... ...develops, troubleshoots and debugs software programs for... ...including solution design, analysis, coding, testing, and... ...product quality and mitigate failure risk. Provides domain-...SeniorWork at officeLocal area
$152k - $241.5k
...AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler... ...optimizations and analysis, and crafting and implementing... ...workloads and future NVIDIA GPUs. What we need to see Bachelor... ...software design skills, including debugging, performance analysis, and test...Senior- ...a highly skilled HPC Cluster Engineer to design, deploy, and operate... ..., including performance analysis and optimizations. Conduct root... ...accelerate researchers' velocity, debugging, and software performance at... ...: Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf...Senior
$152k - $241.5k
...looking for versatile software engineers for our XLA team. NVIDIA is... ...the OpenXLA compiler on NVIDIA GPUs at scale. You’ll collaborate... .... Performance tuning and analysis. Code‑generation for NVIDIA GPU... ...software design skills, including debugging, performance analysis, and...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Debug & Failure Analysis Engineer - Datacenter GPUs. Be the first to apply!
- senior game producer Santa Clara, CA
- senior manager process engineering Santa Clara, CA
- senior manufacturing engineer Santa Clara, CA
- senior manager clinical operations Santa Clara, CA
- senior optical engineer Santa Clara, CA
- senior lead project manager Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior device engineer Santa Clara, CA
- senior full stack developer Santa Clara, CA
- senior hvac project manager Santa Clara, CA


