Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Debug & Failure Analysis Engineer - Datacenter GPUs

NVIDIA

NVIDIA is seeking a Senior System Debug Engineer to join its datacenter product engineering team in Santa Clara, California. The role involves driving failure analysis and debugging efforts during the New Product Introduction phase while collaborating with industry experts. The ideal candidate will have over 12 years of experience and a degree in Electrical Engineering. This position offers a competitive salary, equity, and a comprehensive benefits package. The successful candidate will perform rigorous failure analysis and engage with internal teams to ensure product quality and timely delivery. This job presents a unique opportunity to work with innovative technology that shapes the future of datacenters. #J-18808-Ljbffr NVIDIA

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Debug & Failure Analysis Engineer - Datacenter GPUs in Santa Clara, CA vacancy
  • $200k - $322k

     ...lasting impact on the world. Join NVIDIA's datacenter product engineering team in our Operations organization and be...  ...forefront of technological advancement! As a Senior System Debug Engineer, you will drive failure analysis and debug efforts during our New Product... 
    Senior
    Work experience placement
    Overseas

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $200k - $322k

    Join NVIDIA's datacenter product engineering team in our Operations organization and be at the forefront of technological advancement! As a Senior System Debug Engineer, you will drive failure analysis and debug efforts during our New Product Introduction (NPI) phase. You... 
    Senior
    Work experience placement
    Overseas

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...software team! This software engineering role involves developing datacenter scale performance...  ...Python), analytical, and debugging Good understanding of Deep...  ...Experience with NVIDIA GPUs, CUDA Programming, and...  ...large AI job performance analysis for training/inference workload... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $140k - $224.25k

     ...automotive, vision, HPC, datacenters and networking in...  ...’, and NVIDIA GPUs are the brains...  ...support for root cause analysis on reliability and validation test failures to identify root...  ...Build, develop/debug server and OS level...  ...Science, Technology, Engineering, Math or Physics)... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $130k - $200k

     ...connections among thousands of GPUs and memory units. The...  ...and test systems engineer to design, build, and...  ...automation and analysis frameworks, and closing...  ...limitations and failure modes Debug complex issues across...  ...Familiarity with telecom, datacenter optics, or silicon... 
    Senior
    Contract work

    nEye.ai

    Santa Clara, CA
    6 days ago
  • $152k - $241.5k

     ...and seeking top-tier compiler engineers who want an exciting and...  ...of programmable networks at datacenter scale deployments of NVIDIA...  ...research experience in performance analysis, compiler optimizations,...  ...software design skills, including debugging, performance analysis, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $152k - $241.5k

     ...AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers...  ...around the world are using GPUs to power a revolution in deep...  ...optimizations and analysis, crafting and implementing compiler...  ...software design skills, including debugging, performance analysis, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $200k - $400k

     ...Of Foundation Models Engineer The Institute of Foundation...  ..., distributed debugging, and communication-runtime...  ...across thousands of GPUs · Architect fault-...  ...under real-world cluster failures Core Technical...  ...concepts · Congestion analysis and routing optimization... 
    Senior
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $116k - $184k

    NVIDIA Gruppe in Santa Clara is seeking a Product Quality Engineer to lead failure analysis for system products. The ideal candidate will have a strong analytical skillset and experience in problem-solving, focusing on quality improvement initiatives. Your responsibilities... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...connections among thousands of GPUs and memory units. The company...  ...Packaging Design Engineer to own bump-bond, die-attach...  ...budgets, CTE-mismatch stress analysis, outgassing constraints Fiber...  ...reliability data to identify potential failure modes and work with design... 
    Senior
    Contract work

    nEye.ai

    Santa Clara, CA
    3 days ago
  •  ...SK HMS Systems engineering / Quality assurance...  ...focus on enterprise datacenter SSD development....  ...-dive root-cause analysis for complex,...  ...intermittent system-level failures, driving cross-...  ...LeCroy) for debugging PCIe/NVMe link-level...  ...a contractor or senior consultant,... 
    Senior
    Contract work
    For contractors

    SK Hynix Memory Solutions America Inc.

    San Jose, CA
    3 days ago
  •  ...connections among thousands of GPUs and memory units. The...  ...Overview We are looking for a senior individual contributor with...  ...Applied Physics, Electrical Engineering, or related field ~7 - 15+...  ...Hands-on experience with failure analysis and materials characterization... 
    Senior

    nEye.ai

    Santa Clara, CA
    3 days ago
  • $200k - $351k

     ...receive an alert: Senior Principal, Design Engineering, Power Design Location...  ..., design, and debug of complex power delivery systems for datacenter products. Skills &...  ...relevant simulations, analysis, test vehicles, and...  ...CPU, GPU components Failure mode analysis Good... 
    Senior
    Local area

    Celestica Inc.

    San Jose, CA
    17 hours ago
  • $168k - $258.75k

    The Senior Datacenter Systems Modeling Engineer leads cross-functional execution and modeling for data-center power-delivery programs. This role owns...  ...and optimize the design; drive fast and effective failure analysis with solid root cause and correlation actions. Own technical... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $116k - $184k

     ...hear from you! We are looking for a Product Quality Engineer to join our team leading all aspects of failure analysis for NVIDIA’s system product segment throughout...  ...on experience in PCB level hardware verification/debug and signal integrity measurement. Experience with... 
    Senior
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...company”. We are hiring software engineers for the CUDA Tile team. NVIDIA GPUs are at the center of the deep...  ...compiler optimization, performance analysis and IR design. ~ Ability to work...  ...software design skills, including debugging, performance analysis, and test design... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $168k - $264.5k

     ...are looking for a System Reliability Engineer to join NVIDIA's existing Reliability...  ...cluster products. Lead reliability testing, failure analysis, and root cause investigations; drive...  ...or hardware engineering from datacenter, systems, or computer industries. Hands... 
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    8 hours ago
  • $152k - $241.5k

     ...and experienced HPC Cluster Engineer to design, deploy, and operate...  ...workloads including performance analysis and optimizations. Conduct...  ...researchers' velocity, debugging and software performance at scale...  ...: Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...Senior Systems Engineer Graphcore is one of the world's leading innovators...  .... Diagnose system-level failures involving thermal behavior,...  ...teams to perform root cause analysis and propose corrective actions...  ...and board-level debugging. Experience analyzing system... 
    Senior

    Graphcore

    Milpitas, CA
    4 days ago
  • $184k - $287.5k

     ...NVIDIA is now looking for a Senior Memory System Engineer to join our ASIC Memory Subsystem...  ...to the system workloads Debug and bring up memory evaluation / validation and failure issues on memory technology....  ...for memory failure analysis ~ Experience with Python,... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $191.4k - $281.4k

     ...are a close-knit team focused on hardware quality and failure analysis, within a larger hardware engineering organization which designs, implements, and delivers...  .... Your Impact In this highly visible role as Senior Hardware Engineering Failure Analysis Technical... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Jose, CA
    2 days ago
  • $75 - $85 per hour

     ...development and hardware engineering company, offering end-...  ...com/careers Title: Senior Electrical Engineer...  ...electrical measurements and analysis across power rails,...  .... ~Execute lab debugging and troubleshooting using...  .... ~ Conduct failure analysis of electrical... 
    Senior
    Flexible hours

    Fresh Consulting

    Sunnyvale, CA
    17 hours ago
  • $152k - $241.5k

    We are hiring software engineers for the CUDA Tile team, a new tile‑...  ...programming model for NVIDIA GPUs. What you’ll be doing: Work on...  ...optimization, performance analysis and IR design. Ability to work...  ...software design skills, including debugging, performance analysis, and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

     ...for hardworking systems engineers who will craft FPGA...  ...for our next generation GPUs, SOCs, NICs, and Switches...  ...are now looking for a Senior Systems Prototyping and...  ...platforms and drive complex debug and problem-solving...  ...validation, and performance analysis using FPGA prototypes... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $184k - $287.5k

     ...As NVIDIA makes inroads into the Datacenter business, our team plays a central role in getting...  .... The role of a Deep Learning Systems Engineer would be to analyze the performance and...  ...). Work with experts to help develop analysis and profiling tools in Python, bash and... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...About the Role Eurofins EAG Laboratories is seeking a Senior Failure Analysis Engineer to lead hands-on investigations of electronic components...  ...is best suited for someone who enjoys deep technical debugging operates lab tools independently and can translate complex... 
    Senior
    Permanent employment
    Full time
    Casual work
    Local area
    Remote work

    Eurofins

    Sunnyvale, CA
    6 days ago
  •  ...Senior Distributed Storage System Engineer This role has been designed as 'Onsite' with an...  ...develops, troubleshoots and debugs software programs for...  ...including solution design, analysis, coding, testing, and...  ...product quality and mitigate failure risk. Provides domain-... 
    Senior
    Work at office
    Local area

    Hewlett Packard Enterprise

    Alviso, CA
    3 days ago
  • $152k - $241.5k

     ...AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler...  ...optimizations and analysis, and crafting and implementing...  ...workloads and future NVIDIA GPUs. What we need to see Bachelor...  ...software design skills, including debugging, performance analysis, and test... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...a highly skilled HPC Cluster Engineer to design, deploy, and operate...  ..., including performance analysis and optimizations. Conduct root...  ...accelerate researchers' velocity, debugging, and software performance at...  ...: Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...looking for versatile software engineers for our XLA team. NVIDIA is...  ...the OpenXLA compiler on NVIDIA GPUs at scale. You’ll collaborate...  .... Performance tuning and analysis. Code‑generation for NVIDIA GPU...  ...software design skills, including debugging, performance analysis, and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Debug & Failure Analysis Engineer - Datacenter GPUs. Be the first to apply!