Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Quality and Reliability Engineer - LPU

NVIDIA Corporation

Job Summary We are seeking a Systems Quality and Reliability Engineer to join our LPU team at NVIDIA. The role focuses on ensuring the reliability of NVIDIA AI/ML products through comprehensive root‑cause analysis and failure investigation. Responsibilities Own, build, and manage the RMA and FA debug and root‑cause analysis for existing and new AI/ML products. Conduct tests and root‑cause analysis of field RMAs. Collaborate with Systems Engineers, Hardware Engineers, Software Engineers, and Operations Engineers to address quality issues. Scale root‑cause and FA capabilities within the organization. Create FA result reports that align with the 8D or similar process. Analyze RMA, FA, and repair data to identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for quality alerts. Oversee hardware quality performance, monitoring field quality data and metrics such as RMA rates, MTBF, and Reliability Ratio. Manage operational performance of FA at contract manufacturers, ensuring partners achieve key performance indicators including FA cycle times, fault duplication rates, and fault isolation rates. Oversee the setup of new products into Failure Analysis operations. Qualifications BS/MS in Electrical Engineering, Physics, or a related field (or equivalent experience). 5+ years of hands‑on systems test and/or validation engineering experience. Proven experience in systems quality and reliability engineering. Competence with lab equipment such as oscilloscopes, logic analyzers, power analyzers, etc. Experience enabling reliability tests such as HTOL and quality tests such as Burn‑in. Knowledge of FA techniques and tools such as FIB, SEM, TDR, VNA, and CSAM. Strong knowledge of fault isolation techniques such as OBIRCH, DLS/LADA, LVP, and LVI. Proficiency with high‑speed interfaces (SerDes, PCIe, DDR). Proficiency in Python, PERL, C++, or other languages on UNIX/Linux. Excellent knowledge of PCB card and system‑level test and debugging. Ability to manage factory floor partners (CMs) for RMA/FA activities. Compensation Base salary ranges are 136,000–218,500 USD for Level3 and 168,000–264,500 USD for Level4. Eligibility for equity and benefits. Applications are accepted until May30,2026 . NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. Discrimination on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law is prohibited. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Systems Quality and Reliability Engineer - LPU in Santa Clara, CA vacancy
  • NVIDIA Corporation is seeking a Systems Quality and Reliability Engineer to join their LPU team. This role is crucial for ensuring the reliability of NVIDIA's AI/ML products through in-depth root-cause analysis and failure investigations. The ideal candidate will have... 
    Quality

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $110.5k - $152k

    Applied Materials, Inc. in Santa Clara, CA is seeking a Quality & Reliability Systems Engineer (E3) to ensure product quality and reliability through testing and evaluation. This full-time position involves developing quality standards, implementing testing methods, and... 
    Quality
    Full time

    Applied Materials, Inc.

    Santa Clara, CA
    1 day ago
  • $110.5k - $152k

    ## Quality & Reliability Systems Engineer - (E3)Applylocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2620088**Who We Are**Applied Materials is a global leader in materials engineering solutions used to produce virtually every new... 
    Quality
    Full time
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    1 day ago
  • $136k - $218.5k

     ...markets. As a Silicon Speed Features Engineer, you will co‑design system‑level speed features, build the...  ...hardware, firmware/software, process/reliability, and operations teams to co‑design...  ...satisfy design expectations and product quality. Provide system requirements for... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

     ...every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability...  ...actions to improve design and manufacturing quality. Establish and continuously improve product... 
    Quality
    Full time

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $147.4k - $220.9k

    Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place...  ...other site reliability engineers, software engineers, quality engineers, to gather, define, and analyze non-... 
    Quality
    Relocation

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • $168k - $264.5k

    Senior Reliability Engineer - LPU Packaging page is loaded## Senior Reliability Engineer - LPU Packaginglocations: US, CA, Santa Claratime type:...  ...Analyzes qual and stress data (including HTOL, package qual, SLT/system stress) and convert to design / process/ material changes... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe is seeking a Silicon Speed Features Engineer to lead validation and automation infrastructure for silicon issues. You will work across teams to ensure product quality and performance in a dynamic environment. This role requires an MS in EE or equivalent,... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • Rhoda AI in Palo Alto is looking for a Robot Systems QA Engineer to enhance the quality and reliability of their advanced robotics platform. This role involves designing and executing validation frameworks while collaborating with cross-functional teams to ensure performance... 
    Quality

    Rhoda AI

    Palo Alto, CA
    2 days ago
  •  ...firm in Mountain View, California, is seeking a Senior Quality Assurance and Reliability Engineer to oversee product safety and reliability. In this...  ...experience in design engineering and quality management systems, allowing them to thrive in a fast-paced, innovative environment... 
    Quality

    Reliable Robotics Corporation

    Mountain View, CA
    2 days ago
  • $196k - $310.5k

    Senior System Level Test Engineer Join NVIDIA’s senior engineering team to push the frontiers of system‑level testing...  ...control systems. Improve manufacturing test quality by enhancing test correlation, yield, and reliability across NPI, HVM and RMA processes. Collaborate... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $125k - $140k

     ...innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in...  ...quickly as the market demands.**Reliability Engineering Department**The Reliability Engineering...  ...overall operating health of critical systems across Vantage global facilities. For... 
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours

    Vantage Data Centers

    Santa Clara, CA
    1 day ago
  • $110k - $150k

     ...: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20...  ...commitments. About the role As a Fleet Reliability Engineer at Applied Intuition, you will play a...  ...lifecycle. By driving systemic quality and stability improvements for our on-... 
    Quality
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Hardware Systems Engineer to drive hardware pathfinding for next-generation LPU platforms supporting demanding AI workloads. You will lead the design process, collaborate across teams, and ensure systems are production-ready... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $136k - $212.75k

     ...powers transformative AI, HPC, and data center innovations. The LPU (Language Processing Unit) team focuses on creating purpose-...  ...generation AI inference acceleration. We are hiring a Senior Power System Engineer to join the LPU team and lead development of efficient, high-... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $136k - $218.5k

    NVIDIA in Santa Clara is seeking a Silicon Speed Features Engineer to co-design system-level speed features across Gaming, Datacenter, Automotive, and Embedded markets. The role involves collaborating cross-functionally and using AI to enhance automation tools for performance... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $85k - $130k

     ...HVAC Engineer - Building Systems Consulting San Jose, CA On-site / local travel | Full-time Base salary: $85,000-$130,000 (DOE) An...  ...HVAC scopes across multiple projects, ensuring schedules, quality, and deliverables are met Begin leading small project teams... 
    Quality
    Full time
    Local area
    Relocation

    Building Talent

    Alviso, CA
    23 hours ago
  • $172k - $297.85k

     ...Solutions (RAD) group seeks a Director of Reliability Engineering to lead the Robotics and Digital R&D...  ...improvements of robotic hardware systems. Define and execute a multi‑year hardware...  ...and development meets regulatory and quality standards, maintaining accurate and compliant... 
    Quality
    Temporary work
    Local area

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    23 hours ago
  • $168k - $264.5k

    Join our LPU team as Lead Systems Quality and Reliability Engineer. What you'll be doing: You will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new Nvidia AI/ML products. You will conduct tests and root-cause analysis. Other responsibilities... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • NVIDIA Corporation in Santa Clara is seeking an experienced hardware engineer to collaborate cross-functionally on system-level features. Responsibilities include defining specifications, performing validation, and leading complex debug efforts to ensure timely product... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

    A leading technology company is seeking a Senior Reliability Engineer to join their LPU packaging team in Santa Clara, CA. This role involves owning the package-level reliability specifications, defining qualification requirements, and leading materials selection for reliability... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $115k - $140k

    Sr. NPI Systems Electrical Engineer The Company Halo Industries has developed breakthrough technology...  ...collaboration with design, manufacturing, and quality teams to identify potential issues,...  ...data analysis. Experience with high‑reliability and safety‑critical systems. Any... 
    Quality
    Full time
    Contract work
    Temporary work
    Work at office

    Halo Industries, Inc.

    Santa Clara, CA
    3 days ago
  • $185k

     ...support for facility mechanical systems, including HVAC, exhaust,...  ...‑control concerns, reliability issues, and recurring defects...  ...Perform or review practical engineering calculations such as heat load...  ...authority; verify work scope and quality without assuming employee management... 
    Quality
    Full time
    Temporary work
    For contractors
    H1b
    Local area

    WGNSTAR

    Santa Clara, CA
    2 days ago
  • $106k - $170.2k

    Job Title Hardware Reliability Test Engineer - Robotics and Digital R&D Team (Santa Clara, California)...  ...mechanical, mechatronics, robotic controls, systems and software engineers who are...  ...functionally with design, manufacturing, quality, and FA to close reliability issues... 
    Quality
    Temporary work
    Local area
    Worldwide

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    2 days ago
  • A leading medical technology company in Santa Clara, CA is seeking a Systems Quality Assurance Test Engineer. This role focuses on developing and implementing effective test methods for diagnostic systems, ensuring product quality throughout development phases. Candidates... 
    Quality

    Hologic, Inc.

    Santa Clara, CA
    2 days ago
  •  ...Sub Function: R&D Mechanical Engineering Job Category: Scientific/Technology...  ...Engineer - Robotic Systems for the Ottava Team. This is...  ...and a passion for developing reliable, high‑performance hardware for...  ...engineering, robotics and controls, quality, manufacturing, and... 
    Quality
    Work at office
    Local area

    Johnson & Johnson

    Santa Clara, CA
    3 days ago
  • System Quality Assurance Test Engineer, Sustaining We are looking for a Systems Quality Assurance Test Engineer to join our product development team and help ensure the quality and reliability of our diagnostic systems. In this role, you willdesign, develop, and implement... 
    Quality

    Hologic, Inc.

    Santa Clara, CA
    2 days ago
  • $200k - $322k

     ...NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations...  ...management to build AI-powered systems that enhance reliability, speed, and employee...  ...service-level visibility, signal quality, and actionable insights across the IT... 
    Quality

    NVIDIA

    Santa Clara, CA
    3 days ago
  • A leading data storage company is seeking a Senior Systems QA Engineer in Santa Clara, CA. This role involves ensuring quality for innovative storage solutions, driving automated testing, and high-system reliability. Candidates should possess deep knowledge of the storage... 
    Quality
    Flexible hours

    Pure Storage, Inc.

    Santa Clara, CA
    4 days ago
  • $133.5k - $183.5k

    Product Quality and Reliability Engineer IV - (E4) page is loaded## Product Quality and Reliability Engineer IV - (E4)locations: Santa Clara,CAtime...  ...Demonstrates depth and breadth of expertise in mechanical/systems engineering, mechanisms and components qualification,... 
    Quality
    Full time
    Work experience placement
    Worldwide
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Quality and Reliability Engineer - LPU. Be the first to apply!