Systems Quality and Reliability Engineer - LPU
NVIDIA
Job Summary We are seeking a Systems Quality and Reliability Engineer to join our LPU team at NVIDIA. The role focuses on ensuring the reliability of NVIDIA AI/ML products through comprehensive root‑cause analysis and failure investigation. Responsibilities Own, build, and manage the RMA and FA debug and root‑cause analysis for existing and new AI/ML products. Conduct tests and root‑cause analysis of field RMAs. Collaborate with Systems Engineers, Hardware Engineers, Software Engineers, and Operations Engineers to address quality issues. Scale root‑cause and FA capabilities within the organization. Create FA result reports that align with the 8D or similar process. Analyze RMA, FA, and repair data to identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for quality alerts. Oversee hardware quality performance, monitoring field quality data and metrics such as RMA rates, MTBF, and Reliability Ratio. Manage operational performance of FA at contract manufacturers, ensuring partners achieve key performance indicators including FA cycle times, fault duplication rates, and fault isolation rates. Oversee the setup of new products into Failure Analysis operations. Qualifications BS/MS in Electrical Engineering, Physics, or a related field (or equivalent experience). 5+ years of hands‑on systems test and/or validation engineering experience. Proven experience in systems quality and reliability engineering. Competence with lab equipment such as oscilloscopes, logic analyzers, power analyzers, etc. Experience enabling reliability tests such as HTOL and quality tests such as Burn‑in. Knowledge of FA techniques and tools such as FIB, SEM, TDR, VNA, and CSAM. Strong knowledge of fault isolation techniques such as OBIRCH, DLS/LADA, LVP, and LVI. Proficiency with high‑speed interfaces (SerDes, PCIe, DDR). Proficiency in Python, PERL, C++, or other languages on UNIX/Linux. Excellent knowledge of PCB card and system‑level test and debugging. Ability to manage factory floor partners (CMs) for RMA/FA activities. Compensation Base salary ranges are 136,000–218,500 USD for Level3 and 168,000–264,500 USD for Level4. Eligibility for equity and benefits. Applications are accepted until May30,2026 . NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. Discrimination on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law is prohibited. #J-18808-Ljbffr NVIDIA Corporation
- NVIDIA Corporation is seeking a Systems Quality and Reliability Engineer to join their LPU team. This role is crucial for ensuring the reliability of NVIDIA's AI/ML products through in-depth root-cause analysis and failure investigations. The ideal candidate will have...Quality
$110.5k - $152k
Applied Materials, Inc. in Santa Clara, CA is seeking a Quality & Reliability Systems Engineer (E3) to ensure product quality and reliability through testing and evaluation. This full-time position involves developing quality standards, implementing testing methods, and...QualityFull time$110.5k - $152k
## Quality & Reliability Systems Engineer - (E3)Applylocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2620088**Who We Are**Applied Materials is a global leader in materials engineering solutions used to produce virtually every new...QualityFull timeRelocation- ...ID: JR2018911 Job Category: Engineering Time Type: Full time SCG... ...Engineer, you will co-design system-level speed features, build... ...firmware/software, process/reliability, and operations teams to co‑... ...design expectations and product quality. Provide system...QualityFull time
$147.4k - $220.9k
Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place... ...other site reliability engineers, software engineers, quality engineers, to gather, define, and analyze non-...QualityRelocation$168k - $264.5k
Senior Reliability Engineer - LPU Packaging page is loaded## Senior Reliability Engineer - LPU Packaginglocations: US, CA, Santa Claratime type:... ...Analyzes qual and stress data (including HTOL, package qual, SLT/system stress) and convert to design / process/ material changes...- NVIDIA Gruppe is seeking a Silicon Speed Features Engineer to lead validation and automation infrastructure for silicon issues. You will work across teams to ensure product quality and performance in a dynamic environment. This role requires an MS in EE or equivalent,...Quality
- Rhoda AI in Palo Alto is looking for a Robot Systems QA Engineer to enhance the quality and reliability of their advanced robotics platform. This role involves designing and executing validation frameworks while collaborating with cross-functional teams to ensure performance...Quality
- ...firm in Mountain View, California, is seeking a Senior Quality Assurance and Reliability Engineer to oversee product safety and reliability. In this... ...experience in design engineering and quality management systems, allowing them to thrive in a fast-paced, innovative environment...Quality
$196k - $310.5k
Senior System Level Test Engineer Join NVIDIA’s senior engineering team to push the frontiers of system‑level testing... ...control systems. Improve manufacturing test quality by enhancing test correlation, yield, and reliability across NPI, HVM and RMA processes. Collaborate...Quality$292k
...seeking a strong technology leader for our Engineering Operations and Site Reliability Engineering for our next-generation datacenter server systems. This role sits at the intersection of... ...sustaining teams to improve product quality, serviceability, and development velocity...QualityFull time$184k - $287.5k
NVIDIA Gruppe in Santa Clara is seeking a Senior Hardware Systems Engineer to drive hardware pathfinding for next-generation LPU platforms supporting demanding AI workloads. You will lead the design process, collaborate across teams, and ensure systems are production-ready...$136k - $218.5k
NVIDIA in Santa Clara is seeking a Silicon Speed Features Engineer to co-design system-level speed features across Gaming, Datacenter, Automotive, and Embedded markets. The role involves collaborating cross-functionally and using AI to enhance automation tools for performance...$130k - $176k
...Job Description Job Description Staff R&D Engineer (Catheter & Mechanical Systems) – NuevoSono Status: Full-time, Exempt Location: Onsite, Santa... ...platform designed to simplify workflows, improve image quality, and deliver clinically meaningful insights at the...QualityPermanent employmentFull timeWork at officeRelocationVisa sponsorshipWork visa$110k - $150k
...: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20... ...commitments. About the role As a Fleet Reliability Engineer at Applied Intuition, you will play a... ...vehicle lifecycle. By driving systemic quality and stability improvements for our on-...QualityFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift- Johnson & Johnson is seeking a Design for Reliability (DfR) Engineer to join their Robotics and Digital R&D Team in Santa Clara, CA. This role... ...thinking into product design to enhance robotic surgical systems. Ideal candidates should possess a Bachelor’s degree and have...
- Intuitive in Sunnyvale is looking for a Vision Systems Electrical Engineer. This unique role requires a mix of engineering skills, including... ...in cross-functional teams, ensuring product quality and reliability. This position is a great chance to work with innovative...Quality
$168k - $264.5k
Join our LPU team as Lead Systems Quality and Reliability Engineer. What you'll be doing: You will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new Nvidia AI/ML products. You will conduct tests and root-cause analysis. Other responsibilities...Quality- Santa Clara, CA based Pure Storage, Inc. is seeking a Senior Systems QA Engineer to ensure reliability in their innovative storage solutions. You will lead efforts in quality assurance for the FlashArray team and collaborate closely with Software Engineering and Support...Quality
$168k - $264.5k
A leading technology company is seeking a Senior Reliability Engineer to join their LPU packaging team in Santa Clara, CA. This role involves owning the package-level reliability specifications, defining qualification requirements, and leading materials selection for reliability...$115k - $140k
Sr. NPI Systems Electrical Engineer The Company Halo Industries has developed breakthrough technology... ...collaboration with design, manufacturing, and quality teams to identify potential issues,... ...data analysis. Experience with high‑reliability and safety‑critical systems. Any...QualityFull timeContract workTemporary workWork at office$124k - $171k
...Materials, Inc. is hiring a Mechanical Engineer III in Santa Clara, California. This... ...position focuses on supporting automation systems engineering projects, requiring... ...oversee project management, ensuring quality and reliability throughout the product lifecycle. Qualifications...Quality$160k - $220k
...seeking a Senior Mechanical Engineer to lead the design, prototyping... ...and production of our ground systems. Matternet Stations ground... .... They enable continuous, reliable operations and play a critical... ...design reviews, and produce high-quality documentation and release...QualityFlexible hours$106k - $170.2k
Job Title Hardware Reliability Test Engineer - Robotics and Digital R&D Team (Santa Clara, California)... ...mechanical, mechatronics, robotic controls, systems and software engineers who are... ...functionally with design, manufacturing, quality, and FA to close reliability issues...QualityTemporary workLocal areaWorldwide$113.67k - $153.8k
...Integrity Associates (SIA) is seeking a Senior Mechanical Engineer with expertise in thermo-fluid systems to join our Energy Services Group within the Turbine... ...scale projects as appropriate Ensure technical quality through peer review, documentation, and adherence to...QualityTemporary workCasual workFlexible hours$133.5k - $183.5k
...a global leader in materials engineering solutions used to produce virtually... ...to develop the highest quality manufacturing processes and the most advanced and reliable production machines for our worldwide... ...of expertise in mechanical/systems engineering, mechanisms and...QualityFull timeWork experience placementWorldwideRelocation- NVIDIA Corporation is looking for a Senior Systems Software Engineer (SRE) in Santa Clara, California. This role focuses on designing, building... ...include ensuring GPU cloud services run with maximum reliability, participating in service lifecycles, and leveraging automation...
$154k - $193k
...California, Antora's thermal batteries deliver reliable and cost-effective heat and power for... ...Senior or Staff Mechanical Engineer, Fluid Systems to join our Product Development Team.... ...improve speed, insight, or iteration quality. Just as important, you'll bring the judgment...QualityFlexible hours- NVIDIA Corporation is looking for a Lead Systems Quality and Reliability Engineer in Santa Clara, California. You will own and manage debug and root-cause analysis for AI/ML products, collaborating with various engineering teams. The role requires extensive experience...Quality
$150k - $195k
...Develop best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes.... ...SRE experience with production systems (depending on level) Strong development... ...building production quality cloud infrastructure that enables...QualityFull timeWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Systems Quality and Reliability Engineer - LPU. Be the first to apply!
- visual systems engineer Santa Clara, CA
- system engineer contract Santa Clara, CA
- application system engineer Santa Clara, CA
- system test engineer Santa Clara, CA
- senior windows systems engineer Santa Clara, CA
- system performance engineer Santa Clara, CA
- senior staff systems engineer Santa Clara, CA
- director systems engineering Santa Clara, CA
- systems engineer Santa Clara, CA
- computer system validation engineer Santa Clara, CA

