Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Quality and Reliability Engineer - LPU

NVIDIA

Job Summary We are seeking a Systems Quality and Reliability Engineer to join our LPU team at NVIDIA. The role focuses on ensuring the reliability of NVIDIA AI/ML products through comprehensive root‑cause analysis and failure investigation. Responsibilities Own, build, and manage the RMA and FA debug and root‑cause analysis for existing and new AI/ML products. Conduct tests and root‑cause analysis of field RMAs. Collaborate with Systems Engineers, Hardware Engineers, Software Engineers, and Operations Engineers to address quality issues. Scale root‑cause and FA capabilities within the organization. Create FA result reports that align with the 8D or similar process. Analyze RMA, FA, and repair data to identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for quality alerts. Oversee hardware quality performance, monitoring field quality data and metrics such as RMA rates, MTBF, and Reliability Ratio. Manage operational performance of FA at contract manufacturers, ensuring partners achieve key performance indicators including FA cycle times, fault duplication rates, and fault isolation rates. Oversee the setup of new products into Failure Analysis operations. Qualifications BS/MS in Electrical Engineering, Physics, or a related field (or equivalent experience). 5+ years of hands‑on systems test and/or validation engineering experience. Proven experience in systems quality and reliability engineering. Competence with lab equipment such as oscilloscopes, logic analyzers, power analyzers, etc. Experience enabling reliability tests such as HTOL and quality tests such as Burn‑in. Knowledge of FA techniques and tools such as FIB, SEM, TDR, VNA, and CSAM. Strong knowledge of fault isolation techniques such as OBIRCH, DLS/LADA, LVP, and LVI. Proficiency with high‑speed interfaces (SerDes, PCIe, DDR). Proficiency in Python, PERL, C++, or other languages on UNIX/Linux. Excellent knowledge of PCB card and system‑level test and debugging. Ability to manage factory floor partners (CMs) for RMA/FA activities. Compensation Base salary ranges are 136,000–218,500 USD for Level3 and 168,000–264,500 USD for Level4. Eligibility for equity and benefits. Applications are accepted until May30,2026 . NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. Discrimination on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law is prohibited. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 16 hours ago
Similar jobs that could be interesting for youBased on the Systems Quality and Reliability Engineer - LPU in Santa Clara, CA vacancy
  • NVIDIA Corporation is seeking a Systems Quality and Reliability Engineer to join their LPU team. This role is crucial for ensuring the reliability of NVIDIA's AI/ML products through in-depth root-cause analysis and failure investigations. The ideal candidate will have... 
    Quality

    NVIDIA Corporation

    Santa Clara, CA
    7 hours ago
  • $110.5k - $152k

    Applied Materials, Inc. in Santa Clara, CA is seeking a Quality & Reliability Systems Engineer (E3) to ensure product quality and reliability through testing and evaluation. This full-time position involves developing quality standards, implementing testing methods, and... 
    Quality
    Full time

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  • $110.5k - $152k

    ## Quality & Reliability Systems Engineer - (E3)Applylocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2620088**Who We Are**Applied Materials is a global leader in materials engineering solutions used to produce virtually every new... 
    Quality
    Full time
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  •  ...ID: JR2018911 Job Category: Engineering Time Type: Full time SCG...  ...Engineer, you will co-design system-level speed features, build...  ...firmware/software, process/reliability, and operations teams to co‑...  ...design expectations and product quality. Provide system... 
    Quality
    Full time

    NVIDIA AI

    Santa Clara, CA
    1 day ago
  • $147.4k - $220.9k

    Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place...  ...other site reliability engineers, software engineers, quality engineers, to gather, define, and analyze non-... 
    Quality
    Relocation

    Apple Inc.

    Sunnyvale, CA
    3 days ago
  • $168k - $264.5k

    Senior Reliability Engineer - LPU Packaging page is loaded## Senior Reliability Engineer - LPU Packaginglocations: US, CA, Santa Claratime type:...  ...Analyzes qual and stress data (including HTOL, package qual, SLT/system stress) and convert to design / process/ material changes... 

    NVIDIA Corporation

    Santa Clara, CA
    16 hours ago
  • NVIDIA Gruppe is seeking a Silicon Speed Features Engineer to lead validation and automation infrastructure for silicon issues. You will work across teams to ensure product quality and performance in a dynamic environment. This role requires an MS in EE or equivalent,... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    16 hours ago
  • Rhoda AI in Palo Alto is looking for a Robot Systems QA Engineer to enhance the quality and reliability of their advanced robotics platform. This role involves designing and executing validation frameworks while collaborating with cross-functional teams to ensure performance... 
    Quality

    Rhoda AI

    Palo Alto, CA
    3 days ago
  •  ...firm in Mountain View, California, is seeking a Senior Quality Assurance and Reliability Engineer to oversee product safety and reliability. In this...  ...experience in design engineering and quality management systems, allowing them to thrive in a fast-paced, innovative environment... 
    Quality

    Reliable Robotics Corporation

    Mountain View, CA
    3 days ago
  • $196k - $310.5k

    Senior System Level Test Engineer Join NVIDIA’s senior engineering team to push the frontiers of system‑level testing...  ...control systems. Improve manufacturing test quality by enhancing test correlation, yield, and reliability across NPI, HVM and RMA processes. Collaborate... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $292k

     ...seeking a strong technology leader for our Engineering Operations and Site Reliability Engineering for our next-generation datacenter server systems. This role sits at the intersection of...  ...sustaining teams to improve product quality, serviceability, and development velocity... 
    Quality
    Full time

    NVIDIA

    Santa Clara, CA
    9 hours ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Hardware Systems Engineer to drive hardware pathfinding for next-generation LPU platforms supporting demanding AI workloads. You will lead the design process, collaborate across teams, and ensure systems are production-ready... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $136k - $218.5k

    NVIDIA in Santa Clara is seeking a Silicon Speed Features Engineer to co-design system-level speed features across Gaming, Datacenter, Automotive, and Embedded markets. The role involves collaborating cross-functionally and using AI to enhance automation tools for performance... 

    NVIDIA

    Santa Clara, CA
    16 hours ago
  • $130k - $176k

     ...Job Description Job Description Staff R&D Engineer (Catheter & Mechanical Systems) – NuevoSono Status: Full-time, Exempt Location: Onsite, Santa...  ...platform designed to simplify workflows, improve image quality, and deliver clinically meaningful insights at the... 
    Quality
    Permanent employment
    Full time
    Work at office
    Relocation
    Visa sponsorship
    Work visa

    T45 Labs

    Santa Clara, CA
    20 days ago
  • $110k - $150k

     ...: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20...  ...commitments. About the role As a Fleet Reliability Engineer at Applied Intuition, you will play a...  ...vehicle lifecycle. By driving systemic quality and stability improvements for our on-... 
    Quality
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    1 day ago
  • Johnson & Johnson is seeking a Design for Reliability (DfR) Engineer to join their Robotics and Digital R&D Team in Santa Clara, CA. This role...  ...thinking into product design to enhance robotic surgical systems. Ideal candidates should possess a Bachelor’s degree and have... 

    Johnson & Johnson

    Santa Clara, CA
    16 hours ago
  • Intuitive in Sunnyvale is looking for a Vision Systems Electrical Engineer. This unique role requires a mix of engineering skills, including...  ...in cross-functional teams, ensuring product quality and reliability. This position is a great chance to work with innovative... 
    Quality

    Intuitive

    Sunnyvale, CA
    1 day ago
  • $168k - $264.5k

    Join our LPU team as Lead Systems Quality and Reliability Engineer. What you'll be doing: You will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new Nvidia AI/ML products. You will conduct tests and root-cause analysis. Other responsibilities... 
    Quality

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • Santa Clara, CA based Pure Storage, Inc. is seeking a Senior Systems QA Engineer to ensure reliability in their innovative storage solutions. You will lead efforts in quality assurance for the FlashArray team and collaborate closely with Software Engineering and Support... 
    Quality

    Pure Storage

    Santa Clara, CA
    16 hours ago
  • $168k - $264.5k

    A leading technology company is seeking a Senior Reliability Engineer to join their LPU packaging team in Santa Clara, CA. This role involves owning the package-level reliability specifications, defining qualification requirements, and leading materials selection for reliability... 

    NVIDIA Corporation

    Santa Clara, CA
    16 hours ago
  • $115k - $140k

    Sr. NPI Systems Electrical Engineer The Company Halo Industries has developed breakthrough technology...  ...collaboration with design, manufacturing, and quality teams to identify potential issues,...  ...data analysis. Experience with high‑reliability and safety‑critical systems. Any... 
    Quality
    Full time
    Contract work
    Temporary work
    Work at office

    Halo Industries, Inc.

    Santa Clara, CA
    4 days ago
  • $124k - $171k

     ...Materials, Inc. is hiring a Mechanical Engineer III in Santa Clara, California. This...  ...position focuses on supporting automation systems engineering projects, requiring...  ...oversee project management, ensuring quality and reliability throughout the product lifecycle. Qualifications... 
    Quality

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  • $160k - $220k

     ...seeking a Senior Mechanical Engineer to lead the design, prototyping...  ...and production of our ground systems. Matternet Stations ground...  .... They enable continuous, reliable operations and play a critical...  ...design reviews, and produce high-quality documentation and release... 
    Quality
    Flexible hours

    Matternet

    Mountain View, CA
    12 days ago
  • $106k - $170.2k

    Job Title Hardware Reliability Test Engineer - Robotics and Digital R&D Team (Santa Clara, California)...  ...mechanical, mechatronics, robotic controls, systems and software engineers who are...  ...functionally with design, manufacturing, quality, and FA to close reliability issues... 
    Quality
    Temporary work
    Local area
    Worldwide

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    3 days ago
  • $113.67k - $153.8k

     ...Integrity Associates (SIA) is seeking a Senior Mechanical Engineer with expertise in thermo-fluid systems to join our Energy Services Group within the Turbine...  ...scale projects as appropriate Ensure technical quality through peer review, documentation, and adherence to... 
    Quality
    Temporary work
    Casual work
    Flexible hours

    SI Solutions, LLC

    San Jose, CA
    5 days ago
  • $133.5k - $183.5k

     ...a global leader in materials engineering solutions used to produce virtually...  ...to develop the highest quality manufacturing processes and the most advanced and reliable production machines for our worldwide...  ...of expertise in mechanical/systems engineering, mechanisms and... 
    Quality
    Full time
    Work experience placement
    Worldwide
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    7 hours ago
  • NVIDIA Corporation is looking for a Senior Systems Software Engineer (SRE) in Santa Clara, California. This role focuses on designing, building...  ...include ensuring GPU cloud services run with maximum reliability, participating in service lifecycles, and leveraging automation... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $154k - $193k

     ...California, Antora's thermal batteries deliver reliable and cost-effective heat and power for...  ...Senior or Staff Mechanical Engineer, Fluid Systems to join our Product Development Team....  ...improve speed, insight, or iteration quality. Just as important, you'll bring the judgment... 
    Quality
    Flexible hours

    Antora Energy

    San Jose, CA
    7 days ago
  • NVIDIA Corporation is looking for a Lead Systems Quality and Reliability Engineer in Santa Clara, California. You will own and manage debug and root-cause analysis for AI/ML products, collaborating with various engineering teams. The role requires extensive experience... 
    Quality

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $150k - $195k

     ...Develop best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes....  ...SRE experience with production systems (depending on level) Strong development...  ...building production quality cloud infrastructure that enables... 
    Quality
    Full time
    Worldwide

    Isc2 Eastbay Chapter

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Quality and Reliability Engineer - LPU. Be the first to apply!