Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Distinguished Resiliency and Safety Architect, GPU Diagnostics

$320k

NVIDIA

We are now looking for a Distinguished Resiliency and Safety Architect, GPU Diagnostics! Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world.We are now seeking a Resiliency and Safety Architect to support the development of GPU (graphical processing unit) diagnostics for Resiliency in the Datacenter and Functional Safety in Autonomous Vehicles and Robots. In this role, you will be a key member of a team of innovators, challenging the status quo and pushing beyond boundaries. You will have the opportunity to impact the industry's leading GPUs and SoCs powering product lines ranging from the rapidly growing field of artificial intelligence to self-driving cars and robots.**What you'll be doing:*** Design, develop, and maintain diagnostics software suite to efficiently stress test NVIDIA GPUs and SOCs to identify hardware defects, including defects that cause silent data corruption. These tests will run in large-scale deployments of Datacenter GPUs and Safety SOCs in package/board/rack configurations spanning GPUs, CPUs, and Networking SOCs.* Address coverage gaps in NVIDIA diagnostic suite flagged by silicon failures on customer workloads or test suites. Enhance diagnostics to improve repeatability of failures detected and optimize test time.* Tests for GPUs in automotive functional safety contexts should include low-level routines to exercise instruction sets, memory subsystems and interrupt mechanisms, in compliance with ISO 26262 and related safety standards. Collaborate with architecture, RTL, and verification teams to ensure safety coverage, correctness, and robustness across GPU generations.* Study silent data corruption, intermittent faults, and hard-to-reproduce failures in the field, including customer returns (RMAs), to establish root causes, and improve detection by diagnostics* Support deployment of diagnostics in pre-production qualification environments as well as large-scale production usages.**What we need to see:*** Master’s or PhD degree in Computer Science, Computer Engineering, Electrical Engineering or closely related degree or equivalent experience.* At least 15+ years of relevant experience.* Ability to reason across hardware/software boundaries to debug complex system-level issues* In-depth understanding of the architecture and micro-architecture of high-performance computing systems. Strong knowledge of hardware failure mechanisms that can result in incorrect computation.* Proficiency in C/C++, CUDA programming.* Scripting and automation with Python or similar.* Understanding of the software development life cycle, from requirements to testing closure and maintenance, including creating customer releases and documentation.* Excellent interpersonal skills and ability to collaborate with on-site and remote teams.* Strong debugging and analytical skills.* Be self-driven and results oriented.**Ways to stand out from the crowd:*** Familiarity with GPU and SOC Architectures, Machine Learning/Deep Learning concepts* Understanding factors causing silent data corruption in hardware* Ability to use high performance libraries and write hand-crafted kernels where necessary to create stress conditions to induce hardware failures.* Experience in embedded software development.NVIDIA’s invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”.Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 320,000 USD - 488,750 USD.You will also be eligible for equity and .Applications for this job will be accepted at least until February 27, 2026.This posting is for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Distinguished Resiliency and Safety Architect, GPU Diagnostics in Santa Clara, CA vacancy
  • $320k

    A leading AI computing company in California is looking for a Distinguished Resiliency and Safety Architect to develop key diagnostics for GPUs. This role involves designing software for stress testing, improving safety protocols, and addressing hardware defects. Candidates... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $136k - $218.5k

     ...We are looking for a Senior GPU Low Power Architect within our Hardware Team! NVIDIA is known as a world leader in providing energy-efficient...  ...including writing test plans and directed or random diagnostics. Widely considered to be one of the technology world’s... 
    Suggested

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...This is an outstanding opportunity to join a world-class team and play a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with dedicated individuals in an inclusive and collaborative environment where your hardworking nature will drive flawless... 
    Suggested
    Work experience placement
    Night shift

    NVIDIA

    Santa Clara, CA
    7 hours ago
  • $184k - $287.5k

     ...We are now looking for a Senior GPU Architect! The NVIDIA GPU Architecture group is looking for world class architects and software developers to join and lead our various architecture efforts. A key part of NVIDIA's strength is to innovate in the graphics and parallel... 
    Suggested

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $168k - $310.5k

    Senior Applied Power Architect - GPU page is loaded## Senior Applied Power Architect - GPUlocations: US, CA, Santa Clara: US, TX, Austintime type: Full timeposted on: Posted Todayjob requisition id: JR1997556NVIDIA is known as a world leader in providing energy-efficient... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $152k - $287.5k

    A leading technology company is seeking a Senior GPU Memory System Architect in Santa Clara to develop architecture and micro-architecture for GPU memory systems. Candidates should have 3+ years in GPU or CPU architecture and a master's degree in a relevant discipline.... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $100k - $166.75k

    NVIDIA Corporation is seeking a GPU Power Architect - New College Grad in Santa Clara, CA. This full-time role focuses on developing power estimation models for cutting-edge GPUs and improving energy efficiency. The ideal candidate will pursue or have recently completed... 
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • NVIDIA Corporation is seeking a Power Architect for New College Grad 2026 in Santa Clara, CA. You will be responsible for architecting GPU power features and managing system-level power solutions. Collaboration with various teams is essential to develop energy efficiency... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

    Senior Architect, GPU Profiling System page is loaded## Senior Architect, GPU Profiling Systemlocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014379NVIDIA’s GPU Architecture Group is looking for architects... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • A leading tech company is looking for a Senior GPU Memory Architect to define innovative features for next-generation GPU memory. The ideal candidate will have extensive experience in memory systems architecture, strong programming skills in C/C++ and Python, and excellent... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...art and science of computer graphics, with our invention of the GPU. The GPU has also shown to be spectacularly effective at...  ...join our team!NVIDIA Architecture Modeling group is looking for Architects, Functional Modeling Engineers, and Simulation experts to join... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • Datacenter GPU Power Architect - New College Grad NVIDIA is known as a world leader in providing energy‑efficient high‑performance products and we continue to invest in the research and development of hyper‑efficient GPU and SOC architectures. We continually innovate to... 

    NVIDIA AI

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    Senior GPU Verification Architect page is loaded Senior GPU Verification Architect Apply locations US, CA, Santa Clara time type Full time posted on Posted 6 Days Ago job requisition id JR1989512 This is an outstanding opportunity to join a world-class team and play a... 
    Full time
    Work experience placement

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $272k - $431.25k

    NVIDIA Corporation in Santa Clara seeks a Principal GPU Memory Simulation Architect to design advanced GPU memory systems. Candidates with a Bachelor's degree in engineering or computer science, along with 15+ years of experience, are encouraged to apply. The role includes... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • A leading technology firm in Santa Clara seeks a Senior Architect for GPU Profiling Systems. This role involves architecting features for GPU profiling, building performance models, and validating designs. Candidates should have a Master's or PhD in a relevant field, with... 
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $100k - $189.75k

    NVIDIA AI is seeking a Datacenter GPU Power Architect - New College Grad in Santa Clara, California. This role focuses on developing energy-efficient high-performance products, contributing to power estimation models for GPU products, and utilizing machine-learning techniques... 

    NVIDIA AI

    Santa Clara, CA
    1 day ago
  • Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders... 

    Advanced Micro Devices

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

    Senior GPU Memory Architect page is loaded## Senior GPU Memory Architectlocations: US, CA, Santa Claratime type: Full timeposted on: Posted 2 Days Agojob requisition id: JR2009936NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...ideal candidate has over 10 years of experience, including at least 3 in a leadership role, and strong knowledge of Windows internals, GPU computing, and AI frameworks. A comprehensive salary package is offered, with opportunities for equity and benefits. #J-18808-Ljbffr... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $175k - $250k

     ...semiconductor startup in Sunnyvale is looking for a highly experienced GPU Compiler Lead to design and implement a high-performance...  ..., optimizing workloads, and collaborating with hardware architects. The role offers a competitive salary range of $175,000-$250,00... 

    Bolt Graphics

    Sunnyvale, CA
    4 days ago
  • NVIDIA Corporation is seeking a Senior GPU Architect in Santa Clara, California to innovate in GPU technology. The role involves developing architecture improvements and collaborating with multiple teams to enhance chip design. Candidates should have a minimum of 10 years... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

    Lead Safety Architect - Autonomous Vehicles page is loaded## Lead Safety Architect - Autonomous Vehicleslocations: US, CA, Santa Clara: US, DC...  ...AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars... 
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE. Role Distinguished Technologist, ASIC Design Architect — We are seeking an experienced ASIC Design Architect to own the hardware architecture, microarchitecture, and... 
    Work experience placement
    Work at office

    Hewlett Packard Enterprise

    Sunnyvale, CA
    2 days ago
  • **Distinguished Technologist, ASIC Design Architect** We are seeking an experienced ASIC Design Architect to own the hardware architecture, microarchitecture...  ...to help them succeed.**COVID Policy**The health and safety of our team members, customers and partners is paramount... 
    Local area

    Hewlett Packard Enterprise Development LP

    Sunnyvale, CA
    4 days ago
  •  ...prototype systems and software. Our team is responsible for improving Apple silicon and Apple products through targeted analysis of GPU workloads and proposing improvements at all aspect of the GPU subsystem. In this role, you will analyze existing and new workloads to... 
    Work experience placement

    Apple

    Cupertino, CA
    1 day ago
  •  ...moving fast - and keeping pace means shipping complete, validated GPU stack releases to customers as quickly as the software can...  ...and toolchain configuration ~ Track record of modernizing or architecting a build system used by 100+ developers ~ Strong understanding... 
    Shift work

    Advanced Micro Devices , Inc.

    San Jose, CA
    1 day ago
  • $212k - $386.3k

     ...GPU Benchmark Analysis Architect At Apple, our Platform Architecture group is responsible for connecting our hardware and software into one unified system. You'll collaborate with engineers across Apple to design how all of our technologies work in unison, drive development... 
    Work experience placement
    Relocation

    Apple

    Cupertino, CA
    2 days ago
  • $170k - $220k

     ...keeping mechanisms to maintain explainability, and ensure continued safety of the system for unmanned operations. Visit us at Gatik for...  ...talent and experience to an opportunity to create a more resilient supply chain and contribute to our environment's sustainability... 
    Odd job
    Work at office

    Gatik AI

    Mountain View, CA
    3 days ago
  • $124k - $208.4k

    A leading technology company in San Jose seeks a Senior Engineer, GPU Architect to enhance GPU performance through analysis and optimization. Ideal candidates should have expertise in GPU architecture, programming in C/C++ and Python, and 5+ years in relevant experience... 

    Samsung Electronics Perú

    San Jose, CA
    4 days ago
  • A leading technology company is looking for a GPU Software Architecture Engineer to lead server-side ML acceleration and multi-node distribution initiatives. This role involves architecting next-generation distributed ML infrastructure and optimizing for maximum hardware... 

    Apple Inc.

    Cupertino, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Distinguished Resiliency and Safety Architect, GPU Diagnostics. Be the first to apply!