Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Deep Learning Kernel Software Performance Architect

$152k - $218.5k

NVIDIA

We are now looking for a Senior Kernel Performance Architect for Deep Learning Software!

NVIDIA is seeking extraordinary architects to develop processor and system architectures that accelerate machine learning, data analytics and high-performance computing applications. This position offers the chance to create a meaningful impact in a dynamic, technology-focused company.

What you will be doing:

  • Craft GPU-accelerated system architectures that push the boundaries of deep learning performance.

  • Prototype high-performance software for deep learning and data analytics workloads.

  • Analyze, visualize, and optimize software performance using analytical models, simulators, and test suites.

  • Collaborate closely across NVIDIA teams such as:

  • CUDA Compiler teams to identify performance issues.

  • AI/ML training and inference performance teams to identify and optimize critical deep learning layers.

  • hardware architecture performance teams to define expectation for emerging deep learning hardware features.

What we need to see:

  • A Master's or PhD in Computer Science, Electrical Engineering or Computer Engineering, or equivalent experience.

  • 5+ years of relevant industry or research experience.

  • A strong foundation in machine learning and deep learning fundamentals to complement your expertise in computer architecture.

  • A strong background in high performance kernel (such as CUTLASS), work experience on math library performance analysis and profiling to identify performance bottlenecks.

  • Fluency in programming languages such as Python, C, C++.

  • Experience and familiarity with GPU computing and parallel programming models.

  • You have firsthand work experience with analytical performance modeling, profiling, and analysis.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you creative architect interested in pushing silicon to its highest performance? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 218,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until January 17, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Deep Learning Kernel Software Performance Architect in Santa Clara, CA vacancy
  • $124k - $195.5k

    Deep Learning Kernel Software Performance Architect - New College Grad 2026 page is loaded## Deep Learning Kernel Software Performance Architect - New College Grad 2026locations: US, CA, Santa Claratime type: Full timeposted on: Posted Todayjob requisition id: JR2011814... 
    Performance
    Work experience placement

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our...  ..., NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC. We're seeking a Senior Software Architect to help co-design next-gen data center platforms... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these...  .... We are looking for a Senior Software Architect who has...  ...architecture for scalable and performant server systems, particularly... 
    Senior
    Performance
    Shift work

    NVIDIA

    Santa Clara, CA
    3 days ago
  • A leading technology company is seeking a Senior Software Architect to innovate server systems for deep learning applications. This role involves leading software...  ...architecture design, requiring deep expertise in server performance and collaboration across teams. Ideal... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $195.2k - $361.2k

     ...architectures, algorithms, and software inspired by the brain's...  ...systems. In this role, you will architect and lead the development of the firmware, runtime, and performance infrastructure, working...  ...years of experience with AI, deep learning, or optimization workloads using... 
    Senior
    Performance
    Work at office
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA is seeking a Senior Software Engineer to join...  ...role combines deep technical expertise...  ...firmware, Linux kernel development, and...  ...HPC workloads. Perform advanced system debugging...  ..., and Solution Architect teams to deliver...  ...(CUDA), deep learning workloads... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...technologies in the inference systems software stack! We build innovative...  ..., code generators, and GPU kernel technologies for NVIDIA's...  ...engineers at NVIDIA across deep learning frameworks, libraries,...  ...GPU kernel development and performance optimizations (especially using... 
    Senior
    Performance
    Remote work

    NVIDIA

    Santa Clara, CA
    11 hours ago
  • $184k - $287.5k

     ...artificial intelligence (AI) / deep learning (DL), high-performance computing (HPC),...  ...team as a Senior System Simulation Architect and help us push performance...  ...system and application software in simulation and emulation...  ...Experience with Linux kernel bringup and debug... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    11 hours ago
  •  ...leading technology company is seeking a Senior Systems Software Engineer to join their GPU Software...  ...driver architecture while optimizing performance for both datacenter and gaming...  ...programming, and a strong background in kernel development. This role offers a competitive... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    11 hours ago
  • $152k - $241.5k

     ...Artificial Intelligence, High Performance Computing and Visualization....  ...are looking for a motivated Deep Learning engineer to bring advanced...  ...fused compute-communication kernels to showcase ultimate performance...  ...experience) with 5+ software engineering and HPC/AI experience... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $320k

     ...NVIDIA AI and HPC software stack. We’re looking...  ...a strong technical architect to own the end-to-end...  ...firmware, kernel drivers, operating...  ...we need to see: ~ Deep expertise in scalable and performant server system architecture...  ...recently, GPU deep learning ignited modern deep... 
    Performance
    Shift work

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...building a scalable and modular software stack that powers advanced...  ...looking for a highly motivated Senior Systems Software Engineer...  ...the software stack to analyze performance, inter-operation and deterministic...  ...Mapping, or Localization. Deep understanding of real-time... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

     ...NVIDIA's Networking Systems & Software Architecture group is...  ...computing interconnects. The Senior Architect role is to own modules and...  ...and implementing high-performance communication and memory management...  ...of Reinforcement Learning systems. With competitive... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...We are hiring senior engineers to work on...  ...Our team analyzes performance of applications,...  ...investigates bottlenecks in software or hardware and...  ..., ranging from deep learning, scientific...  ...GPU hardware, kernel mode drivers, and...  .... Evangelize, architect, and implement new... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $262k - $365k

    Senior Staff Software Architect, GPU Uber Tech Leads corporate_fare Google place Sunnyvale...  ...with Linux OS internals, kernel development, or systems...  ...OCP)). Knowledge of High-Performance Computing (HPC) systems...  ...bonus, equity, or benefits. Learn more about benefits at... 
    Senior
    Performance
    Full time

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA’s deep learning and HPC platforms have made a huge impact in various...  ...numeric libraries, modular software design ~ Deep knowledge of...  ...model-parallel programming, performance optimization. Experience...  ..., NCCL), and/or CUDA or GPU kernel programming Experience with... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $208k - $327.75k

     ...Platforms Group is seeking a Senior System Architect to define, design, and...  ...networking, storage, and AI software into repeatable blueprints...  ...The ideal candidate brings deep data center architecture experience...  ...accelerated compute, high-performance networking, storage,... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...about redefining how software is built in the age of...  ...to produce high-performance, high-quality, modern...  ...of state-of-the-art deep learning breakthroughs, and improve...  ...you'll be doing: Architecting an AI-native framework...  ...software. CUDA & kernel expertise: Experience... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    11 hours ago
  • $224k - $356.5k

     ...and medical devices. Our software platforms are central to...  ...globally! We are hiring a Senior Systems Software Engineer...  ...focused on optimizing deep learning inference for autonomous...  ...level. They will locate performance issues through kernel trace analysis and evaluate... 
    Senior
    Performance
    Immediate start

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $104.9k - $184.92k

     ...When it comes to cutting edge machine learning to tackle complex problems, Lockheed Martin...  ...with a strong working knowledge in software engineering, Python, C++, and STAR.OS...  ...GPUs, NPUs, etc) • Experience with high-performance computing • Experience developing... 
    Senior
    Performance
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work
    3 days per week

    Lockheed Martin Corporation

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     ...As a Senior Software Architect in the GPU Networking Architecture team, you will define Software Defined Networking (SDN) architectural solutions...  ...to the modern data center, such as distributed AI and deep learning systems, Networking Operating Systems, Virtualization,... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...Machine Learning System Software Architect Sunnyvale, CA Do you want to be part of the AI revolution...  ...the team to architect and create high-performance machine learning system software and...  ...and HPC area. Experience with deep learning frameworks: TensorFlow/PyTorch... 
    Performance
    Work at office

    Baidu

    Sunnyvale, CA
    11 hours ago
  •  ...company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent...  ...and PyTorch and working closely with GPU software teams. This role promises a dynamic work... 
    Senior
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • A leading technology firm in Santa Clara is looking for a Senior Staff Software Developer to drive AI performance and contribute to groundbreaking innovations. This role emphasizes optimizing GPU efficiency and developing cutting-edge AI solutions. Candidates should possess... 
    Senior
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference...  ..., designing and implementing a high-performance execution environment, low-level GPU...  ...and developing custom GPU kernels in CUDA and/or Triton. This is an exceptional... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...leading semiconductor company is seeking a Senior Staff Software Developer to shape its AI strategy. You will architect the AI software stack, accelerating performance on cutting-edge AMD hardware. Your role will require deep expertise in high-performance C++ programming... 
    Senior
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • $320k

     ...developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our...  ...communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. We are looking for a Distinguished Software Architect to help co-design our next generation data... 
    Performance
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $193.3k - $261.5k

     ...builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on...  .... The Acceleration Kernel Library team is at the...  ...of maximizing performance for AWS's custom ML accelerators...  .... You will architect and implement business... 
    Senior
    Performance
    Internship
    Local area
    Work from home
    Flexible hours

    Amazon

    Cupertino, CA
    11 hours ago
  • A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California....  ...systems software for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate with... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...with innovative solutions. As a Software Solution Architect for NVIS, you will play a crucial...  ...deploy world-class AI and machine learning solutions. Join us in Israel and...  ...profiling and analysis tools to improve deep learning model performance. Working with internal... 
    Performance
    Remote work

    NVIDIA

    Santa Clara, CA
    13 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Deep Learning Kernel Software Performance Architect. Be the first to apply!