Senior Deep Learning Kernel Software Performance Architect
$152k - $218.5kNVIDIA
We are now looking for a Senior Kernel Performance Architect for Deep Learning Software!
NVIDIA is seeking extraordinary architects to develop processor and system architectures that accelerate machine learning, data analytics and high-performance computing applications. This position offers the chance to create a meaningful impact in a dynamic, technology-focused company.
What you will be doing:
Craft GPU-accelerated system architectures that push the boundaries of deep learning performance.
Prototype high-performance software for deep learning and data analytics workloads.
Analyze, visualize, and optimize software performance using analytical models, simulators, and test suites.
Collaborate closely across NVIDIA teams such as:
CUDA Compiler teams to identify performance issues.
AI/ML training and inference performance teams to identify and optimize critical deep learning layers.
hardware architecture performance teams to define expectation for emerging deep learning hardware features.
What we need to see:
A Master's or PhD in Computer Science, Electrical Engineering or Computer Engineering, or equivalent experience.
5+ years of relevant industry or research experience.
A strong foundation in machine learning and deep learning fundamentals to complement your expertise in computer architecture.
A strong background in high performance kernel (such as CUTLASS), work experience on math library performance analysis and profiling to identify performance bottlenecks.
Fluency in programming languages such as Python, C, C++.
Experience and familiarity with GPU computing and parallel programming models.
You have firsthand work experience with analytical performance modeling, profiling, and analysis.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you creative architect interested in pushing silicon to its highest performance? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 218,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits ( .
Applications for this job will be accepted at least until January 17, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$124k - $195.5k
Deep Learning Kernel Software Performance Architect - New College Grad 2026 page is loaded## Deep Learning Kernel Software Performance Architect - New College Grad 2026locations: US, CA, Santa Claratime type: Full timeposted on: Posted Todayjob requisition id: JR2011814...PerformanceWork experience placement$184k - $287.5k
...groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our... ..., NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC. We're seeking a Senior Software Architect to help co-design next-gen data center platforms...SeniorPerformance$224k - $356.5k
...accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these... .... We are looking for a Senior Software Architect who has... ...architecture for scalable and performant server systems, particularly...SeniorPerformanceShift work- A leading technology company is seeking a Senior Software Architect to innovate server systems for deep learning applications. This role involves leading software... ...architecture design, requiring deep expertise in server performance and collaboration across teams. Ideal...SeniorPerformance
$195.2k - $361.2k
...architectures, algorithms, and software inspired by the brain's... ...systems. In this role, you will architect and lead the development of the firmware, runtime, and performance infrastructure, working... ...years of experience with AI, deep learning, or optimization workloads using...SeniorPerformanceWork at officeLocal areaImmediate startShift work$184k - $287.5k
...NVIDIA is seeking a Senior Software Engineer to join... ...role combines deep technical expertise... ...firmware, Linux kernel development, and... ...HPC workloads. Perform advanced system debugging... ..., and Solution Architect teams to deliver... ...(CUDA), deep learning workloads...SeniorPerformance$184k - $287.5k
...technologies in the inference systems software stack! We build innovative... ..., code generators, and GPU kernel technologies for NVIDIA's... ...engineers at NVIDIA across deep learning frameworks, libraries,... ...GPU kernel development and performance optimizations (especially using...SeniorPerformanceRemote work$184k - $287.5k
...artificial intelligence (AI) / deep learning (DL), high-performance computing (HPC),... ...team as a Senior System Simulation Architect and help us push performance... ...system and application software in simulation and emulation... ...Experience with Linux kernel bringup and debug...SeniorPerformance- ...leading technology company is seeking a Senior Systems Software Engineer to join their GPU Software... ...driver architecture while optimizing performance for both datacenter and gaming... ...programming, and a strong background in kernel development. This role offers a competitive...SeniorPerformance
$152k - $241.5k
...Artificial Intelligence, High Performance Computing and Visualization.... ...are looking for a motivated Deep Learning engineer to bring advanced... ...fused compute-communication kernels to showcase ultimate performance... ...experience) with 5+ software engineering and HPC/AI experience...SeniorPerformance$320k
...NVIDIA AI and HPC software stack. We’re looking... ...a strong technical architect to own the end-to-end... ...firmware, kernel drivers, operating... ...we need to see: ~ Deep expertise in scalable and performant server system architecture... ...recently, GPU deep learning ignited modern deep...PerformanceShift work$184k - $287.5k
...building a scalable and modular software stack that powers advanced... ...looking for a highly motivated Senior Systems Software Engineer... ...the software stack to analyze performance, inter-operation and deterministic... ...Mapping, or Localization. Deep understanding of real-time...SeniorPerformance$224k - $356.5k
...NVIDIA's Networking Systems & Software Architecture group is... ...computing interconnects. The Senior Architect role is to own modules and... ...and implementing high-performance communication and memory management... ...of Reinforcement Learning systems. With competitive...SeniorPerformance$184k - $287.5k
...We are hiring senior engineers to work on... ...Our team analyzes performance of applications,... ...investigates bottlenecks in software or hardware and... ..., ranging from deep learning, scientific... ...GPU hardware, kernel mode drivers, and... .... Evangelize, architect, and implement new...SeniorPerformance$262k - $365k
Senior Staff Software Architect, GPU Uber Tech Leads corporate_fare Google place Sunnyvale... ...with Linux OS internals, kernel development, or systems... ...OCP)). Knowledge of High-Performance Computing (HPC) systems... ...bonus, equity, or benefits. Learn more about benefits at...SeniorPerformanceFull time$152k - $241.5k
...NVIDIA’s deep learning and HPC platforms have made a huge impact in various... ...numeric libraries, modular software design ~ Deep knowledge of... ...model-parallel programming, performance optimization. Experience... ..., NCCL), and/or CUDA or GPU kernel programming Experience with...SeniorPerformance$208k - $327.75k
...Platforms Group is seeking a Senior System Architect to define, design, and... ...networking, storage, and AI software into repeatable blueprints... ...The ideal candidate brings deep data center architecture experience... ...accelerated compute, high-performance networking, storage,...SeniorPerformance$152k - $241.5k
...about redefining how software is built in the age of... ...to produce high-performance, high-quality, modern... ...of state-of-the-art deep learning breakthroughs, and improve... ...you'll be doing: Architecting an AI-native framework... ...software. CUDA & kernel expertise: Experience...SeniorPerformance$224k - $356.5k
...and medical devices. Our software platforms are central to... ...globally! We are hiring a Senior Systems Software Engineer... ...focused on optimizing deep learning inference for autonomous... ...level. They will locate performance issues through kernel trace analysis and evaluate...SeniorPerformanceImmediate start$104.9k - $184.92k
...When it comes to cutting edge machine learning to tackle complex problems, Lockheed Martin... ...with a strong working knowledge in software engineering, Python, C++, and STAR.OS... ...GPUs, NPUs, etc) • Experience with high-performance computing • Experience developing...SeniorPerformanceFull timeTemporary workPart timeWork experience placementWork at officeRemote workRelocationFlexible hoursShift work3 days per week$184k - $287.5k
...As a Senior Software Architect in the GPU Networking Architecture team, you will define Software Defined Networking (SDN) architectural solutions... ...to the modern data center, such as distributed AI and deep learning systems, Networking Operating Systems, Virtualization,...Senior- ...Machine Learning System Software Architect Sunnyvale, CA Do you want to be part of the AI revolution... ...the team to architect and create high-performance machine learning system software and... ...and HPC area. Experience with deep learning frameworks: TensorFlow/PyTorch...PerformanceWork at office
- ...company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent... ...and PyTorch and working closely with GPU software teams. This role promises a dynamic work...SeniorPerformance
- A leading technology firm in Santa Clara is looking for a Senior Staff Software Developer to drive AI performance and contribute to groundbreaking innovations. This role emphasizes optimizing GPU efficiency and developing cutting-edge AI solutions. Candidates should possess...SeniorPerformance
$224k - $356.5k
...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference... ..., designing and implementing a high-performance execution environment, low-level GPU... ...and developing custom GPU kernels in CUDA and/or Triton. This is an exceptional...SeniorPerformance- ...leading semiconductor company is seeking a Senior Staff Software Developer to shape its AI strategy. You will architect the AI software stack, accelerating performance on cutting-edge AMD hardware. Your role will require deep expertise in high-performance C++ programming...SeniorPerformance
$320k
...developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our... ...communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. We are looking for a Distinguished Software Architect to help co-design our next generation data...PerformanceWork experience placement$193.3k - $261.5k
...builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on... .... The Acceleration Kernel Library team is at the... ...of maximizing performance for AWS's custom ML accelerators... .... You will architect and implement business...SeniorPerformanceInternshipLocal areaWork from homeFlexible hours- A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California.... ...systems software for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate with...Senior
$152k - $241.5k
...with innovative solutions. As a Software Solution Architect for NVIS, you will play a crucial... ...deploy world-class AI and machine learning solutions. Join us in Israel and... ...profiling and analysis tools to improve deep learning model performance. Working with internal...PerformanceRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Deep Learning Kernel Software Performance Architect. Be the first to apply!
- .net software architects (remote) Santa Clara, CA
- software architect Santa Clara, CA
- senior software architect Santa Clara, CA
- application architect Santa Clara, CA
- senior development executive Santa Clara, CA
- senior technical manager Santa Clara, CA
- senior software development engineer in test Santa Clara, CA
- senior manager data science Santa Clara, CA
- senior platform engineer Santa Clara, CA
- senior procurement Santa Clara, CA

