Senior Software Engineer, CUTLASS Kernels
$152k - $241.5kNVIDIA
NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance linear algebra and Tensor Core primitives. Since 2017, it has provided the community with C++ and Python abstractions to implement custom matrix multiply (GEMM) and related math and deep learning computations on NVIDIA GPUs.
If you are passionate about developing and optimizing math kernels to extract the highest performance out of the hardware architecture, apply to join the CUTLASS team today!
What you'll get to do:
Write Tensor Core-based deep learning kernels such as grouped-GEMM, attention, and convolution using CUTLASS CUDA C++ and Python DSL for Blackwell, Rubin, and future architectures.
Optimize kernels for peak throughput on both silicon and software performance simulators.
Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and DL frameworks teams to ensure fast, functional, and timely kernel delivery to customers.
What we need to see:
Masters or PhD degree in Computer Science, Computer Engineering, or related field (or equivalent experience).
3+ years of relevant industry experience.
Strong proficiency in C++ programming and software design, including debugging, performance evaluation, and testing.
Experience with CUDA, OpenCL, HIP, SYCL, Mojo, Pallas, Triton, Mosaic, Halide, or any general-purpose or domain-specific programming language targeting highly parallel accelerators.
Deep understanding of computer architecture and some experience working at the assembly level.
Ways to stand out from the crowd:
Experience writing code specifically targeting NVIDIA Tensor Cores, particularly through PTX or CUDA/cuTile.
Open-source contributions to math kernel libraries or frameworks.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hard working people in the world working for us. If you're creative, autonomous, and love a challenge, consider joining our Deep Learning Library team and help us build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until June 5, 2026.This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.$152k - $241.5k
...applications and industries. Within our software stack, CUTLASS stands out as a popular open-source... ...training passes to identify key GPU kernel and fusion opportunities. Identify... ...degree in Computer Science, Computer Engineering, or related field (or equivalent...Senior$152k - $241.5k
...applications and industries. Within our software stack, CUTLASS stands out as a popular open-source... ...that enable writing high performance kernels, apply to join the CUTLASS team today... ...degree in Computer Science, Computer Engineering, or related field (or equivalent...Senior$184k - $287.5k
...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to... ...develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture...SeniorRemote work$184k - $287.5k
...their best work. Come join the team and see how you can make a lasting impact on the world. We are looking for a Senior Linux Kernel Software Engineer to join the Linux networking drivers R&D team. The work environment is versatile, informative, dynamic and challenging...Senior$152k - $218.5k
...We are now looking for a Senior Kernel Performance Architect for Deep Learning Software! NVIDIA is seeking extraordinary architects... ...Computer Science, Electrical Engineering or Computer Engineering, or... ...performance kernel (such as CUTLASS), work experience on math library...SeniorWork experience placement$224k - $356.5k
...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and... ...optimizations and developing custom GPU kernels in CUDA and/or Triton. This is an... ...in frameworks such as CUDA, CUTLASS, or Triton. Increasingly known as...Senior$184k - $287.5k
...are seeking highly skilled and motivated software engineers to join us and build AI inference... ...performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks... ...MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor...Senior- ...Senior Software Engineer - Kernel & Device Drivers Austin, Texas, United States; San Jose, California, United States The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes,...SeniorFull timeTemporary workLocal areaFlexible hours
$213k - $263k
...Senior Software Engineer, Linux Kernel Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World...SeniorFull timeRemote work$100k - $180k
...dataplane and integrate with 3rd party SDK. # Contribute in software design, development, review, and documentations to deliver high... ...solving skills are essential. Exposure to OS system calls and kernel APIs will be plus. Any Scripting knowledge is added...Senior$272k - $431.25k
...architecture and hands-on delivery across system software, drivers, and CUDA to make profiling... .... Set technical direction for an engineering team; mentor engineers, drive technical... ...runtime/driver APIs, CUDA streams/graphs, and kernel behavior. Track record building...Senior$152k - $241.5k
...for automotive and robotics. We build the software stack that enables Large Language,... ...Collaborate with teams across CUDA, kernel libraries, compilers, and robotics to deliver... ...in Computer Science, Electrical/Computer Engineering, or a closely related field. ~4+ years...SeniorRemote work$184k - $287.5k
...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running... ...and execution stack, targeting high-performance kernel generation for deep learning workloads. What you'll be doing...SeniorWork experience placement$152k - $241.5k
...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms... ...DGX Simulator; work closely with hardware modeling, kernel & platform driver teams distributed globally....SeniorWork experience placement$152k - $241.5k
...searching for highly motivated, creative engineers to join the Platform Software team. You will work with a team of... ...across engineering levels and senior management. Strong C/C++ and Python... ...Operating systems fundamentals with Kernel , Hypervisor experience on Windows systems...Senior$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers... ...to-end ML system performance by developing and tuning CUDA kernels, reducing model latency, maximizing compute and memory bandwidth...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers... ...TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies....SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$184k - $287.5k
...NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB2... ...combines deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-...Senior$150k - $250k
...Senior-Level Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon... ...data layout, memory movement, synchronization, and kernel performance Architect and optimize multi-threaded C++ systems...SeniorFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$154.42k - $235.9k
...make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver the core communication and... ...resimulationpipelines. Advanced networking/IO:io_uring,RDMA/PCIe messaging, kernel/bpf-based observability, packetcaptureand analysis. Real...SeniorPermanent employmentLocal areaWork from homeRelocationRelocation packageFlexible hours- ...your career. The Role As a senior member of the LLM inference... ...at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. The Person You are... ...architectures and kernel development Software Engineering ~ Expertise in...Senior
$182k - $242k
...2025. Learn more at About this role We're looking for a Senior Engineer for CoreWeave's Benchmarking & Performance team. You will... ...or multi-region environments. Background working with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies. The...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping... ...Software. Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL. Experience in software...Senior$152k - $241.5k
...every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by... ...paths across layers-from Python orchestration to C++/CUDA kernels-using data to guide optimization work. Improve multi‑GPU...SeniorRemote work$152k - $241.5k
...We are seeking a Senior Software Engineer to drive integration of the NVIDIA Grove project within Dynamo and across a set of leading open-source... ...and optimization (Nsight tools or similar), and/or kernel-level performance tuning. Experience with reproducibility...SeniorRemote work- ...work will be instrumental in enhancing GPU kernel performance, accelerating deep learning... ...will collaborate across internal GPU software teams and engage with open-source communities... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...Senior
$152k - $241.5k
...NVIDIA is seeking outstanding senior engineers to work on the CUDA driver, a key component of... ...GPU computing. You will join a versatile software engineering team that delivers innovative... ...help design software solutions across kernel mode components, compilers, and...Senior$184k - $287.5k
...Senior Security Architect NVIDIA has been transforming computer... ...looking for an outstanding engineer to be a Senior Security Architect... ...computing hardware, software, and datacenter platforms in... ...Based Access Control. Linux kernel hardening (SELinux/AppArmor)...Senior- ...looking for an influential software engineer who is passionate about improving... ...THE PERSON: As a Senior Staff Software Developer,... ...from the lowest-level GPU kernels to large-scale distributed systems... ...(e.g., cuBLAS, cuDNN, CUTLASS, Thrust, CUB, NCCL), or the...Senior
$152k - $241.5k
...We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries! NVIDIA's Deep Learning Libraries Group is seeking excellent... ...products, including cuDNN ( , TensorRT ( , and CUDA kernel libraries. The mission is to design and develop...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer, CUTLASS Kernels. Be the first to apply!
- software engineer full time Santa Clara, CA
- startup software engineer Santa Clara, CA
- rust software engineer Santa Clara, CA
- work from home software developer Santa Clara, CA
- software developer Santa Clara, CA
- software development engineer aws Santa Clara, CA
- software qa engineer Santa Clara, CA
- ngo software engineer Santa Clara, CA
- software engineer staff Santa Clara, CA
- software engineer Santa Clara, CA


