Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, CUTLASS Kernels

$152k - $241.5k

NVIDIA

NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance linear algebra and Tensor Core primitives. Since 2017, it has provided the community with C++ and Python abstractions to implement custom matrix multiply (GEMM) and related math and deep learning computations on NVIDIA GPUs.

If you are passionate about developing and optimizing math kernels to extract the highest performance out of the hardware architecture, apply to join the CUTLASS team today!

What you'll get to do:

  • Write Tensor Core-based deep learning kernels such as grouped-GEMM, attention, and convolution using CUTLASS CUDA C++ and Python DSL for Blackwell, Rubin, and future architectures.

  • Optimize kernels for peak throughput on both silicon and software performance simulators.

  • Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and DL frameworks teams to ensure fast, functional, and timely kernel delivery to customers.

What we need to see:

  • Masters or PhD degree in Computer Science, Computer Engineering, or related field (or equivalent experience).

  • 3+ years of relevant industry experience.

  • Strong proficiency in C++ programming and software design, including debugging, performance evaluation, and testing.

  • Experience with CUDA, OpenCL, HIP, SYCL, Mojo, Pallas, Triton, Mosaic, Halide, or any general-purpose or domain-specific programming language targeting highly parallel accelerators.

  • Deep understanding of computer architecture and some experience working at the assembly level.

Ways to stand out from the crowd:

  • Experience writing code specifically targeting NVIDIA Tensor Cores, particularly through PTX or CUDA/cuTile.

  • Open-source contributions to math kernel libraries or frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hard working people in the world working for us. If you're creative, autonomous, and love a challenge, consider joining our Deep Learning Library team and help us build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 5, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, CUTLASS Kernels in Santa Clara, CA vacancy
  • $152k - $241.5k

     ...applications and industries. Within our software stack, CUTLASS stands out as a popular open-source...  ...training passes to identify key GPU kernel and fusion opportunities. Identify...  ...degree in Computer Science, Computer Engineering, or related field (or equivalent... 
    Senior

    NVIDIA

    Santa Clara, CA
    7 hours ago
  • $152k - $241.5k

     ...applications and industries. Within our software stack, CUTLASS stands out as a popular open-source...  ...that enable writing high performance kernels, apply to join the CUTLASS team today...  ...degree in Computer Science, Computer Engineering, or related field (or equivalent... 
    Senior

    NVIDIA

    Santa Clara, CA
    7 hours ago
  • $184k - $287.5k

     ...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to...  ...develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...their best work. Come join the team and see how you can make a lasting impact on the world. We are looking for a Senior Linux Kernel Software Engineer to join the Linux networking drivers R&D team. The work environment is versatile, informative, dynamic and challenging... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $218.5k

     ...We are now looking for a Senior Kernel Performance Architect for Deep Learning Software! NVIDIA is seeking extraordinary architects...  ...Computer Science, Electrical Engineering or Computer Engineering, or...  ...performance kernel (such as CUTLASS), work experience on math library... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and...  ...optimizations and developing custom GPU kernels in CUDA and/or Triton. This is an...  ...in frameworks such as CUDA, CUTLASS, or Triton. Increasingly known as... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...are seeking highly skilled and motivated software engineers to join us and build AI inference...  ...performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks...  ...MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...Senior Software Engineer - Kernel & Device Drivers Austin, Texas, United States; San Jose, California, United States The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes,... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova Systems

    San Jose, CA
    2 days ago
  • $213k - $263k

     ...Senior Software Engineer, Linux Kernel Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World... 
    Senior
    Full time
    Remote work

    Waymo

    Mountain View, CA
    19 hours ago
  • $100k - $180k

     ...dataplane and integrate with 3rd party SDK. # Contribute in software design, development, review, and documentations to deliver high...  ...solving skills are essential. Exposure to OS system calls and kernel APIs will be plus. Any Scripting knowledge is added... 
    Senior

    IP Infusion US Inc.

    Santa Clara, CA
    3 days ago
  • $272k - $431.25k

     ...architecture and hands-on delivery across system software, drivers, and CUDA to make profiling...  .... Set technical direction for an engineering team; mentor engineers, drive technical...  ...runtime/driver APIs, CUDA streams/graphs, and kernel behavior. Track record building... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...for automotive and robotics. We build the software stack that enables Large Language,...  ...Collaborate with teams across CUDA, kernel libraries, compilers, and robotics to deliver...  ...in Computer Science, Electrical/Computer Engineering, or a closely related field. ~4+ years... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    19 hours ago
  • $184k - $287.5k

     ...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running...  ...and execution stack, targeting high-performance kernel generation for deep learning workloads. What you'll be doing... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms...  ...DGX Simulator; work closely with hardware modeling, kernel & platform driver teams distributed globally.... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...searching for highly motivated, creative engineers to join the Platform Software team. You will work with a team of...  ...across engineering levels and senior management. Strong C/C++ and Python...  ...Operating systems fundamentals with Kernel , Hypervisor experience on Windows systems... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $165k - $242k

     ...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers...  ...to-end ML system performance by developing and tuning CUDA kernels, reducing model latency, maximizing compute and memory bandwidth... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers...  ...TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB2...  ...combines deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $150k - $250k

     ...Senior-Level Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon...  ...data layout, memory movement, synchronization, and kernel performance Architect and optimize multi-threaded C++ systems... 
    Senior
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Compute

    Sunnyvale, CA
    1 day ago
  • $154.42k - $235.9k

     ...make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver the core communication and...  ...resimulationpipelines. Advanced networking/IO:io_uring,RDMA/PCIe messaging, kernel/bpf-based observability, packetcaptureand analysis. Real... 
    Senior
    Permanent employment
    Local area
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...your career. The Role As a senior member of the LLM inference...  ...at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. The Person You are...  ...architectures and kernel development Software Engineering ~ Expertise in... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • $182k - $242k

     ...2025. Learn more at About this role We're looking for a Senior Engineer for CoreWeave's Benchmarking & Performance team. You will...  ...or multi-region environments. Background working with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies. The... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    20 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping...  ...Software. Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL. Experience in software... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by...  ...paths across layers-from Python orchestration to C++/CUDA kernels-using data to guide optimization work. Improve multi‑GPU... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...We are seeking a Senior Software Engineer to drive integration of the NVIDIA Grove project within Dynamo and across a set of leading open-source...  ...and optimization (Nsight tools or similar), and/or kernel-level performance tuning. Experience with reproducibility... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    6 days ago
  •  ...work will be instrumental in enhancing GPU kernel performance, accelerating deep learning...  ...will collaborate across internal GPU software teams and engage with open-source communities...  ...THE PERSON: Skilled engineer with strong technical and analytical expertise... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...NVIDIA is seeking outstanding senior engineers to work on the CUDA driver, a key component of...  ...GPU computing. You will join a versatile software engineering team that delivers innovative...  ...help design software solutions across kernel mode components, compilers, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...Senior Security Architect NVIDIA has been transforming computer...  ...looking for an outstanding engineer to be a Senior Security Architect...  ...computing hardware, software, and datacenter platforms in...  ...Based Access Control. Linux kernel hardening (SELinux/AppArmor)... 
    Senior

    NVIDIA

    Santa Clara, CA
    11 hours ago
  •  ...looking for an influential software engineer who is passionate about improving...  ...THE PERSON: As a Senior Staff Software Developer,...  ...from the lowest-level GPU kernels to large-scale distributed systems...  ...(e.g., cuBLAS, cuDNN, CUTLASS, Thrust, CUB, NCCL), or the... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries! NVIDIA's Deep Learning Libraries Group is seeking excellent...  ...products, including cuDNN ( , TensorRT ( , and CUDA kernel libraries. The mission is to design and develop... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, CUTLASS Kernels. Be the first to apply!