Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kernel Engineer (Compute / Accelerator)

$260k - $320k

DensityAI

About The Role You will write, evaluate, and profile specialized compute kernels that run on a custom AI accelerator. This is the critical interface between high-level ML workloads and silicon — your code directly determines how effectively the hardware performs. You'll work closely with the architecture and compiler teams to define the kernel programming model, implement core tensor operations, and drive the performance profiling workflow that validates silicon design decisions. What you'll do Write and optimize compute kernels for a custom AI accelerator — tensor operations, data movement patterns, memory hierarchy exploitation Develop and maintain profiling infrastructure to measure kernel performance against architectural targets Define and document shuffle patterns for ML kernel primitives across CPU-like control, tensor cores, and CUTLASS-style operations Drive kernel DSL design decisions — thread spawn mechanisms, register passing conventions, and memory management strategies Enable end-to-end kernel execution on the architectural simulator Collaborate with the compiler team on the MLIR dialect — your kernels are the primary validation target Create onboarding documentation and kernel writing guides for the broader team What we're looking for C/C++ — production-grade systems code, not scripted glue. You'll write performance‑critical kernels CUDA or equivalent accelerator programming — deep experience writing GPU kernels, understanding warp/wavefront execution, memory coalescing, shared memory optimization. The mental model transfers directly Computer architecture — you need to reason about pipelines, memory hierarchies, data movement costs, and how software maps to hardware Performance profiling and optimization — you live in profilers. Identifying bottlenecks, measuring throughput, and iterating until kernels meet targets is the core loop Tensor operations — practical understanding of GEMM, convolution, attention, reduction, and scatter/gather as they map to hardware Python — for scripting, DSL integration, and profiling automation

  • Optional) RISC‑V, x86, or ARM64 ISA experience
  • Optional) MLIR or LLVM compiler infrastructure
  • Optional) HPC or scientific computing background (large‑scale parallel compute intuition)
  • Optional) FPGA or Verilog/SystemVerilog (ability to read RTL and reason about the hardware you're targeting)
  • Optional) Familiarity with CUTLASS, Triton, or similar kernel libraries
Compensation Final offers depend on level, location, and skills relevant to the role. Additional compensation: equity grant per company guidelines; medical/dental/vision; 401(k); standard PTO. Visa Sponsorship DensityAI sponsors qualified candidates for H‑1B, O‑1, TN, E‑3, and other employment‑based visas, and we welcome applicants on F‑1 OPT and STEM‑OPT. Work authorization is required at start; we provide immigration support to secure or transfer status. Equal Opportunity DensityAI is an Equal Opportunity Employer. We do not discriminate on the basis of race, color, religious creed, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, age (40+), sexual orientation, military or veteran status, pregnancy, or any other status protected by law. We comply with the California CROWN Act and provide reasonable accommodations on request. Full compensation packages are based on candidate experience and relevant certifications. California pay range

$260,000 - $320,000 USD

#J-18808-Ljbffr DensityAI

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Kernel Engineer (Compute / Accelerator) in Mountain View, CA vacancy
  • Cerebras Systems is seeking a deeply technical software engineer for its Kernel Reliability team in Sunnyvale, California. This role involves enhancing the reliability of advanced compute clusters. The ideal candidate will have strong programming skills in C/C++ and Python... 
    Suggested

    Dormont Manufacturing Co

    Sunnyvale, CA
    4 days ago
  •  ...Kernel Engineer Vorticity is building the world's first Scientific Processing Unit (SPU), a new class of silicon purpose-built to accelerate scientific computing beyond the limits of GPUs. We are designing tightly coupled software–hardware systems around applied mathematics... 
    Suggested

    Vorticity

    Redwood City, CA
    4 days ago
  • Sunday Robotics in Redwood City, California is seeking a System Software Engineer to contribute to the accelerated compute layer of their robot platform. The ideal candidate has over 2 years of experience in GPU systems software development, strong proficiency in CUDA,... 
    Suggested

    Sunday Robotics

    Redwood City, CA
    17 hours ago
  •  ...wafer-scale architecture provides the AI compute power of dozens of GPUs on a single...  ...computation. About The Role As a Kernel Engineer on our team, you will develop high-...  ...the full potential of our hardware and accelerating the pace of AI innovation.... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $180k

    xAI is seeking a talented individual to join their Compute Infrastructure team, focused on building one of the world’s largest AI supercomputers...  .... Ideal candidates will possess deep programming skills, GPU kernel optimization experience, and a strong grasp of large-scale... 
    Suggested

    xAI

    Palo Alto, CA
    4 days ago
  • Vorticity Inc. is seeking a Kernel Engineer to work on their Scientific Processing Unit (SPU) in Redwood City, California. The position focuses on applied mathematics and scientific computing, where you'll prototype core kernels, optimize performance, and collaborate with... 

    Vorticity Inc.

    Redwood City, CA
    17 hours ago
  •  ...GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the...  ...The Role We’re looking for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead a high performing team... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    4 days ago
  • $217k - $275k

    General Motors is seeking a Staff Software Engineer for Compute Systems Software in Mountain View, California. This hybrid role involves leading...  ...for GM’s vehicles. Responsibilities include designing kernel modules, optimizing system performance, and collaborating with... 

    General Motors

    Mountain View, CA
    2 days ago
  • $217k - $275k

     ...Description The Vehicle Experiences Engine (VEE) at GM is transforming how millions...  ...the core of this transformation is the Compute Systems Software team - the group that delivers...  ...systems. You'll work across the Linux kernel, board support packages (BSP), device... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    1 day ago
  • $160.36k - $240.54k

     ...connected future. About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in the cloud/data center. This...  ...to provide robust, generic tools and APIs that accelerate their development of business critical applications... 

    Icehouseventures

    Mountain View, CA
    4 days ago
  •  ...our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs...  ...role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. THE PERSON You are a systems‑minded... 

    Advanced Micro Devices

    Santa Clara, CA
    3 days ago
  • $165k - $242k

     ...the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of...  ...is seeking a highly skilled and motivated Systems Kernel Engineer to join our HAVOCK Team, reporting into the Manager of Systems... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    more than 2 months ago
  • $207k - $300k

    Google Inc. is seeking a Staff Software Engineer, Accelerator Platform Software, based in Sunnyvale, CA. This role involves developing next-generation...  ...development, and a strong background in C, C++, and Linux kernel systems. Responsibilities include onboarding new... 

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $138k - $207k

     ...04/19/2026 Category Engineering Hire Type Employee Job...  ...simulation solutions utilizing GPU acceleration and parallel programming to...  ...utilizing high-performance computing to solve complex problems....  ...designs. Developing GPU kernels (CUDA/HIP/DPC++) to reduce... 
    Remote work

    Synopsys

    Sunnyvale, CA
    2 days ago
  • $138k - $198k

     ...USA Apply Qualifications Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or...  ...this role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting‑edge TPU (Tensor... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $256k - $278k

    Staff Functional Safety Engineer, Robotics Compute, DeepMind Location: Mountain View, CA, USA Qualifications Bachelor's degree in Electrical...  ...advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We... 
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago
  • $188k - $275k

     ...infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a...  ...2025. Learn more at What You'll Do: The Field Engineering organization at CoreWeave is dedicated to ensuring every... 
    Permanent employment
    Full time
    Contract work
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    2 days ago
  •  ...cost efficiency. With AI redefining the computing paradigm, solutions must evolve to...  ...contributors of all seniorities. As an Acceleration Kernel Developer at Tenstorrent, you will...  ...closely with a team of highly skilled engineers to ensure that our software operates at... 
    Permanent employment
    Internship

    Tenstorrent

    Santa Clara, CA
    4 days ago
  • $153k - $242k

     ...Senior Systems Engineer, OS Automation CoreWeave is The Essential Cloud for...  ...performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017,...  ...Systems - Containerization - Kernel) CoreWeave is looking for a... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Local area
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    8 days ago
  • $180k - $240k

     ...Senior/Staff Computer Vision Engineer - Deep Learning Focus Mountain View, California, United States At Phantom AI, we've built a team of incredibly talented and ambitious people challenging the norm in the automotive industry. We are building cost-effective L2/... 
    Work experience placement
    Work at office
    Local area

    Phantom AI

    Mountain View, CA
    4 days ago
  • $180k - $240k

    Vinci4D.ai in Palo Alto is seeking a Geometry / Meshing Engineer to help design and optimize systems for their next-generation copilot platform. You'll work with computational geometry and large-scale data to accelerate hardware design through intelligent automation. The... 

    Vinci4D.ai

    Palo Alto, CA
    3 days ago
  • A cutting-edge tech company in Santa Clara is seeking a Software Engineer to develop software kernels for AI computation. The ideal candidate will have an MS or PhD, proficient skills in C/C++ and Python, and experience with ML algorithms. This role involves collaborating... 
    Remote job

    d-Matrix

    Santa Clara, CA
    3 days ago
  • $170k - $260k

     ...a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming...  ...Training: Implement distributed computing techniques to enable parallel training...  ...skills in Python and experience with GPU-accelerated libraries (e.g., CUDA, cuDNN). ~... 
    Work at office

    GenBio AI

    Palo Alto, CA
    17 hours ago
  • A healthcare technology company is seeking a remote Computer Systems Engineer in Mountain View, CA. The ideal candidate will design, develop, and maintain computer systems to meet evolving and innovative technological needs. Key responsibilities include developing internal... 
    Remote job

    ESR Healthcare

    Mountain View, CA
    3 days ago
  • Apple Inc. is seeking a Systems & Kernel Software Engineer to work in Cupertino, California. The role entails collaborating with teams to enhance...  ...on various OS layers. Candidates should have a BS in Computer Science or equivalent experience, strong programming skills... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $300 per month

     ...Crusoe is on a mission to accelerate the abundance of energy and intelligence...  ...our time. The demand for AI compute is boundless, and power is a...  ...platform — and Production Engineering sits at the heart of that...  ...debugging complex issues across kernel and user space ~ Strong... 
    Temporary work

    Crusoe

    Sunnyvale, CA
    11 days ago
  • NVIDIA Gruppe is seeking a highly motivated engineer to enhance their GPU software team's...  ...productivity. Applicants should have a BS or MS in Computer Engineering, over 10 years of software...  ...investigation. Knowledge of NVIDIA GPU kernel software and programming in Python or C/... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $255.85k - $361.2k

    Job Overview We are seeking a Principal Engineer to define and architect the next generation of distributed AI systems across heterogeneous compute platforms, including CPUs, GPUs, IPUs, and emerging dataflow accelerators. This role focuses on dynamically executing and... 
    Local area
    Shift work

    Intel Corporation

    Santa Clara, CA
    4 days ago
  • $295k - $330k

     ...commercialization, and mass production to change the world for the better. JOB SUMMARY Apptronik is seeking a world-class Principal Compute & Sensing Engineer to serve as the lead architect for the "nervous system" of the Apollo humanoid robot. This is a top-tier technical role... 
    Local area

    Booster

    Sunnyvale, CA
    1 day ago
  •  ...every robot in the fleet. As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on...  ...Efficient model execution and switching: Reduce gpu kernel launch overheads and make swapping between... 

    Sunday

    Redwood City, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kernel Engineer (Compute / Accelerator). Be the first to apply!