Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kernel Engineer (Compute / Accelerator)

$260k - $320k

DensityAI

ITAR Notice: This role involves access to ITAR-controlled information. Applicants must be U.S. persons (U.S. citizens, U.S. permanent residents, asylees, or refugees) per 22 CFR 120.62 About the role You will write, evaluate, and profile specialized compute kernels that run on a custom AI accelerator. This is the critical interface between high-level ML workloads and silicon — your code directly determines how effectively the hardware performs. You'll work closely with the architecture and compiler teams to define the kernel programming model, implement core tensor operations, and drive the performance profiling workflow that validates silicon design decisions. What you'll do Write and optimize compute kernels for a custom AI accelerator — tensor operations, data movement patterns, memory hierarchy exploitation Develop and maintain profiling infrastructure to measure kernel performance against architectural targets Define and document shuffle patterns for ML kernel primitives across CPU-like control, tensor cores, and CUTLASS-style operations Drive kernel DSL design decisions — thread spawn mechanisms, register passing conventions, and memory management strategies Enable end-to-end kernel execution on the architectural simulator Collaborate with the compiler team on the MLIR dialect — your kernels are the primary validation target Create onboarding documentation and kernel writing guides for the broader team What we're looking for C/C++ — production‑grade systems code, not scripted glue. You'll write performance‑critical kernels CUDA or equivalent accelerator programming — deep experience writing GPU kernels, understanding warp/wavefront execution, memory coalescing, shared memory optimization. The mental model transfers directly Computer architecture — you need to reason about pipelines, memory hierarchies, data movement costs, and how software maps to hardware Performance profiling and optimization — you live in profilers. Identifying bottlenecks, measuring throughput, and iterating until kernels meet targets is the core loop Tensor operations — practical understanding of GEMM, convolution, attention, reduction, and scatter/gather as they map to hardware Python — for scripting, DSL integration, and profiling automation

  • Optional) RISC-V, x86, or ARM64 ISA experience
  • Optional) MLIR or LLVM compiler infrastructure
  • Optional) HPC or scientific computing background (large‑scale parallel compute intuition)
  • Optional) FPGA or Verilog/SystemVerilog (ability to read RTL and reason about the hardware you’re targeting)
  • Optional) Familiarity with CUTLASS, Triton, or similar kernel libraries
Compensation Final offers depend on level, location, and skills relevant to the role. Additional compensation: equity grant per company guidelines; medical / dental / vision; 401(k); standard PTO. DensityAI sponsors qualified candidates for H‑1B, O‑1, TN, E‑3, and other employment‑based visas, and we welcome applicants on F‑1 OPT and STEM‑OPT. Work authorization is required at start; we provide immigration support to secure or transfer status. DensityAI is an Equal Opportunity Employer. We do not discriminate on the basis of race, color, religious creed, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, age (40+), sexual orientation, military or veteran status, pregnancy, or any other status protected by law. We comply with the California CROWN Act and provide reasonable accommodations on request. Full compensation packages are based on candidate experience and relevant certifications.

$260,000 - $320,000 USD

#J-18808-Ljbffr DensityAI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Kernel Engineer (Compute / Accelerator) in Mountain View, CA vacancy
  • $260k - $320k

    DensityAI is seeking an expert to develop compute kernels for a specialized AI accelerator in Mountain View, California. Your role will focus on writing performance-critical kernels while collaborating with architecture and compiler teams to enhance silicon design. Qualifications... 
    Suggested

    DensityAI

    Mountain View, CA
    2 days ago
  •  ...new class of silicon purpose-built to accelerate scientific computing beyond the limits of GPUs. We are...  ...applied mathematics-driven software engineers who can translate real-world scientific...  ...workloads into executable models, kernels, libraries, and applications that inform... 
    Suggested

    Vorticity

    Redwood City, CA
    1 day ago
  • Sunday Robotics in Redwood City, California is seeking a System Software Engineer to contribute to the accelerated compute layer of their robot platform. The ideal candidate has over 2 years of experience in GPU systems software development, strong proficiency in CUDA,... 
    Suggested

    Sunday Robotics

    Redwood City, CA
    11 hours ago
  • $120k - $192k

     ...possibilities for our customers by accelerating digital business...  ...opportunity to Implement advanced kernel-level resource management features...  .... Be a successful VMKernel engineer and participate in the design...  ...both virtual and real computer hardware. Work with multiple... 
    Suggested
    Local area

    Israelvcforum

    Palo Alto, CA
    4 days ago
  • $180k

    xAI is seeking a talented individual to join their Compute Infrastructure team, focused on building one of the world’s largest AI supercomputers...  .... Ideal candidates will possess deep programming skills, GPU kernel optimization experience, and a strong grasp of large-scale... 
    Suggested

    xAI

    Palo Alto, CA
    4 days ago
  • Vorticity Inc. is seeking a Kernel Engineer to work on their Scientific Processing Unit (SPU) in Redwood City, California. The position focuses on applied mathematics and scientific computing, where you'll prototype core kernels, optimize performance, and collaborate with... 

    Vorticity Inc.

    Redwood City, CA
    11 hours ago
  • $160.36k - $240.54k

     ...connected future. About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in the cloud/data center. This...  ...to provide robust, generic tools and APIs that accelerate their development of business critical applications... 

    Icehouseventures

    Mountain View, CA
    4 days ago
  • $165k - $242k

     ...the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of...  ...is seeking a highly skilled and motivated Systems Kernel Engineer to join our HAVOCK Team, reporting into the Manager of Systems... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    more than 2 months ago
  • We are seeking a highly experienced Data Scientist / Computer Vision Engineer to join an innovative AI team building large-scale computer vision...  ...performance, scalability, and reliability across GPU-accelerated environments Troubleshoot performance bottlenecks and optimize... 

    Glint Tech Solutions

    Sunnyvale, CA
    2 days ago
  • $138k - $198k

     ...CA, USA Qualifications Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or equivalent...  ...role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting‑edge TPU (... 
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $138k - $198k

     ...USA Apply Qualifications Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or...  ...this role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting‑edge TPU (Tensor... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    1 day ago
  •  ...are a technology solutions company helping organizations accelerate business growth. Partnering with Fortune 500 companies, we...  ...two decades now. Job Description Job Title: Senior SLAM Computer Vision Engineer Location: Sunnyvale, CA Duration: 6+ months (with high possibility... 
    Contract work

    Aditi Consulting

    Sunnyvale, CA
    2 days ago
  • Overview Synopsys software engineers are key enablers in the world...  ...simulation solutions utilizing GPU acceleration and parallel programming to...  ...utilizing high-performance computing to solve complex problems....  ...designs. Developing GPU kernels (CUDA/HIP/DPC++) to reduce simulation... 

    Synopsys, Inc.

    Sunnyvale, CA
    2 days ago
  • $180k - $240k

    Vinci4D.ai in Palo Alto is seeking a Geometry / Meshing Engineer to help design and optimize systems for their next-generation copilot platform. You'll work with computational geometry and large-scale data to accelerate hardware design through intelligent automation. The... 

    Vinci4D.ai

    Palo Alto, CA
    3 days ago
  • Israelvcforum is seeking an experienced engineer to join the ESX CPU and Server platform system kernel team at VMware by Broadcom. This role focuses on supporting...  ...will have over a decade of experience in Computer Science or Engineering and possesses excellent C... 

    Israelvcforum

    Palo Alto, CA
    4 days ago
  • A cutting-edge tech company in Santa Clara is seeking a Software Engineer to develop software kernels for AI computation. The ideal candidate will have an MS or PhD, proficient skills in C/C++ and Python, and experience with ML algorithms. This role involves collaborating... 
    Remote job

    d-Matrix

    Santa Clara, CA
    8 days ago
  • Apple Inc. is seeking a Systems & Kernel Software Engineer to work in Cupertino, California. The role entails collaborating with teams to enhance...  ...on various OS layers. Candidates should have a BS in Computer Science or equivalent experience, strong programming skills... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $180k - $240k

    Senior/Staff Computer Vision Engineer - Deep Learning Focus Mountain View, California, United States About Us At Phantom AI, we’ve built a team of incredibly talented and ambitious people challenging the norm in the automotive industry. We are building cost‑effective... 
    Work experience placement
    Work at office
    Local area

    Phantom AI

    Mountain View, CA
    3 days ago
  •  ...industry through advanced electrical architectures, high-performance computing, and the integration of ADAS and infotainment capabilities. GM...  ...and seamless scalability across every GM brand. As a Hardware Engineer - Central Compute, you will contribute to the design,... 
    Internship
    Work at office
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  .... Our team is made up of highly experienced ML engineers and tech industry veterans and we’re backed by leading computing and technology companies. HPC Engineers in our...  ...and optimizing high performance collective and kernel libraries for running LLMs on AMD GPUs, using technologies... 

    Lamini

    Palo Alto, CA
    4 days ago
  • We are seeking a highly skilled High Performance Computing (HPC) Engineer with a strong background in modeling and solving complex discrete optimization...  ...high-performance computing environments to scale and accelerate problem-solving, including multi-core, distributed, and... 

    Schlumberger

    Sunnyvale, CA
    1 day ago
  • A leading cloud technology company is seeking a highly skilled HPC Performance Engineer to join their HAVOCK Team in Sunnyvale, California. In this role, you will optimize bare-metal systems and ensure the performance of complex workloads using various technologies including... 

    CoreWeave

    Sunnyvale, CA
    11 hours ago
  • Icehouseventures is looking for an experienced Hardware Engineer in Sunnyvale, California. The successful candidate will develop next-generation autonomous driving compute platforms, focusing on low-level electronics design and embedded systems. Candidates should have over... 

    Icehouseventures

    Sunnyvale, CA
    11 hours ago
  •  ...Join to apply for the Computer Vision Engineer role at Homekey We’re a Stanford startup working on an AI-driven residential real estate marketplace. Check out our team at Homekey here: We’re building across the full lifecycle of real estate: property valuation... 

    Homekey

    Palo Alto, CA
    2 days ago
  • Fiserv, a leader in Fintech, is seeking a Staff Systems Engineer for its Sunnyvale campus. This role involves leading the engineering of secure payment kernels, system architecture implementation, and feature innovation for payment solutions. Candidates should have 7+ years... 

    Shoptalk

    Sunnyvale, CA
    2 days ago
  •  ...every robot in the fleet. As a System Software Engineer on Robot Platform focused on GPU and accelerated compute, you’ll own how every accelerated workload on the...  ...Efficient model execution and switching: Reduce gpu kernel launch overheads and make swapping between models... 

    Sunday Robotics

    Redwood City, CA
    4 days ago
  •  ...Computer Vision Intern — Data Labeling & Annotation Type: Internship / Temporary Duration: 6 months - 12 months What You’ll Gain Exposure...  ...pipeline, from raw data to deployed model Mentorship from CV engineers working on production systems Hands‑on experience with YOLO,... 
    Temporary work
    Internship

    BrightAI Corporation

    Palo Alto, CA
    1 day ago
  • $147.4k - $272.1k

    3D Perception/Computer Vision Algorithm Engineer Sunnyvale, California, United States Machine Learning and AI The Video Computer Vision organization is working on exciting technologies for future Apple products. Our focus is on real time and low power world tracking and... 
    Relocation

    Apple

    Sunnyvale, CA
    11 hours ago
  •  ..., CA, is seeking a Member of Technical Staff to design and build core components of next-generation AI compute solutions. The ideal candidate has a strong engineering background, experience in systems software or distributed systems, and a deep interest in AI infrastructure... 

    DensityAI

    Mountain View, CA
    11 hours ago
  • General Motors is seeking a Hardware Engineer - Central Compute in Sunnyvale, California, to design and validate next-generation high-compute ECUs that empower automated driving and vehicle intelligence. The ideal candidate will have a Bachelor's in Electrical or Computer... 

    General Motors

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kernel Engineer (Compute / Accelerator). Be the first to apply!