Kernel Engineer (Compute / Accelerator)
$260k - $320kDensityAI
ITAR Notice: This role involves access to ITAR-controlled information. Applicants must be U.S. persons (U.S. citizens, U.S. permanent residents, asylees, or refugees) per 22 CFR 120.62 About the role You will write, evaluate, and profile specialized compute kernels that run on a custom AI accelerator. This is the critical interface between high-level ML workloads and silicon — your code directly determines how effectively the hardware performs. You'll work closely with the architecture and compiler teams to define the kernel programming model, implement core tensor operations, and drive the performance profiling workflow that validates silicon design decisions. What you'll do Write and optimize compute kernels for a custom AI accelerator — tensor operations, data movement patterns, memory hierarchy exploitation Develop and maintain profiling infrastructure to measure kernel performance against architectural targets Define and document shuffle patterns for ML kernel primitives across CPU-like control, tensor cores, and CUTLASS-style operations Drive kernel DSL design decisions — thread spawn mechanisms, register passing conventions, and memory management strategies Enable end-to-end kernel execution on the architectural simulator Collaborate with the compiler team on the MLIR dialect — your kernels are the primary validation target Create onboarding documentation and kernel writing guides for the broader team What we're looking for C/C++ — production‑grade systems code, not scripted glue. You'll write performance‑critical kernels CUDA or equivalent accelerator programming — deep experience writing GPU kernels, understanding warp/wavefront execution, memory coalescing, shared memory optimization. The mental model transfers directly Computer architecture — you need to reason about pipelines, memory hierarchies, data movement costs, and how software maps to hardware Performance profiling and optimization — you live in profilers. Identifying bottlenecks, measuring throughput, and iterating until kernels meet targets is the core loop Tensor operations — practical understanding of GEMM, convolution, attention, reduction, and scatter/gather as they map to hardware Python — for scripting, DSL integration, and profiling automation
- Optional) RISC-V, x86, or ARM64 ISA experience
- Optional) MLIR or LLVM compiler infrastructure
- Optional) HPC or scientific computing background (large‑scale parallel compute intuition)
- Optional) FPGA or Verilog/SystemVerilog (ability to read RTL and reason about the hardware you’re targeting)
- Optional) Familiarity with CUTLASS, Triton, or similar kernel libraries
$260,000 - $320,000 USD
#J-18808-Ljbffr DensityAI$260k - $320k
DensityAI is seeking an expert to develop compute kernels for a specialized AI accelerator in Mountain View, California. Your role will focus on writing performance-critical kernels while collaborating with architecture and compiler teams to enhance silicon design. Qualifications...Suggested- ...new class of silicon purpose-built to accelerate scientific computing beyond the limits of GPUs. We are... ...applied mathematics-driven software engineers who can translate real-world scientific... ...workloads into executable models, kernels, libraries, and applications that inform...Suggested
- Sunday Robotics in Redwood City, California is seeking a System Software Engineer to contribute to the accelerated compute layer of their robot platform. The ideal candidate has over 2 years of experience in GPU systems software development, strong proficiency in CUDA,...Suggested
$120k - $192k
...possibilities for our customers by accelerating digital business... ...opportunity to Implement advanced kernel-level resource management features... .... Be a successful VMKernel engineer and participate in the design... ...both virtual and real computer hardware. Work with multiple...SuggestedLocal area$180k
xAI is seeking a talented individual to join their Compute Infrastructure team, focused on building one of the world’s largest AI supercomputers... .... Ideal candidates will possess deep programming skills, GPU kernel optimization experience, and a strong grasp of large-scale...Suggested- Vorticity Inc. is seeking a Kernel Engineer to work on their Scientific Processing Unit (SPU) in Redwood City, California. The position focuses on applied mathematics and scientific computing, where you'll prototype core kernels, optimize performance, and collaborate with...
$160.36k - $240.54k
...connected future. About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in the cloud/data center. This... ...to provide robust, generic tools and APIs that accelerate their development of business critical applications...$165k - $242k
...the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of... ...is seeking a highly skilled and motivated Systems Kernel Engineer to join our HAVOCK Team, reporting into the Manager of Systems...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours- We are seeking a highly experienced Data Scientist / Computer Vision Engineer to join an innovative AI team building large-scale computer vision... ...performance, scalability, and reliability across GPU-accelerated environments Troubleshoot performance bottlenecks and optimize...
$138k - $198k
...CA, USA Qualifications Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or equivalent... ...role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting‑edge TPU (...Worldwide$138k - $198k
...USA Apply Qualifications Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or... ...this role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting‑edge TPU (Tensor...Full timeWorldwide- ...are a technology solutions company helping organizations accelerate business growth. Partnering with Fortune 500 companies, we... ...two decades now. Job Description Job Title: Senior SLAM Computer Vision Engineer Location: Sunnyvale, CA Duration: 6+ months (with high possibility...Contract work
- Overview Synopsys software engineers are key enablers in the world... ...simulation solutions utilizing GPU acceleration and parallel programming to... ...utilizing high-performance computing to solve complex problems.... ...designs. Developing GPU kernels (CUDA/HIP/DPC++) to reduce simulation...
$180k - $240k
Vinci4D.ai in Palo Alto is seeking a Geometry / Meshing Engineer to help design and optimize systems for their next-generation copilot platform. You'll work with computational geometry and large-scale data to accelerate hardware design through intelligent automation. The...- Israelvcforum is seeking an experienced engineer to join the ESX CPU and Server platform system kernel team at VMware by Broadcom. This role focuses on supporting... ...will have over a decade of experience in Computer Science or Engineering and possesses excellent C...
- A cutting-edge tech company in Santa Clara is seeking a Software Engineer to develop software kernels for AI computation. The ideal candidate will have an MS or PhD, proficient skills in C/C++ and Python, and experience with ML algorithms. This role involves collaborating...Remote job
- Apple Inc. is seeking a Systems & Kernel Software Engineer to work in Cupertino, California. The role entails collaborating with teams to enhance... ...on various OS layers. Candidates should have a BS in Computer Science or equivalent experience, strong programming skills...
$180k - $240k
Senior/Staff Computer Vision Engineer - Deep Learning Focus Mountain View, California, United States About Us At Phantom AI, we’ve built a team of incredibly talented and ambitious people challenging the norm in the automotive industry. We are building cost‑effective...Work experience placementWork at officeLocal area- ...industry through advanced electrical architectures, high-performance computing, and the integration of ADAS and infotainment capabilities. GM... ...and seamless scalability across every GM brand. As a Hardware Engineer - Central Compute, you will contribute to the design,...InternshipWork at officeFlexible hours
- .... Our team is made up of highly experienced ML engineers and tech industry veterans and we’re backed by leading computing and technology companies. HPC Engineers in our... ...and optimizing high performance collective and kernel libraries for running LLMs on AMD GPUs, using technologies...
- We are seeking a highly skilled High Performance Computing (HPC) Engineer with a strong background in modeling and solving complex discrete optimization... ...high-performance computing environments to scale and accelerate problem-solving, including multi-core, distributed, and...
- A leading cloud technology company is seeking a highly skilled HPC Performance Engineer to join their HAVOCK Team in Sunnyvale, California. In this role, you will optimize bare-metal systems and ensure the performance of complex workloads using various technologies including...
- Icehouseventures is looking for an experienced Hardware Engineer in Sunnyvale, California. The successful candidate will develop next-generation autonomous driving compute platforms, focusing on low-level electronics design and embedded systems. Candidates should have over...
- ...Join to apply for the Computer Vision Engineer role at Homekey We’re a Stanford startup working on an AI-driven residential real estate marketplace. Check out our team at Homekey here: We’re building across the full lifecycle of real estate: property valuation...
- Fiserv, a leader in Fintech, is seeking a Staff Systems Engineer for its Sunnyvale campus. This role involves leading the engineering of secure payment kernels, system architecture implementation, and feature innovation for payment solutions. Candidates should have 7+ years...
- ...every robot in the fleet. As a System Software Engineer on Robot Platform focused on GPU and accelerated compute, you’ll own how every accelerated workload on the... ...Efficient model execution and switching: Reduce gpu kernel launch overheads and make swapping between models...
- ...Computer Vision Intern — Data Labeling & Annotation Type: Internship / Temporary Duration: 6 months - 12 months What You’ll Gain Exposure... ...pipeline, from raw data to deployed model Mentorship from CV engineers working on production systems Hands‑on experience with YOLO,...Temporary workInternship
$147.4k - $272.1k
3D Perception/Computer Vision Algorithm Engineer Sunnyvale, California, United States Machine Learning and AI The Video Computer Vision organization is working on exciting technologies for future Apple products. Our focus is on real time and low power world tracking and...Relocation- ..., CA, is seeking a Member of Technical Staff to design and build core components of next-generation AI compute solutions. The ideal candidate has a strong engineering background, experience in systems software or distributed systems, and a deep interest in AI infrastructure...
- General Motors is seeking a Hardware Engineer - Central Compute in Sunnyvale, California, to design and validate next-generation high-compute ECUs that empower automated driving and vehicle intelligence. The ideal candidate will have a Bachelor's in Electrical or Computer...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Kernel Engineer (Compute / Accelerator). Be the first to apply!

