NPU Kernel/Operator Engineer
Black Sesame Technologies Inc
We are looking for a Senior NPU Kernel/Operator Engineer to lead the design and optimization of high-performance kernels for a custom AI accelerator / NPU. This role focuses on general-purpose deep learning operators, fused kernels, and hardware-aware performance optimization across CNNs, transformers, and other neural network workloads. The ideal candidate has strong experience in performance engineering on GPU, NPU, DSP, CPU SIMD, compiler backend, embedded accelerator, or HPC systems. Responsibilities Design and optimize high-performance NPU kernels for a broad range of neural network workloads. Own critical operators such as attention-style kernels, normalization, reduction, layout conversion, gather/scatter, quant/dequant, and fused operators. Develop tiling, blocking, vectorization, and memory scheduling strategies. Optimize data movement across matrix engine, vector engine, SRAM, DMA, NoC, cache, and DRAM. Analyze bottlenecks in compute utilization, memory bandwidth, synchronization, DMA overlap, bank conflicts, and instruction overhead. Build first-principles performance models for key operators. Drive kernels toward hardware roofline limits. Collaborate with hardware, compiler, runtime, and model teams on ISA features, tensor layouts, memory access patterns, and operator APIs. Debug complex correctness, precision, and performance issues on simulator or silicon. Mentor junior engineers and establish kernel optimization best practices. Requirements BS/MS/PhD in CS, EE, Computer Engineering, or related field. 5+ years of experience in performance optimization, accelerator programming, GPU/NPU/DSP development, compiler backend, embedded systems, or HPC. Deep understanding of memory hierarchy, tiling, parallelism, vectorization, synchronization, and bandwidth analysis. Experience optimizing performance-critical kernels or numerical computation. Ability to reason from algorithm requirements to hardware execution and performance bottlenecks. Preferred Experience with CUDA, Triton, CUTLASS, OpenCL, TVM, MLIR, Halide, SIMD intrinsics, DSP SDKs, or custom accelerator SDKs. Experience optimizing operators such as convolution, GEMM, attention, softmax, normalization, reduction, image processing, or fused compute/memory kernels. Familiarity with custom AI accelerator architecture, matrix engines, vector engines, systolic arrays, DMA, SRAM, NoC, or DRAM systems. Experience with mixed precision and quantization: FP32, FP16, BF16, FP8, INT8, INT4. Experience with simulator/emulator/FPGA/silicon bring-up is a plus. #J-18808-Ljbffr Black Sesame Technologies Inc
- ...Senior NPU Kernel / Operator Engineer Overview We are seeking a Senior NPU Kernel / Operator Engineer to lead the development and optimization of high-performance deep learning operators for a next-generation AI accelerator platform . This role focuses on...SuggestedPermanent employmentNight shift
- Black Sesame Technologies Inc is seeking a Senior NPU Kernel/Operator Engineer in San Jose, California. This role focuses on designing and optimizing high-performance kernels for a custom AI accelerator, involving deep learning operations and performance optimization....Suggested
- A global semiconductor company in San Jose seeks a Senior Systems Design Engineer to develop and optimize ML operator kernels for their NPU platform. The candidate will work on end-to-end model performance and collaborate closely with silicon teams to ensure innovation...SuggestedFull time
- ...additional agentic computation. About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions... ...be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor...Suggested
$165.2k - $223.6k
...environment at the heart of this system. We build and maintain the operating system that integrates with the Nitro control plane and powers... ...job responsibilities Research, design, and implement Linux kernel changes to meet business requirements Drive kernel...SuggestedInternshipLocal areaFlexible hours$143.2k - $186k
...electric sedan. About the Position We are seeking a senior OS / kernel engineer to join our SkyOS team. The team is responsible for the design and development of NIO's full-domain vehicle operating systems. The position will explore new ideas and designs that can...Full timeTemporary workFlexible hours$165k - $242k
...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built... ...storage. Kernel Hardware - Acceleration - Virtualization - Operating Systems - Containerization - Kubelite Our Team's Stack:...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours- A tech firm specializing in AI is seeking a Software Engineer to develop optimized kernels for AI chips. The role involves working with hardware integration and existing libraries to ensure compatibility. Ideal candidates have 3+ years of software engineering experience...
- ...for delivering a high-quality and performant kernel that powers every Apple product — from Apple Watch... ...looking for a talented new graduate or junior engineer to join us and contribute to the next generation of Apple's operating system. As a member of a small, technically...InternshipRelocation
- NVIDIA Gruppe is looking for a senior engineer to join their Math Libraries team in Santa Clara, California. This role involves designing... ...linear algebra software on GPUs, with a strong focus on kernel generation. The ideal candidate has over 8 years of experience...
- Cerebras Systems is seeking a deeply technical software engineer for its Kernel Reliability team in Sunnyvale, California. This role involves enhancing the reliability of advanced compute clusters. The ideal candidate will have strong programming skills in C/C++ and Python...
- ...The Role We’re looking for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead a high performing team... ...and customer-facing systems Assist System and Cluster Operations teams on reducing system and service downtime after...
- NVIDIA Gruppe is seeking a Senior Formal Verification Engineer for GPU Kernels, focused on creating verification tools that ensure correct behavior in various environments. This role involves designing verification tools, integrating AI into workflows, and participating...
$143.2k - $186k
1600 NIO USA, Inc. is seeking a Senior OS / Kernel Engineer for the SkyOS team to design and develop full-domain vehicle operating systems. Candidates should have a strong background in operating system internals and proficiency in languages like C or Rust. The position...- A leading technology company is seeking a Linux Kernel Software Engineer to develop and optimize the Linux kernel for enterprise storage solutions. This role requires deep experience in kernel development and a strong foundation in computer systems. You will collaborate...
- NVIDIA Gruppe is seeking a Senior Software Engineer to work on system software for datacenter products in Santa Clara, California. This... ...will have over 10 years of experience, a strong grasp on Linux kernel internals, and expertise in data center architectures. Notably,...
$184k - $287.5k
Overview We are looking for a Senior Formal Verification Engineer for GPU Kernels. NVIDIA's Deep Learning Safety Team is hiring engineers to build verification tools that prove GPU kernels behave correctly, enabling their deployment in a wide range of environments, including...Work experience placement$143.2k - $186k
...Master's degree in Computer Science, Computer Engineering, Applied Mathematics, Communications,... .... Strong understanding of GPU/NPU architecture and optimization techniques... ...Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application...Full timeTemporary workFlexible hours- Apple Inc. is seeking a Junior OS Kernel Engineer for their Darwin Scheduler team in Cupertino, California. This role offers the chance to... ...engineers while contributing to the next generation of Apple's operating system. The ideal candidate will hold a BS in Computer...
$167k - $246k
...THE ROLE Join a world-class team of engineers building the next generation of... ...innovation, developing and optimizing the Linux kernel to push the boundaries of performance and... ...foundation in computer architecture, operating systems, networking and core concepts like...Work at officeFlexible hours$53 per hour
...keeps cutting‑edge innovation moving! We’re seeking a Building Engineer with strong commercial chiller experience to maintain and... ...governmental agency, and company directives related to building operations and work safety. Maintain an energy management program. Ensure...Hourly payWork at officeLocal areaVisa sponsorship- A leading cloud technology company is seeking a highly skilled HPC Performance Engineer to join their HAVOCK Team in Sunnyvale, California. In this role, you will optimize bare-metal systems and ensure the performance of complex workloads using various technologies including...
$143.2k - $186k
...Master’s degree in Computer Science, Computer Engineering, Applied Mathematics, Communications,... .... Strong understanding of GPU/NPU architecture and optimization techniques... ...familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application...Full timeTemporary workFlexible hours$147.4k - $272.1k
...Software Development Engineer In Test - Kernel Quality Engineering, Core Os The Darwin Kernel organization plays a vital role in Apple's success... ...for the XNU kernel running at the heart of the operating systems deployed across all iPhone, iPad, Mac, Watch, Apple...WorldwideRelocation$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...$165k - $241.4k
...The Cisco Distributed System Engineering (DSE) group owns the development... ...One ASIC in the area of NPU management and health monitoring... ...automated test suites for kernel module validation, including... ...Pytest. Experience with Linux Operating Sytems and debugging tools....Full timeTemporary workLocal areaFlexible hours$184k - $287.5k
...We are looking for software engineers to join our development efforts in the area of dense linear algebra kernels for high-performance libraries such as cuSOLVER. Around the world, leading commercial and academic organizations are revolutionizing AI, data analytics, and...Remote work- ...Sr. Wi-Fi Engineer VARITE is looking for a qualified Sr. Wi-Fi Engineer in Philadelphia... ..., implementation, interoperability, and operator deployment Represent the organization... ..., hostapd, iw, nl80211 Strong Linux kernel knowledge — Wi-Fi driver development, cfg...Full time
$184k - $287.5k
.... We are now looking for an extraordinary Senior Perception Engineer to develop and productize NVIDIA’s autonomous driving solutions... ...with development in CUDA language. The ability to implement CUDA kernels as part of training or inference pipelines. Your base...Work experience placement$275k
...Distinguished Engineer – Wireless Architecture Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions... ...cloud-native technologies ~ Experience with Linux/FreeBSD kernel, routing stacks and development tools. Preferred Qualifications...Local areaWork from homeWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to NPU Kernel/Operator Engineer. Be the first to apply!
- mixing operator San Jose, CA
- list operator San Jose, CA
- pool operator San Jose, CA
- scale operator San Jose, CA
- female phone operator San Jose, CA
- semiconductor operator San Jose, CA
- vehicle operator San Jose, CA
- automation operator San Jose, CA
- machine set up operator San Jose, CA
- hotel operator San Jose, CA

