Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

NPU Kernel/Operator Engineer

Black Sesame Technologies Inc

We are looking for a Senior NPU Kernel/Operator Engineer to lead the design and optimization of high-performance kernels for a custom AI accelerator / NPU. This role focuses on general-purpose deep learning operators, fused kernels, and hardware-aware performance optimization across CNNs, transformers, and other neural network workloads. The ideal candidate has strong experience in performance engineering on GPU, NPU, DSP, CPU SIMD, compiler backend, embedded accelerator, or HPC systems. Responsibilities Design and optimize high-performance NPU kernels for a broad range of neural network workloads. Own critical operators such as attention-style kernels, normalization, reduction, layout conversion, gather/scatter, quant/dequant, and fused operators. Develop tiling, blocking, vectorization, and memory scheduling strategies. Optimize data movement across matrix engine, vector engine, SRAM, DMA, NoC, cache, and DRAM. Analyze bottlenecks in compute utilization, memory bandwidth, synchronization, DMA overlap, bank conflicts, and instruction overhead. Build first-principles performance models for key operators. Drive kernels toward hardware roofline limits. Collaborate with hardware, compiler, runtime, and model teams on ISA features, tensor layouts, memory access patterns, and operator APIs. Debug complex correctness, precision, and performance issues on simulator or silicon. Mentor junior engineers and establish kernel optimization best practices. Requirements BS/MS/PhD in CS, EE, Computer Engineering, or related field. 5+ years of experience in performance optimization, accelerator programming, GPU/NPU/DSP development, compiler backend, embedded systems, or HPC. Deep understanding of memory hierarchy, tiling, parallelism, vectorization, synchronization, and bandwidth analysis. Experience optimizing performance-critical kernels or numerical computation. Ability to reason from algorithm requirements to hardware execution and performance bottlenecks. Preferred Experience with CUDA, Triton, CUTLASS, OpenCL, TVM, MLIR, Halide, SIMD intrinsics, DSP SDKs, or custom accelerator SDKs. Experience optimizing operators such as convolution, GEMM, attention, softmax, normalization, reduction, image processing, or fused compute/memory kernels. Familiarity with custom AI accelerator architecture, matrix engines, vector engines, systolic arrays, DMA, SRAM, NoC, or DRAM systems. Experience with mixed precision and quantization: FP32, FP16, BF16, FP8, INT8, INT4. Experience with simulator/emulator/FPGA/silicon bring-up is a plus. #J-18808-Ljbffr Black Sesame Technologies Inc

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the NPU Kernel/Operator Engineer in San Jose, CA vacancy
  •  ...Senior NPU Kernel / Operator Engineer Overview We are seeking a Senior NPU Kernel / Operator Engineer to lead the development and optimization of high-performance deep learning operators for a next-generation AI accelerator platform . This role focuses on... 
    Suggested
    Permanent employment
    Night shift
    San Jose, CA
    14 days ago
  • Black Sesame Technologies Inc is seeking a Senior NPU Kernel/Operator Engineer in San Jose, California. This role focuses on designing and optimizing high-performance kernels for a custom AI accelerator, involving deep learning operations and performance optimization.... 
    Suggested

    Black Sesame Technologies Inc

    San Jose, CA
    4 days ago
  • A global semiconductor company in San Jose seeks a Senior Systems Design Engineer to develop and optimize ML operator kernels for their NPU platform. The candidate will work on end-to-end model performance and collaborate closely with silicon teams to ensure innovation... 
    Suggested
    Full time

    AMD

    San Jose, CA
    4 days ago
  •  ...additional agentic computation. About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions...  ...be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $165.2k - $223.6k

     ...environment at the heart of this system. We build and maintain the operating system that integrates with the Nitro control plane and powers...  ...job responsibilities Research, design, and implement Linux kernel changes to meet business requirements Drive kernel... 
    Suggested
    Internship
    Local area
    Flexible hours

    Amazon

    Santa Clara, CA
    8 days ago
  • $143.2k - $186k

     ...electric sedan. About the Position We are seeking a senior OS / kernel engineer to join our SkyOS team. The team is responsible for the design and development of NIO's full-domain vehicle operating systems. The position will explore new ideas and designs that can... 
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    4 days ago
  • $165k - $242k

     ...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built...  ...storage. Kernel Hardware - Acceleration - Virtualization - Operating Systems - Containerization - Kubelite Our Team's Stack:... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • A tech firm specializing in AI is seeking a Software Engineer to develop optimized kernels for AI chips. The role involves working with hardware integration and existing libraries to ensure compatibility. Ideal candidates have 3+ years of software engineering experience... 

    OpenReq

    Cupertino, CA
    3 days ago
  •  ...for delivering a high-quality and performant kernel that powers every Apple product — from Apple Watch...  ...looking for a talented new graduate or junior engineer to join us and contribute to the next generation of Apple's operating system. As a member of a small, technically... 
    Internship
    Relocation

    Apple Inc.

    Cupertino, CA
    3 days ago
  • NVIDIA Gruppe is looking for a senior engineer to join their Math Libraries team in Santa Clara, California. This role involves designing...  ...linear algebra software on GPUs, with a strong focus on kernel generation. The ideal candidate has over 8 years of experience... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • Cerebras Systems is seeking a deeply technical software engineer for its Kernel Reliability team in Sunnyvale, California. This role involves enhancing the reliability of advanced compute clusters. The ideal candidate will have strong programming skills in C/C++ and Python... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    4 days ago
  •  ...The Role We’re looking for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead a high performing team...  ...and customer-facing systems Assist System and Cluster Operations teams on reducing system and service downtime after... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    4 days ago
  • NVIDIA Gruppe is seeking a Senior Formal Verification Engineer for GPU Kernels, focused on creating verification tools that ensure correct behavior in various environments. This role involves designing verification tools, integrating AI into workflows, and participating... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $143.2k - $186k

    1600 NIO USA, Inc. is seeking a Senior OS / Kernel Engineer for the SkyOS team to design and develop full-domain vehicle operating systems. Candidates should have a strong background in operating system internals and proficiency in languages like C or Rust. The position... 

    1600 NIO USA, Inc.

    San Jose, CA
    4 days ago
  • A leading technology company is seeking a Linux Kernel Software Engineer to develop and optimize the Linux kernel for enterprise storage solutions. This role requires deep experience in kernel development and a strong foundation in computer systems. You will collaborate... 

    Pure Storage, Inc.

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is seeking a Senior Software Engineer to work on system software for datacenter products in Santa Clara, California. This...  ...will have over 10 years of experience, a strong grasp on Linux kernel internals, and expertise in data center architectures. Notably,... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

    Overview We are looking for a Senior Formal Verification Engineer for GPU Kernels. NVIDIA's Deep Learning Safety Team is hiring engineers to build verification tools that prove GPU kernels behave correctly, enabling their deployment in a wide range of environments, including... 
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $143.2k - $186k

     ...Master's degree in Computer Science, Computer Engineering, Applied Mathematics, Communications,...  .... Strong understanding of GPU/NPU architecture and optimization techniques...  ...Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application... 
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    23 hours ago
  • Apple Inc. is seeking a Junior OS Kernel Engineer for their Darwin Scheduler team in Cupertino, California. This role offers the chance to...  ...engineers while contributing to the next generation of Apple's operating system. The ideal candidate will hold a BS in Computer... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $167k - $246k

     ...THE ROLE Join a world-class team of engineers building the next generation of...  ...innovation, developing and optimizing the Linux kernel to push the boundaries of performance and...  ...foundation in computer architecture, operating systems, networking and core concepts like... 
    Work at office
    Flexible hours

    Everpure LLC

    Santa Clara, CA
    2 days ago
  • $53 per hour

     ...keeps cutting‑edge innovation moving! We’re seeking a Building Engineer with strong commercial chiller experience to maintain and...  ...governmental agency, and company directives related to building operations and work safety. Maintain an energy management program. Ensure... 
    Hourly pay
    Work at office
    Local area
    Visa sponsorship

    CBRE

    Santa Clara, CA
    1 day ago
  • A leading cloud technology company is seeking a highly skilled HPC Performance Engineer to join their HAVOCK Team in Sunnyvale, California. In this role, you will optimize bare-metal systems and ensure the performance of complex workloads using various technologies including... 

    CoreWeave

    Sunnyvale, CA
    23 hours ago
  • $143.2k - $186k

     ...Master’s degree in Computer Science, Computer Engineering, Applied Mathematics, Communications,...  .... Strong understanding of GPU/NPU architecture and optimization techniques...  ...familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application... 
    Full time
    Temporary work
    Flexible hours

    1600 NIO USA, Inc.

    San Jose, CA
    4 days ago
  • $147.4k - $272.1k

     ...Software Development Engineer In Test - Kernel Quality Engineering, Core Os The Darwin Kernel organization plays a vital role in Apple's success...  ...for the XNU kernel running at the heart of the operating systems deployed across all iPhone, iPad, Mac, Watch, Apple... 
    Worldwide
    Relocation

    Apple

    Cupertino, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $165k - $241.4k

     ...The Cisco Distributed System Engineering (DSE) group owns the development...  ...One ASIC in the area of NPU management and health monitoring...  ...automated test suites for kernel module validation, including...  ...Pytest. Experience with Linux Operating Sytems and debugging tools.... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    Milpitas, CA
    4 days ago
  • $184k - $287.5k

     ...We are looking for software engineers to join our development efforts in the area of dense linear algebra kernels for high-performance libraries such as cuSOLVER. Around the world, leading commercial and academic organizations are revolutionizing AI, data analytics, and... 
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...Sr. Wi-Fi Engineer VARITE is looking for a qualified Sr. Wi-Fi Engineer in Philadelphia...  ..., implementation, interoperability, and operator deployment Represent the organization...  ..., hostapd, iw, nl80211 Strong Linux kernel knowledge — Wi-Fi driver development, cfg... 
    Full time

    Varite

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     .... We are now looking for an extraordinary Senior Perception Engineer to develop and productize NVIDIA’s autonomous driving solutions...  ...with development in CUDA language. The ability to implement CUDA kernels as part of training or inference pipelines. Your base... 
    Work experience placement

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $275k

     ...Distinguished Engineer – Wireless Architecture Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions...  ...cloud-native technologies ~ Experience with Linux/FreeBSD kernel, routing stacks and development tools. Preferred Qualifications... 
    Local area
    Work from home
    Worldwide
    Flexible hours

    Extreme Networks

    San Jose, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to NPU Kernel/Operator Engineer. Be the first to apply!