Senior Kernel Engineer: High-Performance ML Kernels
Cerebras
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads. Your focus will be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor architecture. You will be part of a world-class team responsible for the design, performance tuning, and validation of foundational ML and HPC kernels. This includes building a library of parallel and distributed algorithms that maximize compute utilization and push the boundaries of training efficiency for state-of-the-art AI models. Your work will be critical to unlocking the full potential of our hardware and accelerating the pace of AI innovation. Responsibilities Develop design specifications for new machine learning and linear algebra kernels and mapping to the Cerebras WSE System using various parallel programming algorithms. Develop and debug kernel library of highly optimized low level assembly instruction and C-like domain specific language routines to implement algorithms targeting the Cerebras hardware system. Develop and debug high-performance kernel routines in low-level assembly and a custom C-like (CSL) language, implementing algorithms optimized for the Cerebras hardware system. Using mathematical models and analysis to measure the software performance and inform design decisions. Develop and integrate unit and system testing methodologies to verify correct functionality and performance of kernel libraries. Study emerging trends in Machine Learning applications and help evolve Kernel library architecture to address computational challenges of the start-of-the-art Neural Networks. Interact with chip and system architects to optimize instruction sets, microarchitecture, and IO of next generation systems. Skills And Qualifications Bachelor’s, Master’s, PhD or foreign equivalents in Computer Science, Computer Engineering, Mathematics, or related fields. Understanding of hardware architecture concepts — must be comfortable learning the details of a new hardware architecture. Skilled in C++ and Python programming languages. Good knowledge of library and/or API development best practices. Strong debugging skills and knowledge of debugging complex software stack. Preferred Skills And Qualifications Experience in kernel development and/or testing. Familiarity with parallel algorithms and distributed memory systems. Experience in programming accelerators such as GPUs and FPGAs. Familiarity with Machine Learning neural networks and frameworks such as TensorFlow and PyTorch. Familiarity with HPC kernels and their optimization. Why Join Cerebras People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras: Build a breakthrough AI platform beyond the constraints of the GPU. Publish and open source their cutting-edge AI research. Work on one of the fastest AI supercomputers in the world. Enjoy job stability with startup vitality. Our simple, non-corporate work culture that respects individual beliefs. Read our blog: Five Reasons to Join Cerebras in 2026. Apply today and become part of the forefront of groundbreaking advancements in AI! Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them. This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice. #J-18808-Ljbffr Cerebras
$184k - $287.5k
...company”. We’re looking for a Senior Performance Compiler Engineer to join our team and work... ...models, agents, and other high‑impact AI applications,... ...MLIR to optimize high‑level kernel descriptions (written in Triton... ..., especially in the AI/ML or compiler space. Familiarity...SeniorPerformance$152k - $241.5k
...includes defining public APIs, performance optimizations and analysis,... ...in Computer Science, Computer Engineering, related field or equivalent... ..., such as PyTorch, JAX. GPU kernel authoring and performance analysis... ...opportunity employer. As we highly value diversity in our...SeniorPerformance$130k - $260k
...teams across verification, CAD, product engineering, and test engineering to create and execute... ...and Experience Proven experience in high‑speed analog IC design using SiGe BiCMOS... ...physics and its influence on analog circuit performance, along with techniques to optimize...SeniorPerformance- ...over a decade. We’re looking for a senior backend engineer to help design and operate high-volume, distributed backend... ...make pragmatic decisions around performance, reliability, and cost. Core skills... ...Kubernetes experience is a plus). ML inference or classification pipelines...SeniorPerformance
$110k - $130k
...Systems at Motivo Are you a high-level individual... ...We serve as the elite engineering engine for everything from... .... We are looking for a Senior Software Engineer to serve... ...record of deploying ML models and high-bandwidth... ...to overall company performance, because we believe in...SeniorPerformance$232k - $290k
...together. We're looking for a Senior Staff Software Engineer to help lead the technical... ...are based on company performance against established financial... ...Webflow. Architect and evolve high-throughput distributed... ...closely with data scientists, ML engineers, and product...SeniorPerformancePermanent employmentFull timeTemporary workFixed term contractImmediate startRemote workFlexible hours$184k - $287.5k
...the current progress. Build high‑performance DC fabrics using InfiniBand... ...workloads and GPU‑dense AI/ML training and inference environments... .../PhD in Electrical/Computer Engineering, Computer Science, Physics,... ...Adapters, Linux OS, and kernel drivers. Superb communication...SeniorPerformance$184k - $287.5k
Join to apply for the Senior High Performance AI Engineer role at NVIDIA NVIDIA has been transforming computer graphics, PC gaming, and accelerated... ...the AI stack—from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and...SeniorPerformance- ...Quartermaster AI is seeking an experienced RF/DSP Engineer to build the digital signal processing... ...on edge compute hardware, and with ML/AI engineers to deliver conditioned, feature... .... Strong proficiency in C/C++ for performance-critical, embedded or near-real-time signal...SeniorPerformance
- ...inference accelerator and looks for experts to help deliver high‑performance processors. Responsibilities Verify the design and implementation... ...process. Qualifications Bachelor's Degree in Electrical Engineering, Computer Science, or Computer Engineering (or equivalent...SeniorPerformance
$168k - $264.5k
...the world.NVIDIA is seeking best-in-class ASIC Verification Engineers to verify the design and implementation of the world’s leading... ...of computing. In this position, you will help to build the high-performance processor elements that implement programmable compute and...SeniorPerformance$192k - $260k
...technology firm is seeking a Staff Database Engineer to support the Lakebase product. You... ...design robust database systems that enhance performance and meet customer requirements. Your... ...5+ years of experience with Postgres in high-volume environments. This position offers...SeniorPerformance$220k - $250k
We’re hiring a Senior Signal Integrity Engineer to lead the design, modelling, and validation of high‑speed optical link platforms for next‑generation AI data centre interconnects... ...package, and photonic interfaces—ensuring performance at state‑of‑the‑art baud rates and tight...SeniorPerformance$155k
...Senior Technical Sales Engineer (REMOTE, CA) Position Summary This technically based sales position will... ...abreast of market conditions, product performance, and make recommendations for future... ...Microsoft Project, etc. Must have a high degree of initiative and the ability...SeniorPerformanceTemporary workRemote workFlexible hours$152k - $241.5k
...the full inference stack to push the boundaries of inference performance. Benchmark state‑of‑the‑art offerings and perform competitive... ...environment and proud to be an equal‑opportunity employer. As we highly value diversity in our current and future employees, we do not...SeniorPerformance$180k - $300k
Yoh Services LLC is looking for experienced Design Verification Engineers in California, USA, to validate high-performance networking silicon and AI fabric technologies. Candidates should have 5-15 years of ASIC/SoC verification experience, strong expertise in SystemVerilog...SeniorPerformance$136k - $218.5k
Senior DFT Engineer Join NVIDIA and lead the charge in revolutionizing AI technology! As a Senior... ...DFT methodologies, ensuring high‑quality products that compete on a global... ...of industry experience in DFT for high‑performance ASICs. Practical experience with SCAN/...SeniorPerformance- A leading audio technology company in Missouri is seeking a Principal Hardware Engineer to design high-performance audio components. You will work within a multi-disciplinary team to ensure superior audio performance and reliability. The ideal candidate must have a Bachelor...SeniorPerformanceFlexible hours
$125k - $135k
...passionate about solving complex engineering challenges, striving for... ...our Sr. Systems Engineers are highly valued and versatile team members... ...of engineering! As a Senior Systems Engineer, you will:... ...integrate effectively and meet performance, cost, and timeline requirements...SeniorPerformance- ...responsibilities We’re looking for a seasoned engineer to design, build, and operate HashiCorp... .... You’ll be responsible for secure, highly available, and compliant secrets... ...logging and tracing for high reliability and performance. Hands‑on experience operating HashiCorp...SeniorPerformance
$141.3k - $226k
Broadcom is seeking an OS Kernel System Software Development Engineer in California. This role involves ownership of CPU and server platform kernel development across Arm and x86. Candidates should have a BS/MS/PHD in Computer Science or Engineering with significant industry...Senior- ...the RoleWe are seeking an experienced Senior ML Inference Engineer to join our team, focusing on... ...pharmaceutical customersArchitect and implement high-performance inference pipelines capable of... ..., including memory optimization, kernel fusion, and mixed-precision inferenceStrong...SeniorPerformanceRemote workWorldwide
- Our Deloitte AI & Engineering team works to transform technology... ...improve financial performance, accelerate new digital... .... Work you'll do As a Senior AWS FDE, you will work... ...prototype and deliver high-impact GenAI-enabled solutions... ..., data modeling or ML/data science background...SeniorPerformanceLocal area
$131.3k - $237.35k
...Business Area is seeking a Senior Digital Engineering and Computational Dynamics... ...Engineer to join the Strategic High-speed Advanced Development... ...aerodynamic analysis and performance prediction. Conduct 6-DoF... ...Underwater Hydrodynamics AI/ML (neuro networks, MDAO)...SeniorPerformanceFlexible hours- Job Description: Senior Core Banking Engineer— XGEN & IBM i Data Specialist Function: Core BankingEngineering... ...data requirements into reliable, performant, and auditable extracts and reports... ...Optimize XGEN extract performance on high-volume banking databases applying logical...SeniorPerformanceRemote jobFull timeBank staff
- Job Title Project Engineer / Senior Project Engineer - Project Farma Location San Diego, CA Job... ...engineering lifecycle. They will deliver high‑quality work, form and maintain long‑... ...that align with Project Farma’s services. Perform tasks to meet strategic objectives such...SeniorPerformanceFull timeFor contractorsWork experience placementVisa sponsorshipWork visa
$97.35k - $155.76k
...a difference at Fiserv. Job Title Senior Systems Engineer (Tandem NonStop) About your role We... ...to provide technical solutions in a highly visible, large enterprise environment... ...commitment to ensuring the reliability, performance, and security of our infrastructure....SeniorPerformanceWork experience placementNight shift$90k - $180k
The Opportunity The Senior Supplier Quality Assurance (SQA) is responsible... ..., and improving supplier performance across a diverse supply base.... ...Bachelor’s degree in engineering, Life Sciences, Quality, or related... ...manufacturing or another highly regulated environment. Experience...SeniorPerformanceRemote work$92k - $118k
Infineon Technologies AG in California is seeking an engineer to conduct EMC testing, perform signal integrity simulations, and collaborate with IC design... .... The ideal candidate has over a year of experience in high-speed SI/PI and EMC fields. A Bachelor's or Master's in...Senior$160k - $188.23k
## Senior System EngineerApplylocations: California - Headquarterstime... ...believe in being humble and highly engaged in the work we do,... ...experienced Senior Systems Engineer to lead system-level engineering... ...integration, functional, performance, and reliability testing* Lead...SeniorPerformance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Kernel Engineer: High-Performance ML Kernels. Be the first to apply!
- senior game producer California, MO
- senior manager process engineering California, MO
- senior manufacturing engineer California, MO
- senior manager clinical operations California, MO
- senior lead project manager California, MO
- senior manager quality engineering California, MO
- senior device engineer California, MO
- senior full stack developer California, MO
- senior marketer California, MO
- senior planner California, MO

