GPU Kernel Engineer — Fast ML Training
MakerMaker.AI
MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels, strong systems expertise, and a proven track record in kernel optimizations. This role requires on-site work in a collaborative environment. #J-18808-Ljbffr MakerMaker.AI
$285k - $315k
...We're looking for a Founding GPU Kernel Engineer who lives right at the boundary... ...-optimize GPU kernels for ML workloads (matmuls, attention... ...Experience with distributed training systems: collective ops like... ...wants to know why things are fast or slow on the hardware. You'...TrainingFull timeWork at officeRelocation package- A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal...Suggested
- ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary... ...to integrating optimized ops into high-level ML frameworks used for large-scale training and inference. This role is ideal for someone...TrainingFlexible hours
- ...ll write and optimize the GPU kernels and supporting systems software that makes our training and inference workloads fast. This is deep, low-level work... ...use. We hire kernel engineers because the gap between "this... ...libraries, compilers, or ML frameworks Experience with...TrainingShift work
$100k - $120k
...robotic foundation models. As training and inference workloads grow, we need kernel‑level innovations to... ...of kernel and system engineers focused on performance-critical... ...for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and... ...optimizations into distributed ML frameworks (e.g.,...Training- MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern...Training
$285k - $315k
SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels...Full timeRelocation package- ...than the status quo. As our ML Performance Engineer, you will be the person... ...You will write custom CUDA kernels, push GPU utilization to its limits,... ...for the team: define what fast looks like and build the tooling... ...model inference and post‑training optimization at scale...TrainingFlexible hoursShift work
- ...hiring on behalf of a fast‑growing AI startup recognized... ...searching for a CUDA Kernel Engineer who has hands‑on... .... You will work on the GPU performance layer powering... ...in GPU acceleration for ML frameworks or HPC workloads... ..., compensation, and training. We are committed to a...TrainingRemote jobLocal areaImmediate startRelocation package
$280k
Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical...Training- ...us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the... .... You’ll work in a fast‑paced, intellectually stimulating... ...performance GPU kernels for key ML operations, including matrix multiplications...Flexible hours
$160k - $320k
...deliver excellence. We seek engineers/researchers with strong... ...systems to optimize GPU performance at the... ...Design and optimize GPU kernels and tensor libraries.... ...Familiarity with distributed training/inference frameworks (... ...leaders. Ambitious, fast-paced startup culture where...TrainingFull timeWork at office- ...is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation... ...distributed training systems and optimize GPU utilization while collaborating... ...over 5 years of experience in ML infrastructure and a strong...Training
- Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA... ...will have a deep understanding of GPU architecture, performance optimization strategies... ...impacting Fortune 500 clients within a fast-growing AI startup recognized by GTM...Remote workRelocation package
$248.8k - $311k
...Physical AI and developing ML pipelines for processing, training, and fine-tuning on... ...As an ML Systems Engineer on the Physical AI... ..., in a fast-paced, cross-functional... ...environments, including GPU-level algorithm optimizations... ...(e.g., CUDA, kernel tuning). Programming...TrainingFull time$179k - $218k
...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe is on a mission... ...Operations & Telemetry: Leverage AI/ML methodologies to analyze fleet-wide telemetry... ...components before they impact customer training runs. Technical Sparing...TrainingTemporary work$280k
...committed researchers, engineers, policy experts, and business... ...innovations in GPU performance and systems... ...techniques from custom kernel development to distributed... ...in production ML systems and will be excited... ...Production Systems: Large-scale training infrastructure, fault...TrainingWork at officeVisa sponsorshipFlexible hours$225k
Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels... ...throughput and latency during AI training and inference. The ideal candidate has low... ...Google TPUs, and experience in optimizing GPU kernels. The position offers a competitive...Training- ...AI teams. With instant GPU access, sub‑second container... ...makes it simple to train models, run batch jobs,... ...infrastructure. We're a fast‑growing team based out of... ...medalists, and experienced engineering and product leaders with... ...can transform their AI/ML infrastructure. You will...TrainingContract work
- Senior Site Reliability Engineer - AI Infrastructure... ...platform routes training and inference jobs... ...debug large-scale GPU infrastructure used... ...from network fabric → kernel → framework. What... ...orchestration, and ML frameworks. Drive... ...to narrow it down fast. Strong Candidates...TrainingFull timeRemote work
$285k - $315k
...portable. We are building a Kernel Optimizer that... ...partnering with researchers, engineers, and organizations who... ...'re hiring a Founding GPU Compiler Engineer to... ...for large-scale AI pre‑training. You will own the entire... ...systems Work closely with ML researchers to...TrainingFull timeWork at officeRelocation package$100k
...combines frontier-scale pre-training, domain-specific RL, ultra-long... .... About the role: As a Kernel Engineer, you will design, implement... ...optimize custom high-performance GPU kernels Evaluate porting... ...SF, if possible ~ A small, fast-paced, highly focused team...TrainingRemote jobRelocationVisa sponsorship$220k - $320k
ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time... ...hard problems around batching, GPU efficiency, memory constraints,... ...real-world load. This is not about training models. It’s about making them fast, efficient, and production-ready...Training3 days per week- FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative...
$128.7k - $261.3k
...San Francisco is seeking an experienced developer for their AI Kernels & Compilers team to innovate in autonomous driving technology. The role focuses on designing high-performance GPU kernels, optimizing ML performance, and collaborating cross-functionally. The ideal...- ...Senior ML Systems Engineer, Frameworks & Tooling at Cohere Our mission... ...intelligence to serve humanity. We’re training and deploying frontier... ...and work hard and move fast to do what’s best for our customers... ...libraries, or custom kernels/fused ops. Experience with...TrainingFull timeWork at officeRemote workFlexible hours
- Pragmatike is seeking a CUDA Kernel Engineer to work remotely for a rapidly growing AI startup. The ideal candidate will have extensive experience... ...NVIDIA CUDA kernels, with a strong understanding of GPU architecture and performance optimization. Responsibilities include...Remote jobRelocation package
- ...history. When people finance GPU clusters, the datacenters housing... ...shape culture, mentor junior engineers, and learn from our customers.... ...experience, including kernel drivers, RDMA stack tuning, and... ...detection Knowledge of distributed training performance (NCCL, GPUDirect...TrainingLong term contractContract workFixed term contractWork at officeLocal areaVisa sponsorshipShift work3 days per week
- Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python...Training
$300k - $405k
...committed researchers, engineers, policy experts, and business... ...system programming, kernel optimization, and... ...efficiently and reliably for training and serving frontier AI... ...Work with our ML engineers to understand... ...experience with: GPU virtualization and acceleration...TrainingWork at officeVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to GPU Kernel Engineer — Fast ML Training. Be the first to apply!
- machine learning intern San Francisco, CA
- machine learning researcher San Francisco, CA
- machine learning part time San Francisco, CA
- machine learning San Francisco, CA
- intern - quantum machine learning for quantum computing San Francisco, CA
- artificial intelligence - machine learning intern San Francisco, CA
- machine learning research scientist San Francisco, CA
- data engineer machine learning San Francisco, CA
- machine learning scientist San Francisco, CA
- internship machine learning San Francisco, CA

