GPU Kernel Engineer — Fast ML Training

MakerMaker.AI

MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels, strong systems expertise, and a proven track record in kernel optimizations. This role requires on-site work in a collaborative environment. #J-18808-Ljbffr MakerMaker.AI

Apply

Vacancy posted 16 hours ago

Similar jobs that could be interesting for youBased on the GPU Kernel Engineer — Fast ML Training in San Francisco, CA vacancy

Founding GPU Kernel Engineer
$285k - $315k
...We're looking for a Founding GPU Kernel Engineer who lives right at the boundary... ...-optimize GPU kernels for ML workloads (matmuls, attention... ...Experience with distributed training systems: collective ops like... ...wants to know why things are fast or slow on the hardware. You'...
Training
Full time
Work at office
Relocation package
SF Tensor
San Francisco, CA
3 days ago
GPU Kernel Engineer: Build Fast AI Inference at Scale
A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal...
Suggested
Baseten
San Francisco, CA
3 days ago
GPU Kernel Engineer
...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary... ...to integrating optimized ops into high-level ML frameworks used for large-scale training and inference. This role is ideal for someone...
Training
Flexible hours
Sciforium
San Francisco, CA
16 hours ago
KERNEL ENGINEER
...ll write and optimize the GPU kernels and supporting systems software that makes our training and inference workloads fast. This is deep, low-level work... ...use. We hire kernel engineers because the gap between "this... ...libraries, compilers, or ML frameworks Experience with...
Training
Shift work
MakerMaker.AI
San Francisco, CA
1 day ago
GPU Kernel Engineer
$100k - $120k
...robotic foundation models. As training and inference workloads grow, we need kernel‑level innovations to... ...of kernel and system engineers focused on performance-critical... ...for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and... ...optimizations into distributed ML frameworks (e.g.,...
Training
Coda Robotics
San Francisco, CA
1 day ago
Senior GPU Kernel Engineer - Accelerate AI Training Systems
MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern...
Training
MakerMaker
San Francisco, CA
1 day ago
Pioneering GPU Kernel Engineer for ML Performance
$285k - $315k
SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels...
Full time
Relocation package
SF Tensor
San Francisco, CA
3 days ago
Founding Engineer - ML Performance
...than the status quo. As our ML Performance Engineer, you will be the person... ...You will write custom CUDA kernels, push GPU utilization to its limits,... ...for the team: define what fast looks like and build the tooling... ...model inference and post‑training optimization at scale...
Training
Flexible hours
Shift work
URun
San Francisco, CA
3 days ago
CUDA Kernel Engineer (Remote US)
...hiring on behalf of a fast‑growing AI startup recognized... ...searching for a CUDA Kernel Engineer who has hands‑on... .... You will work on the GPU performance layer powering... ...in GPU acceleration for ML frameworks or HPC workloads... ..., compensation, and training. We are committed to a...
Training
Remote job
Local area
Immediate start
Relocation package
Pragmatike
San Francisco, CA
1 day ago
TPU Kernel Engineer — Lead Low-Latency ML Kernels (Hybrid)
$280k
Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical...
Training
Anthropic
San Francisco, CA
1 day ago
Kernel Engineer- GPU
...us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the... .... You’ll work in a fast‑paced, intellectually stimulating... ...performance GPU kernels for key ML operations, including matrix multiplications...
Flexible hours
Baseten
San Francisco, CA
16 hours ago
GPU Systems Engineer - HPC / Parallel Computing
$160k - $320k
...deliver excellence. We seek engineers/researchers with strong... ...systems to optimize GPU performance at the... ...Design and optimize GPU kernels and tensor libraries.... ...Familiarity with distributed training/inference frameworks (... ...leaders. Ambitious, fast-paced startup culture where...
Training
Full time
Work at office
Vast.ai
San Francisco, CA
3 days ago
Senior ML Training Systems Engineer - Distributed GPU Infra
...is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation... ...distributed training systems and optimize GPU utilization while collaborating... ...over 5 years of experience in ML infrastructure and a strong...
Training
BaseTen
San Francisco, CA
2 days ago
Senior CUDA Kernel Engineer - GPU Performance Lead
Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA... ...will have a deep understanding of GPU architecture, performance optimization strategies... ...impacting Fortune 500 clients within a fast-growing AI startup recognized by GTM...
Remote work
Relocation package
Pragmatike
San Francisco, CA
1 day ago
ML Systems Engineer, Robotics
$248.8k - $311k
...Physical AI and developing ML pipelines for processing, training, and fine-tuning on... ...As an ML Systems Engineer on the Physical AI... ..., in a fast-paced, cross-functional... ...environments, including GPU-level algorithm optimizations... ...(e.g., CUDA, kernel tuning). Programming...
Training
Full time
Scale AI
San Francisco, CA
10 days ago
Senior Staff Data Center Operations Engineer, GPU Hardware Architecture
$179k - $218k
...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe is on a mission... ...Operations & Telemetry: Leverage AI/ML methodologies to analyze fleet-wide telemetry... ...components before they impact customer training runs. Technical Sparing...
Training
Temporary work
Crusoe
San Francisco, CA
6 days ago
Performance Engineer, GPU
$280k
...committed researchers, engineers, policy experts, and business... ...innovations in GPU performance and systems... ...techniques from custom kernel development to distributed... ...in production ML systems and will be excited... ...Production Systems: Large-scale training infrastructure, fault...
Training
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
2 days ago
Kernel Engineer for High-Performance AI Kernels
$225k
Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels... ...throughput and latency during AI training and inference. The ideal candidate has low... ...Google TPUs, and experience in optimizing GPU kernels. The position offers a competitive...
Training
Magic
San Francisco, CA
3 days ago
Forward Deployed Engineer - ML
...AI teams. With instant GPU access, sub‑second container... ...makes it simple to train models, run batch jobs,... ...infrastructure. We're a fast‑growing team based out of... ...medalists, and experienced engineering and product leaders with... ...can transform their AI/ML infrastructure. You will...
Training
Contract work
Modal Labs
San Francisco, CA
2 days ago
Senior Site Reliability Engineer - AI Infrastructure
Senior Site Reliability Engineer - AI Infrastructure... ...platform routes training and inference jobs... ...debug large-scale GPU infrastructure used... ...from network fabric → kernel → framework. What... ...orchestration, and ML frameworks. Drive... ...to narrow it down fast. Strong Candidates...
Training
Full time
Remote work
Andromeda
San Francisco, CA
1 day ago
Founding GPU Compiler Engineer
$285k - $315k
...portable. We are building a Kernel Optimizer that... ...partnering with researchers, engineers, and organizations who... ...'re hiring a Founding GPU Compiler Engineer to... ...for large-scale AI pre‑training. You will own the entire... ...systems Work closely with ML researchers to...
Training
Full time
Work at office
Relocation package
SF Tensor
San Francisco, CA
2 days ago
Kernel Engineer
$100k
...combines frontier-scale pre-training, domain-specific RL, ultra-long... .... About the role: As a Kernel Engineer, you will design, implement... ...optimize custom high-performance GPU kernels Evaluate porting... ...SF, if possible ~ A small, fast-paced, highly focused team...
Training
Remote job
Relocation
Visa sponsorship
Magic
San Francisco, CA
more than 2 months ago
Real-Time Inference & Model Serving Engineer (Equity)
$220k - $320k
ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time... ...hard problems around batching, GPU efficiency, memory constraints,... ...real-world load. This is not about training models. It’s about making them fast, efficient, and production-ready...
Training
3 days per week
Trades Workforce Solutions
San Francisco, CA
1 day ago
GPU Kernel Engineer for AI Inference & Performance
FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative...
FriendliAI
San Francisco, CA
3 days ago
Senior GPU Kernel Engineer for Autonomous Driving
$128.7k - $261.3k
...San Francisco is seeking an experienced developer for their AI Kernels & Compilers team to innovate in autonomous driving technology. The role focuses on designing high-performance GPU kernels, optimizing ML performance, and collaborating cross-functionally. The ideal...
Israelvcforum
San Francisco, CA
4 days ago
Senior ML Systems Engineer, Frameworks & Tooling
...Senior ML Systems Engineer, Frameworks & Tooling at Cohere Our mission... ...intelligence to serve humanity. We’re training and deploying frontier... ...and work hard and move fast to do what’s best for our customers... ...libraries, or custom kernels/fused ops. Experience with...
Training
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
2 days ago
Remote CUDA Kernel Engineer — GPU Performance Expert
Pragmatike is seeking a CUDA Kernel Engineer to work remotely for a rapidly growing AI startup. The ideal candidate will have extensive experience... ...NVIDIA CUDA kernels, with a strong understanding of GPU architecture and performance optimization. Responsibilities include...
Remote job
Relocation package
Pragmatike
San Francisco, CA
1 day ago
HPC/ GPU Hardware Engineer
...history. When people finance GPU clusters, the datacenters housing... ...shape culture, mentor junior engineers, and learn from our customers.... ...experience, including kernel drivers, RDMA stack tuning, and... ...detection Knowledge of distributed training performance (NCCL, GPUDirect...
Training
Long term contract
Contract work
Fixed term contract
Work at office
Local area
Visa sponsorship
Shift work
3 days per week
The San Francisco Compute Company
San Francisco, CA
1 day ago
Staff ML Systems Engineer - GPU & Distributed Training
Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python...
Training
Genesis AI
San Francisco, CA
3 days ago
Software Engineer, Sandboxing (Systems)
$300k - $405k
...committed researchers, engineers, policy experts, and business... ...system programming, kernel optimization, and... ...efficiently and reliably for training and serving frontier AI... ...Work with our ML engineers to understand... ...experience with: GPU virtualization and acceleration...
Training
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Kernel Engineer — Fast ML Training. Be the first to apply!