ML Kernel Engineer - High-Performance GPU Compute
$200k - $350kInception LLC
Inception in San Francisco is seeking engineers and scientists to design and optimize the compute foundations for AI models. The role includes developing high-performance ML kernels for significant operations and ensuring efficient low-precision arithmetic. A strong background in GPU programming and systems is necessary, as well as experience with distributed systems and cloud platforms. The annual salary ranges from $200,000 to $350,000 with equity and benefits included. #J-18808-Ljbffr Inception LLC
$100k - $200k
Voiceflow is seeking a skilled ML-Infrastructure Engineer in San Francisco to architect and operate auto-scaling systems... ...platform. The role includes optimizing GPU and compute infrastructure, ensuring high performance and reliability. Ideal candidates have hands-on...PerformanceWork at office$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team,... ...kernel development, and performance engineering so that every cycle... ...Kernels team builds high‑performance GPU kernels and custom... ...heart of on‑vehicle ML inference for ADAS and... ..., high performance computing and/ or architecture...PerformanceLocal areaFlexible hours$100k - $120k
...looking for an experienced engineer to join their founding... ..., focusing on low-level compute kernels to enhance robotic... ...assembly), expertise in GPU optimizations, and familiarity with ML framework internals. Responsibilities... ..., and pioneering high-velocity development...Performance- Reactor is looking for an experienced ML Inference Engineer with deep expertise in high-performance ML engineering. This role focuses on optimizing the performance... ...field is required, along with strong knowledge of GPU hardware and modern ML optimization techniques. The...Performance
- ...San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern...Performance
- ML Systems Engineer - Robotics & AI We are building the full-stack... ...robots, from high-performance, software-defined hardware... ...are creating a new computing platform for physical... ...identification at different GPU counts. Drive... ...Experience with GPU kernel‑level performance work...Performance
- ...training, from managing GPU/TPU compute and job... ...researchers and model engineers to translate ideas... ...This is a hands‑on, high‑leverage role at... ...the intersection of ML, software... ...friction. Optimize performance: Profile and improve... ...optimization, custom kernels). Experience operating...PerformanceFull time
- We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is... ....compile, custom CUDA kernels, and specialized inference... ...to identify computational bottlenecks Collaborate... ...Working knowledge of GPU hardware (NVIDIA) Strong...PerformanceFull timeVisa sponsorshipRelocation package
- ...company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal candidates have 1–5 years...Performance
$218.4k - $273k
...Vehicles, and Computer Vision. This position... ...and developing ML pipelines for... ...an ML Systems Engineer on the Physical... ...’ll work in a highly collaborative... ...tolerant, high-performance systems for serving... ..., including GPU-level algorithm... ...optimizations (e.g., CUDA, kernel tuning)....PerformanceFull time- ...San Francisco is seeking candidates to develop and optimize GPU-accelerated kernels for machine learning and AI applications. You will work closely with the modeling and algorithm team to enhance the performance of AI systems. The ideal candidate will collaborate with...Performance
$100k - $120k
...Robotics is scaling the compute infrastructure... ...grow, we need kernel‑level innovations... ...and system engineers focused on performance-critical code Design... ...(AVX/ARM NEON), GPU (CUDA/ROCm), and... ...into distributed ML frameworks (e.g.,... ...rollouts Champion a high‑velocity culture...Performance- ...new Machine Learning Engineer opportunities... ...optimize end-to-end ML pipelines encompassing... ...observability, and performance across all AI systems... ...to ensure high performance and low... ...networking, memory, compute scheduling, and I/O... ...latency behavior, and GPU and model-serving platforms...PerformanceFlexible hours
- ...build the platform engineers turn to to ship AI... ...We’re seeking a GPU Kernel Engineer to join our... ...impacts the performance of state‑of‑the‑art... ...every microsecond of computation to enable breakthrough... ...optimization and high‑impact systems... ...GPU kernels for key ML operations, including...PerformanceFlexible hours
$175k - $250k
...Swayable is seeking a Senior Engineer blending Python software... ...development expertise with scientific computing, machine learning, and AI... ..., and architecture for high-performance computing. You will work with... ...constantly evolving toolset for ML and AI Ops. You are...PerformanceFull time- ...FriendliAI is looking for a GPU Kernel Engineer to design, build, and... ...the low-level compute kernels that power our... ...is a deeply technical, high-impact role where you... ...implement, and optimize high-performance GPU kernels for AI... ...to GPU performance or ML acceleration Research...PerformanceFlexible hours
- Reducto, Inc. is hiring a Machine Learning Infra Engineer in San Francisco to build and maintain ML training and inference frameworks. The role focuses on high performance and scaling across multiple nodes and GPUs. The ideal candidate will have strong Python skills and...Performance
- A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large... ...years in performance engineering and strong experience with GPU programming and ML inference workloads. Candidates should have...Performance
$285k - $315k
...looking for a Founding GPU Kernel Engineer who lives right at the... ...GPU kernels for ML workloads (matmuls, attention... ..., etc.) to set the performance ceilings Profile at the... ...tools: Nsight Compute, Nsight Systems, rocprof... ...understanding of how high-level ML operations map...PerformanceFull timeWork at officeRelocation package- ...both cloud and edge compute resources. Learn more... ...About the Role As an ML / DevOps Engineer, you will play a pivotal... ...required to process high-throughput video data... ...by managing high-performance Linux environments. You... ...Linux administration, kernel tuning, and system performance...PerformanceWork at office
$200k - $300k
...Foundation Model. As a Senior ML Engineer, Foundation Models,... .... We are a small, high-ownership team. We... ...Machine Learning, Robotics, Computer Science, or a related... ...training (multi‑node GPU clusters, FSDP,... ...TensorRT, quantization, performance profiling) Prior work...PerformanceFlexible hours$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team, that... ...kernel development, and performance engineering so that every cycle on... ...compiler that turns high‑level models into fast... ...compiler, systems, and GPU engineers who enjoy working... ..., and effortless for ML engineers across the...PerformanceLocal areaFlexible hours- Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (... ...infrastructure company is building a high-performance, portable... ...optimization on modern GPU architectures. This role sits... ...techniques High-performance computing or systems-level optimization...PerformanceFull time
- ...believe culture can be engineered - but when it falls... ...'re looking for an ML infrastructure... ...areas spanning vehicle compute to data collection... ...us build reliable, performant, and secure... ...curating raw data into high-quality training sets... ...ML training on our GPU clusters Take ownership...PerformanceLocal area
$35 - $45 per hour
Gravity Engineering Services Pvt Ltd. is seeking an MLOps Engineer to guide research and engineering teams in improving AI model performance. Candidates should have over 2 years of experience in ML infrastructure, with production knowledge of JAX or PyTorch. This is a contract...PerformanceRemote jobHourly payWeekly payContract work$144k - $192k
...Machine Learning Systems Engineer to join our ML Acceleration team.... ...research and high-performance systems engineering.... ...data loading, gradient computation, and communication.... ...optimizations like kernel fusion, sharding, and... ...maintain high-performance GPU kernels in Triton or...PerformanceWork at officeRemote work- ...interactive AI: the compute layer that makes real... .... As our ML Infrastructure and Platform Engineer, you will own the architecture... ...and scaling of our GPU compute platform from... ...clusters, ensuring high availability and low... ..., or hardware‑aware performance tuning (CuTe, Triton...PerformanceFlexible hoursShift work
- ...exceptional Infrastructure Engineer to own and build the... ...to manage data, run compute workloads, store... ...data infrastructure, GPU compute, and high-reliability internal... ...discussions with cloud and ML compute providers,... ...capacity planning, performance, reliability, and cost...Performance
- ...new startup in the Brain-Computer Interface (BCI) space, driving... ...advanced hardware engineering and AI solutions. Our mission... ...role in shaping a high-performance, production-grade ML ecosystem to support rapid... ...such model sharding, cross-GPU communication, and real-time...PerformanceFlexible hours
- ...Technical Staff in San Francisco to optimize GPU performance for AI workloads. This role emphasizes high ownership and engineering excellence, ideal for those passionate about... .... The selected candidate will develop kernels, improve execution strategies, and collaborate...Performance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Kernel Engineer - High-Performance GPU Compute. Be the first to apply!
- graduate machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- senior performance engineer San Francisco, CA

