Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

GPU Kernel Engineer — Fast ML Training

MakerMaker.AI

MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels, strong systems expertise, and a proven track record in kernel optimizations. This role requires on-site work in a collaborative environment. #J-18808-Ljbffr MakerMaker.AI

Vacancy posted 16 hours ago
Similar jobs that could be interesting for youBased on the GPU Kernel Engineer — Fast ML Training in San Francisco, CA vacancy
  • $285k - $315k

     ...We're looking for a Founding GPU Kernel Engineer who lives right at the boundary...  ...-optimize GPU kernels for ML workloads (matmuls, attention...  ...Experience with distributed training systems: collective ops like...  ...wants to know why things are fast or slow on the hardware. You'... 
    Training
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    3 days ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 
    Suggested

    Baseten

    San Francisco, CA
    3 days ago
  •  ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary...  ...to integrating optimized ops into high-level ML frameworks used for large-scale training and inference. This role is ideal for someone... 
    Training
    Flexible hours

    Sciforium

    San Francisco, CA
    16 hours ago
  •  ...ll write and optimize the GPU kernels and supporting systems software that makes our training and inference workloads fast. This is deep, low-level work...  ...use. We hire kernel engineers because the gap between "this...  ...libraries, compilers, or ML frameworks Experience with... 
    Training
    Shift work

    MakerMaker.AI

    San Francisco, CA
    1 day ago
  • $100k - $120k

     ...robotic foundation models. As training and inference workloads grow, we need kernel‑level innovations to...  ...of kernel and system engineers focused on performance-critical...  ...for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and...  ...optimizations into distributed ML frameworks (e.g.,... 
    Training

    Coda Robotics

    San Francisco, CA
    1 day ago
  • MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 
    Training

    MakerMaker

    San Francisco, CA
    1 day ago
  • $285k - $315k

    SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels... 
    Full time
    Relocation package

    SF Tensor

    San Francisco, CA
    3 days ago
  •  ...than the status quo. As our ML Performance Engineer, you will be the person...  ...You will write custom CUDA kernels, push GPU utilization to its limits,...  ...for the team: define what fast looks like and build the tooling...  ...model inference and post‑training optimization at scale... 
    Training
    Flexible hours
    Shift work

    URun

    San Francisco, CA
    3 days ago
  •  ...hiring on behalf of a fast‑growing AI startup recognized...  ...searching for a CUDA Kernel Engineer who has hands‑on...  .... You will work on the GPU performance layer powering...  ...in GPU acceleration for ML frameworks or HPC workloads...  ..., compensation, and training. We are committed to a... 
    Training
    Remote job
    Local area
    Immediate start
    Relocation package

    Pragmatike

    San Francisco, CA
    1 day ago
  • $280k

    Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical... 
    Training

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the...  .... You’ll work in a fast‑paced, intellectually stimulating...  ...performance GPU kernels for key ML operations, including matrix multiplications... 
    Flexible hours

    Baseten

    San Francisco, CA
    16 hours ago
  • $160k - $320k

     ...deliver excellence. We seek engineers/researchers with strong...  ...systems to optimize GPU performance at the...  ...Design and optimize GPU kernels and tensor libraries....  ...Familiarity with distributed training/inference frameworks (...  ...leaders. Ambitious, fast-paced startup culture where... 
    Training
    Full time
    Work at office

    Vast.ai

    San Francisco, CA
    3 days ago
  •  ...is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation...  ...distributed training systems and optimize GPU utilization while collaborating...  ...over 5 years of experience in ML infrastructure and a strong... 
    Training

    BaseTen

    San Francisco, CA
    2 days ago
  • Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA...  ...will have a deep understanding of GPU architecture, performance optimization strategies...  ...impacting Fortune 500 clients within a fast-growing AI startup recognized by GTM... 
    Remote work
    Relocation package

    Pragmatike

    San Francisco, CA
    1 day ago
  • $248.8k - $311k

     ...Physical AI and developing ML pipelines for processing, training, and fine-tuning on...  ...As an ML Systems Engineer on the Physical AI...  ..., in a fast-paced, cross-functional...  ...environments, including GPU-level algorithm optimizations...  ...(e.g., CUDA, kernel tuning). Programming... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    10 days ago
  • $179k - $218k

     ...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe is on a mission...  ...Operations & Telemetry: Leverage AI/ML methodologies to analyze fleet-wide telemetry...  ...components before they impact customer training runs. Technical Sparing... 
    Training
    Temporary work

    Crusoe

    San Francisco, CA
    6 days ago
  • $280k

     ...committed researchers, engineers, policy experts, and business...  ...innovations in GPU performance and systems...  ...techniques from custom kernel development to distributed...  ...in production ML systems and will be excited...  ...Production Systems: Large-scale training infrastructure, fault... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  • $225k

    Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels...  ...throughput and latency during AI training and inference. The ideal candidate has low...  ...Google TPUs, and experience in optimizing GPU kernels. The position offers a competitive... 
    Training

    Magic

    San Francisco, CA
    3 days ago
  •  ...AI teams. With instant GPU access, sub‑second container...  ...makes it simple to train models, run batch jobs,...  ...infrastructure. We're a fast‑growing team based out of...  ...medalists, and experienced engineering and product leaders with...  ...can transform their AI/ML infrastructure. You will... 
    Training
    Contract work

    Modal Labs

    San Francisco, CA
    2 days ago
  • Senior Site Reliability Engineer - AI Infrastructure...  ...platform routes training and inference jobs...  ...debug large-scale GPU infrastructure used...  ...from network fabric → kernel → framework. What...  ...orchestration, and ML frameworks. Drive...  ...to narrow it down fast. Strong Candidates... 
    Training
    Full time
    Remote work

    Andromeda

    San Francisco, CA
    1 day ago
  • $285k - $315k

     ...portable. We are building a Kernel Optimizer that...  ...partnering with researchers, engineers, and organizations who...  ...'re hiring a Founding GPU Compiler Engineer to...  ...for large-scale AI pre‑training. You will own the entire...  ...systems Work closely with ML researchers to... 
    Training
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    2 days ago
  • $100k

     ...combines frontier-scale pre-training, domain-specific RL, ultra-long...  .... About the role:  As a Kernel Engineer, you will design, implement...  ...optimize custom high-performance GPU kernels Evaluate porting...  ...SF, if possible ~ A small, fast-paced, highly focused team... 
    Training
    Remote job
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    more than 2 months ago
  • $220k - $320k

    ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time...  ...hard problems around batching, GPU efficiency, memory constraints,...  ...real-world load. This is not about training models. It’s about making them fast, efficient, and production-ready... 
    Training
    3 days per week

    Trades Workforce Solutions

    San Francisco, CA
    1 day ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 

    FriendliAI

    San Francisco, CA
    3 days ago
  • $128.7k - $261.3k

     ...San Francisco is seeking an experienced developer for their AI Kernels & Compilers team to innovate in autonomous driving technology. The role focuses on designing high-performance GPU kernels, optimizing ML performance, and collaborating cross-functionally. The ideal... 

    Israelvcforum

    San Francisco, CA
    4 days ago
  •  ...Senior ML Systems Engineer, Frameworks & Tooling at Cohere Our mission...  ...intelligence to serve humanity. We’re training and deploying frontier...  ...and work hard and move fast to do what’s best for our customers...  ...libraries, or custom kernels/fused ops. Experience with... 
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    2 days ago
  • Pragmatike is seeking a CUDA Kernel Engineer to work remotely for a rapidly growing AI startup. The ideal candidate will have extensive experience...  ...NVIDIA CUDA kernels, with a strong understanding of GPU architecture and performance optimization. Responsibilities include... 
    Remote job
    Relocation package

    Pragmatike

    San Francisco, CA
    1 day ago
  •  ...history. When people finance GPU clusters, the datacenters housing...  ...shape culture, mentor junior engineers, and learn from our customers....  ...experience, including kernel drivers, RDMA stack tuning, and...  ...detection Knowledge of distributed training performance (NCCL, GPUDirect... 
    Training
    Long term contract
    Contract work
    Fixed term contract
    Work at office
    Local area
    Visa sponsorship
    Shift work
    3 days per week

    The San Francisco Compute Company

    San Francisco, CA
    1 day ago
  • Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python... 
    Training

    Genesis AI

    San Francisco, CA
    3 days ago
  • $300k - $405k

     ...committed researchers, engineers, policy experts, and business...  ...system programming, kernel optimization, and...  ...efficiently and reliably for training and serving frontier AI...  ...Work with our ML engineers to understand...  ...experience with: GPU virtualization and acceleration... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Kernel Engineer — Fast ML Training. Be the first to apply!