Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Kernel Engineer - High-Performance GPU Compute

$200k - $350k

Inception LLC

Inception in San Francisco is seeking engineers and scientists to design and optimize the compute foundations for AI models. The role includes developing high-performance ML kernels for significant operations and ensuring efficient low-precision arithmetic. A strong background in GPU programming and systems is necessary, as well as experience with distributed systems and cloud platforms. The annual salary ranges from $200,000 to $350,000 with equity and benefits included. #J-18808-Ljbffr Inception LLC

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Kernel Engineer - High-Performance GPU Compute in San Francisco, CA vacancy
  • $100k - $200k

    Voiceflow is seeking a skilled ML-Infrastructure Engineer in San Francisco to architect and operate auto-scaling systems...  ...platform. The role includes optimizing GPU and compute infrastructure, ensuring high performance and reliability. Ideal candidates have hands-on... 
    Performance
    Work at office

    Voiceflow

    San Francisco, CA
    3 days ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team,...  ...kernel development, and performance engineering so that every cycle...  ...Kernels team builds high‑performance GPU kernels and custom...  ...heart of on‑vehicle ML inference for ADAS and...  ..., high performance computing and/ or architecture... 
    Performance
    Local area
    Flexible hours

    Israelvcforum

    San Francisco, CA
    3 days ago
  • $100k - $120k

     ...looking for an experienced engineer to join their founding...  ..., focusing on low-level compute kernels to enhance robotic...  ...assembly), expertise in GPU optimizations, and familiarity with ML framework internals. Responsibilities...  ..., and pioneering high-velocity development... 
    Performance

    Coda Robotics

    San Francisco, CA
    18 hours ago
  • Reactor is looking for an experienced ML Inference Engineer with deep expertise in high-performance ML engineering. This role focuses on optimizing the performance...  ...field is required, along with strong knowledge of GPU hardware and modern ML optimization techniques. The... 
    Performance

    Reactor

    San Francisco, CA
    3 days ago
  •  ...San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern... 
    Performance

    Reflection AI

    San Francisco, CA
    3 days ago
  • ML Systems Engineer - Robotics & AI We are building the full-stack...  ...robots, from high-performance, software-defined hardware...  ...are creating a new computing platform for physical...  ...identification at different GPU counts. Drive...  ...Experience with GPU kernel‑level performance work... 
    Performance

    Maxwell Bond

    San Francisco, CA
    18 hours ago
  •  ...training, from managing GPU/TPU compute and job...  ...researchers and model engineers to translate ideas...  ...This is a hands‑on, high‑leverage role at...  ...the intersection of ML, software...  ...friction. Optimize performance: Profile and improve...  ...optimization, custom kernels). Experience operating... 
    Performance
    Full time

    Monograph

    San Francisco, CA
    18 hours ago
  • We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is...  ....compile, custom CUDA kernels, and specialized inference...  ...to identify computational bottlenecks Collaborate...  ...Working knowledge of GPU hardware (NVIDIA) Strong... 
    Performance
    Full time
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    3 days ago
  •  ...company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal candidates have 1–5 years... 
    Performance

    Baseten

    San Francisco, CA
    2 days ago
  • $218.4k - $273k

     ...Vehicles, and Computer Vision. This position...  ...and developing ML pipelines for...  ...an ML Systems Engineer on the Physical...  ...’ll work in a highly collaborative...  ...tolerant, high-performance systems for serving...  ..., including GPU-level algorithm...  ...optimizations (e.g., CUDA, kernel tuning).... 
    Performance
    Full time

    Scale AI

    San Francisco, CA
    18 hours ago
  •  ...San Francisco is seeking candidates to develop and optimize GPU-accelerated kernels for machine learning and AI applications. You will work closely with the modeling and algorithm team to enhance the performance of AI systems. The ideal candidate will collaborate with... 
    Performance

    Wilder Wealthy & Wise

    San Francisco, CA
    1 day ago
  • $100k - $120k

     ...Robotics is scaling the compute infrastructure...  ...grow, we need kernel‑level innovations...  ...and system engineers focused on performance-critical code Design...  ...(AVX/ARM NEON), GPU (CUDA/ROCm), and...  ...into distributed ML frameworks (e.g.,...  ...rollouts Champion a high‑velocity culture... 
    Performance

    Coda Robotics

    San Francisco, CA
    1 day ago
  •  ...new Machine Learning Engineer opportunities...  ...optimize end-to-end ML pipelines encompassing...  ...observability, and performance across all AI systems...  ...to ensure high performance and low...  ...networking, memory, compute scheduling, and I/O...  ...latency behavior, and GPU and model-serving platforms... 
    Performance
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  •  ...build the platform engineers turn to to ship AI...  ...We’re seeking a GPU Kernel Engineer to join our...  ...impacts the performance of state‑of‑the‑art...  ...every microsecond of computation to enable breakthrough...  ...optimization and high‑impact systems...  ...GPU kernels for key ML operations, including... 
    Performance
    Flexible hours

    Baseten

    San Francisco, CA
    2 days ago
  • $175k - $250k

     ...Swayable is seeking a Senior Engineer blending Python software...  ...development expertise with scientific computing, machine learning, and AI...  ..., and architecture for high-performance computing. You will work with...  ...constantly evolving toolset for ML and AI Ops. You are... 
    Performance
    Full time

    Swayable

    San Francisco, CA
    1 day ago
  •  ...FriendliAI is looking for a GPU Kernel Engineer to design, build, and...  ...the low-level compute kernels that power our...  ...is a deeply technical, high-impact role where you...  ...implement, and optimize high-performance GPU kernels for AI...  ...to GPU performance or ML acceleration Research... 
    Performance
    Flexible hours

    FriendliAI

    San Francisco, CA
    3 days ago
  • Reducto, Inc. is hiring a Machine Learning Infra Engineer in San Francisco to build and maintain ML training and inference frameworks. The role focuses on high performance and scaling across multiple nodes and GPUs. The ideal candidate will have strong Python skills and... 
    Performance

    Reducto, Inc.

    San Francisco, CA
    2 days ago
  • A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large...  ...years in performance engineering and strong experience with GPU programming and ML inference workloads. Candidates should have... 
    Performance

    Amadeus Search

    San Francisco, CA
    18 hours ago
  • $285k - $315k

     ...looking for a Founding GPU Kernel Engineer who lives right at the...  ...GPU kernels for ML workloads (matmuls, attention...  ..., etc.) to set the performance ceilings Profile at the...  ...tools: Nsight Compute, Nsight Systems, rocprof...  ...understanding of how high-level ML operations map... 
    Performance
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    3 days ago
  •  ...both cloud and edge compute resources. Learn more...  ...About the Role As an ML / DevOps Engineer, you will play a pivotal...  ...required to process high-throughput video data...  ...by managing high-performance Linux environments. You...  ...Linux administration, kernel tuning, and system performance... 
    Performance
    Work at office

    Zensors

    San Francisco, CA
    1 day ago
  • $200k - $300k

     ...Foundation Model. As a Senior ML Engineer, Foundation Models,...  .... We are a small, high-ownership team. We...  ...Machine Learning, Robotics, Computer Science, or a related...  ...training (multi‑node GPU clusters, FSDP,...  ...TensorRT, quantization, performance profiling) Prior work... 
    Performance
    Flexible hours

    Alumni Ventures

    San Francisco, CA
    1 day ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team, that...  ...kernel development, and performance engineering so that every cycle on...  ...compiler that turns high‑level models into fast...  ...compiler, systems, and GPU engineers who enjoy working...  ..., and effortless for ML engineers across the... 
    Performance
    Local area
    Flexible hours

    Israelvcforum

    San Francisco, CA
    3 days ago
  • Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (...  ...infrastructure company is building a high-performance, portable...  ...optimization on modern GPU architectures. This role sits...  ...techniques High-performance computing or systems-level optimization... 
    Performance
    Full time

    Amadeus Search

    San Francisco, CA
    18 hours ago
  •  ...believe culture can be engineered - but when it falls...  ...'re looking for an ML infrastructure...  ...areas spanning vehicle compute to data collection...  ...us build reliable, performant, and secure...  ...curating raw data into high-quality training sets...  ...ML training on our GPU clusters Take ownership... 
    Performance
    Local area

    Humble Robotics

    San Francisco, CA
    3 days ago
  • $35 - $45 per hour

    Gravity Engineering Services Pvt Ltd. is seeking an MLOps Engineer to guide research and engineering teams in improving AI model performance. Candidates should have over 2 years of experience in ML infrastructure, with production knowledge of JAX or PyTorch. This is a contract... 
    Performance
    Remote job
    Hourly pay
    Weekly pay
    Contract work

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  • $144k - $192k

     ...Machine Learning Systems Engineer to join our ML Acceleration team....  ...research and high-performance systems engineering....  ...data loading, gradient computation, and communication....  ...optimizations like kernel fusion, sharding, and...  ...maintain high-performance GPU kernels in Triton or... 
    Performance
    Work at office
    Remote work

    Motional AD Inc.

    San Francisco, CA
    1 day ago
  •  ...interactive AI: the compute layer that makes real...  .... As our ML Infrastructure and Platform Engineer, you will own the architecture...  ...and scaling of our GPU compute platform from...  ...clusters, ensuring high availability and low...  ..., or hardware‑aware performance tuning (CuTe, Triton... 
    Performance
    Flexible hours
    Shift work

    U-Run

    San Francisco, CA
    3 days ago
  •  ...exceptional Infrastructure Engineer to own and build the...  ...to manage data, run compute workloads, store...  ...data infrastructure, GPU compute, and high-reliability internal...  ...discussions with cloud and ML compute providers,...  ...capacity planning, performance, reliability, and cost... 
    Performance

    TryApplyNow

    San Francisco, CA
    1 day ago
  •  ...new startup in the Brain-Computer Interface (BCI) space, driving...  ...advanced hardware engineering and AI solutions. Our mission...  ...role in shaping a high-performance, production-grade ML ecosystem to support rapid...  ...such model sharding, cross-GPU communication, and real-time... 
    Performance
    Flexible hours

    Echo Neurotechnologies

    San Francisco, CA
    18 hours ago
  •  ...Technical Staff in San Francisco to optimize GPU performance for AI workloads. This role emphasizes high ownership and engineering excellence, ideal for those passionate about...  .... The selected candidate will develop kernels, improve execution strategies, and collaborate... 
    Performance

    Acceler8 Talent

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Kernel Engineer - High-Performance GPU Compute. Be the first to apply!