Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

GPU Kernel Engineer — High-Performance ML at Scale

The Consensus

The Consensus is looking for a GPU Kernel Engineer to optimize machine learning performance. The ideal candidate will design high-performance GPU kernels and collaborate on cutting-edge projects in the AI field. This role offers substantial growth opportunities in an inclusive environment. You'll be a critical part of shaping AI applications, working with state-of-the-art tools, and contributing to significant advancements in GPU performance. #J-18808-Ljbffr The Consensus

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the GPU Kernel Engineer — High-Performance ML at Scale in San Francisco, CA vacancy
  •  ...San Francisco is seeking candidates to develop and optimize GPU-accelerated kernels for machine learning and AI applications. You will work closely with the modeling and algorithm team to enhance the performance of AI systems. The ideal candidate will collaborate with... 
    Performance

    Wilder Wealthy & Wise

    San Francisco, CA
    2 days ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 
    Performance

    Baseten

    San Francisco, CA
    4 days ago
  • MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels... 
    Performance

    MakerMaker.AI

    San Francisco, CA
    1 day ago
  • San Francisco Tensor Company is seeking a Founding GPU Kernel Engineer to enhance GPU performance for AI applications. You will optimize and write kernels while collaborating with compiler teams to improve efficiencies across architectures. The ideal candidate has deep... 
    Performance
    Work at office
    Relocation package

    San Francisco Tensor Company

    San Francisco, CA
    3 days ago
  • $285k - $315k

    SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine...  ...has deep expertise, proven capabilities in hand-optimizing performance-critical kernels, and strong programming skills in C++ and... 
    Performance
    Full time
    Relocation package

    SF Tensor

    San Francisco, CA
    4 days ago
  • $100k - $120k

    Coda Robotics is scaling the compute infrastructure...  ...grow, we need kernel‑level innovations...  ...and system engineers focused on performance-critical code Design...  ...(AVX/ARM NEON), GPU (CUDA/ROCm), and...  ...into distributed ML frameworks (e.g.,...  ...Champion a high‑velocity culture... 
    Performance

    Coda Robotics

    San Francisco, CA
    2 days ago
  • $285k - $315k

     ...looking for a Founding GPU Kernel Engineer who lives right at the...  ...GPU kernels for ML workloads (matmuls, attention...  ..., etc.) to set the performance ceilings Profile at the...  ...understanding of how high-level ML operations map...  ...experience with large-scale scientific computing,... 
    Performance
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    4 days ago
  • $100k - $120k

     ...looking for an experienced engineer to join their founding...  ...focusing on low-level compute kernels to enhance robotic...  ...assembly), expertise in GPU optimizations, and familiarity with ML framework internals. Responsibilities...  ..., and pioneering high-velocity development culture... 
    Performance

    Coda Robotics

    San Francisco, CA
    1 day ago
  • $200k - $350k

    Inception in San Francisco is seeking engineers and scientists to design and optimize...  ...models. The role includes developing high-performance ML kernels for significant operations and ensuring...  ...precision arithmetic. A strong background in GPU programming and systems is necessary,... 
    Performance

    Inception LLC

    San Francisco, CA
    2 days ago
  •  ...help build the platform engineers turn to to ship AI...  ...ROLE We’re seeking a GPU Kernel Engineer to join our team...  ...directly impacts the performance of state‑of‑the‑art machine...  ...optimization and high‑impact systems work. EXAMPLE...  ...GPU kernels for key ML operations, including... 
    Performance
    Flexible hours

    Baseten

    San Francisco, CA
    3 days ago
  • $128.7k - $261.3k

     ...San Francisco is seeking an experienced developer for their AI Kernels & Compilers team to innovate in autonomous driving technology. The role focuses on designing high-performance GPU kernels, optimizing ML performance, and collaborating cross-functionally. The ideal... 
    Performance

    Israelvcforum

    San Francisco, CA
    14 hours ago
  •  ...build the platform engineers turn to to ship...  ...workloads scale, the network is...  ...engineers to lead our GPU Networking...  ...networking performance on bleeding‑edge...  ...behaviors. Optimize Kernels: You will work...  ...with high‑performance networking...  ...to a variety of ML startups,... 
    Performance
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • $225k

    Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels that optimize throughput and latency during AI training and inference....  ...Blackwell and Google TPUs, and experience in optimizing GPU kernels. The position offers a competitive... 
    Performance

    Magic Inc

    San Francisco, CA
    4 days ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels...  ...requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting...  ...GPU technology and contribute to a highly collaborative, supportive work environment... 
    Performance

    FriendliAI

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

     ...DigitalOcean is seeking a Senior Engineer 2 to play a key technical...  ...can offer the industry-leading performance for our inference services....  ...the technical roadmap for our high-performance inference fleet....  ...at the inference engine and GPU kernel layers, ensuring our infrastructure... 
    Performance
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    2 days ago
  • Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA CUDA kernels for high-performance AI systems. The ideal candidate will have a deep understanding of GPU architecture, performance optimization strategies, and hands-on experience... 
    Performance
    Remote work
    Relocation package

    Pragmatike

    San Francisco, CA
    2 days ago
  •  ...innovative company is seeking a talented software engineer to join their dynamic Inference team. This role involves...  ...and implementing infrastructure for large-scale multimodal models, focusing on high-performance delivery of audio and image inputs. You'll collaborate... 
    Performance

    Jobleads-US

    San Francisco, CA
    4 days ago
  •  ...FriendliAI is looking for a GPU Kernel Engineer to design, build, and...  ...that power our large-scale, GPU-accelerated AI...  ...a deeply technical, high-impact role where you...  ...implement, and optimize high-performance GPU kernels for AI...  ...to GPU performance or ML acceleration Research... 
    Performance
    Flexible hours

    FriendliAI

    San Francisco, CA
    4 days ago
  • $80 - $120 per hour

    Mercor is seeking a CUDA Engineering Expert to analyze and optimize GPU kernels for performance and efficiency. You'll be working remotely, requiring at least 20 hours a week, and fluent in C++ through C++17. This role includes responsibilities like using profiler metrics... 
    Performance
    Remote job
    Hourly pay

    Mercor

    San Francisco, CA
    1 day ago
  •  ...Francisco is seeking a Research Engineer specializing in AI Performance & Kernel Optimization. The role...  ...enhancing the performance of large-scale AI systems, optimizing...  ...strong engineering background in GPU kernel development and experience with ML workloads. Benefits include... 
    Performance

    Zyphra

    San Francisco, CA
    4 days ago
  • $285k - $315k

     ...future of AI and high-performance computing depends...  ...We are building a Kernel Optimizer that automatically...  ...with researchers, engineers, and organizations...  ...hiring a Founding GPU Compiler Engineer...  ...for large-scale AI pre-training. You...  ...Work closely with ML researchers to... 
    Performance
    Full time
    Work at office
    Relocation package

    San Francisco Tensor Company

    San Francisco, CA
    4 days ago
  • $225k

     ...Manufacturing Co is looking for a Software Engineer on the Inference & RL Systems team in...  ...distributed systems, optimizing performance, and ensuring high reliability for RL and post-training...  ...fundamentals and experience with large-scale systems. Compensation includes a... 
    Performance

    Dormont Manufacturing Co

    San Francisco, CA
    3 days ago
  • $218.4k - $273k

     ...Scale's Physical AI business unit is...  ...and developing ML pipelines for processing...  ...an ML Systems Engineer on the Physical...  ...’ll work in a highly collaborative...  ...tolerant, high-performance systems for...  ..., including GPU-level algorithm...  ...optimizations (e.g., CUDA, kernel tuning).... 
    Performance
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...Manufacturing Co is looking for a Software Engineer for their Pre-training Systems team in...  ...that trains long-context models at scale, tackling challenges related to memory...  ...towards maintaining critical systems in a high-performance environment. #J-18808-Ljbffr Dormont... 
    Performance

    Dormont Manufacturing Co

    San Francisco, CA
    4 days ago
  • $500 per month

     ...small team of ~10 engineers, former US military...  ...matter experts scaling America’s directed...  ...computer vision, ML inference, controls...  ...real-time systems performance at the hardware boundary...  ...across CPU, GPU, memory, and I/O;...  ...Develop and optimize kernels for high-throughput, low-latency... 
    Performance
    Permanent employment
    Work at office
    Monday to Friday
    Flexible hours
    Night shift
    Weekend work

    Aurelius Systems, Inc

    San Francisco, CA
    3 days ago
  • $300k

     ...Job Description GPU Optimisation Engineer - Real-Time Inference Want to push GPU performance to its limits - not in theory, but in...  ...really lost: memory hierarchy, kernel launch overhead, occupancy limits...  ...layers, profiling large-scale speech and multimodal models... 
    Performance
    Relocation
    Visa sponsorship
    Free visa

    Techire Ai

    San Francisco, CA
    2 days ago
  • $100k - $200k

    Voiceflow is seeking a skilled ML-Infrastructure Engineer in San Francisco to architect and operate auto-scaling systems for our voice AI simulation...  .... The role includes optimizing GPU and compute infrastructure, ensuring high performance and reliability. Ideal... 
    Performance
    Work at office

    Voiceflow

    San Francisco, CA
    4 days ago
  •  ...High Performance Computing Engineer San Francisco Bay Area We are pioneering advanced computational solutions...  ...with CUDA/ROCm, MPI, and large-scale system optimization. Advanced...  ...ROCm implementations for computational kernels Create and maintain MPI-based... 
    Performance

    Polyhedra

    San Francisco, CA
    1 day ago
  •  ...You’ll write and optimize the GPU kernels and supporting systems...  ...This is deep, low-level work (performance counters, memory bandwidth, warp...  ...actually use. We hire kernel engineers because the gap between "this...  ...kernel libraries, compilers, or ML frameworks Experience with... 
    Performance
    Shift work

    MakerMaker.AI

    San Francisco, CA
    3 days ago
  • Anyscale is seeking a Distributed LLM Inference Engineer in San Francisco, California. This pivotal role involves pushing the boundaries of performance for ML inference at scale. You'll work closely with product teams to deliver end-to-end solutions while leveraging open... 
    Performance

    Anyscale

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Kernel Engineer — High-Performance ML at Scale. Be the first to apply!