Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior CUDA Kernel Engineer - GPU Performance Lead

Pragmatike

Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA CUDA kernels for high-performance AI systems. The ideal candidate will have a deep understanding of GPU architecture, performance optimization strategies, and hands-on experience with profiling tools. This role provides the opportunity to work on significant projects impacting Fortune 500 clients within a fast-growing AI startup recognized by GTM Capital. Competitive salary, sign-on bonus, and health benefits offered. #J-18808-Ljbffr Pragmatike

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior CUDA Kernel Engineer - GPU Performance Lead in San Francisco, CA vacancy
  • $128.7k - $261.3k

     ...seeking an experienced developer for their AI Kernels & Compilers team to innovate in...  ...technology. The role focuses on designing high-performance GPU kernels, optimizing ML performance, and...  ...will have a strong background in CUDA programming and C++, with a minimum of 3... 
    Senior

    Israelvcforum

    San Francisco, CA
    5 days ago
  •  ...in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves...  ...ideal candidate will have a strong background in CUDA or similar, with proven experience in kernel optimizations... 
    Senior

    MakerMaker

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

     ...DigitalOcean is seeking a Senior Engineer 2 to play a key technical role...  ...we can offer the industry-leading performance for our inference services....  ...the inference engine and GPU kernel layers, ensuring our infrastructure...  ...and their software stacks (CUDA, ROCm, TensorRT, OpenAI... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    1 day ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 
    Senior

    inference.net

    San Francisco, CA
    5 days ago
  •  ...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next-generation...  ...about pushing the limits of performance on modern accelerators. In this role,...  ...optimize custom GPU kernels using C++, PTX, CUDA, ROCm, Triton, and/or JAX Pallas. Profile... 
    Suggested
    Flexible hours

    Sciforium

    San Francisco, CA
    2 days ago
  •  ...FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 

    FriendliAI

    San Francisco, CA
    5 days ago
  •  ...and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at...  ...your code directly impacts the performance of state‑of‑the‑art machine learning...  ...and optimize code using CUDA, PTX assembly, and... 
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • $100k - $120k

     ...inference workloads grow, we need kernel‑level innovations to reduce...  ...cheaper and faster. Responsibilities Lead a team of kernel and system engineers focused on performance-critical code Design, implement...  ...for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and hardware accelerators... 

    Coda Robotics

    San Francisco, CA
    4 days ago
  •  ...A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance...  .... Ideal candidates have 1–5 years of CUDA development experience and a strong... 

    Baseten

    San Francisco, CA
    5 days ago
  • Mercor is seeking a CUDA Engineering Expert to analyze and optimize GPU kernels for performance in a remote role. The ideal candidate should be fluent in core C++ features through C++17, with working knowledge of Python and Git, and experience in GPU programming models... 
    Remote job

    Mercor

    San Francisco, CA
    1 day ago
  • $285k - $315k

    SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture...  ...expertise, proven capabilities in hand-optimizing performance-critical kernels, and strong programming skills in C++ and CUDA. This full-time position offers a... 
    Full time
    Relocation package

    SF Tensor

    San Francisco, CA
    5 days ago
  • $285k - $315k

     ...The Role We're looking for a Founding GPU Kernel Engineer who lives right at the boundary between...  ..., normalization, etc.) to set the performance ceilings Profile at the microarchitectural...  ...Solid systems programming in C++ and CUDA (or ROCm/HIP) Good understanding of... 
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    5 days ago
  • $220k - $320k

    A tech startup specializing in AI inference seeks a skilled professional to optimize their inference stack. Candidates should have over 2 years of experience in ML systems, fluency in Python, and hands-on experience with LLM frameworks. The role offers competitive compensation...
    Senior
    Local area

    Inference

    San Francisco, CA
    4 days ago
  • $150k - $250k

    DataDirect Networks, Inc. is seeking a Lustre Engineer to contribute to Lustre architecture and implement new features. This role demands expertise in kernel-space C, performance tuning, and collaboration with engineers. Applicants should have 7+ years of experience in... 
    Senior

    DataDirect Networks, Inc.

    San Francisco, CA
    2 days ago
  • MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels... 

    MakerMaker.AI

    San Francisco, CA
    2 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 
    Senior

    Perplexity

    San Francisco, CA
    4 days ago
  • $180k - $250k

     ...A leading technology company in San Francisco is seeking a skilled engineer to build custom compute environments, enhancing GPU performance for customer workloads. Candidates should have deep expertise in Linux virtualization and networking fundamentals, along with experience... 
    Senior
    Relocation package

    Fal

    San Francisco, CA
    5 days ago
  •  ...Francisco, is searching for a Sr. Systems Performance Software Engineer to own the architecture and...  ...systems and drive performance across CPU, GPU, and memory boundaries. The ideal candidate...  ...in robotics software, possesses kernel-level coding skills, and a solid understanding... 
    Senior

    Aurelius Systems

    San Francisco, CA
    4 days ago
  •  ...systems for discovering faster kernels Develop backend code...  ...Help shape Luminal’s engineering culture from the ground up...  .../RDNA assembly, or other GPU ISAs Familiarity with ML...  ...tasks will include writing CUDA kernels, conducting model performance reviews. #J-18808-Ljbffr... 
    Senior
    Full time

    Slope

    San Francisco, CA
    4 days ago
  • A technology infrastructure company in San Francisco is seeking an experienced engineer to manage and operate GPU clusters. The role requires over 5 years of hands-on experience, a deep understanding of hardware systems, and a passion for automating fleet operations. You... 
    Senior

    The San Francisco Compute Company

    San Francisco, CA
    4 days ago
  • Sciforium, an AI infrastructure company in San Francisco, is looking for a Senior HPC & GPU Infrastructure Engineer to manage the health and performance of our GPU compute cluster. You will be the primary custodian of a high-density accelerator environment, bridging hardware... 
    Senior
    Flexible hours

    Sciforium

    San Francisco, CA
    4 days ago
  • $180k - $250k

     ...A tech innovation company is looking for a hands-on engineer in San Francisco to manage a vast fleet of GPU servers. You will build systems for tracking server lifecycle, automate provisioning and health checks, and ensure OS-level security. The role requires 5+ years... 
    Senior

    Fal

    San Francisco, CA
    4 days ago
  •  ...researchers. We are searching for a CUDA Kernel Engineer who has hands-on experience...  ...from scratch . You will work on the GPU performance layer powering large-scale, high-throughput...  ...2026. Career growth & influence: Lead AI initiatives, optimize pipelines, and... 
    Remote job
    Local area
    Immediate start
    Relocation package

    PRAGMATIKE

    San Francisco, CA
    a month ago
  •  ...A leading AI technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation...  ...distributed training systems and optimize GPU utilization while collaborating with cross-functional... 
    Senior

    Baseten

    San Francisco, CA
    4 days ago
  •  ...company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining...  ...reliability standards, design monitoring systems and lead incident response. Join a forward-thinking environment... 
    Senior

    Hyperbolic Labs

    San Francisco, CA
    4 days ago
  •  ...You’ll write and optimize the GPU kernels and supporting systems...  ...This is deep, low-level work (performance counters, memory bandwidth,...  ...actually use. We hire kernel engineers because the gap between "this...  ...Write and optimize GPU kernels (CUDA, ROCm, Triton, or similar)... 
    Shift work

    MakerMaker.AI

    San Francisco, CA
    4 days ago
  • $500 per month

     ...a small team of engineers, former US military...  ...and performance of our full software...  ...Every inefficient kernel is a target that...  ...accumulates across CPU, GPU, memory, and I/O...  .... This is a senior IC role with subteam lead scope. You\'ll...  ...Develop and optimize CUDA kernels for high... 
    Senior
    Permanent employment
    Work at office
    Monday to Friday
    Night shift
    Weekend work

    Aurelius Systems

    San Francisco, CA
    4 days ago
  • $280k

     ...group of committed researchers, engineers, policy experts, and business...  ...breakthrough innovations in GPU performance and systems engineering. As a...  ...-art techniques from custom kernel development to distributed...  ...GPU Kernel Development: CUDA, Triton, CUTLASS, Flash Attention... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    4 days ago
  •  ...GPU Performance Engineer We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking...  ...that achieve 5-10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you'll ensure our infrastructure... 

    Genmo

    San Francisco, CA
    2 days ago
  • $300k

    GPU Optimisation Engineer — Real-Time Inference Want to push GPU performance to its limits — not in theory, but in production systems handling...  ...lost: memory hierarchy, kernel launch overhead, occupancy limits...  ...Writing and tuning custom CUDA / Triton kernels for performance... 
    Relocation
    Visa sponsorship
    Free visa

    Techire Ai

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior CUDA Kernel Engineer - GPU Performance Lead. Be the first to apply!