Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior CUDA Kernel Engineer - GPU Performance Lead

Pragmatike

Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA CUDA kernels for high-performance AI systems. The ideal candidate will have a deep understanding of GPU architecture, performance optimization strategies, and hands-on experience with profiling tools. This role provides the opportunity to work on significant projects impacting Fortune 500 clients within a fast-growing AI startup recognized by GTM Capital. Competitive salary, sign-on bonus, and health benefits offered. #J-18808-Ljbffr Pragmatike

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior CUDA Kernel Engineer - GPU Performance Lead in San Francisco, CA vacancy
  • Pragmatike is seeking a CUDA Kernel Engineer to work remotely for a rapidly growing AI startup. The ideal candidate will have extensive...  ...NVIDIA CUDA kernels, with a strong understanding of GPU architecture and performance optimization. Responsibilities include designing CUDA... 
    Suggested
    Remote job
    Relocation package

    Pragmatike

    San Francisco, CA
    2 days ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 
    Senior

    inference.net

    San Francisco, CA
    4 days ago
  •  ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal...  ...about pushing the limits of performance on modern accelerators. In this role,...  ...optimize custom GPU kernels using C++, PTX, CUDA, ROCm, Triton, and/or JAX Pallas.... 
    Suggested
    Flexible hours

    Sciforium

    San Francisco, CA
    1 day ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Suggested

    FriendliAI

    San Francisco, CA
    4 days ago
  • $285k - $315k

    SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture...  ...expertise, proven capabilities in hand-optimizing performance-critical kernels, and strong programming skills in C++ and CUDA. This full-time position offers a... 
    Suggested
    Full time
    Relocation package

    SF Tensor

    San Francisco, CA
    4 days ago
  • $100k - $120k

     ...inference workloads grow, we need kernel‑level innovations to reduce...  ...cheaper and faster. Responsibilities Lead a team of kernel and system engineers focused on performance-critical code Design, implement...  ...for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and hardware accelerators... 

    Coda Robotics

    San Francisco, CA
    2 days ago
  •  ...date: ASAP Languages: English (required) We are searching for a CUDA Kernel Engineer who has hands‑on experience developing and optimizing NVIDIA CUDA kernels from scratch. You will work on the GPU performance layer powering large-scale, high-throughput AI systems used by... 
    Remote job
    Local area
    Immediate start
    Relocation package

    Pragmatike

    San Francisco, CA
    2 days ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance...  .... Ideal candidates have 1-5 years of CUDA development experience and a strong... 

    Baseten

    San Francisco, CA
    4 days ago
  • $285k - $315k

     ...The Role We're looking for a Founding GPU Kernel Engineer who lives right at the boundary between...  ..., normalization, etc.) to set the performance ceilings Profile at the microarchitectural...  ...Solid systems programming in C++ and CUDA (or ROCm/HIP) Good understanding of... 
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    4 days ago
  •  ...and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at...  ...your code directly impacts the performance of state‑of‑the‑art machine learning...  ...and optimize code using CUDA, PTX assembly, and... 
    Flexible hours

    Baseten

    San Francisco, CA
    1 day ago
  • $166k - $225k

    A leading AI research company in California seeks a Senior GenAI Research Engineer to enhance deep learning techniques. The role...  ...involves designing efficient GPU kernels, optimizing performance, and ensuring successful...  ...have a background in CUDA programming and... 
    Senior

    Databricks Inc.

    San Francisco, CA
    1 day ago
  • $220k - $320k

    A tech startup specializing in AI inference seeks a skilled professional to optimize their inference stack. Candidates should have over 2 years of experience in ML systems, fluency in Python, and hands-on experience with LLM frameworks. The role offers competitive compensation...
    Senior
    Local area

    Inference

    San Francisco, CA
    3 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 
    Senior

    Perplexity

    San Francisco, CA
    3 days ago
  • A technology infrastructure company in San Francisco is seeking an experienced engineer to manage and operate GPU clusters. The role requires over 5 years of hands-on experience, a deep understanding of hardware systems, and a passion for automating fleet operations. You... 
    Senior

    The San Francisco Compute Company

    San Francisco, CA
    3 days ago
  • $180k - $250k

    A leading technology company in San Francisco is seeking a skilled engineer to build custom compute environments, enhancing GPU performance for customer workloads. Candidates should have deep expertise in Linux virtualization and networking fundamentals, along with experience... 
    Senior
    Relocation package

    Fal

    San Francisco, CA
    4 days ago
  • A leading AI technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation...  ...distributed training systems and optimize GPU utilization while collaborating with cross-functional... 
    Senior

    Baseten

    San Francisco, CA
    4 days ago
  • $180k - $250k

    A tech innovation company is looking for a hands-on engineer in San Francisco to manage a vast fleet of GPU servers. You will build systems for tracking server lifecycle, automate provisioning and health checks, and ensure OS-level security. The role requires 5+ years of... 
    Senior

    Fal

    San Francisco, CA
    4 days ago
  •  ...company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining...  ...reliability standards, design monitoring systems and lead incident response. Join a forward-thinking environment... 
    Senior

    Hyperbolic Labs

    San Francisco, CA
    4 days ago
  • $300k

    GPU Optimisation Engineer — Real-Time Inference Want to push GPU performance to its limits — not in theory, but in production systems handling...  ...lost: memory hierarchy, kernel launch overhead, occupancy limits...  ...Writing and tuning custom CUDA / Triton kernels for performance... 
    Relocation
    Visa sponsorship
    Free visa

    Techire Ai

    San Francisco, CA
    1 day ago
  • $285k - $315k

     ...future of AI and high-performance computing depends on rethinking...  .... We are building a Kernel Optimizer that...  ...partnering with researchers, engineers, and organizations who...  ...'re hiring a Founding GPU Compiler Engineer to build...  ...‑level optimization (CUDA, ROCm, or equivalent)... 
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    3 days ago
  • $160k - $320k

     ...deliver excellence. We seek engineers/researchers with strong...  ...leverage your knowledge of high-performance systems to optimize GPU performance at the...  ...or LA offices Tech Stack CUDA/C++, GPGPU, Python, Linux....  ...Design and optimize GPU kernels and tensor libraries. Translate... 
    Full time
    Work at office

    Vast.ai

    San Francisco, CA
    4 days ago
  •  ...designing and optimizing distributed systems on GPU clusters, implementing efficient low-level code such as CUDA and Triton, and managing workloads to ensure high...  ...systems, a strong Python background, and mastery in kernel optimization. This position is essential for our... 

    Genesis AI

    San Francisco, CA
    4 days ago
  •  ...ABOUT THE ROLE As a senior Robot Perception Engineer on the Smart Robotics team...  ...model inference for GPU deployment, leveraging CUDA, TensorRT, and related...  ...C/C++ experience for performance-critical components...  ...) for hyperscalers and leading Original Equipment Manufacturers... 
    Senior
    Full time

    Bright Machines

    San Francisco, CA
    a month ago
  • $300 per month

     ...strategies, and be part of a high-performing team that believes in each...  ...platform — and Production Engineering sits at the heart of that mission...  ...and performance of Crusoe’s GPU cloud that powers next-...  ...debugging complex issues across kernel and user space ~ Previous experience... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    23 days ago
  • $179k - $218k

     ...strategies, and be part of a high-performing team that believes in each other,...  ...must be bridged. We are seeking a Senior Staff Data Center Operations Engineer, GPU Hardware Architecture to be the...  ...gradients, and error rates). You will lead the transition from reactive... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    6 days ago
  •  ...As a Computer Vision Engineer on the Applied Algorithms...  ...our map of the world. Performance Engineering: Optimize...  ...execution on GPU/CPU. Benchmarking &...  ...Technical Leadership: Lead technical design reviews...  ...). Experience with CUDA or shader programming... 
    Senior
    Work at office
    3 days per week

    Niantic Spatial, Inc

    San Francisco, CA
    6 hours ago
  •  ...Senior HPC & GPU Infrastructure Engineer Sciforium is an AI infrastructure company...  ..., reliability, and performance of our GPU compute...  ...ML software stack (CUDA/ROCm, PyTorch, JAX,...  ...consistent configuration, kernel tuning, and...  ...Deployment & Bring-Up: Lead deployment of new GPU... 
    Senior
    Flexible hours

    Sciforium

    San Francisco, CA
    4 days ago
  •  ...Compute is seeking a talented C++ developer in San Francisco to focus on core systems development with responsibilities including performance optimization, systems debugging, and research. The role requires top-tier C++ skills, a strong background in low-level systems,... 
    Senior
    Full time

    Thunder Compute

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates... 
    Senior

    OpenAI

    San Francisco, CA
    3 days ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team, that...  ...development, and performance engineering so that every cycle on...  ...builds high-performance GPU kernels and custom libraries...  ..., and iterate on CUDA-based kernels and custom...  ...the responsibility to lead the change that will... 
    Senior
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    San Francisco, CA
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior CUDA Kernel Engineer - GPU Performance Lead. Be the first to apply!