Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Pioneering GPU Kernel Engineer for ML Performance

$285k - $315k

SF Tensor

SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels, and strong programming skills in C++ and CUDA. This full-time position offers a competitive salary of $285,000 - $315,000, plus bonus and equity, with relocation assistance available for the right candidate who values in-person collaboration. #J-18808-Ljbffr SF Tensor

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Pioneering GPU Kernel Engineer for ML Performance in San Francisco, CA vacancy
  • MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels... 
    Performance

    MakerMaker.AI

    San Francisco, CA
    1 day ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 
    Performance

    Baseten

    San Francisco, CA
    4 days ago
  •  ...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next-generation...  ...passionate about pushing the limits of performance on modern accelerators. In this role,...  ...optimized ops into high-level ML frameworks used for large-scale training... 
    Performance
    Flexible hours

    Sciforium

    San Francisco, CA
    1 day ago
  • $100k - $120k

     ...inference workloads grow, we need kernel‑level innovations to reduce...  ...team of kernel and system engineers focused on performance-critical code Design,...  ...kernels for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and hardware...  ...into distributed ML frameworks (e.g., PyTorch,... 
    Performance

    Coda Robotics

    San Francisco, CA
    2 days ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Performance

    FriendliAI

    San Francisco, CA
    4 days ago
  • $285k - $315k

    About The Role We're looking for a Founding GPU Kernel Engineer who lives right at the boundary between...  ...Do Write and hand-optimize GPU kernels for ML workloads (matmuls, attention, normalization, etc.) to set the performance ceilings Profile at the microarchitectural... 
    Performance
    Full time
    Work at office
    Relocation package

    SF Tensor

    San Francisco, CA
    4 days ago
  •  ...and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at...  ...your code directly impacts the performance of state‑of‑the‑art machine learning...  ...GPU kernels for key ML operations, including matrix... 
    Performance
    Flexible hours

    Baseten

    San Francisco, CA
    6 days ago
  • $167.2k - $209k

     ...world. DigitalOcean is seeking a Senior Engineer 2 to play a key technical role in our...  ...ensure we can offer the industry-leading performance for our inference services. You will be...  ...at the inference engine and GPU kernel layers, ensuring our infrastructure extracts... 
    Performance
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    2 days ago
  • Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA CUDA kernels for high-performance AI systems. The ideal candidate will have a deep understanding of GPU architecture, performance optimization strategies, and hands-on experience... 
    Performance
    Remote work
    Relocation package

    Pragmatike

    San Francisco, CA
    2 days ago
  • $100k - $120k

     ...looking for an experienced engineer to join their founding...  ...on low-level compute kernels to enhance robotic foundation...  ...assembly), expertise in GPU optimizations, and familiarity with ML framework internals....  ...kernel optimizations, and pioneering high-velocity... 
    Performance

    Coda Robotics

    San Francisco, CA
    1 day ago
  • $160k - $230k

     ...Systems Research Engineer, GPU Programming San Francisco About the Role As...  ...developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely...  ...model architecture to enhance the performance and efficiency of our AI systems.... 
    Performance
    Full time
    Remote work

    Together AI

    San Francisco, CA
    3 days ago
  •  ...this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal candidates have over 5 years of systems...  ...with strong C++ skills and a deep understanding of ML fundamentals. The position offers competitive salary... 
    Performance
    Flexible hours

    Liquid AI

    San Francisco, CA
    1 day ago
  • $280k

     ...committed researchers, engineers, policy experts, and...  ...About the role: Pioneering the next generation of...  ...breakthrough innovations in GPU performance and systems...  ...techniques from custom kernel development to distributed...  ...improvements in production ML systems and will be... 
    Performance
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  •  ...You’ll write and optimize the GPU kernels and supporting systems...  ...This is deep, low-level work (performance counters, memory bandwidth, warp...  ...actually use. We hire kernel engineers because the gap between "this...  ...kernel libraries, compilers, or ML frameworks Experience with... 
    Performance
    Shift work

    MakerMaker.AI

    San Francisco, CA
    2 days ago
  • $280k

    Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical... 
    Performance

    Anthropic

    San Francisco, CA
    2 days ago
  • About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-...  ...Design, implement, and optimize high-performance GPU kernels for AI inference (e.g.,...  ...contributions related to GPU performance or ML acceleration Research or conference... 
    Performance
    Flexible hours

    FriendliAI

    San Francisco, CA
    4 days ago
  • Mercor is seeking a CUDA Engineering Expert to analyze and optimize GPU kernels for performance in a remote role. The ideal candidate should be fluent in core C++ features through C++17, with working knowledge of Python and Git, and experience in GPU programming models... 
    Performance
    Remote job

    Mercor

    San Francisco, CA
    5 days ago
  •  ...GPU Performance Engineer We are Genmo, a research lab dedicated to building open, state-of-the-art...  ...10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you'...  ...and GPU utilization Collaborate with ML engineers to optimize model implementations... 
    Performance

    Genmo

    San Francisco, CA
    1 day ago
  • $300k

    GPU Optimisation Engineer — Real-Time Inference Want to push GPU performance to its limits — not in theory, but in production systems handling real-time speech and multimodal...  ...performance is really lost: memory hierarchy, kernel launch overhead, occupancy limits, scheduling... 
    Performance
    Relocation
    Visa sponsorship
    Free visa

    Techire Ai

    San Francisco, CA
    1 day ago
  • Slope is seeking a Founding Compiler Engineer in San Francisco, responsible for designing...  ...AI models. You will write CUDA kernels and conduct performance reviews, contributing to Luminal's mission...  ...should have experience with Rust and GPU ISAs. #J-18808-Ljbffr Slope
    Performance
    Full time

    Slope

    San Francisco, CA
    3 days ago
  • OpenAI is seeking a GPU Inference Engineer based in San Francisco, CA. In this high-impact role, you'll optimize inference performance and scalability for Robotics research, driving engineering...  ...in model performance optimization, kernel-level systems, and low-level... 
    Performance
    Work at office
    Relocation
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role...  ...-to-end inference pipelines and enhancing performance under real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems... 
    Performance

    Acceler8 Talent

    San Francisco, CA
    3 days ago
  •  ...history. When people finance GPU clusters, the datacenters...  ...clusters are some of the most performant computers on the planet. Even...  ...shape culture, mentor junior engineers, and learn from our customers...  ...administration experience, including kernel drivers, RDMA stack tuning,... 
    Performance
    Long term contract
    Contract work
    Fixed term contract
    Work at office
    Local area
    Visa sponsorship
    Shift work
    3 days per week

    The San Francisco Compute Company

    San Francisco, CA
    2 days ago
  • $248.8k - $311k

     ...Physical AI and developing ML pipelines for processing...  ...As an ML Systems Engineer on the Physical AI team,...  ...Maintain fault-tolerant, high-performance systems for serving...  ...environments, including GPU-level algorithm optimizations (e.g., CUDA, kernel tuning). Programming:... 
    Performance
    Full time

    Scale AI

    San Francisco, CA
    20 days ago
  • $500 per month

     ...re a small team of ~10 engineers, former US military operators...  ..., computer vision, ML inference, controls,...  ...for is real-time systems performance at the hardware...  ...accumulates across CPU, GPU, memory, and I/O; how bandwidth...  ...Develop and optimize kernels for high-throughput, low... 
    Performance
    Permanent employment
    Work at office
    Monday to Friday
    Flexible hours
    Night shift
    Weekend work

    Aurelius Systems, Inc

    San Francisco, CA
    2 days ago
  •  ...we offer an innovative GPU marketplace and AI inference...  ...for all. As pioneers at the intersection of...  ...seeking a Site Reliability Engineer to ensure Hyperbolic's...  ...exceptional reliability, performance, and security. As an aggregator...  ...GPU infrastructure, AI/ML platforms, or compute... 
    Performance

    deCircle

    San Francisco, CA
    1 day ago
  •  ...Description Machine Learning Engineer, Inference Want to solve...  ...inference, scheduler design, GPU utilisation, concurrency...  ...GPU profiling and identifying kernel-level bottlenecks Optimising...  ...already operates beyond the performance of most publicly available realtime... 
    Performance
    Remote work
    Flexible hours

    techire ai

    San Francisco, CA
    4 days ago
  • $280k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...systems. About the Role As a TPU Kernel Engineer, you'll be responsible for identifying and addressing performance issues across many different ML systems, including research,... 
    Performance
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 
    Performance

    inference.net

    San Francisco, CA
    4 days ago
  • Acceler8 Talent is looking for a Kernel Engineer in San Francisco, California. The role involves designing and optimizing high-performance kernels to enhance throughput and latency for large-scale AI systems. Candidates should have low-level programming experience with... 
    Performance
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Pioneering GPU Kernel Engineer for ML Performance. Be the first to apply!