Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

TPU Kernel Engineer Lead Low-Latency ML Kernels (Hybrid)

$280k

Anthropic

Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical feedback to researchers. The position offers a competitive salary between $280,000 and $850,000. This role requires a Bachelor’s degree and relevant experience in ML systems. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the TPU Kernel Engineer Lead Low-Latency ML Kernels (Hybrid) in San Francisco, CA vacancy
  • $280k

     ...About The Role As a TPU Kernel Engineer, you'll be responsible...  ...across many different ML systems, including research...  ...systems problems and low-level optimization....  ...Projects Implement low-latency, high-throughput sampling...  ...Location-based Hybrid Policy Currently, we expect... 
    Suggested
    Visa sponsorship

    Anthropic

    San Francisco, CA
    5 days ago
  • $315k

     ...committed researchers, engineers, policy experts,...  ...the Role As a TPU Kernel Engineer, you'll be...  ...across many different ML systems, including...  ...problems and low-level optimization...  ...projects: Implement low-latency, high-throughput...  .... Location-based hybrid policy: Currently,... 
    Suggested
    Contract work
    For contractors
    For subcontractor
    Work at office
    Relocation
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    5 days ago
  • $90 - $125 per hour

    A cutting-edge AI company is looking for Low-Level Engineers to design RL environments that optimize kernel development and systems programming. Candidates should have strong Python skills and a solid understanding of LLMs. This remote contractor role offers an hourly... 
    Suggested
    Remote job
    Hourly pay
    For contractors

    Open Data Science

    San Francisco, CA
    2 days ago
  • MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels... 
    Suggested

    MakerMaker.AI

    San Francisco, CA
    2 days ago
  •  ...Wilder Wealthy & Wise in San Francisco is seeking candidates to develop and optimize GPU-accelerated kernels for machine learning and AI applications. You will work closely with the modeling and algorithm team to enhance the performance of AI systems. The ideal candidate... 
    Suggested

    Wilder Wealthy & Wise

    San Francisco, CA
    1 day ago
  •  ...ll write and optimize the GPU kernels and supporting systems software...  ...workloads fast. This is deep, low-level work (performance...  ...actually use. We hire kernel engineers because the gap between "this...  ...kernel libraries, compilers, or ML frameworks Experience with multiple... 
    Shift work

    MakerMaker.AI

    San Francisco, CA
    4 days ago
  •  ...Acceler8 Talent is looking for a Kernel Engineer in San Francisco, California. The role involves designing and...  ...-performance kernels to enhance throughput and latency for large-scale AI systems. Candidates should have low-level programming experience with AI hardware accelerators... 
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    5 days ago
  • $225k

     ...Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels that optimize throughput and latency during AI training and inference. The ideal candidate has low-level programming expertise, particularly for AI accelerators like NVIDIA... 

    Magic Inc

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

     ...DigitalOcean is seeking a Senior Engineer 2 to play a key...  ...can offer the industry-leading performance for our...  ...throughput and minimize latency for the world’s most advanced...  ...engine and GPU kernel layers, ensuring our infrastructure...  ...related to low-level GPU programming -... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    1 day ago
  •  ...San Francisco Tensor Company is seeking a Founding GPU Kernel Engineer to enhance GPU performance for AI applications. You will optimize and write kernels while collaborating with compiler teams to improve efficiencies across architectures. The ideal candidate has deep... 
    Work at office
    Relocation package

    San Francisco Tensor Company

    San Francisco, CA
    2 days ago
  • $100k - $120k

     ...workloads grow, we need kernel‑level innovations to reduce latency, memory usage, and...  ...architect and optimize low‑level compute kernels,...  ...faster. Responsibilities Lead a team of kernel and system engineers focused on performance...  ...into distributed ML frameworks (e.g., PyTorch... 

    Coda Robotics

    San Francisco, CA
    4 days ago
  •  ...Francisco. In this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal...  ...experience with strong C++ skills and a deep understanding of ML fundamentals. The position offers competitive salary, equity, and... 
    Flexible hours

    Liquid AI

    San Francisco, CA
    2 days ago
  • $285k - $315k

    SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels... 
    Full time
    Relocation package

    SF Tensor

    San Francisco, CA
    5 days ago
  • $100k

     .... About the role:  As a Kernel Engineer, you will design, implement and...  ...to optimize throughput and latency during training and inference...  ...looking for: Experience with low-level programming of AI...  ...Deep understanding of GPU, TPU, and/or CPU architecture Magic... 
    Remote job
    Relocation
    Visa sponsorship

    Magic

    San Francisco, CA
    more than 2 months ago
  •  ...A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 

    Baseten

    San Francisco, CA
    4 days ago
  •  ...us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the...  ...for engineers passionate about low‑level optimization and high‑...  ...performance GPU kernels for key ML operations, including matrix multiplications... 
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • $285k - $315k

     ...universally portable. We are building a Kernel Optimizer that automatically...  ...partnering with researchers, engineers, and organizations who share...  ...hand‑optimize GPU kernels for ML workloads (matmuls, attention,...  ..., CUTLASS) Strong skills with low‑level profiling tools: Nsight... 
    Full time
    Work at office
    Relocation package

    San Francisco Tensor Company

    San Francisco, CA
    4 days ago
  •  ...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal...  ...across the hardware–software stack, from low-level kernel development to integrating optimized ops into high-level ML frameworks used for large-scale training and... 
    Flexible hours

    Sciforium

    San Francisco, CA
    2 days ago
  •  ...MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 

    MakerMaker

    San Francisco, CA
    4 days ago
  • A leading AI research company in San Francisco is seeking a Systems Engineer focused on kernel optimization and AI-assisted workflows. You'll develop tooling to improve performance...  ...performance optimization, particularly in low-level software. Join us in shaping the future... 

    OpenAI

    San Francisco, CA
    4 days ago
  • $342k

    A leading AI company in San Francisco seeks an Engineer for the hardware optimization team. This role...  ...vendors to develop essential kernels. Candidates need strong...  ...a focus on optimizing ML platform code. The...  ...and $555K, along with a hybrid work model and extensive... 

    Slope

    San Francisco, CA
    3 days ago
  • A leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal candidate will have over 8 years...  ...AWS. Responsibilities include building low-latency online microservices and collaborating with machine... 
    Flexible hours

    Tubi Tv

    San Francisco, CA
    4 days ago
  •  ...FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 

    FriendliAI

    San Francisco, CA
    5 days ago
  •  ...Obsidian is seeking GPU kernel optimization experts for a contract-based project with a leading AI lab. Candidates must have strong C++ skills and practical GPU programming experience. The role involves analyzing and optimizing GPU kernels, using profiling metrics to improve... 
    Contract work

    Obsidian

    San Francisco, CA
    1 day ago
  • A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating...  ...a plus. This position is based in San Francisco and allows for hybrid work. #J-18808-Ljbffr Tubi Tv

    Tubi Tv

    San Francisco, CA
    3 days ago
  • $128.7k - $261.3k

     ...San Francisco is seeking an experienced developer for their AI Kernels & Compilers team to innovate in autonomous driving technology. The...  ...focuses on designing high-performance GPU kernels, optimizing ML performance, and collaborating cross-functionally. The ideal candidate... 

    Israelvcforum

    San Francisco, CA
    5 days ago
  • Mercor is seeking a CUDA Engineering Expert to analyze and optimize GPU kernels for performance in a remote role. The ideal candidate should be fluent in core C++ features through C++17, with working knowledge of Python and Git, and experience in GPU programming models... 
    Remote job

    Mercor

    San Francisco, CA
    6 days ago
  •  ...unicorn founders and senior engineers with deep expertise in 3D, generative...  ...for a Founding Engineer, ML Inference with deep expertise...  ...competitive edge in ultra-low-latency, high-throughput environments...  ...using torch.compile, custom CUDA kernels, and specialized inference... 
    Relocation
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    6 days ago
  • Mercor is looking for a CUDA Engineering Expert to analyze and optimize GPU kernels, ensuring performance and efficiency in a remote capacity. Candidates should be fluent in C++ through C++17 and have a strong understanding of GPU profiling metrics. The ideal candidate... 
    Remote job

    Mercor

    San Francisco, CA
    4 days ago
  •  ...OpenAI is seeking a GPU Inference Engineer based in San Francisco, CA. In this high-impact role,...  ...expertise in model performance optimization, kernel-level systems, and low-level performance tuning. The position offers a hybrid work model of 3 days in the office per week... 
    Work at office
    Relocation
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to TPU Kernel Engineer Lead Low-Latency ML Kernels (Hybrid). Be the first to apply!