Pioneering GPU Kernel Engineer for ML Performance

$285k - $315k

SF Tensor

SF Tensor is looking for a Founding GPU Kernel Engineer in San Francisco, specializing in GPU architecture and kernel optimization for machine learning workloads. The ideal candidate has deep expertise, proven capabilities in hand-optimizing performance-critical kernels, and strong programming skills in C++ and CUDA. This full-time position offers a competitive salary of $285,000 - $315,000, plus bonus and equity, with relocation assistance available for the right candidate who values in-person collaboration. #J-18808-Ljbffr SF Tensor

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Pioneering GPU Kernel Engineer for ML Performance in San Francisco, CA vacancy

GPU Kernel Engineer — Fast ML Training
MakerMaker.AI in San Francisco is seeking a skilled Software Engineer to write and optimize GPU kernels. You will work on deep low-level tasks that directly impact the performance of machine learning models. The ideal candidate has over 4 years of experience with GPU kernels...
Performance
MakerMaker.AI
San Francisco, CA
1 day ago
GPU Kernel Engineer: Build Fast AI Inference at Scale
A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal...
Performance
Baseten
San Francisco, CA
4 days ago
GPU Kernel Engineer
...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next-generation... ...passionate about pushing the limits of performance on modern accelerators. In this role,... ...optimized ops into high-level ML frameworks used for large-scale training...
Performance
Flexible hours
Sciforium
San Francisco, CA
1 day ago
GPU Kernel Engineer
$100k - $120k
...inference workloads grow, we need kernel‑level innovations to reduce... ...team of kernel and system engineers focused on performance-critical code Design,... ...kernels for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and hardware... ...into distributed ML frameworks (e.g., PyTorch,...
Performance
Coda Robotics
San Francisco, CA
2 days ago
GPU Kernel Engineer for AI Inference & Performance
FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative...
Performance
FriendliAI
San Francisco, CA
4 days ago
Founding GPU Kernel Engineer
$285k - $315k
About The Role We're looking for a Founding GPU Kernel Engineer who lives right at the boundary between... ...Do Write and hand-optimize GPU kernels for ML workloads (matmuls, attention, normalization, etc.) to set the performance ceilings Profile at the microarchitectural...
Performance
Full time
Work at office
Relocation package
SF Tensor
San Francisco, CA
4 days ago
Kernel Engineer- GPU
...and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at... ...your code directly impacts the performance of state‑of‑the‑art machine learning... ...GPU kernels for key ML operations, including matrix...
Performance
Flexible hours
Baseten
San Francisco, CA
6 days ago
Senior Engineer 2: GPU Kernel and Performance
$167.2k - $209k
...world. DigitalOcean is seeking a Senior Engineer 2 to play a key technical role in our... ...ensure we can offer the industry-leading performance for our inference services. You will be... ...at the inference engine and GPU kernel layers, ensuring our infrastructure extracts...
Performance
Local area
Remote work
Worldwide
Flexible hours
DigitalOcean
San Francisco, CA
2 days ago
Senior CUDA Kernel Engineer - GPU Performance Lead
Pragmatike is seeking a CUDA Kernel Engineer for a remote position to develop and optimize NVIDIA CUDA kernels for high-performance AI systems. The ideal candidate will have a deep understanding of GPU architecture, performance optimization strategies, and hands-on experience...
Performance
Remote work
Relocation package
Pragmatike
San Francisco, CA
2 days ago
Lead GPU Kernel Engineer for High-Performance ML
$100k - $120k
...looking for an experienced engineer to join their founding... ...on low-level compute kernels to enhance robotic foundation... ...assembly), expertise in GPU optimizations, and familiarity with ML framework internals.... ...kernel optimizations, and pioneering high-velocity...
Performance
Coda Robotics
San Francisco, CA
1 day ago
Systems Research Engineer, GPU Programming
$160k - $230k
...Systems Research Engineer, GPU Programming San Francisco About the Role As... ...developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely... ...model architecture to enhance the performance and efficiency of our AI systems....
Performance
Full time
Remote work
Together AI
San Francisco, CA
3 days ago
Edge Inference Engineer: Optimize On-Device AI Kernels
...this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal candidates have over 5 years of systems... ...with strong C++ skills and a deep understanding of ML fundamentals. The position offers competitive salary...
Performance
Flexible hours
Liquid AI
San Francisco, CA
1 day ago
Performance Engineer, GPU
$280k
...committed researchers, engineers, policy experts, and... ...About the role: Pioneering the next generation of... ...breakthrough innovations in GPU performance and systems... ...techniques from custom kernel development to distributed... ...improvements in production ML systems and will be...
Performance
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago
KERNEL ENGINEER
...You’ll write and optimize the GPU kernels and supporting systems... ...This is deep, low-level work (performance counters, memory bandwidth, warp... ...actually use. We hire kernel engineers because the gap between "this... ...kernel libraries, compilers, or ML frameworks Experience with...
Performance
Shift work
MakerMaker.AI
San Francisco, CA
2 days ago
TPU Kernel Engineer — Lead Low-Latency ML Kernels (Hybrid)
$280k
Anthropic is looking for a TPU Kernel Engineer in San Francisco, California. In this role, you will identify and resolve performance issues across ML systems, particularly in research, training, and inference. You will design and optimize TPU kernels and provide critical...
Performance
Anthropic
San Francisco, CA
2 days ago
Software Engineer - GPU Kernel
About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-... ...Design, implement, and optimize high-performance GPU kernels for AI inference (e.g.,... ...contributions related to GPU performance or ML acceleration Research or conference...
Performance
Flexible hours
FriendliAI
San Francisco, CA
4 days ago
Remote CUDA Kernel Engineer
Mercor is seeking a CUDA Engineering Expert to analyze and optimize GPU kernels for performance in a remote role. The ideal candidate should be fluent in core C++ features through C++17, with working knowledge of Python and Git, and experience in GPU programming models...
Performance
Remote job
Mercor
San Francisco, CA
5 days ago
GPU Performance Engineer
...GPU Performance Engineer We are Genmo, a research lab dedicated to building open, state-of-the-art... ...10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you'... ...and GPU utilization Collaborate with ML engineers to optimize model implementations...
Performance
Genmo
San Francisco, CA
1 day ago
GPU Optimization Engineer
$300k
GPU Optimisation Engineer — Real-Time Inference Want to push GPU performance to its limits — not in theory, but in production systems handling real-time speech and multimodal... ...performance is really lost: memory hierarchy, kernel launch overhead, occupancy limits, scheduling...
Performance
Relocation
Visa sponsorship
Free visa
Techire Ai
San Francisco, CA
1 day ago
Founding Compiler Engineer - AI/ML Model Optimizer
Slope is seeking a Founding Compiler Engineer in San Francisco, responsible for designing... ...AI models. You will write CUDA kernels and conduct performance reviews, contributing to Luminal's mission... ...should have experience with Rust and GPU ISAs. #J-18808-Ljbffr Slope
Performance
Full time
Slope
San Francisco, CA
3 days ago
Robotics GPU Inference Engineer — Hybrid (Relocation)
OpenAI is seeking a GPU Inference Engineer based in San Francisco, CA. In this high-impact role, you'll optimize inference performance and scalability for Robotics research, driving engineering... ...in model performance optimization, kernel-level systems, and low-level...
Performance
Work at office
Relocation
Relocation package
OpenAI
San Francisco, CA
2 days ago
Staff ML Inference Systems Engineer - Scalable GPU Infra (SF)
...Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role... ...-to-end inference pipelines and enhancing performance under real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems...
Performance
Acceler8 Talent
San Francisco, CA
3 days ago
HPC/ GPU Hardware Engineer
...history. When people finance GPU clusters, the datacenters... ...clusters are some of the most performant computers on the planet. Even... ...shape culture, mentor junior engineers, and learn from our customers... ...administration experience, including kernel drivers, RDMA stack tuning,...
Performance
Long term contract
Contract work
Fixed term contract
Work at office
Local area
Visa sponsorship
Shift work
3 days per week
The San Francisco Compute Company
San Francisco, CA
2 days ago
ML Systems Engineer, Robotics
$248.8k - $311k
...Physical AI and developing ML pipelines for processing... ...As an ML Systems Engineer on the Physical AI team,... ...Maintain fault-tolerant, high-performance systems for serving... ...environments, including GPU-level algorithm optimizations (e.g., CUDA, kernel tuning). Programming:...
Performance
Full time
Scale AI
San Francisco, CA
20 days ago
Robotics Software Engineer
$500 per month
...re a small team of ~10 engineers, former US military operators... ..., computer vision, ML inference, controls,... ...for is real-time systems performance at the hardware... ...accumulates across CPU, GPU, memory, and I/O; how bandwidth... ...Develop and optimize kernels for high-throughput, low...
Performance
Permanent employment
Work at office
Monday to Friday
Flexible hours
Night shift
Weekend work
Aurelius Systems, Inc
San Francisco, CA
2 days ago
Hyperbolic Labs - Senior Site Reliability Engineer
...we offer an innovative GPU marketplace and AI inference... ...for all. As pioneers at the intersection of... ...seeking a Site Reliability Engineer to ensure Hyperbolic's... ...exceptional reliability, performance, and security. As an aggregator... ...GPU infrastructure, AI/ML platforms, or compute...
Performance
deCircle
San Francisco, CA
1 day ago
Inference Engineer
...Description Machine Learning Engineer, Inference Want to solve... ...inference, scheduler design, GPU utilisation, concurrency... ...GPU profiling and identifying kernel-level bottlenecks Optimising... ...already operates beyond the performance of most publicly available realtime...
Performance
Remote work
Flexible hours
techire ai
San Francisco, CA
4 days ago
TPU Kernel Engineer
$280k
...growing group of committed researchers, engineers, policy experts, and business... ...systems. About the Role As a TPU Kernel Engineer, you'll be responsible for identifying and addressing performance issues across many different ML systems, including research,...
Performance
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago
Senior Inference Performance Engineer - GPU & CUDA
$220k - $320k
inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques...
Performance
inference.net
San Francisco, CA
4 days ago
Kernel Engineer for High-Performance AI Systems
Acceler8 Talent is looking for a Kernel Engineer in San Francisco, California. The role involves designing and optimizing high-performance kernels to enhance throughput and latency for large-scale AI systems. Candidates should have low-level programming experience with...
Performance
Flexible hours
Acceler8 Talent
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Pioneering GPU Kernel Engineer for ML Performance. Be the first to apply!