Member of Technical Staff - Kernels & GPU Performance

Gimlet Labs

About Us

Gimlet Labs is building the first heterogeneous neocloud for AI workloads.

As AI systems scale, the industry is hitting fundamental limits in power, capacity, and cost with today's homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling AI workloads from the underlying hardware. Our platform intelligently partitions workloads into components and orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems across multi-vendor and multi-generation hardware, including the latest emerging accelerators. These systems unlock step-function improvements in performance and cost efficiency at scale.

On top of this foundation, Gimlet is building a production-grade neocloud for agentic workloads. Customers use Gimlet to deploy and manage their workloads through stable, production-ready APIs, without having to reason about hardware selection, placement, or low-level performance optimization.

Gimlet works with foundation labs, hyperscalers, and AI native companies to power real production workloads built to scale to gigawatt-class AI datacenters.
Mission

Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance. In this role, you will work close to accelerators and execution hardware to extract maximum performance from AI workloads across diverse and rapidly evolving platforms. You will analyze low-level execution behavior, design and optimize kernels, and ensure performance is reliable across both established and emerging hardware.

This role is ideal for engineers who enjoy deep performance work, reasoning about hardware tradeoffs, and turning theoretical peak performance into real-world results.
Responsibilities

Design, implement, and optimize GPU and accelerator kernels for AI workloads
Analyze and tune performance across the GPU execution stack, including memory access patterns, synchronization, and instruction scheduling
Work with compilers and runtimes to ensure kernels integrate cleanly and perform well in end-to-end systems
Bring up and optimize execution on new or emerging accelerators
Profile, benchmark, and debug performance issues across kernels, runtimes, and hardware
Ensure performance optimizations are robust, correct, and production-ready at scale

Qualifications

Strong software engineering fundamentals
Experience working on performance-critical systems close to hardware
Comfort reasoning about low-level execution behavior, memory hierarchies, and performance tradeoffs

Preferred Qualifications

Experience with CUDA, Triton, CUTLASS, or other accelerator programming models
Deep understanding of GPU execution models (warps/wavefronts, blocks, grids)
Experience optimizing memory access patterns (coalescing, shared memory, cache behavior)
Familiarity with occupancy, latency hiding, and instruction-level parallelism
Experience using profiling and performance analysis tools
Familiarity with multi-GPU or distributed execution is a plus

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff - Kernels & GPU Performance in San Francisco, CA vacancy

Member of Technical Staff - Kernel Engineer
...challenges and the wins. What You'll Do Bring deep kernel expertise to our AI agents that optimize high-performance, mission-critical computing systems. You'll shape... ...or optimizing kernels for ML or other GPU-heavy workloads Fluency in Python and C/C++, and...
Performance
Work at office
Flexible hours
Asari AI
San Francisco, CA
6 days ago
Member of Technical Staff - Image / Video Generation
...Member Of Technical Staff - Image / Video Generation Freiburg (Germany) About... ...models don't fit on one GPU and training decisions impact... ...and backward Triton kernels and ensuring their correctness... ...trace viewers Know the performance characteristics of different...
Performance
Remote work
Worldwide
2 days per week
Black Forest Labs
San Francisco, CA
5 days ago
Member of Technical Staff - Inference
$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create... ...Role We are building the high-performance inference platform that serves Grok... ...scaling) to deep low-level optimizations (GPU kernels, quantization, speculative decoding,...
Performance
Temporary work
Xai
San Francisco, CA
9 hours ago
Member of Technical Staff, Model Efficiency
...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence... ...inference stack to improve core performance metrics by diving deep into model... ...performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution...
Performance
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
4 days ago
Member of Technical Staff - ML Systems & Inference
...component to hardware that best fits its performance and efficiency needs. This approach... ...Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference.... ...boundaries Work closely with compilers, kernels, networking, and distributed systems...
Performance
Gimlet Labs
San Francisco, CA
2 days ago
Member of Technical Staff, Pre-training Systems
$225k
...large-scale model training across massive GPU clusters. You will work at the boundary... ...systems, ensuring that training runs are performant, reliable, and reproducible under extreme... ...training throughput Collaborate with Kernels and Research to align model architecture...
Performance
Relocation
Visa sponsorship
Magic Inc
San Francisco, CA
8 hours ago
Member of Technical Staff - Data & ML Infra Engineer
...The Role We're looking for a Member of Technical Staff - Data & ML Infrastructure Engineer... ...regressions. You'll work across GPU kernels, inference systems, distributed training... ...Production AI deployment Performance engineering This role emerged directly...
Performance
Moonlake AI
San Francisco, CA
4 days ago
Member of Technical Staff, AI Platform & Architecture (Infrastructure)
$256k - $276k
...Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and... ...and research teams to ensure performance, scalability, and reliability of critical... ...services Optimize performance for GPU/xPU accelerators and cloud environments...
Performance
Work at office
Flexible hours
3 days per week
Postman
San Francisco, CA
2 days ago
Member of Technical Staff - Training Platform
$150k - $300k
...fine-tuning runs on managed GPU clusters with a single API call... ...runs the jobs. Core Technical Responsibilities Hosted... ...fundamentals: networking, namespaces, performance tuning Programming &... ...and encourage team members to contribute to the broader...
Performance
Work at office
Local area
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
San Francisco, CA
7 hours ago
Member of Technical Staff - Infrastructure Engineer
$180k - $300k
...Member Of Technical Staff - Infrastructure Engineer Freiburg (Germany), San Francisco (USA) About... ...optimizing components to extract peak performance from the system (both on application,... ...Python, Bash, Go Kubernetes Nvidia GPU drivers, and operators OTel,...
Performance
Work at office
Remote work
Worldwide
Relocation
2 days per week
Black Forest Labs
San Francisco, CA
7 hours ago
Member of Technical Staff (AI Inference Engineer)
...scheduling and KV-cache management to support in API Gateway. GPU Kernels Migration to CuTe DSL. Port our in-house CUDA kernels to... ...Python pains and keep up with rapidly growing traffic. Performance Optimization. Profile and fix bottlenecks from network...
Performance
Perplexity AI
San Francisco, CA
9 hours ago
Member of Technical Staff - Distributed Systems
...orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous... ...AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build...
Performance
Gimlet Labs
San Francisco, CA
2 days ago
Member of Technical Staff - ML Infra
...techniques and numerical precision trade-offs across different model scales Analyze, profile and debug low-level GPU operations to optimize performance Stay up-to-date on research to bring new ideas to work What We're Looking For Strong grasp of state-...
Performance
Causal Labs
San Francisco, CA
9 hours ago
Member of Technical Staff - AI Research (Intern)
...orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems... ...AI datacenters. Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern...
Performance
Internship
Gimlet Labs
San Francisco, CA
4 days ago
Member of Technical Staff, Supercomputing Platform & Infrastructure
$200k
...Infrastructure team, you will design, build, and operate the large-scale GPU infrastructure that powers Magic's model training and inference... ...Experience operating production GPU infrastructure or high-performance distributed systems Strong understanding of networking and...
Performance
Relocation
Visa sponsorship
Magic Inc
San Francisco, CA
6 hours ago
Member of Technical Staff - RL Infrastructure
$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe... ...an efficient and robust environment for the agent to perform actions in? # Evaluations and observability are a core part...
Performance
Temporary work
Xai
San Francisco, CA
8 hours ago
Member of Technical Staff, Infrastructure
...volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers... ...resources and Kubernetes clusters for cost-effectiveness and performance. Enable external customer deployment success through...
Performance
Work at office
LlamaIndex
San Francisco, CA
4 days ago
Member of ML Technical Staff
$200k - $350k
...Member of ML Technical Staff Title of Role: Member of ML Technical Staff Location: San Francisco, onsite Company Stage of Funding... ...continuous improvement of engineering practices. Analyze model performance and implement improvements based on quantitative metrics....
Performance
Work at office
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
1 day ago
Member of Technical Staff
...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates your medical records, AI, and real doctors... ...compliance-grade visibility. Instrument model performance tracking in production — monitoring latency, token usage,...
Performance
Lotus Health
San Francisco, CA
5 hours ago
Member of Technical Staff, Training Infra
...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury... ...scale across thousands of GPUs and nodes. Develop high-performance optimizations to maximize throughput and efficiency. Develop...
Performance
Immediate start
Flexible hours
Inception LLC
San Francisco, CA
6 hours ago
Member of Technical Staff - Simulation, Frontier AI Robotics
$150k
...We are seeking a Member of Technical Staff Simulation Engineer to join our AI robotics research team developing foundation models for robotics... ...Experience with transformer model optimization - Background in performance profiling and optimization - Experience working directly...
Performance
Internship
Local area
Amazon
San Francisco, CA
5 days ago
Member of Technical Staff - Research Software Engineer
...training loops and distributed GPU training to massive-scale... ...training stacks Triton / custom kernels Data Infrastructure... ...Distributed systems High-performance computing You care deeply... ...and enjoy solving hard technical problems. What We Offer:...
Performance
Relocation package
Reflection AI
San Francisco, CA
7 hours ago
Member of Technical Staff - Science, Frontier AI & Robotics (FAR)
$150k
...Amazon's Frontier AI & Robotics (FAR) team is seeking a Member of Technical Staff to drive foundational research and build intelligent robotic... ...collaborating with platform teams to ensure your models and algorithms perform robustly in dynamic real-world environments. You'll have...
Performance
Local area
Amazon
San Francisco, CA
3 days ago
Member of the Technical Staff Applied AI, Fullstack
...Member Of Technical Staff – Fullstack Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for... ...Frontend teams, as well as customers to deliver seamless, high-performance experiences across the entire stack. This is a high-...
Performance
Full time
Flexible hours
Stuut
San Francisco, CA
4 days ago
Member of Technical Staff
$125k - $200k
...agent system from the ground up Making critical technical decisions that will shape our product's future... ...transformations, and infrastructure Understanding of GPU infrastructure and model optimization for performance Our Values We play to dominate -...
Performance
Full time
Temporary work
Currently hiring
Immediate start
Flexible hours
burnt
San Francisco, CA
7 hours ago
Member of Technical Staff
$200k
...Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are seeing strong market demand and an aggressive 6‑month product... ...enterprise wins at Google, Microsoft, Nestlé, and P&G. Performance: 83% win rate on deals with no losses to competitors. Market...
Performance
Flexible hours
Listen Labs
San Francisco, CA
4 days ago
Member of Technical Staff, JAX & Compiler
$180k
...fulfill the need of our high-performance large-scale LLM... ...scale LLMs with JAX (on GPU or TPU) and applying various... ...complex use cases. Kernel Compiler Experience:... ...interview”) during which a member of our team will ask... ...which consists of four technical interviews: # Coding...
Performance
Temporary work
Relocation
xAI
San Francisco, CA
more than 2 months ago
Member of Technical Staff - RL Infrastructure
$300k
...inference and/or RL training. Experience with GPU clusters, distributed training, model... ...observability, testing, debugging, and performance optimization. Ability to work closely... ..., platforms, or services used by other technical users. Strong judgment around technical...
Performance
Full time
Work at office
Local area
Vmax
San Francisco, CA
4 days ago
Member of Technical Staff, Mechanical Engineer
$1,000 per month
...Member of Technical Staff, Mechanical Engineer Title of Role: Member of Technical Staff, Mechanical Engineer Location: San Francisco... ...element analysis (FEA) to validate design integrity and performance. Collaborate with cross-functional teams to ensure seamless...
Performance
Work at office
Recruiting from Scratch
San Francisco, CA
7 hours ago
Backend Engineer (Member of Technical Staff)
$100k - $300k
...ambitious Backend Senior and Staff Engineers who are excited to... ...support and uplevel future team members Participate in, provide... ...as a hands-on engineer and technical leader, overseeing and contributing... ..., to support high-performance, enterprise-level applications...
Performance
Cogent Security, Inc.
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Kernels & GPU Performance. Be the first to apply!