Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Engineer: GPU Kernels & AI Performance

Gimlet Labs

A cutting-edge technology company in San Francisco is seeking a Member of Technical Staff focused on kernels and GPU performance. This role involves optimizing GPU and accelerator kernels for AI workloads by analyzing performance across various hardware. Ideal candidates have strong software engineering foundations and experience with performance-critical systems. Familiarity with tools like CUDA and performance profiling is preferred. This position offers a dynamic environment focused on real-world performance optimizations. #J-18808-Ljbffr Gimlet Labs

Vacancy posted 22 hours ago
Similar jobs that could be interesting for youBased on the Staff Engineer: GPU Kernels & AI Performance in San Francisco, CA vacancy
  • $150k - $350k

     ...is seeking a Member of Technical Staff focused on optimizing GPU and accelerator kernels for AI workloads. This role involves analyzing and tuning performance across diverse execution platforms...  ...a strong foundation in software engineering and experience with performance-critical... 
    Performance

    Gimlet Labs, Inc.

    San Francisco, CA
    22 hours ago
  •  ...Build Ai To Co-Invent The Future Our mission...  ...'ll Do Bring deep kernel expertise to our AI agents that optimize high-performance, mission-critical computing...  ...intersection of kernel engineering and applied AI to scale...  ...for ML or other GPU-heavy workloads Fluency... 
    Performance
    Work at office
    Flexible hours

    Asari AI

    San Francisco, CA
    4 days ago
  • A leading data and AI company in San Francisco seeks a Staff Software Engineer to lead kernel-level performance engineering for GenAI workloads. The role involves designing and optimizing high-performance GPU kernels, mentoring engineers, and driving performance roadmaps... 
    Performance

    Databricks

    San Francisco, CA
    1 day ago
  • B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This role involves designing and operating high-performance systems for model inference, synthetic data generation...  ...a passion for working in cutting-edge AI. Benefits include top-tier compensation... 
    Performance

    B Capital

    San Francisco, CA
    3 days ago
  • A tech startup focusing on AI optimization is seeking engineers in San Francisco to enhance their GPU kernel optimization framework. Candidates should possess strong technical intuition, be comfortable working across the stack, and be able to deliver production code swiftly... 
    Suggested

    Wafer

    San Francisco, CA
    3 days ago
  • MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 
    Performance

    MakerMaker

    San Francisco, CA
    22 hours ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Performance

    FriendliAI

    San Francisco, CA
    2 days ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 
    Performance

    Baseten

    San Francisco, CA
    2 days ago
  •  ...building the next generation of AI infrastructure: large-...  ...-function improvements in performance and efficiency. Customers...  ...a Member of Technical Staff focused on kernels and GPU performance. In this role,...  ...This role is ideal for engineers who enjoy deep performance... 
    Performance

    Gimlet Labs

    San Francisco, CA
    22 hours ago
  • $141k - $249k

     ...Description Waabi, founded by AI visionary Raquel...  ...autonomy and algorithm engineers to scale safe self-...  ...and memory to pinpoint performance bottlenecks. - Identify...  ...limited to efficient CUDA kernels for training,...  ...Skilled in profiling CPU and GPU code using tools such as... 
    Performance
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    2 days ago
  • Overview About Liquid AI Spun out of MIT CSAIL, we build...  ...directly impacts model performance on real devices. While San...  ...Implement and optimize inference kernels for CPU, NPU, and GPU architectures across...  ...Experience Embedded software engineering experience or work on... 
    Performance

    Liquid AI

    San Francisco, CA
    4 days ago
  • Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch...  ...has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python, CUDA, and cuDNN.... 
    Performance

    Genesis AI

    San Francisco, CA
    2 days ago
  •  ...looking for a Member of Technical Staff focused on building and...  ...inference pipelines and enhancing performance under real-world workloads....  ...should have strong software engineering skills, experience with ML...  ...to work at the forefront of AI infrastructure technology. #J... 
    Performance

    Acceler8 Talent

    San Francisco, CA
    1 day ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity...  ...to support in API Gateway. GPU Kernels Migration to CuTe DSL. Port our in-house...  ...up with rapidly growing traffic. Performance Optimization. Profile and fix bottlenecks... 
    Performance

    Perplexity AI

    San Francisco, CA
    4 days ago
  • About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-...  ...power our large-scale, GPU-accelerated AI inference platform. You will be...  ...Design, implement, and optimize high-performance GPU kernels for AI inference (e.g., GEMM... 
    Performance
    Flexible hours

    FriendliAI

    San Francisco, CA
    2 days ago
  • $230k - $360k

     ...Infrastructure and Reliability Engineer (Systems & Scale) A...  ...rapidly scaling 10k+ GPU fleets, pushing...  ...close to the metal: Kernels Containers Schedulers...  ...utilization and performance where small gains materially...  ...a new generation of AI infrastructure. The decisions... 
    Performance
    Immediate start

    Luma AI

    San Francisco, CA
    2 days ago
  • $188k - $275k

     ...Staff Security Engineer, PKI & Secrets Livingston, NJ / New York, NY / Sunnyvale...  ...is The Essential Cloud for AI™. Built for pioneers by...  ...combines superior infrastructure performance with deep technical...  ...design, from data centers and GPU fleets to the platform layers... 
    Performance
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    San Francisco, CA
    1 day ago
  • $225k - $325k

     ...Staff Machine Learning Engineer, Partnerships Company: TwelveLabs Location: San Francisco, CA Remote...  ...technical decisions Experience with GPU-based compute, performance tuning, and capacity planning for...  ...-driven team on cutting-edge AI technology. Full health, dental, and... 
    Performance
    Full time
    H1b
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    ChatGPT Jobs

    San Francisco, CA
    7 days ago
  •  ...for the world's most dynamic AI companies, like Cursor, Notion...  ...and help build the platform engineers turn to to ship AI products....  ...THE ROLE We’re seeking a GPU Kernel Engineer to join our team at...  ...your code directly impacts the performance of state-of-the-art machine... 
    Performance
    Full time
    Flexible hours

    Baseten

    San Francisco, CA
    22 hours ago
  •  ...Staff Security Engineer, Infrastructure San Francisco fal is the generative...  ...the next generation of AI products. We build the infrastructure...  ...unified platform where high-performance inference, orchestration,...  ...power fal.ai's platform: GPU compute, multi-cloud environments... 
    Performance
    Shift work

    fal

    San Francisco, CA
    4 days ago
  • About Us We’re building the AI infrastructure powering the...  ...precision, reliability, and performance matter most. About the Role We're looking for a Staff Infrastructure Engineer to architect and own the systems...  ...serving and performance of GPU-intensive workloads. Drive... 
    Performance
    Full time
    Work at office

    Salient

    San Francisco, CA
    22 hours ago
  •  ...pioneering the future of agentic AI in property management. We...  ...and most valuable kind of AI engineering there is. What You'll Do...  ...integration framework, and AI performance analytics infrastructure. Collaborate...  ..., CrewAI, AutoGen, Semantic Kernel, or a comparable... 
    Performance
    Live in
    Remote work

    Zuma

    San Francisco, CA
    3 days ago
  • $225k - $275k

     ...only vertically integrated AI infrastructure company built...  ..., and be part of a high-performing team that believes in each...  ...Cloud is seeking a Senior Staff Network Deployment Engineer to serve as the technical owner...  ...performance compute (HPC) and GPU-based AI infrastructure,... 
    Performance
    Temporary work
    Remote work

    Crusoe

    San Francisco, CA
    3 days ago
  • A tech-focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for... 
    Performance

    Embedding VC

    San Francisco, CA
    22 hours ago
  •  ...company located in California is seeking a Senior Staff Software Engineer specializing in Software Defined Networking....  ...technical leadership, and enhancing network performance using advanced technologies in the Linux Kernel. Ideal candidates will have over 8 years of experience... 
    Performance

    Crusoe Energy Systems LLC

    San Francisco, CA
    12 hours ago
  •  ...tech-first company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role involves...  ...training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have a passion for addressing... 
    Performance

    Mirendil

    San Francisco, CA
    2 days ago
  • $150k - $250k

     ...Senior/Staff AI Engineer Job Locations US-CA-San Francisco - Remote | US-NC-Raleigh...  ...part of AI: the systems, storage, and performance infrastructure behind real-world model...  ...environments Improve performance across GPU and CPU pathways Work on KV cache,... 
    Performance
    Full time
    Remote work

    DataDirect Networks Inc

    San Francisco, CA
    4 days ago
  •  ...world. The role: SoFi's Senior Staff AI Engineer is a hands-on AI engineering role in...  ...techniques to maximize model performance and minimize operational costs across the...  ...and managing the underlying Kubernetes/GPU orchestration for custom model deployments... 
    Performance
    Remote work

    SoFi

    San Francisco, CA
    1 day ago
  • $200k - $270k

     ...Staff Agentic AI Engineer / AI Engineer $200,000 - $270,000 San Francisco, CA Onsite, 5x per week in office Full time / Permanent...  ...established benchmark Agent harness build-out and ongoing performance improvement (task success rate, token efficiency, workflow... 
    Performance
    Permanent employment
    Full time
    Live in
    Work at office

    DeepRec.ai

    San Francisco, CA
    17 hours ago
  • $207k - $290k

     ...Description About JazzX AI: Vision:...  ...seeking an experienced AI Engineer with deep expertise in...  ...join our team as a Senior Staff Architect. In this role...  ...infrastructure, and optimize for performance, reproducibility, and...  ...(Kubernetes, GPU/TPU clusters, and cloud... 
    Performance
    Worldwide
    Flexible hours

    JazzX AI

    San Francisco, CA
    14 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer: GPU Kernels & AI Performance. Be the first to apply!