Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge Inference Engineer: Optimize On-Device AI Kernels

Liquid AI

Liquid AI is seeking a Systems Programmer to join their Edge Inference team in San Francisco. In this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal candidates have over 5 years of systems programming experience with strong C++ skills and a deep understanding of ML fundamentals. The position offers competitive salary, equity, and comprehensive health benefits. Flexible location options available. #J-18808-Ljbffr Liquid AI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Edge Inference Engineer: Optimize On-Device AI Kernels in San Francisco, CA vacancy
  •  ...FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Suggested

    FriendliAI

    San Francisco, CA
    5 days ago
  •  ...in the Consumer Devices group focused on...  ...for on-device and edge deployment of...  ...lead a team of engineers responsible for...  ...implementing the low-level inference stack, including kernel development and...  ...designed or optimized high-performance...  ...OpenAI is an AI research and deployment... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  • $90 - $125 per hour

    A cutting-edge AI company is looking for Low-Level Engineers to design RL environments that optimize kernel development and systems programming. Candidates should have strong Python skills and a solid understanding of LLMs. This remote contractor role offers an hourly rate... 
    Suggested
    Remote job
    Hourly pay
    For contractors

    Open Data Science

    San Francisco, CA
    2 days ago
  • Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to... 
    Suggested

    Quadric

    San Francisco, CA
    5 days ago
  •  ...A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 
    Suggested

    Baseten

    San Francisco, CA
    5 days ago
  •  ...Kovari is seeking a skilled robotics engineer in San Francisco to own the perception stack for deploying robots...  ...developing high-reliability manipulation policies and optimizing for real-time inference on edge devices. The ideal candidate will have experience in deploying... 

    Kovari

    San Francisco, CA
    5 days ago
  • $175k - $225k

     ...by veteran operators and engineers, alumni of Sonos, Paypal,...  ...We're looking for an AI Inference Engineer who lives at the...  ...ready engines running on edge devices in homes across the country...  ...you are obsessed with CUDA kernels, TensorRT optimizations, and the challenge of deploying... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    1 day ago
  •  ...) architecture. Quadric's co-optimized software and hardware is targeted...  ...to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery...  ...and control code. Role The AI Kernel Engineer in Quadric plays the key role... 

    Quadric

    San Francisco, CA
    5 days ago
  •  ...About Us Most AI is frozen in place - it...  ...useful intelligence - the inference services that serve...  ...both. Researchers and ML engineers will hand you...  ...inference systems for LLMs, optimizing throughput, latency, and...  ...don't need to write kernels, but you should know why... 
    Flexible hours

    Adaption

    San Francisco, CA
    20 days ago
  •  ...professional software engineering experience,...  ...deploying, and optimizing large-scale...  ...familiarity with custom‑kernels for diverse...  ...art advanced AI infrastructure...  ...+ on‑device low‑latency deployments...  ...work with cutting‑edge ML models that...  ...training/inference setups, apply roofline... 

    Waymo

    San Francisco, CA
    5 days ago
  • $167.2k - $209k

     ...DigitalOcean is expanding its AI Infrastructure layer to...  .... We are seeking a Senior Engineer 2 to join our AI Inference Data Plane team. In this...  ...resiliency standards. Performance Optimization: Implement and optimise...  ...ll be a part of a cutting‑edge technology company with an... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    4 days ago
  • $160k - $230k

     ...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam About the Role At Together.ai, we are building state-of-the-art infrastructure to enable efficient...  ..., CUDA graph, compiled, efficient kernels Soft Skills: Strong... 
    Full time

    Together AI

    San Francisco, CA
    22 days ago
  •  ...lookout for talented Performance Engineers to join our cutting-edge GenAI team in San Francisco, California...  ...emphasizes low-level systems optimization using C++, Python, and Rust, aimed at elevating the quality of AI training and inference infrastructure. We seek... 

    Obsidian

    San Francisco, CA
    4 days ago
  • $95k

     ...What You’ll Do We’re hiring Edge Engineers to partner closely with our...  ...shipping and assembling edge devices to managing full‑scale rollouts...  ...hardware challenges, optimizing field workflows, and traveling...  ...troubleshooting of cameras, inference pipelines, and data uploads... 
    Remote work
    Work from home
    Relocation package
    Flexible hours

    Roboflow

    San Francisco, CA
    5 days ago
  • $225k

    Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels that optimize throughput and latency during AI training and inference. The ideal candidate has low-level programming expertise, particularly for AI accelerators like NVIDIA... 

    Magic

    San Francisco, CA
    5 days ago
  • Gravity Engineering Services Pvt Ltd. is looking for an Inference Frameworks and Optimization Engineer to enhance the performance of AI infrastructure. This role involves designing distributed inference...  ...you're passionate about cutting-edge AI technologies, we want to hear... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    2 days ago
  •  ...MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 

    MakerMaker

    San Francisco, CA
    4 days ago
  •  ...infrastructure for an edge-first world — a...  ...billions of devices, vehicles, robots,...  ...and satellites. AI is breaking free from...  ...As Lead Edge AI Engineer , you will own Source...  ...and on-device inference to adaptive compute...  ...efficient experiences. Optimize performance across... 
    Local area

    Source, Inc.

    San Francisco, CA
    2 days ago
  •  ...Embedded Software Engineer - Embedded Systems...  ...world of physical AI and robotics. We are...  ...to own the full on-device software stack for...  ...current and future edge devices across a wide...  .... Debugging and optimizing system performance...  ...on experience with kernel driver development... 

    Specter Services LLC

    San Francisco, CA
    4 days ago
  • $180k - $250k

     ...Staff Software Engineer, ML Performance & Systems...  ...generation of AI products. We build...  ...high-performance inference, orchestration, and...  ...identify bottlenecks and optimization opportunities....  ...of cutting edge ML infrastructure...  ...bottlenecks (custom GEMM kernels with CUTLASS for... 
    Currently hiring
    Relocation package

    fal

    San Francisco, CA
    19 days ago
  •  ...Technical Lead for Inference & ML Performance...  ...next generation of AI products. We build...  ...of fal's inference engine and ensure our...  ...rapidly deliver cutting-edge creative solutions...  ...Guide your team (kernels, applied...  ...enhancements and optimizations. - You regularly ship... 

    Fal

    San Francisco, CA
    4 days ago
  • A tech startup focused on AI workloads is seeking a Member...  ...Technical Staff to design and optimize inference systems. The role involves...  ...should have strong software engineering skills and experience with ML...  ...opportunity to contribute to cutting-edge AI technology in a dynamic... 

    Gimlet Labs

    San Francisco, CA
    3 days ago
  • A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future...  ...evaluating silicon platforms and optimizing model architectures while working in a hybrid...  ...and is centered on deploying cutting-edge AI technology responsibly and effectively... 
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...powers mission‑critical inference for the world's most dynamic AI companies, like...  ...to bring cutting‑edge models into production...  ...build the platform engineers turn to to ship AI...  ...inference optimizations. THE OPPORTUNITY Networking...  ...behaviors. Optimize Kernels: You will work with... 
    Flexible hours

    Baseten

    San Francisco, CA
    5 days ago
  • $342k

     ...demands of advanced AI workloads. The...  ...and enable hardware optimized specifically for AI...  ...the Role As an Engineer on our hardware optimization...  ...work with our kernel, compiler and...  ...efficient training and inference on our models. If...  ...model across devices, dealing with and... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Centaur Labs

    San Francisco, CA
    3 days ago
  • Hayden AI Technologies, Inc. is looking for a Senior Firmware Engineer to join the Device Software team in San Francisco, California...  ...expertise in Linux kernel and device driver...  ...device drivers, optimize performance, and...  ...key role in advancing edge AI systems. #J-18808... 

    Hayden AI Technologies, Inc.

    San Francisco, CA
    4 days ago
  •  ...Senior Site Reliability Engineer - AI Infrastructure...  ...platform routes training and inference jobs across global...  ...from network fabric – kernel – framework. What You’...  ...GPU compute clusters optimized for large‑scale training...  ...workloads, including device plugins, topology‑aware... 
    Full time
    Remote work

    Cortes 23

    San Francisco, CA
    4 days ago
  • $293k - $325k

     ..., and validation systems that ensure our device software is reliable, testable, and ready...  ...standards. About the Role As a Software Engineer, Quality and Developer Tools , you will...  .... About OpenAI OpenAI is an AI research and deployment company dedicated... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next...  ...role, you will design and optimize custom GPU kernels that power...  ...large-scale training and inference. This role is ideal for...  ...engineering, and cutting-edge AI workloads, and who... 
    Flexible hours

    Sciforium

    San Francisco, CA
    2 days ago
  • $167.2k - $209k

     ...DigitalOcean is seeking a Senior Engineer 2 to play a key technical role in our AI Inference Optimization team. DigitalOcean aims to be...  ...the inference engine and GPU kernel layers, ensuring our...  ...Proactively implement cutting-edge optimization techniques to keep... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge Inference Engineer: Optimize On-Device AI Kernels. Be the first to apply!