Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge Inference Engineer: Optimize On-Device AI Kernels

Liquid AI

Liquid AI is seeking a Systems Programmer to join their Edge Inference team in San Francisco. In this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal candidates have over 5 years of systems programming experience with strong C++ skills and a deep understanding of ML fundamentals. The position offers competitive salary, equity, and comprehensive health benefits. Flexible location options available. #J-18808-Ljbffr Liquid AI

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Edge Inference Engineer: Optimize On-Device AI Kernels in San Francisco, CA vacancy
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Suggested

    FriendliAI

    San Francisco, CA
    4 days ago
  •  ...in the Consumer Devices group focused on...  ...for on-device and edge deployment of...  ...lead a team of engineers responsible for...  ...implementing the low-level inference stack, including kernel development and...  ...designed or optimized high-performance...  ...OpenAI is an AI research and deployment... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • Genesis AI is seeking an experienced individual...  ...develop low-latency inference pipelines for on-device deployment in...  ...involves designing and optimizing distributed systems on...  ...background, and mastery in kernel optimization. This...  ...essential for our cutting-edge work in machine... 
    Suggested

    Genesis AI

    San Francisco, CA
    4 days ago
  • Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to... 
    Suggested

    Quadric

    San Francisco, CA
    4 days ago
  • $250k

    Edge AI is a production requirement across automotive, robotics, and...  ...deploying models on edge devices rebuilds memory management, platform...  ..., memory managers that optimize dynamically, observability stacks...  ...are doing in the field. Inference latency, memory pressure, thermal... 
    Suggested

    Forum Ventures

    San Francisco, CA
    1 day ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 

    Baseten

    San Francisco, CA
    4 days ago
  •  ...Team Our team analyzes inference stack performance...  ...understanding into performance optimizations and models that...  ...application behavior to kernels, accelerators, networking...  ...collaborating with engineering and research teams to...  ...About OpenAI OpenAI is an AI research and... 

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $175k - $225k

     ...by veteran operators and engineers, alumni of Sonos, Paypal,...  ...We're looking for an AI Inference Engineer who lives at the...  ...ready engines running on edge devices in homes across the country...  ...you are obsessed with CUDA kernels, TensorRT optimizations, and the challenge of deploying... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    7 hours ago
  • Kovari is seeking a skilled robotics engineer in San Francisco to own the perception stack for deploying robots in...  ...high-reliability manipulation policies and optimizing for real-time inference on edge devices. The ideal candidate will have experience in deploying... 

    Kovari

    San Francisco, CA
    4 days ago
  •  ...) architecture. Quadric's co-optimized software and hardware is targeted...  ...to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery...  ...and control code. Role The AI Kernel Engineer in Quadric plays the key role... 

    Quadric

    San Francisco, CA
    4 days ago
  •  ...About Us Most AI is frozen in place - it...  ...useful intelligence - the inference services that serve...  ...both. Researchers and ML engineers will hand you...  ...inference systems for LLMs, optimizing throughput, latency, and...  ...don't need to write kernels, but you should know why... 
    Flexible hours

    Adaption

    San Francisco, CA
    3 days ago
  • $160k - $230k

     ...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam About the Role At Together.ai, we are building state-of-the-art infrastructure to enable efficient...  ..., CUDA graph, compiled, efficient kernels Soft Skills: Strong... 
    Full time

    Together AI

    San Francisco, CA
    1 day ago
  •  ...ML Systems Engineer — Training & Inference Optimization (MBMB) We are building large-scale embodied...  ...infrastructure, and on-device inference systems that...  ...Work across: CUDA kernels and low-level GPU execution...  ...We are a research-driven AI and robotics company focused... 

    Seer

    San Francisco, CA
    7 hours ago
  • $95k

     ...What You’ll Do We’re hiring Edge Engineers to partner closely with our...  ...shipping and assembling edge devices to managing full‑scale rollouts...  ...hardware challenges, optimizing field workflows, and traveling...  ...troubleshooting of cameras, inference pipelines, and data uploads... 
    Remote work
    Work from home
    Relocation package
    Flexible hours

    Roboflow

    San Francisco, CA
    4 days ago
  • Quadric in San Francisco is seeking a Senior Platform Software Engineer to optimize neural networks on their innovative GPNPU architecture. The ideal candidate will have a MS or Ph.D. and at least eight years of industry experience. Responsibilities include driving optimization... 

    Quadric

    San Francisco, CA
    5 days ago
  • $225k

    Magic is hiring a Kernel Engineer in San Francisco to design and maintain high-performance kernels that optimize throughput and latency during AI training and inference. The ideal candidate has low-level programming expertise, particularly for AI accelerators like NVIDIA... 

    Magic

    San Francisco, CA
    4 days ago
  • $160k - $320k

    A leading AI computing firm is seeking a Systems Engineer in San Francisco or Los Angeles to scale AI inference. Candidates should have strong C++ skills, HPC experience, and knowledge...  ...include designing GPU kernels, optimizing performance, and collaborating with technical... 

    Vast.ai

    San Francisco, CA
    4 days ago
  •  ...infrastructure for an edge-first world — a...  ...billions of devices, vehicles, robots,...  ...and satellites. AI is breaking free from...  ...As Lead Edge AI Engineer , you will own Source...  ...and on-device inference to adaptive compute...  ...efficient experiences. Optimize performance across... 
    Local area

    Source, Inc.

    San Francisco, CA
    1 day ago
  •  ...Embedded Software Engineer - Embedded Systems...  ...world of physical AI and robotics. We are...  ...to own the full on-device software stack for...  ...current and future edge devices across a wide...  .... Debugging and optimizing system performance...  ...on experience with kernel driver development... 

    Specter Services LLC

    San Francisco, CA
    3 days ago
  • $180k - $250k

     ...Staff Software Engineer, ML Performance & Systems...  ...generation of AI products. We build...  ...high-performance inference, orchestration, and...  ...identify bottlenecks and optimization opportunities....  ...of cutting edge ML infrastructure...  ...bottlenecks (custom GEMM kernels with CUTLASS for... 
    Currently hiring
    Relocation package

    fal

    San Francisco, CA
    1 day ago
  •  ...Technical Lead for Inference & ML Performance...  ...next generation of AI products. We build...  ...of fal's inference engine and ensure our...  ...rapidly deliver cutting-edge creative solutions...  ...Guide your team (kernels, applied...  ...enhancements and optimizations. - You regularly ship... 

    Fal

    San Francisco, CA
    7 hours ago
  • A tech startup focused on AI workloads is seeking a Member...  ...Technical Staff to design and optimize inference systems. The role involves...  ...should have strong software engineering skills and experience with ML...  ...opportunity to contribute to cutting-edge AI technology in a dynamic... 

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $293k - $325k

     ...Team The Software Engineering Firmware team...  ...hardware engineers to design, optimize, and ship software that bridges cutting-edge devices and real-world...  ...system bring-up or Linux kernel development. About...  ...OpenAI OpenAI is an AI research and deployment... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    7 hours ago
  • A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future...  ...evaluating silicon platforms and optimizing model architectures while working in a hybrid...  ...and is centered on deploying cutting-edge AI technology responsibly and effectively... 
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • $342k

     ...demands of advanced AI workloads. The...  ...and enable hardware optimized specifically for AI...  ...the Role As an Engineer on our hardware optimization...  ...work with our kernel, compiler and...  ...efficient training and inference on our models. If...  ...model across devices, dealing with and... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    2 days ago
  •  ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation...  ...role, you will design and optimize custom GPU kernels that...  ...large-scale training and inference. This role is ideal for...  ...engineering, and cutting-edge AI workloads, and who wants... 
    Flexible hours

    Sciforium

    San Francisco, CA
    1 day ago
  • $293k - $325k

     ..., and validation systems that ensure our device software is reliable, testable, and ready...  ...standards. About the Role As a Software Engineer, Quality and Developer Tools , you will...  .... About OpenAI OpenAI is an AI research and deployment company dedicated... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  • Hayden AI Technologies, Inc. is looking for a Senior Firmware Engineer to join the Device Software team in San Francisco, California...  ...expertise in Linux kernel and device driver...  ...device drivers, optimize performance, and...  ...key role in advancing edge AI systems. #J-18808... 

    Hayden AI Technologies, Inc.

    San Francisco, CA
    3 days ago
  •  ...We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our...  ...designing, implementing, and optimizing GPU kernels and supporting infrastructure for...  ...-generation generative and agentic AI workloads. Your work will directly... 
    Worldwide
    Flexible hours

    FriendliAI Corp

    San Francisco, CA
    1 day ago
  • $142.2k - $204.6k

     ...This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers...  ...GenAI inference stack - from kernels and runtimes to orchestration...  ...Databricks is the data and AI company. More than 10,000 organizations... 
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge Inference Engineer: Optimize On-Device AI Kernels. Be the first to apply!