Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Compiler Engineer, Edge-AI & Low-Latency

femtoAI

A technology company based in San Bruno is seeking a Compiler Engineer to work on a custom ML compiler for their AI accelerator. The role requires a minimum of 2 years experience in compilers or edge-AI, with strong proficiency in Python and/or C++. You will be responsible for building model ingestion pipelines, implementing graph transformations, and debugging. The position offers benefits including medical insurance, 401(k), and paid leave for parents. #J-18808-Ljbffr femtoAI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the ML Compiler Engineer, Edge-AI & Low-Latency in San Bruno, CA vacancy
  • A tech company specializing in AI solutions, based in San Bruno, is looking for a Compiler Developer. The role involves building...  ...execution for a custom ML compiler. Candidates should have...  ...years of experience in compilers or edge AI, with proficiency in Python and... 
    Suggested

    Femtosense

    San Bruno, CA
    3 days ago
  •  ...You will work on a custom ML compiler that transforms modern ML and...  ...execution under tight memory and latency constraints. What You'll...  ...experience in compilers and/or edge-AI ~ Proficiency in Python and...  ...Exposure to quantization or low-precision computation Interest... 
    Suggested

    femtoAI

    San Bruno, CA
    3 days ago
  •  ...Francisco seeks candidates with expertise in AI simulation development. The role...  ...enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient...  ...opportunities for innovation and cutting-edge technology implementation. #J-18808-Ljbffr... 
    Suggested

    Embedding VC

    San Francisco, CA
    5 days ago
  •  ...pioneered a high-performance AI accelerator integrated with...  ...embedded AI platform, enabling low‑latency operation with less energy...  ...core components of our ML compiler, owning critical parts of the...  ...compilers, systems, or performance engineering Masters or PhD in Computer... 
    Suggested

    Femtosense

    San Bruno, CA
    3 days ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team, that mission shows...  ...: turning cutting-edge perception, prediction...  ..., and performance engineering so that every cycle on...  ..., and effortless for ML engineers across the...  ...fidelity, and on-vehicle latency. Along the way, you'... 
    Suggested
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    San Francisco, CA
    4 days ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team, that...  ...turning cutting-edge perception, prediction...  ...and performance engineering so that every cycle...  ...our on-vehicle ML inference for ADAS...  ...meeting strict latency, throughput, and...  ...Experience with low latencyorreal time... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    San Francisco, CA
    4 days ago
  •  ...strong foundation in low-level operating...  ...serving state-of-the-art AI models...  ...throughput and low latency at scale Developing...  ...team to bring cutting-edge capabilities into production...  ...(CUDA, Triton) and compiler optimization Are...  ...current with ML infrastructure developments... 
    Work at office

    Reducto

    San Francisco, CA
    4 days ago
  •  ...ZK and ML Compiler Engineer San Francisco Bay Area We are at the forefront of Zero-Knowledge Machine...  ...learning, and compiler optimization. AI Tool Proficiency Requirements...  ...Offer Opportunity to work on cutting-edge zkML technology Competitive compensation... 

    Polyhedra

    San Francisco, CA
    4 days ago
  • A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations... 
    Relocation package

    Reactor

    San Francisco, CA
    3 days ago
  • Reactor seeks an ML Inference Engineer in San Francisco to enhance performance on generative media models. In this role, you'll drive model...  ...Science or a related field, deep expertise in PyTorch and model compilation, and strong knowledge of GPU hardware. Benefits include a... 
    Relocation package

    Reactor

    San Francisco, CA
    2 days ago
  • A leading AI technology firm located in San Bruno is seeking an experienced compiler design engineer with over 4 years of expertise in compilers, systems, or performance engineering. The ideal candidate will have a Master's or PhD in Computer Science, Electrical Engineering... 

    Femtosense

    San Bruno, CA
    4 days ago
  • $180k - $240k

     ...Senior Machine Learning Engineer - this is a fully...  ...been on a mission to use AI to resolve as many conversations...  ...our small but mighty ML team building...  ...Mandatory) Familiarity with low-latency production ML systems (...  ...on genuinely cutting-edge AI problems — low latency... 
    Remote work
    Flexible hours

    Career Renew

    San Francisco, CA
    4 days ago
  •  ...Position: Senior ML Performance Engineer Location: SF Bay Area (US)...  ...Full-Time Industry: AI Infrastructure / Compiler Systems Overview A...  ...anywhere." This includes cloud, edge, and hybrid environments...  ..., and test suites (latency, throughput, memory utilization... 
    Full time

    Amadeus Search

    San Francisco, CA
    2 days ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI...  ...researchers, engineers, policy experts,...  ...and implement ML infrastructure that...  ...Optimize inference latency and throughput for...  ...handle high-throughput, low-latency workloads...  ...translating cutting-edge research into production... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...across a wide range of environments—from edge devices to large-scale deployments. Our...  ...We are seeking a Staff-level (or higher) AI/ML engineer with expertise in multimodal systems to...  ...speech) Optimizing multimodal pipelines for latency, efficiency, and deployment across a... 

    PrismML

    San Francisco, CA
    1 day ago
  •  ...building the future of voice AI operating systems for clinicians...  ...point where cutting-edge AI meets real clinical impact...  ...We are seeking an Applied ML Engineer to productionize and scale machine...  ...transforming models into reliable, low-latency, production-grade systems... 

    Knowtex

    San Francisco, CA
    4 days ago
  • Job Overview Department: Engineering Location: San Francisco We're looking for an ML Inference Engineer with deep expertise...  ...Reactor’s competitive edge in ultra‑low‑latency, high‑throughput environments...  ...Implement optimizations using torch.compile, custom CUDA kernels, and... 
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    2 days ago
  • $125k - $150k

    A leading AI solutions firm in California is seeking a Senior Engineer, Machine Learning to enhance their ML capabilities. This role involves developing audio and vision models for edge devices, implementing prototypes, and optimizing performance through cutting-edge ML... 

    Syntiant Corp.

    Redwood City, CA
    3 days ago
  •  ...to bring the power of AI to everyone—without leaving...  ...performance, portable compiler that lets developers "...  ..., deploy them to the edge, and everything in...  ...looking for a Senior ML Performance Engineer to architect and lead...  ...test suites that measure latency, throughput, memory... 

    Alumni Ventures

    San Francisco, CA
    1 day ago
  •  ...Zensors Infrastructure Engineer Zensors is the spatial...  ...physical world. Our AI platform provides real-...  ...scale, we rely on cutting-edge optimization to ensure...  ...high throughput, low latency, and reliability for critical...  ...deeply with ML engineers to ensure validation... 
    Work at office

    Zensors

    San Francisco, CA
    4 days ago
  •  ...The Next Frontier Of Ai-Driven Robotics! Dyna...  ...As a Lead ML Infrastructure Engineer, you are the architect...  ...hardware and cutting-edge research, ensuring that...  ...Production Inference: Build low-latency inference pipelines...  ...distillation, and model compilation (TensorRT, Triton) to... 

    DYNA Robotics Inc

    Redwood City, CA
    2 days ago
  • The problem we saw Most AI infrastructure is built for...  ...inference infrastructure. As our ML Infrastructure and Platform Engineer, you will own the...  ...ensuring high availability and low‑latency inference across the fleet...  ...that will give you an edge Exposure to ML infrastructure... 
    Flexible hours
    Shift work

    URun

    San Francisco, CA
    2 days ago
  • $178.64k - $319k

    Samsara seeks a Staff Machine Learning Engineer to develop AI solutions and optimize ML systems for Edge compute constraints. This remote role involves working with large-scale data, leading AI initiatives, and collaborating across teams. With over 8 years of experience... 
    Remote job
    Flexible hours

    Samsara

    San Francisco, CA
    5 days ago
  •  ...AI Chopping Block, Inc. is seeking a talented engineer to lead the development of our high-performance robotics simulation platform. You will design and implement...  ...for optimize performance, focusing on compiler stack advancements, including JIT compilation and GPU... 

    AI Chopping Block, Inc.

    San Carlos, CA
    1 day ago
  • $216k - $270k

     ...Scale's Physical AI business unit is dedicated to...  ...Physical AI and developing ML pipelines for...  ...As an ML Systems Engineer on the Physical AI team...  ...platform powers cutting-edge research and production...  ...models at scale, ensuring low latency for real-time applications... 
    Full time

    Scale AI

    San Francisco, CA
    5 days ago
  • A cutting-edge robotics company is searching for a candidate to manage the full lifecycle of learned components on OG-1, focusing on training...  ...will possess a strong educational background in CS, Robotics, or ML and demonstrate proficiency in Python and PyTorch, as well as... 

    Origin

    San Francisco, CA
    5 days ago
  • A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building... 

    Tubi Tv

    San Francisco, CA
    5 days ago
  • $218.4k - $273k

    Scale's Physical AI business unit is dedicated to...  ...Physical AI and developing ML pipelines for...  ...Role As an ML Systems Engineer on the Physical AI team...  ...platform powers cutting-edge research and production...  ...models at scale, ensuring low latency for real-time applications... 
    Full time

    Scale AI, Inc.

    San Francisco, CA
    3 days ago
  • $250k - $350k

     ...Staff AI/ML Engineer San Francisco, California, United States The leader in AI-native...  ...Experience with big-data engines like Spark. Compiler development for languages like SQL,...  .... Optimization of ML models for low-latency, high-throughput production use. Contributions... 
    Work at office
    Remote work
    Flexible hours
    2 days per week
    3 days per week

    Prophecy

    San Francisco, CA
    3 days ago
  • $300k - $400k

     ...Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that...  ...As a Principal AI/ML Engineer in our AdTech team, you will...  ...operating at large scale and low latency to handle billions of ad events...  ...capabilities on the cutting edge. AI & Agentic Applications:... 

    Zeta Global

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Compiler Engineer, Edge-AI & Low-Latency. Be the first to apply!