Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Compiler Engineer, Edge-AI & Low-Latency

femtoAI

A technology company based in San Bruno is seeking a Compiler Engineer to work on a custom ML compiler for their AI accelerator. The role requires a minimum of 2 years experience in compilers or edge-AI, with strong proficiency in Python and/or C++. You will be responsible for building model ingestion pipelines, implementing graph transformations, and debugging. The position offers benefits including medical insurance, 401(k), and paid leave for parents. #J-18808-Ljbffr femtoAI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the ML Compiler Engineer, Edge-AI & Low-Latency in San Bruno, CA vacancy
  • A tech company specializing in AI solutions, based in San Bruno, is looking for a Compiler Developer. The role involves building...  ...execution for a custom ML compiler. Candidates should have...  ...years of experience in compilers or edge AI, with proficiency in Python and... 
    Suggested

    Femtosense

    San Bruno, CA
    3 days ago
  •  ...You will work on a custom ML compiler that transforms modern ML and...  ...execution under tight memory and latency constraints. What You'll...  ...experience in compilers and/or edge-AI ~ Proficiency in Python and...  ...Exposure to quantization or low-precision computation Interest... 
    Suggested

    femtoAI

    San Bruno, CA
    3 days ago
  •  ...Francisco seeks candidates with expertise in AI simulation development. The role...  ...enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient...  ...opportunities for innovation and cutting-edge technology implementation. #J-18808-Ljbffr... 
    Suggested

    Embedding VC

    San Francisco, CA
    11 hours ago
  •  ...pioneered a high-performance AI accelerator integrated with...  ...embedded AI platform, enabling low‑latency operation with less energy...  ...core components of our ML compiler, owning critical parts of the...  ...compilers, systems, or performance engineering Masters or PhD in Computer... 
    Suggested

    Femtosense

    San Bruno, CA
    3 days ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers team, that mission shows...  ...: turning cutting-edge perception, prediction...  ..., and performance engineering so that every cycle on...  ..., and effortless for ML engineers across the...  ...fidelity, and on-vehicle latency. Along the way, you'... 
    Suggested
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    San Francisco, CA
    4 days ago
  •  ...ZK and ML Compiler Engineer San Francisco Bay Area We are at the forefront of Zero-Knowledge Machine...  ...learning, and compiler optimization. AI Tool Proficiency Requirements...  ...Offer Opportunity to work on cutting-edge zkML technology Competitive compensation... 

    Polyhedra

    San Francisco, CA
    4 days ago
  • A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations... 
    Relocation package

    Reactor

    San Francisco, CA
    3 days ago
  •  ...strong foundation in low-level operating...  ...-of-the-art AI models Optimizing...  ...and low latency at scale Developing...  ...to bring cutting-edge capabilities into...  ..., Triton ) and compiler optimization Are...  ...staying current with ML infrastructure...  ...requires a large engineering effort dedicated... 
    Work at office

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    3 days ago
  • A leading AI technology firm located in San Bruno is seeking an experienced compiler design engineer with over 4 years of expertise in compilers, systems, or performance engineering. The ideal candidate will have a Master's or PhD in Computer Science, Electrical Engineering... 

    Femtosense

    San Bruno, CA
    4 days ago
  •  ...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal...  ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers... 
    Flexible hours

    Tubi Tv

    San Francisco, CA
    1 day ago
  • About ZETIC.ai ZETIC.ai builds an end-to...  ...We’re hiring an ML Software Engineer (On-Device AI Model...  ...encoders, etc.) onto edge devices,...  ...performance roadmap (latency, memory, power/thermal...  ...engineers to resolve compiler/runtime...  ...C++ (or equivalent low-level performance... 
    Full time

    CAPSA

    San Francisco, CA
    11 hours ago
  •  ...for the physical world. Our AI platform provides real-time...  ...at scale, we rely on cutting-edge optimization to ensure our vision...  ...About the Role As an ML / DevOps Engineer, you will play a pivotal role...  ...will ensure high throughput, low latency, and rock-solid reliability... 
    Work at office

    Zensors

    San Francisco, CA
    7 days ago
  • $180k - $240k

     ...Senior Machine Learning Engineer - this is a fully...  ...been on a mission to use AI to resolve as many conversations...  ...our small but mighty ML team building...  ...Mandatory) Familiarity with low-latency production ML systems (...  ...on genuinely cutting-edge AI problems — low latency... 
    Remote work
    Flexible hours

    Career Renew

    San Francisco, CA
    18 days ago
  • Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto...  ...: Full-Time Industry: AI Infrastructure / Compiler Systems Overview A...  ...anywhere.” This includes cloud, edge, and hybrid environments...  ...metrics, and test suites (latency, throughput, memory... 
    Full time

    Amadeus Search

    San Francisco, CA
    4 days ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI...  ...researchers, engineers, policy experts,...  ...and implement ML infrastructure that...  ...Optimize inference latency and throughput for...  ...handle high-throughput, low-latency workloads...  ...translating cutting-edge research into production... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • ZETIC.ai is seeking an ML Software Engineer in San Francisco to optimize AI models for edge devices. You will lead strategies for deploying models, ensuring high performance while collaborating with engineers to achieve reliable production. The ideal candidate has 3+ years... 

    CAPSA

    San Francisco, CA
    11 hours ago
  •  ...Zensors Infrastructure Engineer Zensors is the spatial...  ...physical world. Our AI platform provides real-...  ...scale, we rely on cutting-edge optimization to ensure...  ...high throughput, low latency, and reliability for critical...  ...deeply with ML engineers to ensure validation... 
    Work at office

    Zensors

    San Francisco, CA
    9 days ago
  • A cutting-edge robotics company is searching for a candidate to manage the full lifecycle of learned components on OG-1, focusing on training...  ...will possess a strong educational background in CS, Robotics, or ML and demonstrate proficiency in Python and PyTorch, as well as... 

    Origin

    San Francisco, CA
    11 hours ago
  • $248.8k - $311k

     ...Scale's Physical AI business unit is dedicated to...  ...Physical AI and developing ML pipelines for...  ...Role As an ML Systems Engineer on the Physical AI team...  ...platform powers cutting-edge research and production...  ...models at scale, ensuring low latency for real-time applications... 
    Full time

    Scale AI

    San Francisco, CA
    18 days ago
  • A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building... 

    Tubi Tv

    San Francisco, CA
    11 hours ago
  • $300k - $400k

     ...Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that...  ...As a Principal AI/ML Engineer in our AdTech team, you will...  ...operating at large scale and low latency to handle billions of ad events...  ...capabilities on the cutting edge. ~ AI & Agentic Applications... 

    Zeta Global

    San Francisco, CA
    4 days ago
  • $250k - $350k

     ...Staff AI/ML Engineer San Francisco, California, United States The leader in AI-native...  ...Experience with big-data engines like Spark. Compiler development for languages like SQL,...  .... Optimization of ML models for low-latency, high-throughput production use. Contributions... 
    Work at office
    Remote work
    Flexible hours
    2 days per week
    3 days per week

    Prophecy

    San Francisco, CA
    3 days ago
  •  ...we’re not just building AI models—we’re redefining...  ...’t: on-device, at the edge, under real-time constraints...  ...of world-class engineers, researchers, and builders...  ...debug models in popular ML frameworks, and...  ...others stall: on CPUs, with low latency, minimal memory, and maximum... 

    Liquid AI

    San Francisco, CA
    2 days ago
  • $150k - $220k

     ...Founding Machine Learning Engineer San Francisco...  ...browser understanding, and low-latency systems, shipping models...  ..., or consumer-focused "AI browsers," we run AI...  ...architecture creates unique ML challenges. This is...  ..., Cloudflare Workers edge proxy, and inference providers... 
    H1b
    Work at office
    Visa sponsorship
    Sleeping nights

    Composite.ai

    San Francisco, CA
    4 days ago
  • $200k

     ...building the world's most trusted AI work companion for...  ...Gain exposure to cutting-edge AI for Pro tools and play a...  ...deploying high-throughput, ultra-low-latency inference engines for large language models or...  ...intersection between the core ML training team and the backend... 
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    2 days ago
  •  ...Data Science & ML Ops Engineer Location: Bay Area, CA Tax Term (W2, C2C): W2, C2...  ...Leverage AutoML tools (e.g., Vertex AI AutoML, H2O Driverless AI) for low-code/no-code model development,...  ...containerized environments and support model scoring via low-latency APIs... 

    Apolis

    San Francisco, CA
    4 days ago
  • $152k - $228k

     ...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together marketing...  ...-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs... 
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    San Francisco, CA
    29 days ago
  • $190k - $222k

    Wheel the World in San Francisco seeks a Senior Machine Learning Engineer to lead the perception stack for their Archimedes project. This role requires expertise in developing and deploying perception systems under real-world conditions. The successful candidate will have... 

    Wheel the World

    San Francisco, CA
    1 day ago
  •  ...at the intersection of AI, robotics, and healthcare...  ...Senior Machine Learning Engineer, you will build the intelligence...  ...‑world data, ambiguous edge cases, and high‑leverage...  ..., error handling, and low‑quality input recovery....  .... Optimize cost, latency, and reliability across... 
    Work at office

    Hike-Medical

    San Francisco, CA
    3 days ago
  • Check out 1962 new Machine Learning Engineer opportunities posted on AI Chopping Block Design, build, and maintain...  .... Develop and optimize end-to-end ML pipelines encompassing data...  ...environments to ensure high performance and low latency. Machine Learning Enginer, Core... 
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    11 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Compiler Engineer, Edge-AI & Low-Latency. Be the first to apply!