ML Compiler Engineer, Edge-AI & Low-Latency

femtoAI

A technology company based in San Bruno is seeking a Compiler Engineer to work on a custom ML compiler for their AI accelerator. The role requires a minimum of 2 years experience in compilers or edge-AI, with strong proficiency in Python and/or C++. You will be responsible for building model ingestion pipelines, implementing graph transformations, and debugging. The position offers benefits including medical insurance, 401(k), and paid leave for parents. #J-18808-Ljbffr femtoAI

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the ML Compiler Engineer, Edge-AI & Low-Latency in San Bruno, CA vacancy

ML Compiler Engineer, Edge-AI & Low-Latency
A tech company specializing in AI solutions, based in San Bruno, is looking for a Compiler Developer. The role involves building... ...execution for a custom ML compiler. Candidates should have... ...years of experience in compilers or edge AI, with proficiency in Python and...
Suggested
Femtosense
San Bruno, CA
3 days ago
ML Compiler Engineer
...You will work on a custom ML compiler that transforms modern ML and... ...execution under tight memory and latency constraints. What You'll... ...experience in compilers and/or edge-AI ~ Proficiency in Python and... ...Exposure to quantization or low-precision computation Interest...
Suggested
femtoAI
San Bruno, CA
3 days ago
Staff ML Engineer: Efficient ML & Low-Latency AI
...Francisco seeks candidates with expertise in AI simulation development. The role... ...enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient... ...opportunities for innovation and cutting-edge technology implementation. #J-18808-Ljbffr...
Suggested
Embedding VC
San Francisco, CA
11 hours ago
Senior ML Compiler Engineer
...pioneered a high-performance AI accelerator integrated with... ...embedded AI platform, enabling low‑latency operation with less energy... ...core components of our ML compiler, owning critical parts of the... ...compilers, systems, or performance engineering Masters or PhD in Computer...
Suggested
Femtosense
San Bruno, CA
3 days ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team, that mission shows... ...: turning cutting-edge perception, prediction... ..., and performance engineering so that every cycle on... ..., and effortless for ML engineers across the... ...fidelity, and on-vehicle latency. Along the way, you'...
Suggested
Local area
Work from home
Relocation package
Flexible hours
General Motors
San Francisco, CA
4 days ago
ZK and ML Compiler Engineer
...ZK and ML Compiler Engineer San Francisco Bay Area We are at the forefront of Zero-Knowledge Machine... ...learning, and compiler optimization. AI Tool Proficiency Requirements... ...Offer Opportunity to work on cutting-edge zkML technology Competitive compensation...
Polyhedra
San Francisco, CA
4 days ago
Founding ML Inference Engineer — Ultra-Low Latency AI
A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations...
Relocation package
Reactor
San Francisco, CA
3 days ago
LLM/ML Engineer (Inference)
...strong foundation in low-level operating... ...-of-the-art AI models Optimizing... ...and low latency at scale Developing... ...to bring cutting-edge capabilities into... ..., Triton ) and compiler optimization Are... ...staying current with ML infrastructure... ...requires a large engineering effort dedicated...
Work at office
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
3 days ago
Senior ML Compiler Engineer — Real-Time, Hardware-Aware
A leading AI technology firm located in San Bruno is seeking an experienced compiler design engineer with over 4 years of expertise in compilers, systems, or performance engineering. The ideal candidate will have a Master's or PhD in Computer Science, Electrical Engineering...
Femtosense
San Bruno, CA
4 days ago
Staff ML Infra Engineer: Low-Latency Cloud Systems
...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal... ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers...
Flexible hours
Tubi Tv
San Francisco, CA
1 day ago
ML Software Engineer
About ZETIC.ai ZETIC.ai builds an end-to... ...We’re hiring an ML Software Engineer (On-Device AI Model... ...encoders, etc.) onto edge devices,... ...performance roadmap (latency, memory, power/thermal... ...engineers to resolve compiler/runtime... ...C++ (or equivalent low-level performance...
Full time
CAPSA
San Francisco, CA
11 hours ago
ML Ops / Dev Ops Engineer
...for the physical world. Our AI platform provides real-time... ...at scale, we rely on cutting-edge optimization to ensure our vision... ...About the Role As an ML / DevOps Engineer, you will play a pivotal role... ...will ensure high throughput, low latency, and rock-solid reliability...
Work at office
Zensors
San Francisco, CA
7 days ago
Senior ML Engineer
$180k - $240k
...Senior Machine Learning Engineer - this is a fully... ...been on a mission to use AI to resolve as many conversations... ...our small but mighty ML team building... ...Mandatory) Familiarity with low-latency production ML systems (... ...on genuinely cutting-edge AI problems — low latency...
Remote work
Flexible hours
Career Renew
San Francisco, CA
18 days ago
Senior ML Performance Engineer
Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto... ...: Full-Time Industry: AI Infrastructure / Compiler Systems Overview A... ...anywhere.” This includes cloud, edge, and hybrid environments... ...metrics, and test suites (latency, throughput, memory...
Full time
Amadeus Search
San Francisco, CA
4 days ago
ML Infrastructure Engineer, Safeguards
$320k - $405k
...interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts,... ...and implement ML infrastructure that... ...Optimize inference latency and throughput for... ...handle high-throughput, low-latency workloads... ...translating cutting-edge research into production...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago
Edge ML Engineer: On-Device AI Optimization Lead
ZETIC.ai is seeking an ML Software Engineer in San Francisco to optimize AI models for edge devices. You will lead strategies for deploying models, ensuring high performance while collaborating with engineers to achieve reliable production. The ideal candidate has 3+ years...
CAPSA
San Francisco, CA
11 hours ago
ML/Dev Ops Systems Engineer
...Zensors Infrastructure Engineer Zensors is the spatial... ...physical world. Our AI platform provides real-... ...scale, we rely on cutting-edge optimization to ensure... ...high throughput, low latency, and reliability for critical... ...deeply with ML engineers to ensure validation...
Work at office
Zensors
San Francisco, CA
9 days ago
Robotics ML Engineer: Edge AI for Production Robots
A cutting-edge robotics company is searching for a candidate to manage the full lifecycle of learned components on OG-1, focusing on training... ...will possess a strong educational background in CS, Robotics, or ML and demonstrate proficiency in Python and PyTorch, as well as...
Origin
San Francisco, CA
11 hours ago
ML Systems Engineer, Robotics
$248.8k - $311k
...Scale's Physical AI business unit is dedicated to... ...Physical AI and developing ML pipelines for... ...Role As an ML Systems Engineer on the Physical AI team... ...platform powers cutting-edge research and production... ...models at scale, ensuring low latency for real-time applications...
Full time
Scale AI
San Francisco, CA
18 days ago
Staff ML Infra Engineer - Low-Latency Distributed Systems
A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building...
Tubi Tv
San Francisco, CA
11 hours ago
Principal AI/ML Engineer - AdTech
$300k - $400k
...Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that... ...As a Principal AI/ML Engineer in our AdTech team, you will... ...operating at large scale and low latency to handle billions of ad events... ...capabilities on the cutting edge. ~ AI & Agentic Applications...
Zeta Global
San Francisco, CA
4 days ago
Staff AI/ML Engineer
$250k - $350k
...Staff AI/ML Engineer San Francisco, California, United States The leader in AI-native... ...Experience with big-data engines like Spark. Compiler development for languages like SQL,... .... Optimization of ML models for low-latency, high-throughput production use. Contributions...
Work at office
Remote work
Flexible hours
2 days per week
3 days per week
Prophecy
San Francisco, CA
3 days ago
Member of Technical Staff - ML Research Engineer, Data
...we’re not just building AI models—we’re redefining... ...’t: on-device, at the edge, under real-time constraints... ...of world-class engineers, researchers, and builders... ...debug models in popular ML frameworks, and... ...others stall: on CPUs, with low latency, minimal memory, and maximum...
Liquid AI
San Francisco, CA
2 days ago
Founding Machine Learning Engineer
$150k - $220k
...Founding Machine Learning Engineer San Francisco... ...browser understanding, and low-latency systems, shipping models... ..., or consumer-focused "AI browsers," we run AI... ...architecture creates unique ML challenges. This is... ..., Cloudflare Workers edge proxy, and inference providers...
H1b
Work at office
Visa sponsorship
Sleeping nights
Composite.ai
San Francisco, CA
4 days ago
Machine Learning Engineer, Inference & Serving (Speech LLM) - San Francisco
$200k
...building the world's most trusted AI work companion for... ...Gain exposure to cutting-edge AI for Pro tools and play a... ...deploying high-throughput, ultra-low-latency inference engines for large language models or... ...intersection between the core ML training team and the backend...
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
2 days ago
Data Science & ML Ops Engineer
...Data Science & ML Ops Engineer Location: Bay Area, CA Tax Term (W2, C2C): W2, C2... ...Leverage AutoML tools (e.g., Vertex AI AutoML, H2O Driverless AI) for low-code/no-code model development,... ...containerized environments and support model scoring via low-latency APIs...
Apolis
San Francisco, CA
4 days ago
Senior ML Engineer
$152k - $228k
...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together marketing... ...-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs...
Currently hiring
Remote work
Flexible hours
Invoca
San Francisco, CA
29 days ago
Senior ML Engineer, Perception — Real-Time Edge AI
$190k - $222k
Wheel the World in San Francisco seeks a Senior Machine Learning Engineer to lead the perception stack for their Archimedes project. This role requires expertise in developing and deploying perception systems under real-world conditions. The successful candidate will have...
Wheel the World
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Multimodal AI
...at the intersection of AI, robotics, and healthcare... ...Senior Machine Learning Engineer, you will build the intelligence... ...‑world data, ambiguous edge cases, and high‑leverage... ..., error handling, and low‑quality input recovery.... .... Optimize cost, latency, and reliability across...
Work at office
Hike-Medical
San Francisco, CA
3 days ago
ML Engineer
Check out 1962 new Machine Learning Engineer opportunities posted on AI Chopping Block Design, build, and maintain... .... Develop and optimize end-to-end ML pipelines encompassing data... ...environments to ensure high performance and low latency. Machine Learning Enginer, Core...
Flexible hours
AI Chopping Block, Inc.
San Francisco, CA
11 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Compiler Engineer, Edge-AI & Low-Latency. Be the first to apply!