ML Compiler Engineer, Edge-AI & Low-Latency
femtoAI
A technology company based in San Bruno is seeking a Compiler Engineer to work on a custom ML compiler for their AI accelerator. The role requires a minimum of 2 years experience in compilers or edge-AI, with strong proficiency in Python and/or C++. You will be responsible for building model ingestion pipelines, implementing graph transformations, and debugging. The position offers benefits including medical insurance, 401(k), and paid leave for parents. #J-18808-Ljbffr femtoAI
- A tech company specializing in AI solutions, based in San Bruno, is looking for a Compiler Developer. The role involves building... ...execution for a custom ML compiler. Candidates should have... ...years of experience in compilers or edge AI, with proficiency in Python and...Suggested
- ...You will work on a custom ML compiler that transforms modern ML and... ...execution under tight memory and latency constraints. What You'll... ...experience in compilers and/or edge-AI ~ Proficiency in Python and... ...Exposure to quantization or low-precision computation Interest...Suggested
- ...Francisco seeks candidates with expertise in AI simulation development. The role... ...enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient... ...opportunities for innovation and cutting-edge technology implementation. #J-18808-Ljbffr...Suggested
- ...pioneered a high-performance AI accelerator integrated with... ...embedded AI platform, enabling low‑latency operation with less energy... ...core components of our ML compiler, owning critical parts of the... ...compilers, systems, or performance engineering Masters or PhD in Computer...Suggested
$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team, that mission shows... ...: turning cutting-edge perception, prediction... ..., and performance engineering so that every cycle on... ..., and effortless for ML engineers across the... ...fidelity, and on-vehicle latency. Along the way, you'...SuggestedLocal areaWork from homeRelocation packageFlexible hours$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team, that... ...turning cutting-edge perception, prediction... ...and performance engineering so that every cycle... ...our on-vehicle ML inference for ADAS... ...meeting strict latency, throughput, and... ...Experience with low latencyorreal time...Local areaWork from homeRelocation packageFlexible hours- ...strong foundation in low-level operating... ...serving state-of-the-art AI models... ...throughput and low latency at scale Developing... ...team to bring cutting-edge capabilities into production... ...(CUDA, Triton) and compiler optimization Are... ...current with ML infrastructure developments...Work at office
- ...ZK and ML Compiler Engineer San Francisco Bay Area We are at the forefront of Zero-Knowledge Machine... ...learning, and compiler optimization. AI Tool Proficiency Requirements... ...Offer Opportunity to work on cutting-edge zkML technology Competitive compensation...
- A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations...Relocation package
- Reactor seeks an ML Inference Engineer in San Francisco to enhance performance on generative media models. In this role, you'll drive model... ...Science or a related field, deep expertise in PyTorch and model compilation, and strong knowledge of GPU hardware. Benefits include a...Relocation package
- A leading AI technology firm located in San Bruno is seeking an experienced compiler design engineer with over 4 years of expertise in compilers, systems, or performance engineering. The ideal candidate will have a Master's or PhD in Computer Science, Electrical Engineering...
$180k - $240k
...Senior Machine Learning Engineer - this is a fully... ...been on a mission to use AI to resolve as many conversations... ...our small but mighty ML team building... ...Mandatory) Familiarity with low-latency production ML systems (... ...on genuinely cutting-edge AI problems — low latency...Remote workFlexible hours- ...Position: Senior ML Performance Engineer Location: SF Bay Area (US)... ...Full-Time Industry: AI Infrastructure / Compiler Systems Overview A... ...anywhere." This includes cloud, edge, and hybrid environments... ..., and test suites (latency, throughput, memory utilization...Full time
$320k - $405k
...interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts,... ...and implement ML infrastructure that... ...Optimize inference latency and throughput for... ...handle high-throughput, low-latency workloads... ...translating cutting-edge research into production...Work at officeVisa sponsorshipFlexible hours- ...across a wide range of environments—from edge devices to large-scale deployments. Our... ...We are seeking a Staff-level (or higher) AI/ML engineer with expertise in multimodal systems to... ...speech) Optimizing multimodal pipelines for latency, efficiency, and deployment across a...
- ...building the future of voice AI operating systems for clinicians... ...point where cutting-edge AI meets real clinical impact... ...We are seeking an Applied ML Engineer to productionize and scale machine... ...transforming models into reliable, low-latency, production-grade systems...
- Job Overview Department: Engineering Location: San Francisco We're looking for an ML Inference Engineer with deep expertise... ...Reactor’s competitive edge in ultra‑low‑latency, high‑throughput environments... ...Implement optimizations using torch.compile, custom CUDA kernels, and...Visa sponsorshipRelocation package
$125k - $150k
A leading AI solutions firm in California is seeking a Senior Engineer, Machine Learning to enhance their ML capabilities. This role involves developing audio and vision models for edge devices, implementing prototypes, and optimizing performance through cutting-edge ML...- ...to bring the power of AI to everyone—without leaving... ...performance, portable compiler that lets developers "... ..., deploy them to the edge, and everything in... ...looking for a Senior ML Performance Engineer to architect and lead... ...test suites that measure latency, throughput, memory...
- ...Zensors Infrastructure Engineer Zensors is the spatial... ...physical world. Our AI platform provides real-... ...scale, we rely on cutting-edge optimization to ensure... ...high throughput, low latency, and reliability for critical... ...deeply with ML engineers to ensure validation...Work at office
- ...The Next Frontier Of Ai-Driven Robotics! Dyna... ...As a Lead ML Infrastructure Engineer, you are the architect... ...hardware and cutting-edge research, ensuring that... ...Production Inference: Build low-latency inference pipelines... ...distillation, and model compilation (TensorRT, Triton) to...
- The problem we saw Most AI infrastructure is built for... ...inference infrastructure. As our ML Infrastructure and Platform Engineer, you will own the... ...ensuring high availability and low‑latency inference across the fleet... ...that will give you an edge Exposure to ML infrastructure...Flexible hoursShift work
$178.64k - $319k
Samsara seeks a Staff Machine Learning Engineer to develop AI solutions and optimize ML systems for Edge compute constraints. This remote role involves working with large-scale data, leading AI initiatives, and collaborating across teams. With over 8 years of experience...Remote jobFlexible hours- ...AI Chopping Block, Inc. is seeking a talented engineer to lead the development of our high-performance robotics simulation platform. You will design and implement... ...for optimize performance, focusing on compiler stack advancements, including JIT compilation and GPU...
$216k - $270k
...Scale's Physical AI business unit is dedicated to... ...Physical AI and developing ML pipelines for... ...As an ML Systems Engineer on the Physical AI team... ...platform powers cutting-edge research and production... ...models at scale, ensuring low latency for real-time applications...Full time- A cutting-edge robotics company is searching for a candidate to manage the full lifecycle of learned components on OG-1, focusing on training... ...will possess a strong educational background in CS, Robotics, or ML and demonstrate proficiency in Python and PyTorch, as well as...
- A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building...
$218.4k - $273k
Scale's Physical AI business unit is dedicated to... ...Physical AI and developing ML pipelines for... ...Role As an ML Systems Engineer on the Physical AI team... ...platform powers cutting-edge research and production... ...models at scale, ensuring low latency for real-time applications...Full time$250k - $350k
...Staff AI/ML Engineer San Francisco, California, United States The leader in AI-native... ...Experience with big-data engines like Spark. Compiler development for languages like SQL,... .... Optimization of ML models for low-latency, high-throughput production use. Contributions...Work at officeRemote workFlexible hours2 days per week3 days per week$300k - $400k
...Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that... ...As a Principal AI/ML Engineer in our AdTech team, you will... ...operating at large scale and low latency to handle billions of ad events... ...capabilities on the cutting edge. AI & Agentic Applications:...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Compiler Engineer, Edge-AI & Low-Latency. Be the first to apply!
- machine learning San Bruno, CA
- machine learning ai engineer
- lead machine learning engineer
- machine learning engineer
- entry level machine learning engineer
- staff machine learning engineer
- junior machine learning research engineer
- junior machine learning engineer
- machine learning software engineer
- ai ml engineer

