ML Compiler Engineer, Edge-AI & Low-Latency
femtoAI
A technology company based in San Bruno is seeking a Compiler Engineer to work on a custom ML compiler for their AI accelerator. The role requires a minimum of 2 years experience in compilers or edge-AI, with strong proficiency in Python and/or C++. You will be responsible for building model ingestion pipelines, implementing graph transformations, and debugging. The position offers benefits including medical insurance, 401(k), and paid leave for parents. #J-18808-Ljbffr femtoAI
- A tech company specializing in AI solutions, based in San Bruno, is looking for a Compiler Developer. The role involves building... ...execution for a custom ML compiler. Candidates should have... ...years of experience in compilers or edge AI, with proficiency in Python and...Suggested
- ...You will work on a custom ML compiler that transforms modern ML and... ...execution under tight memory and latency constraints. What You'll... ...experience in compilers and/or edge-AI ~ Proficiency in Python and... ...Exposure to quantization or low-precision computation Interest...Suggested
- ...Francisco seeks candidates with expertise in AI simulation development. The role... ...enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient... ...opportunities for innovation and cutting-edge technology implementation. #J-18808-Ljbffr...Suggested
- ...pioneered a high-performance AI accelerator integrated with... ...embedded AI platform, enabling low‑latency operation with less energy... ...core components of our ML compiler, owning critical parts of the... ...compilers, systems, or performance engineering Masters or PhD in Computer...Suggested
$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers team, that mission shows... ...: turning cutting-edge perception, prediction... ..., and performance engineering so that every cycle on... ..., and effortless for ML engineers across the... ...fidelity, and on-vehicle latency. Along the way, you'...SuggestedLocal areaWork from homeRelocation packageFlexible hours- ...ZK and ML Compiler Engineer San Francisco Bay Area We are at the forefront of Zero-Knowledge Machine... ...learning, and compiler optimization. AI Tool Proficiency Requirements... ...Offer Opportunity to work on cutting-edge zkML technology Competitive compensation...
- A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations...Relocation package
- ...strong foundation in low-level operating... ...-of-the-art AI models Optimizing... ...and low latency at scale Developing... ...to bring cutting-edge capabilities into... ..., Triton ) and compiler optimization Are... ...staying current with ML infrastructure... ...requires a large engineering effort dedicated...Work at office
- A leading AI technology firm located in San Bruno is seeking an experienced compiler design engineer with over 4 years of expertise in compilers, systems, or performance engineering. The ideal candidate will have a Master's or PhD in Computer Science, Electrical Engineering...
- ...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal... ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers...Flexible hours
- About ZETIC.ai ZETIC.ai builds an end-to... ...We’re hiring an ML Software Engineer (On-Device AI Model... ...encoders, etc.) onto edge devices,... ...performance roadmap (latency, memory, power/thermal... ...engineers to resolve compiler/runtime... ...C++ (or equivalent low-level performance...Full time
- ...for the physical world. Our AI platform provides real-time... ...at scale, we rely on cutting-edge optimization to ensure our vision... ...About the Role As an ML / DevOps Engineer, you will play a pivotal role... ...will ensure high throughput, low latency, and rock-solid reliability...Work at office
$180k - $240k
...Senior Machine Learning Engineer - this is a fully... ...been on a mission to use AI to resolve as many conversations... ...our small but mighty ML team building... ...Mandatory) Familiarity with low-latency production ML systems (... ...on genuinely cutting-edge AI problems — low latency...Remote workFlexible hours- Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto... ...: Full-Time Industry: AI Infrastructure / Compiler Systems Overview A... ...anywhere.” This includes cloud, edge, and hybrid environments... ...metrics, and test suites (latency, throughput, memory...Full time
$320k - $405k
...interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts,... ...and implement ML infrastructure that... ...Optimize inference latency and throughput for... ...handle high-throughput, low-latency workloads... ...translating cutting-edge research into production...Work at officeVisa sponsorshipFlexible hours- ZETIC.ai is seeking an ML Software Engineer in San Francisco to optimize AI models for edge devices. You will lead strategies for deploying models, ensuring high performance while collaborating with engineers to achieve reliable production. The ideal candidate has 3+ years...
- ...Zensors Infrastructure Engineer Zensors is the spatial... ...physical world. Our AI platform provides real-... ...scale, we rely on cutting-edge optimization to ensure... ...high throughput, low latency, and reliability for critical... ...deeply with ML engineers to ensure validation...Work at office
- A cutting-edge robotics company is searching for a candidate to manage the full lifecycle of learned components on OG-1, focusing on training... ...will possess a strong educational background in CS, Robotics, or ML and demonstrate proficiency in Python and PyTorch, as well as...
$248.8k - $311k
...Scale's Physical AI business unit is dedicated to... ...Physical AI and developing ML pipelines for... ...Role As an ML Systems Engineer on the Physical AI team... ...platform powers cutting-edge research and production... ...models at scale, ensuring low latency for real-time applications...Full time- A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building...
$300k - $400k
...Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that... ...As a Principal AI/ML Engineer in our AdTech team, you will... ...operating at large scale and low latency to handle billions of ad events... ...capabilities on the cutting edge. ~ AI & Agentic Applications...$250k - $350k
...Staff AI/ML Engineer San Francisco, California, United States The leader in AI-native... ...Experience with big-data engines like Spark. Compiler development for languages like SQL,... .... Optimization of ML models for low-latency, high-throughput production use. Contributions...Work at officeRemote workFlexible hours2 days per week3 days per week- ...we’re not just building AI models—we’re redefining... ...’t: on-device, at the edge, under real-time constraints... ...of world-class engineers, researchers, and builders... ...debug models in popular ML frameworks, and... ...others stall: on CPUs, with low latency, minimal memory, and maximum...
$150k - $220k
...Founding Machine Learning Engineer San Francisco... ...browser understanding, and low-latency systems, shipping models... ..., or consumer-focused "AI browsers," we run AI... ...architecture creates unique ML challenges. This is... ..., Cloudflare Workers edge proxy, and inference providers...H1bWork at officeVisa sponsorshipSleeping nights$200k
...building the world's most trusted AI work companion for... ...Gain exposure to cutting-edge AI for Pro tools and play a... ...deploying high-throughput, ultra-low-latency inference engines for large language models or... ...intersection between the core ML training team and the backend...Full timeWork at officeWorldwide- ...Data Science & ML Ops Engineer Location: Bay Area, CA Tax Term (W2, C2C): W2, C2... ...Leverage AutoML tools (e.g., Vertex AI AutoML, H2O Driverless AI) for low-code/no-code model development,... ...containerized environments and support model scoring via low-latency APIs...
$152k - $228k
...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together marketing... ...-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs...Currently hiringRemote workFlexible hours$190k - $222k
Wheel the World in San Francisco seeks a Senior Machine Learning Engineer to lead the perception stack for their Archimedes project. This role requires expertise in developing and deploying perception systems under real-world conditions. The successful candidate will have...- ...at the intersection of AI, robotics, and healthcare... ...Senior Machine Learning Engineer, you will build the intelligence... ...‑world data, ambiguous edge cases, and high‑leverage... ..., error handling, and low‑quality input recovery.... .... Optimize cost, latency, and reliability across...Work at office
- Check out 1962 new Machine Learning Engineer opportunities posted on AI Chopping Block Design, build, and maintain... .... Develop and optimize end-to-end ML pipelines encompassing data... ...environments to ensure high performance and low latency. Machine Learning Enginer, Core...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Compiler Engineer, Edge-AI & Low-Latency. Be the first to apply!


