ML Software Engineer

CAPSA

About ZETIC.ai ZETIC.ai builds an end-to-end on-device AI deployment and benchmarking platform that helps companies run their existing AI models efficiently on real consumer devices—without relying on expensive cloud GPU infrastructure.We specialize in hardware-aware optimization and deployment across heterogeneous mobile accelerators (NPU/GPU/CPU), enabling fast iteration, clear performance decisions, and controlled production rollout at scale. Our mission is to make high-performance on-device AI practical and shippable for every team that already has models. Job Description We’re hiring an ML Software Engineer (On-Device AI Model Optimizations) to drive the end-to-end effort of porting and optimizing LLMs and multimodal models (ASR, TTS, Vision encoders, etc.) onto edge devices, especially mobile NPUs. The Role You will own the performance roadmap (latency, memory, power/thermal), lead model-side optimization strategy, and collaborate closely with runtime/SDK and app engineers to ship real deployments. Responsibilities Lead model-side optimization and deployment for LLM + multimodal workloads (ASR/TTS/Vision encoders, etc.) on NPU/GPU/CPU paths. Own performance targets and trade-offs across latency / memory / accuracy / battery. quantization (PTQ/QAT), pruning, distillation, operator fusion, KV-cache strategies, attention optimizations, speculative decoding (where applicable), etc. Build and maintain evaluation + profiling pipelines: on-device benchmarks, regression tracking, correctness checks, and performance dashboards. Collaborate with runtime/SDK engineers to resolve compiler/runtime constraints (ops coverage, precision, layout, scheduling). Work with product/engineering to define “ready-to-ship” criteria and ensure reliable production deployment across device variants. Qualifications 3+ years (or equivalent) building and shipping ML systems, with substantial hands-on experience optimizing models for real-world deployment. Strong understanding of deep learning fundamentals and performance bottlenecks (compute, memory bandwidth, cache behavior). Practical experience with at least one of: LLM inference optimization (quantization, attention/KV cache, decode-time performance) ASR/TTS deployment (streaming, latency constraints, audio pre/post) Vision encoder optimization (image preprocessing, feature extraction performance) Solid software engineering skills in Python + C/C++ (or equivalent low-level performance language). Experience debugging numerical issues and ensuring correctness across mixed precision / quantized inference. Comfortable working across ambiguous constraints and turning “it should be faster” into measurable engineering work. Preferred Qualifications Direct experience deploying to mobile/edge accelerators (NPU/DSP/GPU) and/or working with hardware vendor stacks. Experience with model compilation toolchains and performance tooling (profilers, operator-level tracing, memory analysis). Experience shipping SDKs or inference runtimes used by external developers. Familiarity with multi-device deployment realities: device fragmentation, fallback paths, capability detection, and reproducibility. Required Skillset Edge/On-device ML optimization mindset (latency, memory, power, thermal) Quantization & mixed-precision inference (PTQ/QAT; int8/fp16 strategies) Performance profiling + debugging (numerical + system-level) Preferred Skills Model architecture understanding across transformers / conformers / diffusion-vocoders (as applicable) Cross-functional collaboration (runtime/compiler/app/product) Required Toolset C/C++ (performance-critical components / integration work) Benchmarking & profiling tools (device profilers, operator-level tracing, memory tools) Must Have Proven ability to make models materially faster/smaller on real devices (not just on GPU Server) Can lead optimization efforts end-to-end with clear metrics and deliverables Comfortable with heterogeneous execution (NPU/GPU/CPU fallbacks) Compensation Range Equity: meaningful early-stage option grant (role & level dependent) Benefits: standard US benefits package (details shared during process) Job Information Company ZETIC.ai Location San Francisco, CA Seoul, South Korea Employment Type Full-Time Workplace Type On-Site #J-18808-Ljbffr

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the ML Software Engineer in San Francisco, CA vacancy

ML Software Engineer
Job Title: ML Software Engineer About Xterra Xterra is a Khosla Ventures-backed company building AI agents that reason about complex scientific problems. We’re not a wrapper around existing models, we’re training our own foundation models on top of large-scale proprietary...
Suggested
Xterraai
San Francisco, CA
3 days ago
ML Software Engineer Intern — Build Production Models
Rippling is seeking a Machine Learning Software Engineer Intern for Winter 2027 in San Francisco. The role involves developing innovative products, working with a mentor, and experiencing a full-time engineer's responsibilities. Interns collaborate on impactful projects...
Suggested
Full time
Internship
Rippling
San Francisco, CA
4 days ago
AI/ML Software Engineer — AI-Powered Requirements
A tech company specializing in AI solutions is seeking an AI/ML-focused Software Engineer to build innovative AI-powered features for requirements management. The ideal candidate will have over 3 years of experience in applied ML and strong software engineering skills....
Suggested
Flexible hours
Flow Engineering
San Francisco, CA
1 day ago
Staff ML Software Engineer
$140k - $210k
...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Software Engineer IV (ML) on the Machine Learning Model Platform team at Indeed, you will be responsible for leading and executing key objectives for...
Suggested
Temporary work
Work experience placement
Local area
Indeed Inc.
San Francisco, CA
9 days ago
Senior AI/ML Engineer: Python & Scientific Computing
$175k - $250k
...GraphQL. ABOUT THE ROLE Swayable is seeking a Senior Engineer blending Python software development expertise with scientific computing, machine... .... * You keep up with the constantly evolving toolset for ML and AI Ops. * You are knowledgeable about software architecture...
Suggested
Swayable
San Francisco, CA
6 days ago
Machine Learning Software Engineer (Part-Time | $80 -$120/hr)
$80 - $120 per hour
Machine Learning Software Engineer (Part-Time | $80 -$120/hr) Join to apply for the Machine Learning Software Engineer (Part-Time | $80 -$120/... ...workloads. If you’re an early-career Machine Learning Engineer or an ML-focused graduate student/PhD who values innovation, rigor, and...
Hourly pay
Part time
Remote work
Call For Referral
San Francisco, CA
6 days ago
Machine Learning Software Engineer Intern - Winter 2027
Machine Learning Software Engineer Intern - Winter 2027 About this position About Rippling Rippling gives businesses one place to run HR, IT,... ...model training. Stay up‑to‑date with the latest research in ML and related fields, and apply this knowledge to improve Rippling...
Full time
Internship
Work at office
3 days per week
Rippling
San Francisco, CA
4 days ago
Intern - Software/ML Engineering
...will be responsible for helping build the software and machine learning systems that power... ...closely with the team to develop and deploy ML models, build real‑time software systems,... ...intelligence Collaborate with robotics engineers to build integrated robotic systems You...
Internship
Immediate start
Human Computer Lab
San Francisco, CA
4 days ago
ML/AI Engineers
...salary range: Based on experience and market value Role: ML/AI Engineers (This role is open to US Citizens, Green Card holders, GC-EAD... ..., Apple, Spotify, US Bank, FedEx, and more. We're not just a software consulting company we're a dynamic force shaping the future...
Remote work
Visa sponsorship
Relocation package
Adidev Technologies Inc
Daly City, CA
13 days ago
AI/ML Engineer
Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...
GEMÜ
San Francisco, CA
2 days ago
Founding AI/ML Engineer ($200-250K + Equity) at Generalcatalyst.com
$200k - $250k
...customers. She will pick the best candidates from Jack's network The next step is to speak to Jack. Job Title: Founding AI/ML Engineer Salary: $200-250K + Equity Company Description: Generalcatalyst.com - AI startup pioneered by Princeton researchers and a...
Jack and Jill AI
San Francisco, CA
4 days ago
AI/ML Engineer
...can do in education. About the role We’re looking for an AI/ML Engineer to join our product engineering and applied research team. Our... ...the job, whether that’s a novel machine learning model or plain software engineering fundamentals. You’re excited to build an AI...
Work at office
Worldwide
Shift work
Ello Technology, Inc
San Francisco, CA
4 days ago
AI/ML Engineer
$125k
...AI/ML Engineer San Francisco, CA, USA About the role Chime's AI/ML Trust & Safety team is building models, insights, and decisioning systems that help protect millions of members while enabling safe, reliable financial progress. We are looking for an AI/ML Engineer...
Full time
Internship
Work at office
Local area
Remote work
Night shift
Chime
San Francisco, CA
4 days ago
AI/ML Engineer (Computer Vision)
...AI/ML Engineer (Computer Vision) Location: On site, Bay Area, CA A fast-growing applied AI company is expanding its engineering... ...opportunity for someone who enjoys turning research into reliable software, working close to product decisions, and shipping models that...
Blue Signal LLC
San Francisco, CA
1 day ago
AI / ML Engineer - Known
...team (product, platform, and design) to shape both the data and ML foundations and the user-facing experiences that differentiate Known... ...quality. Collaborate cross-functionally with platform engineers and product designers to integrate AI seamlessly into the Known...
Pear VC
San Francisco, CA
7 days ago
AI / ML Engineer
...Known - Founding Machine Learning Engineer ~ San Francisco, CA (In-Person) ~200k-375k Cash + Equity Known is a matchmaker... ...compatibility. You'll work directly with Chen Peng, former head of ML at Uber Eats and Faire. What you'll do It's up to you to...
Known, Inc
San Francisco, CA
19 hours ago
AI/ML Engineering Junior Engineer
...Overview: AI/ML Engineering Junior Engineer Location: Silicon Valley (Onsite) | Type: Full-time | Visa Sponsorship: No What you'll do • Work embedded in the AI engineering team on a scoped project: RAG pipeline optimization, agentic workflow tooling...
Full time
Internship
Visa sponsorship
Swift Pace Solutions Inc
San Francisco, CA
1 day ago
AI/ML Engineer
...that impact millions of people. About The Role As an AI/ML Engineer at Brain Co., you will play a crucial role in deploying state-... ...alongside experienced ex. Founders, AI researchers, and software engineers to understand complex business challenges and deliver...
Worldwide
Brainco
San Francisco, CA
3 days ago
Senior AI/ML Engineer
$172k
...Senior AI/ML Engineer Chicago, IL, USA; New York, NY, USA; San Francisco, CA, USA; Seattle, WA, USA About the role We're hiring for a Senior AI/ML Engineer, Growth & Marketing AI to help us build the next generation of AI-powered growth and marketing capabilities...
Full time
Work at office
Local area
Remote work
Night shift
CHIME INC.
San Francisco, CA
4 days ago
Embedded ML Engineer - Gesture Recognition
...Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly... ...alive. About the Role We are seeking an engineer living at the intersection of embedded systems and ML to enable rich, reliable interactions on wearable...
Full time
Contract work
Flexible hours
SESAME
San Francisco, CA
19 hours ago
Senior AI/ML Engineer for Production Recruiting AI
...A technology recruiting platform is seeking a senior or staff AI/ML Engineer in San Francisco. In this pivotal role, you will build innovative AI features to enhance recruiting processes. You will collaborate closely with a talented team to create cutting-edge solutions...
Flexible hours
Clutch Canada
San Francisco, CA
4 days ago
Senior AI/ML Engineer - Build Production Hiring AI
$190k - $260k
...Kindredventures is seeking a senior or staff AI/ML Engineer in San Francisco to design and deliver cutting-edge AI features that transform the hiring process. You will engage in multi-step reasoning systems and intelligent searches to help recruiters efficiently find the...
Flexible hours
Kindredventures
San Francisco, CA
19 hours ago
ML/AI Founding Engineer
...Navi AI Pilot Debrief Intelligence Engineer Navi captures everything a pilot sees and hears and turns it into automated debrief intelligence... ...CFI would catch. About the Role This is a founding AI/ML role. You'll own the intelligence layer that sits at the core of...
Navi Ai
San Francisco, CA
19 hours ago
AI/ML Engineer
...from 100+ labor databases. We are now building our Silicon Valley engineering team — a small, senior group focused on next-generation AI... ...Computer Science, Machine Learning, or related field. 3–5 years of AI/ML engineering; minimum 2 years building LLM-powered systems...
Work at office
Visa sponsorship
Flexible hours
3 days per week
HopHR
San Francisco, CA
4 days ago
ML/AI Research Engineer Agentic AI Lab (Founding Team)
...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical...
Full time
Fabrion
San Francisco, CA
2 days ago
AI/ML Research Engineer ($150k-$350k + Equity) at Fast-growing enterprise AI startup
$150k - $350k
...network The next step is to speak to Jack. Job Title: AI/ML Research Engineer Salary: $150k-$350k + Equity Company Description:... ...multi-agent systems and vision-language models that navigate software interfaces. This role offers the unique opportunity to shape...
Jack and Jill AI
San Francisco, CA
4 days ago
AI/ML Engineer Personalization & Matchmaking (SF)
...A consumer AI startup seeks an AI/ML Engineer to design and implement core matchmaking systems from scratch. You will lead a small technical team, scale systems to support millions of users, and work closely with industry veterans. Ideal candidates have experience with...
Jack & Jill/External ATS
San Francisco, CA
4 days ago
Lead ML Engineer, Fitness AI — End-to-End Systems
$198k - $221.5k
Alumni Ventures is seeking an AI-focused individual to join Strava in San Francisco. The role involves building and optimizing ML systems for a well-loved consumer product. Candidates should have experience in data analysis and model deployment, with proficiency in tools...
Alumni Ventures
San Francisco, CA
3 days ago
Principal Applied AI / ML Engineer
$308k - $423.5k
...the shop local movement. If you believe in community, come join ours. About this role: We are seeking a Principal AI / ML Engineer to be a company-level technical thought leader and practitioner to help shape the future of Data and AI at Faire. This is a rare...
Work experience placement
Work at office
Local area
Remote work
Monday to Friday
Flexible hours
3 days per week
Faire Inc
San Francisco, CA
2 days ago
Lead ML Compiler Engineer for Autonomous Driving
Israelvcforum is looking for a Senior Compiler Engineer to join their AI Kernels & Compilers team in San Francisco, California. In this role... ...teams. Candidates should have a strong background in compilers, Python/C++, and ML frameworks. #J-18808-Ljbffr Israelvcforum
Israelvcforum
San Francisco, CA
19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Software Engineer. Be the first to apply!