Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Software Engineer

CAPSA

About ZETIC.ai ZETIC.ai builds an end-to-end on-device AI deployment and benchmarking platform that helps companies run their existing AI models efficiently on real consumer devices—without relying on expensive cloud GPU infrastructure.We specialize in hardware-aware optimization and deployment across heterogeneous mobile accelerators (NPU/GPU/CPU), enabling fast iteration, clear performance decisions, and controlled production rollout at scale. Our mission is to make high-performance on-device AI practical and shippable for every team that already has models. Job Description We’re hiring an ML Software Engineer (On-Device AI Model Optimizations) to drive the end-to-end effort of porting and optimizing LLMs and multimodal models (ASR, TTS, Vision encoders, etc.) onto edge devices, especially mobile NPUs. The Role You will own the performance roadmap (latency, memory, power/thermal), lead model-side optimization strategy, and collaborate closely with runtime/SDK and app engineers to ship real deployments. Responsibilities Lead model-side optimization and deployment for LLM + multimodal workloads (ASR/TTS/Vision encoders, etc.) on NPU/GPU/CPU paths. Own performance targets and trade-offs across latency / memory / accuracy / battery. quantization (PTQ/QAT), pruning, distillation, operator fusion, KV-cache strategies, attention optimizations, speculative decoding (where applicable), etc. Build and maintain evaluation + profiling pipelines: on-device benchmarks, regression tracking, correctness checks, and performance dashboards. Collaborate with runtime/SDK engineers to resolve compiler/runtime constraints (ops coverage, precision, layout, scheduling). Work with product/engineering to define “ready-to-ship” criteria and ensure reliable production deployment across device variants. Qualifications 3+ years (or equivalent) building and shipping ML systems, with substantial hands-on experience optimizing models for real-world deployment. Strong understanding of deep learning fundamentals and performance bottlenecks (compute, memory bandwidth, cache behavior). Practical experience with at least one of: LLM inference optimization (quantization, attention/KV cache, decode-time performance) ASR/TTS deployment (streaming, latency constraints, audio pre/post) Vision encoder optimization (image preprocessing, feature extraction performance) Solid software engineering skills in Python + C/C++ (or equivalent low-level performance language). Experience debugging numerical issues and ensuring correctness across mixed precision / quantized inference. Comfortable working across ambiguous constraints and turning “it should be faster” into measurable engineering work. Preferred Qualifications Direct experience deploying to mobile/edge accelerators (NPU/DSP/GPU) and/or working with hardware vendor stacks. Experience with model compilation toolchains and performance tooling (profilers, operator-level tracing, memory analysis). Experience shipping SDKs or inference runtimes used by external developers. Familiarity with multi-device deployment realities: device fragmentation, fallback paths, capability detection, and reproducibility. Required Skillset Edge/On-device ML optimization mindset (latency, memory, power, thermal) Quantization & mixed-precision inference (PTQ/QAT; int8/fp16 strategies) Performance profiling + debugging (numerical + system-level) Preferred Skills Model architecture understanding across transformers / conformers / diffusion-vocoders (as applicable) Cross-functional collaboration (runtime/compiler/app/product) Required Toolset C/C++ (performance-critical components / integration work) Benchmarking & profiling tools (device profilers, operator-level tracing, memory tools) Must Have Proven ability to make models materially faster/smaller on real devices (not just on GPU Server) Can lead optimization efforts end-to-end with clear metrics and deliverables Comfortable with heterogeneous execution (NPU/GPU/CPU fallbacks) Compensation Range Equity: meaningful early-stage option grant (role & level dependent) Benefits: standard US benefits package (details shared during process) Job Information Company ZETIC.ai Location San Francisco, CA Seoul, South Korea Employment Type Full-Time Workplace Type On-Site #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Software Engineer in San Francisco, CA vacancy
  • Job Title: ML Software Engineer About Xterra Xterra is a Khosla Ventures-backed company building AI agents that reason about complex scientific problems. We’re not a wrapper around existing models, we’re training our own foundation models on top of large-scale proprietary... 
    Suggested

    Xterraai

    San Francisco, CA
    3 days ago
  • Rippling is seeking a Machine Learning Software Engineer Intern for Winter 2027 in San Francisco. The role involves developing innovative products, working with a mentor, and experiencing a full-time engineer's responsibilities. Interns collaborate on impactful projects... 
    Suggested
    Full time
    Internship

    Rippling

    San Francisco, CA
    4 days ago
  • A tech company specializing in AI solutions is seeking an AI/ML-focused Software Engineer to build innovative AI-powered features for requirements management. The ideal candidate will have over 3 years of experience in applied ML and strong software engineering skills.... 
    Suggested
    Flexible hours

    Flow Engineering

    San Francisco, CA
    1 day ago
  • $140k - $210k

     ...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Software Engineer IV (ML) on the Machine Learning Model Platform team at Indeed, you will be responsible for leading and executing key objectives for... 
    Suggested
    Temporary work
    Work experience placement
    Local area

    Indeed Inc.

    San Francisco, CA
    9 days ago
  • $175k - $250k

     ...GraphQL. ABOUT THE ROLE Swayable is seeking a Senior Engineer blending Python software development expertise with scientific computing, machine...  .... * You keep up with the constantly evolving toolset for ML and AI Ops. * You are knowledgeable about software architecture... 
    Suggested

    Swayable

    San Francisco, CA
    6 days ago
  • $80 - $120 per hour

    Machine Learning Software Engineer (Part-Time | $80 -$120/hr) Join to apply for the Machine Learning Software Engineer (Part-Time | $80 -$120/...  ...workloads. If you’re an early-career Machine Learning Engineer or an ML-focused graduate student/PhD who values innovation, rigor, and... 
    Hourly pay
    Part time
    Remote work

    Call For Referral

    San Francisco, CA
    6 days ago
  • Machine Learning Software Engineer Intern - Winter 2027 About this position About Rippling Rippling gives businesses one place to run HR, IT,...  ...model training. Stay up‑to‑date with the latest research in ML and related fields, and apply this knowledge to improve Rippling... 
    Full time
    Internship
    Work at office
    3 days per week

    Rippling

    San Francisco, CA
    4 days ago
  •  ...will be responsible for helping build the software and machine learning systems that power...  ...closely with the team to develop and deploy ML models, build real‑time software systems,...  ...intelligence Collaborate with robotics engineers to build integrated robotic systems You... 
    Internship
    Immediate start

    Human Computer Lab

    San Francisco, CA
    4 days ago
  •  ...salary range: Based on experience and market value Role: ML/AI Engineers (This role is open to US Citizens, Green Card holders, GC-EAD...  ..., Apple, Spotify, US Bank, FedEx, and more. We're not just a software consulting company we're a dynamic force shaping the future... 
    Remote work
    Visa sponsorship
    Relocation package

    Adidev Technologies Inc

    Daly City, CA
    13 days ago
  • Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...

    GEMÜ

    San Francisco, CA
    2 days ago
  • $200k - $250k

     ...customers. She will pick the best candidates from Jack's network The next step is to speak to Jack. Job Title: Founding AI/ML Engineer Salary: $200-250K + Equity Company Description: Generalcatalyst.com - AI startup pioneered by Princeton researchers and a... 

    Jack and Jill AI

    San Francisco, CA
    4 days ago
  •  ...can do in education. About the role We’re looking for an AI/ML Engineer to join our product engineering and applied research team. Our...  ...the job, whether that’s a novel machine learning model or plain software engineering fundamentals. You’re excited to build an AI... 
    Work at office
    Worldwide
    Shift work

    Ello Technology, Inc

    San Francisco, CA
    4 days ago
  • $125k

     ...AI/ML Engineer San Francisco, CA, USA About the role Chime's AI/ML Trust & Safety team is building models, insights, and decisioning systems that help protect millions of members while enabling safe, reliable financial progress. We are looking for an AI/ML Engineer... 
    Full time
    Internship
    Work at office
    Local area
    Remote work
    Night shift

    Chime

    San Francisco, CA
    4 days ago
  •  ...AI/ML Engineer (Computer Vision) Location: On site, Bay Area, CA A fast-growing applied AI company is expanding its engineering...  ...opportunity for someone who enjoys turning research into reliable software, working close to product decisions, and shipping models that... 

    Blue Signal LLC

    San Francisco, CA
    1 day ago
  •  ...team (product, platform, and design) to shape both the data and ML foundations and the user-facing experiences that differentiate Known...  ...quality. Collaborate cross-functionally with platform engineers and product designers to integrate AI seamlessly into the Known... 

    Pear VC

    San Francisco, CA
    7 days ago
  •  ...Known - Founding Machine Learning Engineer ~ San Francisco, CA (In-Person) ~200k-375k Cash + Equity Known is a matchmaker...  ...compatibility. You'll work directly with Chen Peng, former head of ML at Uber Eats and Faire. What you'll do It's up to you to... 

    Known, Inc

    San Francisco, CA
    19 hours ago
  •  ...Overview: AI/ML Engineering Junior Engineer Location: Silicon Valley (Onsite) | Type: Full-time | Visa Sponsorship: No What you'll do • Work embedded in the AI engineering team on a scoped project: RAG pipeline optimization, agentic workflow tooling... 
    Full time
    Internship
    Visa sponsorship

    Swift Pace Solutions Inc

    San Francisco, CA
    1 day ago
  •  ...that impact millions of people. About The Role As an AI/ML Engineer at Brain Co., you will play a crucial role in deploying state-...  ...alongside experienced ex. Founders, AI researchers, and software engineers to understand complex business challenges and deliver... 
    Worldwide

    Brainco

    San Francisco, CA
    3 days ago
  • $172k

     ...Senior AI/ML Engineer Chicago, IL, USA; New York, NY, USA; San Francisco, CA, USA; Seattle, WA, USA About the role We're hiring for a Senior AI/ML Engineer, Growth & Marketing AI to help us build the next generation of AI-powered growth and marketing capabilities... 
    Full time
    Work at office
    Local area
    Remote work
    Night shift

    CHIME INC.

    San Francisco, CA
    4 days ago
  •  ...Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly...  ...alive. About the Role We are seeking an engineer living at the intersection of embedded systems and ML to enable rich, reliable interactions on wearable... 
    Full time
    Contract work
    Flexible hours

    SESAME

    San Francisco, CA
    19 hours ago
  •  ...A technology recruiting platform is seeking a senior or staff AI/ML Engineer in San Francisco. In this pivotal role, you will build innovative AI features to enhance recruiting processes. You will collaborate closely with a talented team to create cutting-edge solutions... 
    Flexible hours

    Clutch Canada

    San Francisco, CA
    4 days ago
  • $190k - $260k

     ...Kindredventures is seeking a senior or staff AI/ML Engineer in San Francisco to design and deliver cutting-edge AI features that transform the hiring process. You will engage in multi-step reasoning systems and intelligent searches to help recruiters efficiently find the... 
    Flexible hours

    Kindredventures

    San Francisco, CA
    19 hours ago
  •  ...Navi AI Pilot Debrief Intelligence Engineer Navi captures everything a pilot sees and hears and turns it into automated debrief intelligence...  ...CFI would catch. About the Role This is a founding AI/ML role. You'll own the intelligence layer that sits at the core of... 

    Navi Ai

    San Francisco, CA
    19 hours ago
  •  ...from 100+ labor databases. We are now building our Silicon Valley engineering team — a small, senior group focused on next-generation AI...  ...Computer Science, Machine Learning, or related field. 3–5 years of AI/ML engineering; minimum 2 years building LLM-powered systems... 
    Work at office
    Visa sponsorship
    Flexible hours
    3 days per week

    HopHR

    San Francisco, CA
    4 days ago
  •  ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical... 
    Full time

    Fabrion

    San Francisco, CA
    2 days ago
  • $150k - $350k

     ...network The next step is to speak to Jack. Job Title: AI/ML Research Engineer Salary: $150k-$350k + Equity Company Description:...  ...multi-agent systems and vision-language models that navigate software interfaces. This role offers the unique opportunity to shape... 

    Jack and Jill AI

    San Francisco, CA
    4 days ago
  •  ...A consumer AI startup seeks an AI/ML Engineer to design and implement core matchmaking systems from scratch. You will lead a small technical team, scale systems to support millions of users, and work closely with industry veterans. Ideal candidates have experience with... 

    Jack & Jill/External ATS

    San Francisco, CA
    4 days ago
  • $198k - $221.5k

    Alumni Ventures is seeking an AI-focused individual to join Strava in San Francisco. The role involves building and optimizing ML systems for a well-loved consumer product. Candidates should have experience in data analysis and model deployment, with proficiency in tools... 

    Alumni Ventures

    San Francisco, CA
    3 days ago
  • $308k - $423.5k

     ...the shop local movement. If you believe in community, come join ours. About this role: We are seeking a  Principal AI / ML Engineer to be a  company-level technical thought leader and practitioner to help shape the future of Data and AI at Faire. This is a rare... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    2 days ago
  • Israelvcforum is looking for a Senior Compiler Engineer to join their AI Kernels & Compilers team in San Francisco, California. In this role...  ...teams. Candidates should have a strong background in compilers, Python/C++, and ML frameworks. #J-18808-Ljbffr Israelvcforum

    Israelvcforum

    San Francisco, CA
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Software Engineer. Be the first to apply!