ML Software Engineer
CAPSA
About ZETIC.ai ZETIC.ai builds an end-to-end on-device AI deployment and benchmarking platform that helps companies run their existing AI models efficiently on real consumer devices—without relying on expensive cloud GPU infrastructure.We specialize in hardware-aware optimization and deployment across heterogeneous mobile accelerators (NPU/GPU/CPU), enabling fast iteration, clear performance decisions, and controlled production rollout at scale. Our mission is to make high-performance on-device AI practical and shippable for every team that already has models. Job Description We’re hiring an ML Software Engineer (On-Device AI Model Optimizations) to drive the end-to-end effort of porting and optimizing LLMs and multimodal models (ASR, TTS, Vision encoders, etc.) onto edge devices, especially mobile NPUs. The Role You will own the performance roadmap (latency, memory, power/thermal), lead model-side optimization strategy, and collaborate closely with runtime/SDK and app engineers to ship real deployments. Responsibilities Lead model-side optimization and deployment for LLM + multimodal workloads (ASR/TTS/Vision encoders, etc.) on NPU/GPU/CPU paths. Own performance targets and trade-offs across latency / memory / accuracy / battery. quantization (PTQ/QAT), pruning, distillation, operator fusion, KV-cache strategies, attention optimizations, speculative decoding (where applicable), etc. Build and maintain evaluation + profiling pipelines: on-device benchmarks, regression tracking, correctness checks, and performance dashboards. Collaborate with runtime/SDK engineers to resolve compiler/runtime constraints (ops coverage, precision, layout, scheduling). Work with product/engineering to define “ready-to-ship” criteria and ensure reliable production deployment across device variants. Qualifications 3+ years (or equivalent) building and shipping ML systems, with substantial hands-on experience optimizing models for real-world deployment. Strong understanding of deep learning fundamentals and performance bottlenecks (compute, memory bandwidth, cache behavior). Practical experience with at least one of: LLM inference optimization (quantization, attention/KV cache, decode-time performance) ASR/TTS deployment (streaming, latency constraints, audio pre/post) Vision encoder optimization (image preprocessing, feature extraction performance) Solid software engineering skills in Python + C/C++ (or equivalent low-level performance language). Experience debugging numerical issues and ensuring correctness across mixed precision / quantized inference. Comfortable working across ambiguous constraints and turning “it should be faster” into measurable engineering work. Preferred Qualifications Direct experience deploying to mobile/edge accelerators (NPU/DSP/GPU) and/or working with hardware vendor stacks. Experience with model compilation toolchains and performance tooling (profilers, operator-level tracing, memory analysis). Experience shipping SDKs or inference runtimes used by external developers. Familiarity with multi-device deployment realities: device fragmentation, fallback paths, capability detection, and reproducibility. Required Skillset Edge/On-device ML optimization mindset (latency, memory, power, thermal) Quantization & mixed-precision inference (PTQ/QAT; int8/fp16 strategies) Performance profiling + debugging (numerical + system-level) Preferred Skills Model architecture understanding across transformers / conformers / diffusion-vocoders (as applicable) Cross-functional collaboration (runtime/compiler/app/product) Required Toolset C/C++ (performance-critical components / integration work) Benchmarking & profiling tools (device profilers, operator-level tracing, memory tools) Must Have Proven ability to make models materially faster/smaller on real devices (not just on GPU Server) Can lead optimization efforts end-to-end with clear metrics and deliverables Comfortable with heterogeneous execution (NPU/GPU/CPU fallbacks) Compensation Range Equity: meaningful early-stage option grant (role & level dependent) Benefits: standard US benefits package (details shared during process) Job Information Company ZETIC.ai Location San Francisco, CA Seoul, South Korea Employment Type Full-Time Workplace Type On-Site #J-18808-Ljbffr
- Job Title: ML Software Engineer About Xterra Xterra is a Khosla Ventures-backed company building AI agents that reason about complex scientific problems. We’re not a wrapper around existing models, we’re training our own foundation models on top of large-scale proprietary...Suggested
- Rippling is seeking a Machine Learning Software Engineer Intern for Winter 2027 in San Francisco. The role involves developing innovative products, working with a mentor, and experiencing a full-time engineer's responsibilities. Interns collaborate on impactful projects...SuggestedFull timeInternship
- A tech company specializing in AI solutions is seeking an AI/ML-focused Software Engineer to build innovative AI-powered features for requirements management. The ideal candidate will have over 3 years of experience in applied ML and strong software engineering skills....SuggestedFlexible hours
$140k - $210k
...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Software Engineer IV (ML) on the Machine Learning Model Platform team at Indeed, you will be responsible for leading and executing key objectives for...SuggestedTemporary workWork experience placementLocal area$175k - $250k
...GraphQL. ABOUT THE ROLE Swayable is seeking a Senior Engineer blending Python software development expertise with scientific computing, machine... .... * You keep up with the constantly evolving toolset for ML and AI Ops. * You are knowledgeable about software architecture...Suggested$80 - $120 per hour
Machine Learning Software Engineer (Part-Time | $80 -$120/hr) Join to apply for the Machine Learning Software Engineer (Part-Time | $80 -$120/... ...workloads. If you’re an early-career Machine Learning Engineer or an ML-focused graduate student/PhD who values innovation, rigor, and...Hourly payPart timeRemote work- Machine Learning Software Engineer Intern - Winter 2027 About this position About Rippling Rippling gives businesses one place to run HR, IT,... ...model training. Stay up‑to‑date with the latest research in ML and related fields, and apply this knowledge to improve Rippling...Full timeInternshipWork at office3 days per week
- ...will be responsible for helping build the software and machine learning systems that power... ...closely with the team to develop and deploy ML models, build real‑time software systems,... ...intelligence Collaborate with robotics engineers to build integrated robotic systems You...InternshipImmediate start
- ...salary range: Based on experience and market value Role: ML/AI Engineers (This role is open to US Citizens, Green Card holders, GC-EAD... ..., Apple, Spotify, US Bank, FedEx, and more. We're not just a software consulting company we're a dynamic force shaping the future...Remote workVisa sponsorshipRelocation package
- Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...
$200k - $250k
...customers. She will pick the best candidates from Jack's network The next step is to speak to Jack. Job Title: Founding AI/ML Engineer Salary: $200-250K + Equity Company Description: Generalcatalyst.com - AI startup pioneered by Princeton researchers and a...- ...can do in education. About the role We’re looking for an AI/ML Engineer to join our product engineering and applied research team. Our... ...the job, whether that’s a novel machine learning model or plain software engineering fundamentals. You’re excited to build an AI...Work at officeWorldwideShift work
$125k
...AI/ML Engineer San Francisco, CA, USA About the role Chime's AI/ML Trust & Safety team is building models, insights, and decisioning systems that help protect millions of members while enabling safe, reliable financial progress. We are looking for an AI/ML Engineer...Full timeInternshipWork at officeLocal areaRemote workNight shift- ...AI/ML Engineer (Computer Vision) Location: On site, Bay Area, CA A fast-growing applied AI company is expanding its engineering... ...opportunity for someone who enjoys turning research into reliable software, working close to product decisions, and shipping models that...
- ...team (product, platform, and design) to shape both the data and ML foundations and the user-facing experiences that differentiate Known... ...quality. Collaborate cross-functionally with platform engineers and product designers to integrate AI seamlessly into the Known...
- ...Known - Founding Machine Learning Engineer ~ San Francisco, CA (In-Person) ~200k-375k Cash + Equity Known is a matchmaker... ...compatibility. You'll work directly with Chen Peng, former head of ML at Uber Eats and Faire. What you'll do It's up to you to...
- ...Overview: AI/ML Engineering Junior Engineer Location: Silicon Valley (Onsite) | Type: Full-time | Visa Sponsorship: No What you'll do • Work embedded in the AI engineering team on a scoped project: RAG pipeline optimization, agentic workflow tooling...Full timeInternshipVisa sponsorship
- ...that impact millions of people. About The Role As an AI/ML Engineer at Brain Co., you will play a crucial role in deploying state-... ...alongside experienced ex. Founders, AI researchers, and software engineers to understand complex business challenges and deliver...Worldwide
$172k
...Senior AI/ML Engineer Chicago, IL, USA; New York, NY, USA; San Francisco, CA, USA; Seattle, WA, USA About the role We're hiring for a Senior AI/ML Engineer, Growth & Marketing AI to help us build the next generation of AI-powered growth and marketing capabilities...Full timeWork at officeLocal areaRemote workNight shift- ...Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly... ...alive. About the Role We are seeking an engineer living at the intersection of embedded systems and ML to enable rich, reliable interactions on wearable...Full timeContract workFlexible hours
- ...A technology recruiting platform is seeking a senior or staff AI/ML Engineer in San Francisco. In this pivotal role, you will build innovative AI features to enhance recruiting processes. You will collaborate closely with a talented team to create cutting-edge solutions...Flexible hours
$190k - $260k
...Kindredventures is seeking a senior or staff AI/ML Engineer in San Francisco to design and deliver cutting-edge AI features that transform the hiring process. You will engage in multi-step reasoning systems and intelligent searches to help recruiters efficiently find the...Flexible hours- ...Navi AI Pilot Debrief Intelligence Engineer Navi captures everything a pilot sees and hears and turns it into automated debrief intelligence... ...CFI would catch. About the Role This is a founding AI/ML role. You'll own the intelligence layer that sits at the core of...
- ...from 100+ labor databases. We are now building our Silicon Valley engineering team — a small, senior group focused on next-generation AI... ...Computer Science, Machine Learning, or related field. 3–5 years of AI/ML engineering; minimum 2 years building LLM-powered systems...Work at officeVisa sponsorshipFlexible hours3 days per week
- ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical...Full time
$150k - $350k
...network The next step is to speak to Jack. Job Title: AI/ML Research Engineer Salary: $150k-$350k + Equity Company Description:... ...multi-agent systems and vision-language models that navigate software interfaces. This role offers the unique opportunity to shape...- ...A consumer AI startup seeks an AI/ML Engineer to design and implement core matchmaking systems from scratch. You will lead a small technical team, scale systems to support millions of users, and work closely with industry veterans. Ideal candidates have experience with...
$198k - $221.5k
Alumni Ventures is seeking an AI-focused individual to join Strava in San Francisco. The role involves building and optimizing ML systems for a well-loved consumer product. Candidates should have experience in data analysis and model deployment, with proficiency in tools...$308k - $423.5k
...the shop local movement. If you believe in community, come join ours. About this role: We are seeking a Principal AI / ML Engineer to be a company-level technical thought leader and practitioner to help shape the future of Data and AI at Faire. This is a rare...Work experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week- Israelvcforum is looking for a Senior Compiler Engineer to join their AI Kernels & Compilers team in San Francisco, California. In this role... ...teams. Candidates should have a strong background in compilers, Python/C++, and ML frameworks. #J-18808-Ljbffr Israelvcforum
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Software Engineer. Be the first to apply!
- computer vision machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- software developer internship no experience San Francisco, CA


