Staff Machine Learning Engineer - Model Optimization & Quantization

$158.4k - $237.6k

Nutanix

Company:Qualcomm Technologies, Inc.Job Area:Engineering Group, Engineering Group > Machine Learning EngineeringGeneral Summary:About the RoleJoin the Qualcomm AI Hub team and help developers integrate machine learning into their products and experiences: .In this role you will develop tools to help developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You will develop and support cutting-edge model optimization workflows — pushing the boundary of what's possible on resource-constrained hardware. Applications range from quantizing large language models (LLMs) and generative AI models to compressing latency-critical vision, audio, and multimodal networks for deployment on Qualcomm Snapdragon and other edge SoCs.For this role we are seeking a talented and motivated Staff Software Engineer with expertise in the optimizing and deploying ML models – especially for edge devices .What You'll DoDesign, develop, and maintain quantization algorithms and compression pipelines within the AIMET framework (PTQ, QAT, mixed-precision, AdaScale etc.)Implement advanced quantization techniques including weight-only quantization, activation quantization, KV-cache quantization, and sub-4-bit quantization for LLMs and generative AI modelsBuild tooling to analyze, profile, and debug model accuracy degradation caused by quantizationIntegrate AIMET workflows with popular ML frameworks — PyTorch and ONNXDevelop APIs and developer-facing tooling to make AIMET accessible and easy to use for external customers and design partnersIntegrate AIMET in AI Hub Workbench Quantize job to enable Quantization at large scale.Own end-to-end quantization and optimization of models published on Qualcomm AI Hub, ensuring they meet accuracy, latency, and power targets on Qualcomm hardwareQuantize and validate a broad range of model families — vision transformers, LLMs, diffusion models, speech, and multimodal architectures — for deployment via AI HubDevelop and maintain automated quantization pipelines and evaluation harnesses to scale model onboarding across AI Hub's growing model catalogMinimum Qualifications:Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.ORMaster's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.ORPhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.Preferred Qualifications:3+ years of industry experience in machine learning, deep learning, or AI infrastructureStrong proficiency in Python, with hands-on experience in PyTorch , ONNX and/or TensorFlowSolid understanding of neural network architectures — CNNs, Transformers, LLMs, diffusion models, multimodal modelsExperience with model quantization techniques — PTQ, QAT, weight-only quantization, mixed-precision, sub-4-bit methodsHands-on experience quantizing LLMs (GPT, LLaMA , Mistral, Falcon, or similar families) for inference optimizationFamiliarity with AIMET, GPTQ, AWQ, SmoothQuant , or similar quantization frameworks is a strong plusExperience working with ONNX, TFLite / LiteRT , or other model interchange formatsUnderstanding of hardware constraints: memory bandwidth, compute precision (INT4/INT8/FP16/BF16), and NPU/DSP executionExperience collaborating across teams or BUs to drive technical alignment and model deliveryProficiency with git and software development best practicesStrong written and verbal communication skills — ability to write clean APIs, documentation, and engage directly with external developersExperience with C++ for performance-critical components is a bonusFamiliarity with ARM processors and mobile SoC architecture (Snapdragon) is a plusExperience with automated evaluation pipelines and model benchmarking at scale is a plusLevel of ResponsibilityWorks independently with minimal supervisionProvides technical guidance and mentorship to other team membersDecision-making is significant and affects work beyond the immediate teamRequires strong communication skills to convey complex quantization concepts to varied audiences — from hardware engineers and BU partners to external researchers and application developersHas meaningful influence on the AIMET product roadmap, AI Hub model catalog, and cross-BU quantization strategyTasks are open-ended; planning, prioritization, and problem-solving are core to the roleQualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail View email address on click.appcast.io or call Qualcomm's toll-free number found here . Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.Pay range and Other Compensation & Benefits :$158,400.00 - $237,600.00The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer – and you can review more details about our US benefits at this link .If you would like more information about this role, please contact Qualcomm Careers . #J-18808-Ljbffr

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - Model Optimization & Quantization in San Diego, CA vacancy

Senior ML Engineer - Edge AI & Model Quantization
$178.4k - $267.6k
...A leading technology company in San Diego seeks a Sr. Staff Engineer to join their Machine Learning Engineering team, focusing on model optimization and enabling on-device AI. Candidates should have strong experience in software engineering and AI frameworks, as well...
Suggested
Qualcomm
San Diego, CA
2 days ago
Senior Staff ML Engineer - Edge AI & Model Optimization
$178.4k - $267.6k
...Qualcomm Technologies, Inc. is seeking a Machine Learning Engineer in San Diego, California, to work with cutting-edge AI technologies and... ...generative AI workflows. Responsibilities include architecting model optimization techniques and collaborating with various teams. The...
Suggested
Stryker
San Diego, CA
2 days ago
Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)
$178.4k - $267.6k
..., Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General Summary... ...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and... ...develop and test model optimization techniques that include...
Suggested
Work experience placement
Work from home
Qualcomm
San Diego, CA
2 days ago
Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)
$178.4k - $267.6k
..., Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General Summary... ...for the Edge including model fine tuning, hardware acceleration, model quantization, edge inference and... ...develop and test model optimization techniques that include...
Suggested
Work experience placement
Work from home
Qualcomm
San Diego, CA
5 days ago
Machine Learning Engineer - Gurobi Scheduling Optimization (Hybrid)
$75 - $95 per hour
...Machine Learning Engineer – Scheduling Optimization (Greenfield Project) SPONSORSHIP NOT AVAILABLE- MUST BE US CITIZEN/ GREEN CARD HOLDER LOCATION: Irvine,... ...Optimization Engine: Design and implement robust optimization models using Gurobi, CPLEX, OR-Tools, or similar solvers....
Suggested
Full time
Contract work
Remote work
Worldwide
Monday to Friday
Monday to Thursday
Shift work
Match-Made-Tech
San Diego, CA
2 days ago
Senior AI Inference Engineer - Model Optimization & Deployment
$242k - $290k
...multi-modality foundation model to drive the next... ...intelligence. As a Model Optimization & Deployment Engineer, you will focus on bringing... ..., VLMs) using advanced quantization (PTQ, QAT), pruning,... ...intersection of robotics, machine learning, and design, Zoox aims to...
Temporary work
Remote work
Relocation package
Zoox
San Diego, CA
5 days ago
Machine Learning Engineer - College Graduate
$122.8k - $184.2k
...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...and software engineers who work with cutting... ...skills. Knowledge of deep learning and ML frameworks (i.e... ...neural network model optimization and on‑target deployment...
Work from home
Qualcomm
San Diego, CA
2 days ago
Modem Machine Learning Engineer
$104k - $156k
...Technologies, Inc. Job Area: Engineering Group Modem... ...General Summary The Modem Machine Learning Engineer applies... ...across data engineering, model development,... .... Deploy and heavily optimize ML models for on‑device... ...device ML deployment, quantization, and neural network optimization...
Work from home
Qualcomm
San Diego, CA
2 days ago
Machine Learning Engineer - College Graduate
$122.8k - $184.2k
...Technologies, Inc. Job Area Engineering Group, Engineering Group Machine Learning Engineering General... ...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and... ...neural network model optimization and on-target...
Work from home
Qualcomm
San Diego, CA
6 days ago
Sr Engineer, Machine Learning Engineering (ML Apps)
$140.8k - $211.2k
...Technologies, Inc. Job Area: Engineering Group Machine Learning Engineering General... ...for the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...Knowledge of neural network model optimization and on‑target deployment...
Work experience placement
Work from home
Qualcomm
San Diego, CA
3 days ago
Staff Engineer, Machine Learning Engineering (Heterogenous SW)
$158.4k - $237.6k
...Join to apply for the Staff Engineer, Machine Learning Engineering (Heterogenous SW... ...Technology for the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...device frameworks enabling optimization. Minimum Qualifications...
Full time
Work experience placement
Work from home
Qualcomm
San Diego, CA
2 days ago
Sr Engineer, Machine Learning Engineering (ML Apps)
$140.8k - $211.2k
...Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General... ...for the Edge - including model fine tuning, hardware acceleration, model quantization, edge inference and... ...of neural network model optimization and on-target deployment...
Work experience placement
Work from home
Qualcomm
San Diego, CA
4 days ago
Machine Learning Engineer III
$140k - $195k
...and the generation of governed, machine-consumable data. From strategic... ...Position Overview The Machine Learning Engineer III designs and implements sophisticated models and algorithms tailored for naval... ...processing approaches to optimize ML model performance. Participate...
Contract work
Work at office
The Marlin Alliance
San Diego, CA
3 days ago
Machine Learning Engineer- Gen AI
$139.5k - $210.1k
...Operations partners with a variety of engineering and operations teams, leading development of machine learning solutions. We deliver... ...engineering, and data mining models with an emphasis on large... ...manufacturing, testing, or hardware optimization is a major plus. Proven...
Relocation package
Apple
San Diego, CA
2 days ago
Machine Learning Algorithm Engineer - Auto Focus
$139.5k - $258.1k
...Machine Learning Algorithm Engineer - Auto Focus San Diego, California, United States Machine Learning and AI Are you passionate about shaping the... ...components impacting auto-focus, and the firmware team to optimize system-level flows for machine learning algorithms. The...
Worldwide
Relocation
Apple
San Diego, CA
2 days ago
Video Machine Learning Engineer
$139.5k - $258.1k
...very quickly. We are seeking a passionate and innovative machine learning engineer to join a team that is shaping the future of video... ...understanding — from crafting novel neural network architectures to optimizing models for on‑device performance. Your work will span the full...
Worldwide
Relocation
Apple
San Diego, CA
2 days ago
Sr Machine Learning Engineer
$165k - $195k
...seeking a talented and experienced Senior Machine Learning Engineer to join our team. The successful... ...and implement advanced machine learning models and algorithms in support of naval applications... ...and parallel processing approaches to optimize ML model performance. Participate in...
Contract work
Work at office
The Marlin Alliance
San Diego, CA
3 days ago
Machine Learning Engineer - Generative AI
$104k - $156k
...Summary We are seeking an experienced Machine Learning Engineer specializing in Generative AI to join... ...solutions, with a focus on Large Language Models (LLMs), Retrieval‑Augmented Generation... ...knowledge sources Develop and optimize LLM fine‑tuning strategies for specific...
Qualcomm
San Diego, CA
2 days ago
Machine Learning Engineer
$120k - $190k
...ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities... ...engineering, training, prediction and model serving using tools including Airflow,... ...Accelerate ML development: Optimize feature engineering pipelines for performance...
Remote work
Flexible hours
Radar
San Diego, CA
2 hours ago
Machine Learning Engineer
$122.8k - $184.2k
...Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group Machine Learning Engineering General Summary As a... ...embedded system development and optimization with application to a specific... ...or runtime frameworks or model efficiency software tools with new...
Work from home
Qualcomm
San Diego, CA
5 days ago
Machine Learning Engineer Camera & Photos, Creative Foundations
$171.6k - $302.2k
...Machine Learning Engineer — Camera & Photos, Creative Foundations San Diego, California, United States... ...this role, you won't just implement models — you'll invent them. You'll work at... ...efficient neural network design and optimization. Proficiency in ML frameworks such as...
Relocation
Apple
San Diego, CA
2 days ago
Machine Learning Engineer- Gen AI
$139.5k - $210.1k
...Machine Learning Engineer- Gen AI Product Operations partners with a variety of different engineering... ...engineering, and data mining models with an emphasis on large language models... ...manufacturing, testing, or hardware optimization is a major plus. Proven experience...
Relocation
Apple
San Diego, CA
6 days ago
Senior Machine Learning Engineer
$110k - $180k
...Marlin Alliance, Inc. is seeking a Senior Machine Learning Engineer to design, develop, and implement advanced machine learning models and algorithms in support of naval applications... ...environments. Deploy, monitor, and optimize data pipelines to ensure high performance...
Contract work
The Marlin Alliance
San Diego, CA
2 days ago
Machine Learning Engineer
...Machine Learning Engineer San Diego, California, United States Engineering Protogon Research builds AI models with a deep understanding of the world, monetizing them through proprietary... ...in feature engineering, model optimization, and evaluating model performance....
Temporary work
Relocation
Flexible hours
Protogon Holdings, Inc
San Diego, CA
5 days ago
Machine Learning Engineer
$105.78k - $189.35k
...communities better everyday! Learn more about why you want to... ...PURPOSE OF THE JOB The Machine Learning Engineer (MLE) selected for this role... ...machine learning models across the enterprise. As an... ...learning, deep learning, and optimization. Experience with Python,...
Full time
Work at office
Local area
Relocation
3 days per week
ICW Group
San Diego, CA
3 days ago
Machine Learning Engineer
$139.5k - $258.1k
...Apple’s Health Sensing team is seeking a versatile Machine Learning Engineer to develop next-generation health algorithms that deliver... ...full algorithm lifecycle including data strategy, modeling, evaluation, optimization, and deployment. Responsibilities Develop and...
Relocation package
Night shift
Apple
San Diego, CA
2 days ago
Machine Learning Systems Engineer, Staff - XR Labs
$158.4k - $237.6k
...Technologies, Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General... ...to deliver ultra-optimized, power-efficient software... ...development of deep learning models (PyTorch/TensorFlow)... ...-device optimization (quantization, acceleration, profiling...
Work experience placement
Work from home
Relocation package
Qualcomm
San Diego, CA
2 days ago
Senior/Staff Machine Learning Engineer - Offline Driving Intelligence
$242k - $333k
...service. We are looking for a Senior Machine Learning Engineer to join our team to help find rare events... ...and embedding space used by our models to better identify and cluster interesting... ...learning models, evaluation, and optimization. Strong programming skills in Python...
Zoox
San Diego, CA
5 days ago
Principal Machine Learning Engineer
$264k - $330k
...Looking For We’re seeking a Principal Machine Learning Engineer to help define and lead the next... ...assistance into execution, automation, and optimization. This role is for someone who doesn’t... ...‑to‑end ML systems: data collection, model training, evaluation, deployment, and...
AppFolio
San Diego, CA
2 days ago
Machine Learning Engineer II, Logistics AI
...regular in-person events. Learn more about our flexible approach... ...About the Role: As a Machine Learning Engineer, you will have the... ...crucial areas such as routing optimization, pricing, dispatch, and mapping... ...advancing our algorithms and models. About You: Minimum...
Permanent employment
Full time
Work at office
Remote work
Work from home
Flexible hours
Instacart
San Diego, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Engineer - Model Optimization & Quantization. Be the first to apply!