Staff Machine Learning Engineer - Model Optimization & Quantization
$158.4k - $237.6kNutanix
Company:Qualcomm Technologies, Inc.Job Area:Engineering Group, Engineering Group > Machine Learning EngineeringGeneral Summary:About the RoleJoin the Qualcomm AI Hub team and help developers integrate machine learning into their products and experiences: .In this role you will develop tools to help developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You will develop and support cutting-edge model optimization workflows — pushing the boundary of what's possible on resource-constrained hardware. Applications range from quantizing large language models (LLMs) and generative AI models to compressing latency-critical vision, audio, and multimodal networks for deployment on Qualcomm Snapdragon and other edge SoCs.For this role we are seeking a talented and motivated Staff Software Engineer with expertise in the optimizing and deploying ML models – especially for edge devices .What You'll DoDesign, develop, and maintain quantization algorithms and compression pipelines within the AIMET framework (PTQ, QAT, mixed-precision, AdaScale etc.)Implement advanced quantization techniques including weight-only quantization, activation quantization, KV-cache quantization, and sub-4-bit quantization for LLMs and generative AI modelsBuild tooling to analyze, profile, and debug model accuracy degradation caused by quantizationIntegrate AIMET workflows with popular ML frameworks — PyTorch and ONNXDevelop APIs and developer-facing tooling to make AIMET accessible and easy to use for external customers and design partnersIntegrate AIMET in AI Hub Workbench Quantize job to enable Quantization at large scale.Own end-to-end quantization and optimization of models published on Qualcomm AI Hub, ensuring they meet accuracy, latency, and power targets on Qualcomm hardwareQuantize and validate a broad range of model families — vision transformers, LLMs, diffusion models, speech, and multimodal architectures — for deployment via AI HubDevelop and maintain automated quantization pipelines and evaluation harnesses to scale model onboarding across AI Hub's growing model catalogMinimum Qualifications:Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.ORMaster's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.ORPhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.Preferred Qualifications:3+ years of industry experience in machine learning, deep learning, or AI infrastructureStrong proficiency in Python, with hands-on experience in PyTorch , ONNX and/or TensorFlowSolid understanding of neural network architectures — CNNs, Transformers, LLMs, diffusion models, multimodal modelsExperience with model quantization techniques — PTQ, QAT, weight-only quantization, mixed-precision, sub-4-bit methodsHands-on experience quantizing LLMs (GPT, LLaMA , Mistral, Falcon, or similar families) for inference optimizationFamiliarity with AIMET, GPTQ, AWQ, SmoothQuant , or similar quantization frameworks is a strong plusExperience working with ONNX, TFLite / LiteRT , or other model interchange formatsUnderstanding of hardware constraints: memory bandwidth, compute precision (INT4/INT8/FP16/BF16), and NPU/DSP executionExperience collaborating across teams or BUs to drive technical alignment and model deliveryProficiency with git and software development best practicesStrong written and verbal communication skills — ability to write clean APIs, documentation, and engage directly with external developersExperience with C++ for performance-critical components is a bonusFamiliarity with ARM processors and mobile SoC architecture (Snapdragon) is a plusExperience with automated evaluation pipelines and model benchmarking at scale is a plusLevel of ResponsibilityWorks independently with minimal supervisionProvides technical guidance and mentorship to other team membersDecision-making is significant and affects work beyond the immediate teamRequires strong communication skills to convey complex quantization concepts to varied audiences — from hardware engineers and BU partners to external researchers and application developersHas meaningful influence on the AIMET product roadmap, AI Hub model catalog, and cross-BU quantization strategyTasks are open-ended; planning, prioritization, and problem-solving are core to the roleQualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail View email address on click.appcast.io or call Qualcomm's toll-free number found here . Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.Pay range and Other Compensation & Benefits :$158,400.00 - $237,600.00The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer – and you can review more details about our US benefits at this link .If you would like more information about this role, please contact Qualcomm Careers . #J-18808-Ljbffr
$178.4k - $267.6k
...A leading technology company in San Diego seeks a Sr. Staff Engineer to join their Machine Learning Engineering team, focusing on model optimization and enabling on-device AI. Candidates should have strong experience in software engineering and AI frameworks, as well...Suggested$178.4k - $267.6k
...Qualcomm Technologies, Inc. is seeking a Machine Learning Engineer in San Diego, California, to work with cutting-edge AI technologies and... ...generative AI workflows. Responsibilities include architecting model optimization techniques and collaborating with various teams. The...Suggested$178.4k - $267.6k
..., Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General Summary... ...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and... ...develop and test model optimization techniques that include...SuggestedWork experience placementWork from home$178.4k - $267.6k
..., Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General Summary... ...for the Edge including model fine tuning, hardware acceleration, model quantization, edge inference and... ...develop and test model optimization techniques that include...SuggestedWork experience placementWork from home$75 - $95 per hour
...Machine Learning Engineer – Scheduling Optimization (Greenfield Project) SPONSORSHIP NOT AVAILABLE- MUST BE US CITIZEN/ GREEN CARD HOLDER LOCATION: Irvine,... ...Optimization Engine: Design and implement robust optimization models using Gurobi, CPLEX, OR-Tools, or similar solvers....SuggestedFull timeContract workRemote workWorldwideMonday to FridayMonday to ThursdayShift work$242k - $290k
...multi-modality foundation model to drive the next... ...intelligence. As a Model Optimization & Deployment Engineer, you will focus on bringing... ..., VLMs) using advanced quantization (PTQ, QAT), pruning,... ...intersection of robotics, machine learning, and design, Zoox aims to...Temporary workRemote workRelocation package$122.8k - $184.2k
...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...and software engineers who work with cutting... ...skills. Knowledge of deep learning and ML frameworks (i.e... ...neural network model optimization and on‑target deployment...Work from home$104k - $156k
...Technologies, Inc. Job Area: Engineering Group Modem... ...General Summary The Modem Machine Learning Engineer applies... ...across data engineering, model development,... .... Deploy and heavily optimize ML models for on‑device... ...device ML deployment, quantization, and neural network optimization...Work from home$122.8k - $184.2k
...Technologies, Inc. Job Area Engineering Group, Engineering Group Machine Learning Engineering General... ...the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and... ...neural network model optimization and on-target...Work from home$140.8k - $211.2k
...Technologies, Inc. Job Area: Engineering Group Machine Learning Engineering General... ...for the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...Knowledge of neural network model optimization and on‑target deployment...Work experience placementWork from home$158.4k - $237.6k
...Join to apply for the Staff Engineer, Machine Learning Engineering (Heterogenous SW... ...Technology for the Edge – including model fine tuning, hardware acceleration, model quantization, edge inference and related... ...device frameworks enabling optimization. Minimum Qualifications...Full timeWork experience placementWork from home$140.8k - $211.2k
...Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General... ...for the Edge - including model fine tuning, hardware acceleration, model quantization, edge inference and... ...of neural network model optimization and on-target deployment...Work experience placementWork from home$140k - $195k
...and the generation of governed, machine-consumable data. From strategic... ...Position Overview The Machine Learning Engineer III designs and implements sophisticated models and algorithms tailored for naval... ...processing approaches to optimize ML model performance. Participate...Contract workWork at office$139.5k - $210.1k
...Operations partners with a variety of engineering and operations teams, leading development of machine learning solutions. We deliver... ...engineering, and data mining models with an emphasis on large... ...manufacturing, testing, or hardware optimization is a major plus. Proven...Relocation package$139.5k - $258.1k
...Machine Learning Algorithm Engineer - Auto Focus San Diego, California, United States Machine Learning and AI Are you passionate about shaping the... ...components impacting auto-focus, and the firmware team to optimize system-level flows for machine learning algorithms. The...WorldwideRelocation$139.5k - $258.1k
...very quickly. We are seeking a passionate and innovative machine learning engineer to join a team that is shaping the future of video... ...understanding — from crafting novel neural network architectures to optimizing models for on‑device performance. Your work will span the full...WorldwideRelocation$165k - $195k
...seeking a talented and experienced Senior Machine Learning Engineer to join our team. The successful... ...and implement advanced machine learning models and algorithms in support of naval applications... ...and parallel processing approaches to optimize ML model performance. Participate in...Contract workWork at office$104k - $156k
...Summary We are seeking an experienced Machine Learning Engineer specializing in Generative AI to join... ...solutions, with a focus on Large Language Models (LLMs), Retrieval‑Augmented Generation... ...knowledge sources Develop and optimize LLM fine‑tuning strategies for specific...$120k - $190k
...ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities... ...engineering, training, prediction and model serving using tools including Airflow,... ...Accelerate ML development: Optimize feature engineering pipelines for performance...Remote workFlexible hours$122.8k - $184.2k
...Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group Machine Learning Engineering General Summary As a... ...embedded system development and optimization with application to a specific... ...or runtime frameworks or model efficiency software tools with new...Work from home$171.6k - $302.2k
...Machine Learning Engineer — Camera & Photos, Creative Foundations San Diego, California, United States... ...this role, you won't just implement models — you'll invent them. You'll work at... ...efficient neural network design and optimization. Proficiency in ML frameworks such as...Relocation$139.5k - $210.1k
...Machine Learning Engineer- Gen AI Product Operations partners with a variety of different engineering... ...engineering, and data mining models with an emphasis on large language models... ...manufacturing, testing, or hardware optimization is a major plus. Proven experience...Relocation$110k - $180k
...Marlin Alliance, Inc. is seeking a Senior Machine Learning Engineer to design, develop, and implement advanced machine learning models and algorithms in support of naval applications... ...environments. Deploy, monitor, and optimize data pipelines to ensure high performance...Contract work- ...Machine Learning Engineer San Diego, California, United States Engineering Protogon Research builds AI models with a deep understanding of the world, monetizing them through proprietary... ...in feature engineering, model optimization, and evaluating model performance....Temporary workRelocationFlexible hours
$105.78k - $189.35k
...communities better everyday! Learn more about why you want to... ...PURPOSE OF THE JOB The Machine Learning Engineer (MLE) selected for this role... ...machine learning models across the enterprise. As an... ...learning, deep learning, and optimization. Experience with Python,...Full timeWork at officeLocal areaRelocation3 days per week$139.5k - $258.1k
...Apple’s Health Sensing team is seeking a versatile Machine Learning Engineer to develop next-generation health algorithms that deliver... ...full algorithm lifecycle including data strategy, modeling, evaluation, optimization, and deployment. Responsibilities Develop and...Relocation packageNight shift$158.4k - $237.6k
...Technologies, Inc. Job Area: Engineering Group, Engineering Group Machine Learning Engineering General... ...to deliver ultra-optimized, power-efficient software... ...development of deep learning models (PyTorch/TensorFlow)... ...-device optimization (quantization, acceleration, profiling...Work experience placementWork from homeRelocation package$242k - $333k
...service. We are looking for a Senior Machine Learning Engineer to join our team to help find rare events... ...and embedding space used by our models to better identify and cluster interesting... ...learning models, evaluation, and optimization. Strong programming skills in Python...$264k - $330k
...Looking For We’re seeking a Principal Machine Learning Engineer to help define and lead the next... ...assistance into execution, automation, and optimization. This role is for someone who doesn’t... ...‑to‑end ML systems: data collection, model training, evaluation, deployment, and...- ...regular in-person events. Learn more about our flexible approach... ...About the Role: As a Machine Learning Engineer, you will have the... ...crucial areas such as routing optimization, pricing, dispatch, and mapping... ...advancing our algorithms and models. About You: Minimum...Permanent employmentFull timeWork at officeRemote workWork from homeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Engineer - Model Optimization & Quantization. Be the first to apply!
- software engineer staff San Diego, CA
- assistant engineer San Diego, CA
- project engineer assistant project manager San Diego, CA
- technology administrator San Diego, CA
- staff data engineer San Diego, CA
- senior staff systems engineer San Diego, CA
- staff engineer San Diego, CA
- senior staff engineer San Diego, CA
- assistant mechanical engineer San Diego, CA
- engineering aide San Diego, CA


