AI Model Optimization Architect

$158.4k - $237.6k

Qualcomm

Company Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group > Machine Learning Engineering General Summary Qualcomm is leveraging its strengths in compute, connectivity, and AI acceleration to play a central role in the evolution of Cloud AI. The Qualcomm Cloud AI team develops hardware and software platforms enabling efficient inference of large-scale foundation models. Position Overview We are seeking a Staff Engineer – AI Model Optimization Architect to lead end-to-end model transformation and optimization for LLMs, VLMs, diffusion, and multimodal models on Qualcomm inference accelerators. This role works closely with compiler, performance, and accuracy teams to translate models into accelerator efficient execution while balancing throughput, latency, memory, and quality. The scope spans Day0 enablement through production deployment, with a strong emphasis on scaling optimizations to future architectures. Key Responsibilities Architect and deliver model optimization strategies that transform PyTorch models for efficient inference on Qualcomm accelerators. Drive graph capture and deployment using PyTorch, ONNX, and torch.compile, including model rewrites and graph-level transformations. Design and implement fusion kernels using DSL based approaches (e.g., Triton), enabling fused operations and performance critical algorithmic rewrites. Partner deeply with compiler, performance, and accuracy teams to co-design lowering strategies, kernel fusion, layout decisions, and runtime integration. Profile and optimize LLM/VLM/diffusion inference for throughput and latency across batch sizes, sequence lengths, and serving modes. Own transformer specific optimizations including KVcache management, decoding behavior, and long context performance. Enable and optimize continuous batching (dynamic/iteration-level scheduling), understanding its impact on memory, scheduling, and tail latency. Architect and scale distributed inference strategies (e.g., sharding and parallelism) across multi-core and multi-device systems. Establish reusable approaches to scale model optimizations to new hardware architectures, creating robust patterns and tooling. Debug complex performance or stability issues to root cause and drive production ready solutions. Required Qualifications Expert level expertise in PyTorch and inference focused model optimization; strong Python engineering skills. Hands on experience with torch.compile / TorchDynamo or related graph capture and compilation workflows. Deep understanding of transformer architectures, attention mechanisms, MoEs, and performance trade-offs. Practical experience with KVcache behavior, serving time optimizations, and memory/performance tradeoffs. Strong foundation in computer architecture, ML accelerators, and distributed systems. Proven ability to lead cross-functional technical efforts and influence design decisions. MS in Computer Science, Machine Learning, Computer Engineering, or Electrical Engineering, or equivalent experience. Preferred / Bonus Qualifications Experience developing fusion kernels using Triton or similar DSLs, and collaborating with ML compiler teams. Familiarity with LLM serving stacks and continuous batching systems. Background in numerical methods, performance/accuracy trade-off analysis, or evaluation frameworks. PhD in a relevant field. Minimum Qualifications Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Pay Range and Benefits $158,400.00 – $237,600.00. Salary is one component of total compensation. Competitive annual discretionary bonus program, opportunity for annual RSU grants. Highly competitive benefits package. For more details, refer to Qualcomm U.S. benefits information. Equal Opportunity Employer Qualcomm is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification. Qualcomm is committed to providing reasonable accommodations for individuals with disabilities. #J-18808-Ljbffr Qualcomm

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the AI Model Optimization Architect in San Diego, CA vacancy

Staff AI Model Optimization Architect
$158.4k - $237.6k
Qualcomm is seeking a Staff Engineer - AI Model Optimization Architect in San Diego, California. This position involves leading end-to-end model transformation and optimization for large-scale models on Qualcomm accelerators, collaborating with compiler and performance...
Suggested
Qualcomm
San Diego, CA
5 days ago
Senior Staff ML Engineer - Edge AI & Model Optimization
$178.4k - $267.6k
...Engineer in San Diego, California, to work with cutting-edge AI technologies and frameworks. The ideal candidate will have... ...with generative AI workflows. Responsibilities include architecting model optimization techniques and collaborating with various teams. The role...
Suggested
Stryker Corporation
San Diego, CA
4 days ago
Physical AI Model Optimization Engineer - Qualcomm Advanced Robotics Team
$158.4k - $237.6k
...Qualcomm Robotics Qualcomm Advanced Robotics Team is building the AI first stack and platform for the next generation general... ...emerging market. About The Role We are seeking a Physical AI Model Optimization Engineer to help bring cutting‑edge robotic AI models onto Qualcomm...
Suggested
Work experience placement
Immediate start
Work from home
Qualcomm
San Diego, CA
5 days ago
Senior AI Inference Engineer - Model Optimization & Deployment
...a multi-modality foundation model to drive the next generation... ...intelligence. As a Model Optimization & Deployment Engineer, you will... ...-tuning (LoRA, QLoRA). Architect and implement model conversion... ...maximize memory bandwidth on AI accelerators. Write production...
Suggested
Temporary work
Relocation package
Zoox
San Diego, CA
11 days ago
AI Accuracy Architect
$158.4k - $237.6k
...leadership in compute, connectivity, and AI acceleration to play a central role... ...inference of large scale foundation models.We are seeking a Staff Engineer - AI Accuracy Architect to lead accuracy centric architecture and optimization for LLMs, VLMs, and emerging...
Suggested
Work experience placement
Work from home
Nutanix
San Diego, CA
1 day ago
AI Architect
$105.78k - $189.35k
...we are today. Purpose of the Job The AI Architect II is a senior technical leadership role... ...covering RAG pipelines, agentic workflows, model serving, and AI gateway design. Lead... ...ensure consistency, security, and cost optimization. Collaborate with business and...
Full time
Work at office
Local area
ICW Group
San Diego, CA
3 days ago
AI Accuracy Architect for LLMs & Multimodal Inference
Qualcomm is seeking a Staff Engineer - AI Accuracy Architect to lead accuracy-centric architecture for LLMs and multimodal models. This position encompasses hardware enablement... ...proven capabilities in design and optimization in a fast-paced environment, with responsibilities...
Qualcomm
San Diego, CA
5 days ago
Advanced AI Architect
$94.4k - $293.8k
...transformative era where Data & AI are reshaping industries,... ...You Are: As a n Advanced AI Architect, you will be responsible for... ..., Generative AI , Foundation Models , Knowledge & Data Engineering... ...solutions . Recommend design optimizations and improvements for performance...
Work experience placement
Live in
Work at office
Local area
Accenture
San Diego, CA
3 days ago
Silicon Architect, AI Power and Performance
$192k - $279k
...hardware/software co-design, or power and performance modeling. Experience in memory architecture, interface, and... ...architecture, next-generation memory systems, or AI hardware accelerators. Experience in optimizing hardware and software architectures for Mixture of Experts...
Worldwide
Google
San Diego, CA
5 days ago
Senior AI Hardware Architect for Edge & Training
$126.7k - $217.9k
A leading technology company is seeking AI Accelerator Architecture Engineers to design and optimize hardware for AI models. The role involves understanding ML trends, collaborating with customers, and enhancing systems for power efficiency. Candidates should have a Master...
Qualcomm
San Diego, CA
5 days ago
Senior ML Engineer - Edge AI & Model Quantization
$178.4k - $267.6k
...technology company in San Diego seeks a Sr. Staff Engineer to join their Machine Learning Engineering team, focusing on model optimization and enabling on-device AI. Candidates should have strong experience in software engineering and AI frameworks, as well as a relevant...
Qualcomm
San Diego, CA
3 days ago
ServiceNow -ServiceNow AI Architect Senior Manager - Tech Cons - Open Location
$171.6k - $392.1k
...working world. ServiceNow – ServiceNow AI Architect Senior Manager In the digital economy,... ...Experience in waterfall and agile delivery models – including supporting management... ...models. Proficiency in developing and optimizing data pipelines for AI‑driven solutions....
Summer holiday
Worldwide
Flexible hours
EY
San Diego, CA
3 days ago
CX Platform Architect & AI/Automation Lead
...and digital channels for inbound/outbound calls. The role includes leading the design of call flows, overseeing integrations, and optimizing system performance. The ideal candidate will have experience with telephony systems and cloud migration consulting, ensuring...
Kaleidoscope Innovation
San Diego, CA
5 days ago
AI Hardware Architect: Power & Performance Lead
...develop custom silicon solutions that will enhance the performance and efficiency of their products. This role involves optimizing hardware architectures for AI and ML applications, thereby influencing the next-generation products that impact millions of users worldwide. The...
Worldwide
Google
San Diego, CA
5 days ago
Senior AI Architect — TMT Innovation & Strategy
Join EY as a Principal AI Architect to influence AI strategy in the Technology, Media, and Telecommunications sectors. This role demands expertise in machine learning and a track record of impactful AI solutions. You will lead project teams, architect scalable solutions...
Flexible hours
EY
San Diego, CA
5 days ago
IT Mergers & Acquisitions (M&A) Architect
$124k - $186k
...Information Technology Group, Information Technology Group > IT Architect General Summary Qualcomm is seeking a highly skilled and experienced... ...successful M&A integrations, preferably in the Semiconductor, AI, Automotive, Datacenter, or IoT sectors. Strong understanding...
Full time
Work at office
Shift work
Qualcomm
San Diego, CA
5 days ago
GPU Research Architect: AI/ML/GPGPU Hardware Innovation
Qualcomm in San Diego is looking for an innovative GPU architect to advance capabilities in AI, ML, and GPGPU computing. This role involves designing next-generation GPU architectures for mobile devices and data center GPUs. The ideal candidate should have a Bachelor's...
Qualcomm
San Diego, CA
5 days ago
Senior GCP Agentic AI Delivery Architect
Accenture seeks a GCP Agentic AI Delivery Senior Engineer to innovate and customize Google Cloud solutions addressing client needs. The ideal candidate has 5+ years in technology roles and experience with Google Cloud tools like Vertex AI and BigQuery. This role focuses...
Accenture
San Diego, CA
5 days ago
Senior architecture architect
$179k - $286k
MediaTek in San Diego, California is seeking an experienced professional for a role focused on GPU compiler software architecture and development. The ideal candidate will possess 10+ years of experience in product compiler tools such as assembler and linker development...
MediaTek
San Diego, CA
5 days ago
Principal AI Workflow Architect — Enterprise-Scale
Intuit’s AI Transformation Org is transforming how 18,000+ employees work by embedding AI-enabled workflows across Finance, Legal,... ...production-ready implementations, and partner with HR Craft Leaders, Enterprise Solution Architect, and senior leaders to #J-18808-Ljbffr Intuit
Intuit
San Diego, CA
1 day ago
Strategic AI Value Architect for Enterprise Growth
A leading technology consulting firm in San Diego is searching for an AI & Data Strategy Manager. The role involves shaping a vision for AI, building strong client relationships, and defining strategies for organizations to leverage AI and data for growth. Ideal candidates...
Accenture
San Diego, CA
5 days ago
AI Transformation Architect
A global technology leader is seeking a Cloud Architect in San Diego, California. This role focuses on solving complex enterprise AI transformation challenges, designing multi-agent systems, and leveraging cutting-edge AI technologies. Candidates should have a minimum...
Accenture
San Diego, CA
2 days ago
Principal AI-Enabled L&D Architect
...redefine executive and manager development. Lead a shift from episodic programs to in-the-flow, personalized learning using microlearning, AI guidance, and analytics-driven delivery. You will define vision, roadmap, and execution while prototyping scalable solutions across...
Shift work
Intuit Inc.
San Diego, CA
3 days ago
AI-Powered Commerce Architect
Accenture is seeking an Agentic Commerce & AI Consultant to design and implement AI-powered commerce solutions. You will work at the forefront of Agentic Commerce and Generative AI, collaborating with cross-functional teams to drive transformation in digital commerce ecosystems...
Work at office
Accenture
San Diego, CA
5 days ago
Autonomous Lab Automation Architect (Agentic AI)
A leading pharmaceutical company in San Diego is seeking a motivated engineer to advance laboratory automation through innovative AI integration. The role involves building connections between AI systems and lab infrastructure, aiming to enhance the Design-Make-Test-Analyze...
Eli Lilly
San Diego, CA
5 days ago
AI IAM Architect
$153.47k - $255.75k
...seeking an experienced Identity and Access Management (IAM) Architect with a strong AI and agent-integration focus to lead the design, proof-of-... ...Assess existing SSO, MFA, federation, and API authorization models; identify gaps in delegation, token lifecycle, scopes, secrets...
Work from home
LPL Financial LLC
San Diego, CA
4 days ago
AI Workflow Architect: Enterprise-Scale Adoption Lead
Intuit Inc. is seeking a Principal for its AI Transformation Org to scale redesigned, production‑ready workflows across finance, legal, marketing, customer success, and people operations. You will partner with AI Champions, apply enterprise blueprints, and set baselines...
Intuit
San Diego, CA
4 days ago
AI IAM Architect: Secure Identities for AI Agents
LPL Financial is seeking an experienced Identity and Access Management (IAM) Architect to lead the design and implementation of identity solutions for AI workflows. This role combines deep IAM expertise with engineering skills to produce secure identity standards in collaboration...
LPL Financial
San Diego, CA
4 days ago
Knowledge Graph Architect & AI Semantics Lead
Accenture seeks a Knowledge Engineer who formulates practical AI and Knowledge Graph solutions. Responsibilities include designing and implementing innovative solutions, leading a team, and collaborating with various stakeholders to deliver successful AI projects. The...
Accenture
San Diego, CA
4 days ago
Staff Firmware Architect: Power & BMC for Cloud AI Systems
...have at least 6 years of experience in embedded systems, proficiency in C/C++, and hands-on experience with OpenBMC and Redfish APIs. The position offers a competitive salary and benefits, fostering career growth in cloud and AI infrastructure. #J-18808-Ljbffr Qualcomm
Qualcomm
San Diego, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Model Optimization Architect. Be the first to apply!