Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Model Optimization Architect

$158.4k - $237.6k

Qualcomm

Company Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group > Machine Learning Engineering General Summary Qualcomm is leveraging its strengths in compute, connectivity, and AI acceleration to play a central role in the evolution of Cloud AI. The Qualcomm Cloud AI team develops hardware and software platforms enabling efficient inference of large-scale foundation models. Position Overview We are seeking a Staff Engineer – AI Model Optimization Architect to lead end-to-end model transformation and optimization for LLMs, VLMs, diffusion, and multimodal models on Qualcomm inference accelerators. This role works closely with compiler, performance, and accuracy teams to translate models into accelerator efficient execution while balancing throughput, latency, memory, and quality. The scope spans Day0 enablement through production deployment, with a strong emphasis on scaling optimizations to future architectures. Key Responsibilities Architect and deliver model optimization strategies that transform PyTorch models for efficient inference on Qualcomm accelerators. Drive graph capture and deployment using PyTorch, ONNX, and torch.compile, including model rewrites and graph-level transformations. Design and implement fusion kernels using DSL based approaches (e.g., Triton), enabling fused operations and performance critical algorithmic rewrites. Partner deeply with compiler, performance, and accuracy teams to co-design lowering strategies, kernel fusion, layout decisions, and runtime integration. Profile and optimize LLM/VLM/diffusion inference for throughput and latency across batch sizes, sequence lengths, and serving modes. Own transformer specific optimizations including KVcache management, decoding behavior, and long context performance. Enable and optimize continuous batching (dynamic/iteration-level scheduling), understanding its impact on memory, scheduling, and tail latency. Architect and scale distributed inference strategies (e.g., sharding and parallelism) across multi-core and multi-device systems. Establish reusable approaches to scale model optimizations to new hardware architectures, creating robust patterns and tooling. Debug complex performance or stability issues to root cause and drive production ready solutions. Required Qualifications Expert level expertise in PyTorch and inference focused model optimization; strong Python engineering skills. Hands on experience with torch.compile / TorchDynamo or related graph capture and compilation workflows. Deep understanding of transformer architectures, attention mechanisms, MoEs, and performance trade-offs. Practical experience with KVcache behavior, serving time optimizations, and memory/performance tradeoffs. Strong foundation in computer architecture, ML accelerators, and distributed systems. Proven ability to lead cross-functional technical efforts and influence design decisions. MS in Computer Science, Machine Learning, Computer Engineering, or Electrical Engineering, or equivalent experience. Preferred / Bonus Qualifications Experience developing fusion kernels using Triton or similar DSLs, and collaborating with ML compiler teams. Familiarity with LLM serving stacks and continuous batching systems. Background in numerical methods, performance/accuracy trade-off analysis, or evaluation frameworks. PhD in a relevant field. Minimum Qualifications Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Pay Range and Benefits $158,400.00 – $237,600.00. Salary is one component of total compensation. Competitive annual discretionary bonus program, opportunity for annual RSU grants. Highly competitive benefits package. For more details, refer to Qualcomm U.S. benefits information. Equal Opportunity Employer Qualcomm is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification. Qualcomm is committed to providing reasonable accommodations for individuals with disabilities. #J-18808-Ljbffr Qualcomm

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Model Optimization Architect in San Diego, CA vacancy
  • $158.4k - $237.6k

    Qualcomm is seeking a Staff Engineer - AI Model Optimization Architect in San Diego, California. This position involves leading end-to-end model transformation and optimization for large-scale models on Qualcomm accelerators, collaborating with compiler and performance... 
    Suggested

    Qualcomm

    San Diego, CA
    4 days ago
  • $158.4k - $237.6k

     ...Summary: About the Role Join the Qualcomm AI Hub team and help developers integrate machine learning...  ...In this role you will develop tools to help developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-... 
    Suggested
    Work experience placement
    Immediate start
    Work from home

    Qualcomm

    San Diego, CA
    2 days ago
  • $178.4k - $267.6k

     ...Engineer in San Diego, California, to work with cutting-edge AI technologies and frameworks. The ideal candidate will have...  ...with generative AI workflows. Responsibilities include architecting model optimization techniques and collaborating with various teams. The role... 
    Suggested

    Stryker Corporation

    San Diego, CA
    4 days ago
  •  ...a multi-modality foundation model to drive the next generation...  ...intelligence. As a Model Optimization & Deployment Engineer, you will...  ...-tuning (LoRA, QLoRA). Architect and implement model conversion...  ...bandwidth and minimize latency on AI accelerators. Write... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    San Diego, CA
    3 days ago
  • Qualcomm is looking for a Physical AI Model Optimization Engineer in San Diego to optimize AI models for robotic applications. You will work with advanced models, applying Qualcomm’s toolchains to ensure optimal deployment on Snapdragon chipsets. This role demands a strong... 
    Suggested

    Nutanix

    San Diego, CA
    4 days ago
  • $158.4k - $237.6k

     ...Qualcomm RoboticsQualcomm Advanced Robotics Team is building the AI first stack and platform for the next generation general...  ...this emerging market.About the RoleWe are seeking a Physical AI Model Optimization Engineer to help bring cutting‑edge robotic AI models onto Qualcomm... 
    Work experience placement
    Immediate start
    Work from home

    Nutanix

    San Diego, CA
    3 days ago
  • $158.4k - $237.6k

     ...leadership in compute, connectivity, and AI acceleration to play a central role...  ...inference of large scale foundation models. We are seeking a Staff Engineer – AI Accuracy Architect to lead accuracy centric architecture and optimization for LLMs, VLMs, and emerging... 
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    6 days ago
  • A leading IT solutions provider in California is seeking an experienced AI Transformation Consultant to optimize internal systems through AI. The consultant will assess workflows, identify AI opportunities, and implement scalable automation solutions. Candidates should... 
    Remote work

    GigaKOM

    San Diego, CA
    1 day ago
  • $126.7k - $217.9k

     ...AI Accelerator Architecture Engineer Today, more intelligence...  ...the largest of today's models. The AI Architecture team...  ...algorithm development, kernel optimization, down to hardware accelerator...  ...accelerator and GPU architectures Architect enhancements required for... 
    Work experience placement
    Work from home
    Flexible hours
    Night shift

    Qualcomm

    San Diego, CA
    3 days ago
  • $124k - $208.4k

    A global technology leader seeks a Senior Engineer, GPU Architect to optimize and analyze mobile GPU performance. Within a creative and supportive environment, you'll leverage over 5 years of experience in GPU architecture, performance analysis, and programming with C/C++... 

    Samsung Semiconductor

    San Diego, CA
    2 days ago
  • $105.78k - $189.35k

     ...want to be here! PURPOSE OF THE JOB The AI Architect II is a senior technical leadership...  ...covering RAG pipelines, agentic workflows, model serving, and AI gateway design. Lead...  ...ensure consistency, security, and cost optimization. Collaborate with business and technology... 
    Full time
    Work at office
    Local area

    ICW Group

    San Diego, CA
    3 days ago
  • Intuit Inc. is looking for an AI Marketing Manager to join its Marketing Futures Team. This role will own the strategy and execution...  ...marketing campaigns, aiming to enhance creative production and optimize performance metrics. The ideal candidate will have 4-6 years of... 

    Intuit Inc.

    San Diego, CA
    4 days ago
  • $73k - $107k

     ...Will Do Act as a customer experience architect; design and optimize the end‑to‑end self‑service journey across...  ...smarter self‑service. Leverage AI tools to streamline and scale content...  ...Scribd, Inc. Scribd Flex (flexible work model) Comprehensive health, dental, and vision... 
    Local area
    Home office
    Flexible hours

    Scribd, Inc.

    San Diego, CA
    2 days ago
  • $200.8k - $301.2k

    A leading technology firm in San Diego seeks a Senior Windows Platform Architect to enhance AI systems on the Snapdragon platform. You will drive performance and power optimizations, ensuring competitive developer experiences. The ideal candidate has extensive AI architecture... 

    Qualcomm

    San Diego, CA
    1 day ago
  • Qualcomm is seeking a Staff Engineer - AI Accuracy Architect to lead accuracy-centric architecture for LLMs and multimodal models. This position encompasses hardware enablement...  ...proven capabilities in design and optimization in a fast-paced environment, with responsibilities... 

    Qualcomm

    San Diego, CA
    4 days ago
  • $171.6k - $392.1k

     ...working world. ServiceNow – ServiceNow AI Architect Senior Manager ​In the digital...  ...Experience in waterfall and agile delivery models – including supporting management...  ...models Proficiency in developing and optimizing data pipelines for AI-driven solutions... 
    Summer holiday
    Worldwide
    Flexible hours

    EY

    San Diego, CA
    2 days ago
  •  ...Description Title: Platform Architect/AWS solution Architect...  ...Practical experience with AWS AI/ML services (SageMaker, Bedrock...  ...Databricks Al capabilities (Model Serving, Feature Store, MLflow...  ..., re-architecting). • Cost Optimization: Ability to design cost-effective... 
    Shift work

    Krest Global Solutions

    San Diego, CA
    3 days ago
  • $171.6k - $392.1k

     ...working world. ServiceNow - ServiceNow AI Architect Senior Manager In the digital economy,...  ...in waterfall and agile delivery models - including supporting management activities...  ...models Proficiency in developing and optimizing data pipelines for AI‑driven solutions... 
    Summer holiday
    Worldwide
    Flexible hours

    Ernst & Young Oman

    San Diego, CA
    22 hours ago
  • $13k

     ...At G2 Ops, MBSE Engineers specialize in digital system modeling , using SysML, Cameo, and MagicDraw to create interactive...  ...engineering, leveraging automation, simulation, and AI-driven efficiencies to optimize defense and cybersecurity solutions. What Makes G2... 
    Full time
    Temporary work
    For contractors
    Work at office
    Remote work
    Flexible hours

    G2 Ops, Inc.

    San Diego, CA
    22 hours ago
  • $146.88k - $220.32k

     ...Job Description As a Senior Agentic AI Architect/Engineer , you will be a technical leader...  .... You have experience leading projects, optimizing ML systems for performance, and building...  ...Performance Optimization: Increase model serving layers and high-throughput data... 
    Work experience placement
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Rockwell Automation

    San Diego, CA
    1 day ago
  • $122.8k - $184.2k

     ...Summary:We are looking for an AI Performance System Software...  ...evolve the benchmarking and optimization of reference AI networks that...  ...leads, software, and hardware architects.Ideal candidate has knowledge...  ...methodsKnowledge of state of the art in AI models for one or more of the... 
    Work from home

    Nutanix

    San Diego, CA
    22 hours ago
  • $153k - $187k

     ...is the industry leader and innovator in AI and machine learning-powered Conversation...  ...innovative, and forward-thinking Sr. GTM AI Architect to lead the design and deployment of AI-...  ..., ensuring alignment with data models, governance standards, and long-term architecture... 
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    San Diego, CA
    2 days ago
  • $126k - $229.8k

     ...Group, Engineering Group ASICS Engineering General Summary: Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter. We are looking for a data center engineer whose expertise spans security, RAS and/... 
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    22 hours ago
  •  ...approach. Teradata delivers real business value with AI. What You'll Do As the Principal AI Architect for Teradata AI Studio, you will define the...  ...integrates with Teradata Vantage's query engine, model registry, feature store, and agent harness. You will... 
    Permanent employment
    Flexible hours

    Teradata

    San Diego, CA
    4 days ago
  •  ...4  Remote Position: Yes  Region: Americas  Country: USA Summary This position is for a Senior Principal Engineer, AI/ML System Architect. As system architect,one will define the architecture of leading and competitive AI systems, lead new technologyresearch, study... 
    Local area
    Remote work

    Celestica

    San Diego, CA
    5 days ago
  • A global professional services company in San Diego is seeking a Marketing Services Campaign Manager to oversee campaign execution. The candidate will develop strategies within marketing automation platforms and liaise with clients to enhance marketing effectiveness. Ideally...

    Accenture

    San Diego, CA
    3 days ago
  • $185k - $215k

     ...healthcare technology company in San Diego is seeking an experienced AI Architect to design and implement AI platforms that streamline complex...  .... This role demands significant expertise in large language models and distributed AI systems. Ideal candidates will possess... 

    XiFin, Inc.

    San Diego, CA
    2 days ago
  • Accenture is hiring an MDM Architect in San Diego, CA to define and deliver enterprise-scale Master Data Management solutions for global clients...  ...and collaborative work across business units, offering a dynamic environment within AI & Data innovation. #J-18808-Ljbffr Accenture

    Accenture

    San Diego, CA
    2 days ago
  • $141k - $228.8k

     ...Engineering Advisor to develop innovative software solutions at the Lilly Biotechnology Center in San Diego. The role focuses on designing AI-integrated lab environments that enhance biotherapeutics discovery through automation. Qualified candidates will hold advanced... 

    Initial Therapeutics, Inc.

    San Diego, CA
    3 days ago
  • $185k - $215k

     ...interested in harnessing technology and AI to transform healthcare? At XiFin , we believe...  ...real difference. About The Role The AI Architect will lead the design and implementation...  ...that combine large language models (LLMs), knowledge graphs, retrieval systems... 
    Flexible hours

    XiFin, Inc.

    San Diego, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Model Optimization Architect. Be the first to apply!