Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal AI Performance Engineer

Advanced Micro Devices , Inc.

WHAT YOU DO AT AMD CHANGES EVERYTHING


At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

PRINCIPAL AI PERFORMANCE ENGINEER

THE ROLE:

AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact.

THE PERSON:

You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything.

KEY RESPONSIBILITIES:
  • Drive performance optimization end-to-end across the stack on leading models and customer-relevant serving configurations, closing competitive gaps through kernel and systems-level optimizations
  • Profile, diagnose, and resolve the hardest cross-stack performance bottlenecks, from GPU kernels and operator dispatch to framework-level scheduling and multi-node communication
  • Diagnose kernel-level performance issues using profiling tools: identify occupancy limitations, L2 cache thrashing, register pressure, memory coalescing issues, etc, and translate findings into actionable optimizations
  • Lead customer-facing technical engagements: present findings, recommend optimizations, and deliver measurable performance uplifts
  • Integrate and optimize custom kernels (Triton, Gluon, CK, PyDSL, ASM, AITER) within serving frameworks, understanding dispatch paths, shape extraction, and backend selection
  • Optimize multi-node distributed inference: communication-compute overlap, parallelism strategies, and scale-out performance
  • Develop and refine shared performance optimization methodology that raises the bar across the broader team
  • Leverage AI agents to accelerate daily work and define best practices for AI-assisted performance engineering
  • Upstream optimizations into open-source frameworks such as vLLM, SGLang, and PyTorch
PREFERRED EXPERIENCE:
  • 7+ years of software development experience in GPU computing, AI systems, or high-performance computing
  • Deep hands-on experience with AI serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) and their internals
  • Strong background in end-to-end workload profiling and bottleneck diagnosis: you can trace from user request to GPU kernel and back
  • Understanding of GPU kernel performance characteristics: occupancy, register and LDS pressure, memory coalescing, cache utilization, wavefront scheduling, and instruction-level bottlenecks
  • Ability to read and reason about kernel-level profiling data and translate it into concrete optimization actions. You may not write kernels from scratch daily, but you can tell exactly why one is slow and what needs to change
  • Understanding of model architectures (transformers, MoE, diffusion), inference paradigms (speculative decoding, prefill-decode disaggregation, continuous batching), and how they map to hardware
  • Experience with custom kernel development or integration (HIP, CUDA, Triton, CK, or similar)
  • Understanding of multi-GPU and multi-node distributed systems: scale-up and scale-out topologies, RCCL/NCCL, RDMA, and communication-compute overlap
  • System and rack-level design awareness: understanding performance tradeoffs across the full deployment stack
  • Strong proficiency in Python and C++
  • Customer-facing technical leadership experience: ability to engage with customers, present findings, and drive decisions
  • Fluent in AI-assisted development: daily user of AI agents and tools, with a mindset toward defining new AI-powered workflows
  • Strong Linux systems knowledge
  • Excellent written and verbal English communication skills

ACADEMIC CREDENTIALS:

Bachelor's, Master's, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. Advanced degree preferred but exceptional industry experience valued equally.

LOCATION:

San Jose, CA, preferred

#LI-TC1

#LI-HYBRID

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here.

This posting is for an existing vacancy.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Principal AI Performance Engineer in San Jose, CA vacancy
  • $272k - $431.25k

     ...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU...  ...collaboration with various teams, monitoring infrastructure performance, and implementing improvements based on AI/ML... 
    Principal
    Performance

    Jobleads-US

    Santa Clara, CA
    4 days ago
  • $272k - $431.25k

     ...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing... 
    Principal
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    12 hours ago
  •  ...AI Compiler Engineer Locations available: San Diego and San Jose, California or Austin, Texas NXP is searching for a hands-on AI Compiler...  ...silicon and code for maximum impact. Diagnose and crush performance bottlenecks with AI-enabled profiling and diagnostics,... 
    Principal
    Performance
    Local area

    NXP Semiconductors

    San Jose, CA
    12 hours ago
  • $262.7k - $355.4k

     ...Job Title Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices? We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running... 
    Principal
    Performance

    ARM

    San Jose, CA
    5 hours ago
  •  ...About the Opportunity We are seeking a Principal Engineer with a deep expertise in autonomous AI agent architecture and deployment, to spearhead the design,...  ...simulations, and continuous learning pipelines for agent performance optimization. Stay ahead of the curve on... 
    Principal
    Performance

    United States Digital Space LLC

    San Jose, CA
    4 days ago
  •  ...Principal AI Performance Engineer US - Milpitas About Us Graphcore is one of the world's leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs... 
    Principal
    Performance
    Flexible hours

    Graphcore

    Milpitas, CA
    3 days ago
  •  ...that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems....  ...future of AI and beyond. Together, we advance your career. PRINCIPAL AI PERFORMANCE ENGINEER THE ROLE: AMD is looking for a performance‑obsessed engineer... 
    Principal
    Performance

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • $223k - $306.5k

     ...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do...  ...world. Job Summary As a Sr Principal AI Engineer, you will join a dynamic team to pioneer...  ...outputs, and model latency. Cost & Performance Optimization: Create architecture that... 
    Principal
    Performance
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    8 days ago
  • $185.2k - $299.48k

     ...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do...  ...best work. Your Career As a Principal AI Engineer for the Enterprise AI Platform, you...  ..., distributed AI/ML systems, ensuring performance, reliability, and security. Proactively... 
    Principal
    Performance
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    9 days ago
  • $123.24k - $200k

     ...Overview of Role As a Sr./Principal AI Engineer within TSMC's Artificial Intelligence for Business Intelligence Innovation (AI4BII) Center...  ..., machine learning engineering, or related fields in high-performance environments. This should include: ~7+ years of hands-on... 
    Principal
    Performance
    Work at office

    Taiwan Semiconductor Manufacturing Company Limited

    San Jose, CA
    12 hours ago
  • $313.06k

     ...for every OK‑er. About the Opportunity We are looking for a Principal AI Engineer to lead the architecture and deployment of large‑scale, LLM...  ...simulations, and continuous learning pipelines for Chatbot performance optimization. Design multi‑level intent routing, classifier... 
    Principal
    Performance

    United States Digital Space LLC

    San Jose, CA
    4 days ago
  • $167k - $270.5k

     ...Integrity, and Inclusion. We weave AI into the fabric of everything...  ...experience motion. The Sr. Principal/Principal person will have...  ..., observability and performance across all AI/ML/Agentic components...  ...validation. Partner with data engineering to design high-quality... 
    Principal
    Performance
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    6 days ago
  •  ...Principal Ai Engineer TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider. We are a force...  ...for designing, developing, and optimizing scalable, high-performance AI systems. You will play a crucial role in shaping the... 
    Principal
    Performance
    Work from home

    TenEx

    San Jose, CA
    2 days ago
  •  ...accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...ROLE: AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the performance of key applications and benchmarks, with a special focus... 
    Principal
    Performance

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  •  ...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical...  ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability... 
    Principal
    Performance

    Oracle

    Santa Clara, CA
    1 day ago
  • $170k - $240k

     ...there’s no better time to join the Extreme team. Position: AI Staff Machine Learning Engineer - Gen AI, Machine Learning, Graph ML, Big Data...  ...businesses manage networks by building intelligent, high-performance multi-agent systems that perceive, learn, and act in real... 
    Principal
    Performance
    Remote work
    Shift work

    Extreme Networks

    San Jose, CA
    6 hours ago
  • $99.6k - $234.6k

     ...Principal AI Agent / ML Software Engineer The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role...  ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability... 
    Principal
    Performance
    Temporary work
    Flexible hours

    Oracle

    Santa Clara, CA
    1 day ago
  • $275.8k - $340.5k

     ...to meet the unique demands of AI and ML innovation, supporting...  ...the productivity of ML engineers, and drive the adoption of cutting...  ...Inference: Ensures robust model performance by running large-scale...  ...Position Overview: The Principal AI/ML Engineer will lead a growing... 
    Principal
    Performance
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  • $296.3k

     ...to meet the unique demands of AI and ML innovation, supporting...  ...the productivity of ML engineers, and drive the adoption of cutting...  ...Inference: Ensures robust model performance by running large-scale...  ...Position Overview: The Principal AI/ML Engineer will lead a growing... 
    Principal
    Performance
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    6 days ago
  •  ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters... 
    Principal

    Jobleads-US

    Santa Clara, CA
    4 days ago
  • $180k - $300k

     ...unleashing the potential of generative AI to power the transformation of...  ...possibilities of AI.  AI Software Application Engineer – Technical lead / Principal d-Matrix is seeking an...  ...a focus on inference solutions and performance optimization for data center environments... 
    Principal
    Performance

    d-Matrix

    Santa Clara, CA
    more than 2 months ago
  • $171k - $256k

     ...Possible™. Learn more at and on  LinkedIn and  Twitter (X) . Principal AI Engineer, Intelligent Sensors  Role Overview: The Edge AI team...  ...of hardware-aware AI, including benchmarking and performance evaluation on embedded platforms.  ~ Proficiency in Python... 
    Principal
    Performance
    Permanent employment
    Work at office
    Shift work
    Day shift

    Analog Devices

    San Jose, CA
    more than 2 months ago
  • $170k - $277k

     ...A leading cybersecurity company is seeking an experienced engineer to join their Cloud Network and AI Security team. The role involves designing and implementing next-generation virtualization features for their firewalls and AI solutions, across both public and private... 
    Principal

    Palo Alto Networks

    Santa Clara, CA
    12 hours ago
  • $190k - $270k

     ...Celestica Inc. is looking for a Principal, Design Engineering to lead the hands-on design and coding of AI architectures, focusing on high accuracy and optimization. Located in San Jose, CA, this role requires experience in leading AI projects and knowledge of tools like... 
    Principal

    Celestica

    San Jose, CA
    1 day ago
  •  ...leading semiconductor organization seeking a PrincipalApplications Engineer with deep expertise in DC/DC power conversion.You will work...  ...teams to supportadvanced power solutions used in high-performance systems. Responsibilities Support customers with DC/DC power solutions... 
    Principal
    Performance

    Charles Brown Recruitment

    San Jose, CA
    4 days ago
  • $272k - $425.5k

    Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software...  ...framework for serving generative AI and reasoning models across multi-...  ...distributed environments. Built in Rust for performance and Python for extensibility, Dynamo... 
    Principal
    Performance
    Local area
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $175k - $350k

     ...Power Architecture Lead for PMIC in San Jose, CA. In this role, you will define and oversee the IVR architecture, ensure optimal performance across power domain specifications, and collaborate with foundry partners. Qualified candidates should possess over 15 years of experience... 
    Principal
    Performance

    TylSemi

    San Jose, CA
    12 hours ago
  • $167k - $270.5k

    Palo Alto Networks, Inc. is seeking a Technical Leader to develop AI applications within the GTM/CX domain. This role involves defining the architecture for scalable AI/ML systems and leading the design of intelligent agents. Ideal candidates will have 15+ years of experience... 
    Principal

    Palo Alto Networks, Inc.

    Santa Clara, CA
    12 hours ago
  •  ...d-Matrix inc. is seeking a Principal Software Engineer specializing in kernels at our headquarters in Santa Clara, CA. In this...  ...maintaining software kernels for next-generation AI hardware, ensuring optimized performance. The ideal candidate has a strong background in computer... 
    Principal
    Performance
    3 days per week

    d-Matrix inc.

    Santa Clara, CA
    4 days ago
  •  ...A leading automotive company is seeking a Principal AI Engineer to lead the design and optimization of its AI platform. The successful candidate will guide the infrastructure for large-scale training and cloud inference, working closely with data scientists and engineers... 
    Principal
    Remote work

    General Motors

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal AI Performance Engineer. Be the first to apply!