Principal AI Performance Engineer

Advanced Micro Devices , Inc.

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

PRINCIPAL AI PERFORMANCE ENGINEER

THE ROLE:

AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact.

THE PERSON:

You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything.

KEY RESPONSIBILITIES:

Drive performance optimization end-to-end across the stack on leading models and customer-relevant serving configurations, closing competitive gaps through kernel and systems-level optimizations
Profile, diagnose, and resolve the hardest cross-stack performance bottlenecks, from GPU kernels and operator dispatch to framework-level scheduling and multi-node communication
Diagnose kernel-level performance issues using profiling tools: identify occupancy limitations, L2 cache thrashing, register pressure, memory coalescing issues, etc, and translate findings into actionable optimizations
Lead customer-facing technical engagements: present findings, recommend optimizations, and deliver measurable performance uplifts
Integrate and optimize custom kernels (Triton, Gluon, CK, PyDSL, ASM, AITER) within serving frameworks, understanding dispatch paths, shape extraction, and backend selection
Optimize multi-node distributed inference: communication-compute overlap, parallelism strategies, and scale-out performance
Develop and refine shared performance optimization methodology that raises the bar across the broader team
Leverage AI agents to accelerate daily work and define best practices for AI-assisted performance engineering
Upstream optimizations into open-source frameworks such as vLLM, SGLang, and PyTorch

PREFERRED EXPERIENCE:

7+ years of software development experience in GPU computing, AI systems, or high-performance computing
Deep hands-on experience with AI serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) and their internals
Strong background in end-to-end workload profiling and bottleneck diagnosis: you can trace from user request to GPU kernel and back
Understanding of GPU kernel performance characteristics: occupancy, register and LDS pressure, memory coalescing, cache utilization, wavefront scheduling, and instruction-level bottlenecks
Ability to read and reason about kernel-level profiling data and translate it into concrete optimization actions. You may not write kernels from scratch daily, but you can tell exactly why one is slow and what needs to change
Understanding of model architectures (transformers, MoE, diffusion), inference paradigms (speculative decoding, prefill-decode disaggregation, continuous batching), and how they map to hardware
Experience with custom kernel development or integration (HIP, CUDA, Triton, CK, or similar)
Understanding of multi-GPU and multi-node distributed systems: scale-up and scale-out topologies, RCCL/NCCL, RDMA, and communication-compute overlap
System and rack-level design awareness: understanding performance tradeoffs across the full deployment stack
Strong proficiency in Python and C++
Customer-facing technical leadership experience: ability to engage with customers, present findings, and drive decisions
Fluent in AI-assisted development: daily user of AI agents and tools, with a mindset toward defining new AI-powered workflows
Strong Linux systems knowledge
Excellent written and verbal English communication skills

ACADEMIC CREDENTIALS:

Bachelor's, Master's, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. Advanced degree preferred but exceptional industry experience valued equally.

LOCATION:

San Jose, CA, preferred

#LI-TC1

#LI-HYBRID

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here.

This posting is for an existing vacancy.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Principal AI Performance Engineer in San Jose, CA vacancy

Principal AI/ML Infra Engineer GPU Clusters & HPC
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU... ...collaboration with various teams, monitoring infrastructure performance, and implementing improvements based on AI/ML...
Principal
Performance
Jobleads-US
Santa Clara, CA
4 days ago
Principal AI Inference Engineer Open-Source & GPU-Focused
$272k - $431.25k
...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing...
Principal
Performance
NVIDIA Gruppe
Santa Clara, CA
12 hours ago
Principal AI Compiler Engineer
...AI Compiler Engineer Locations available: San Diego and San Jose, California or Austin, Texas NXP is searching for a hands-on AI Compiler... ...silicon and code for maximum impact. Diagnose and crush performance bottlenecks with AI-enabled profiling and diagnostics,...
Principal
Performance
Local area
NXP Semiconductors
San Jose, CA
12 hours ago
Principal AI Performance Engineer
$262.7k - $355.4k
...Job Title Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices? We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running...
Principal
Performance
ARM
San Jose, CA
5 hours ago
Principal AI Engineer, AI Agent Development
...About the Opportunity We are seeking a Principal Engineer with a deep expertise in autonomous AI agent architecture and deployment, to spearhead the design,... ...simulations, and continuous learning pipelines for agent performance optimization. Stay ahead of the curve on...
Principal
Performance
United States Digital Space LLC
San Jose, CA
4 days ago
Principal AI Performance Engineer
...Principal AI Performance Engineer US - Milpitas About Us Graphcore is one of the world's leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs...
Principal
Performance
Flexible hours
Graphcore
Milpitas, CA
3 days ago
Principal AI Performance Engineer
...that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.... ...future of AI and beyond. Together, we advance your career. PRINCIPAL AI PERFORMANCE ENGINEER THE ROLE: AMD is looking for a performance‑obsessed engineer...
Principal
Performance
Advanced Micro Devices , Inc.
San Jose, CA
4 days ago
Senior Principal AI Software Engineer - IT Go-To-Market & Customer Experience
$223k - $306.5k
...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do... ...world. Job Summary As a Sr Principal AI Engineer, you will join a dynamic team to pioneer... ...outputs, and model latency. Cost & Performance Optimization: Create architecture that...
Principal
Performance
Full time
Work at office
Palo Alto Networks
Santa Clara, CA
8 days ago
Principal AI Engineer - Enterprise AI Solutions
$185.2k - $299.48k
...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do... ...best work. Your Career As a Principal AI Engineer for the Enterprise AI Platform, you... ..., distributed AI/ML systems, ensuring performance, reliability, and security. Proactively...
Principal
Performance
Full time
Work at office
Palo Alto Networks
Santa Clara, CA
9 days ago
Senior / Principal AI Engineer for Business Intelligence (7063)
$123.24k - $200k
...Overview of Role As a Sr./Principal AI Engineer within TSMC's Artificial Intelligence for Business Intelligence Innovation (AI4BII) Center... ..., machine learning engineering, or related fields in high-performance environments. This should include: ~7+ years of hands-on...
Principal
Performance
Work at office
Taiwan Semiconductor Manufacturing Company Limited
San Jose, CA
12 hours ago
Principal AI Engineer, Chatbot Development
$313.06k
...for every OK‑er. About the Opportunity We are looking for a Principal AI Engineer to lead the architecture and deployment of large‑scale, LLM... ...simulations, and continuous learning pipelines for Chatbot performance optimization. Design multi‑level intent routing, classifier...
Principal
Performance
United States Digital Space LLC
San Jose, CA
4 days ago
Principal AI Software Engineer
$167k - $270.5k
...Integrity, and Inclusion. We weave AI into the fabric of everything... ...experience motion. The Sr. Principal/Principal person will have... ..., observability and performance across all AI/ML/Agentic components... ...validation. Partner with data engineering to design high-quality...
Principal
Performance
Full time
Work at office
Palo Alto Networks
Santa Clara, CA
6 days ago
Principal AI Engineer
...Principal Ai Engineer TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider. We are a force... ...for designing, developing, and optimizing scalable, high-performance AI systems. You will play a crucial role in shaping the...
Principal
Performance
Work from home
TenEx
San Jose, CA
2 days ago
Principal AI Inference Systems Engineer
...accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...ROLE: AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the performance of key applications and benchmarks, with a special focus...
Principal
Performance
Advanced Micro Devices , Inc.
Santa Clara, CA
4 days ago
Senior Principal AI Agent / ML Engineer (OCI)
...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability...
Principal
Performance
Oracle
Santa Clara, CA
1 day ago
AI Principal Machine Learning Engineer (10189)
$170k - $240k
...there’s no better time to join the Extreme team. Position: AI Staff Machine Learning Engineer - Gen AI, Machine Learning, Graph ML, Big Data... ...businesses manage networks by building intelligent, high-performance multi-agent systems that perceive, learn, and act in real...
Principal
Performance
Remote work
Shift work
Extreme Networks
San Jose, CA
6 hours ago
Principal AI Agent / ML Software Engineer (OCI)
$99.6k - $234.6k
...Principal AI Agent / ML Software Engineer The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role... ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability...
Principal
Performance
Temporary work
Flexible hours
Oracle
Santa Clara, CA
1 day ago
Principal AI/ML Engineer, AV ML Infra
$275.8k - $340.5k
...to meet the unique demands of AI and ML innovation, supporting... ...the productivity of ML engineers, and drive the adoption of cutting... ...Inference: Ensures robust model performance by running large-scale... ...Position Overview: The Principal AI/ML Engineer will lead a growing...
Principal
Performance
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Principal Staff AI/ML Engineer - AV ML Infra
$296.3k
...to meet the unique demands of AI and ML innovation, supporting... ...the productivity of ML engineers, and drive the adoption of cutting... ...Inference: Ensures robust model performance by running large-scale... ...Position Overview: The Principal AI/ML Engineer will lead a growing...
Principal
Performance
Local area
Work from home
Flexible hours
General Motors
Sunnyvale, CA
6 days ago
Principal AI/ML Infra Engineer for GPU Clusters
...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters...
Principal
Jobleads-US
Santa Clara, CA
4 days ago
AI Software Application Engineer, Technical Lead / Principal
$180k - $300k
...unleashing the potential of generative AI to power the transformation of... ...possibilities of AI. AI Software Application Engineer – Technical lead / Principal d-Matrix is seeking an... ...a focus on inference solutions and performance optimization for data center environments...
Principal
Performance
d-Matrix
Santa Clara, CA
more than 2 months ago
Principal AI Engineer, Intelligent Sensors
$171k - $256k
...Possible™. Learn more at and on LinkedIn and Twitter (X) . Principal AI Engineer, Intelligent Sensors Role Overview: The Edge AI team... ...of hardware-aware AI, including benchmarking and performance evaluation on embedded platforms. ~ Proficiency in Python...
Principal
Performance
Permanent employment
Work at office
Shift work
Day shift
Analog Devices
San Jose, CA
more than 2 months ago
Principal Cloud Network & AI Security Engineer
$170k - $277k
...A leading cybersecurity company is seeking an experienced engineer to join their Cloud Network and AI Security team. The role involves designing and implementing next-generation virtualization features for their firewalls and AI solutions, across both public and private...
Principal
Palo Alto Networks
Santa Clara, CA
12 hours ago
Principal AI Design Engineer - Hardware Systems
$190k - $270k
...Celestica Inc. is looking for a Principal, Design Engineering to lead the hands-on design and coding of AI architectures, focusing on high accuracy and optimization. Located in San Jose, CA, this role requires experience in leading AI projects and knowledge of tools like...
Principal
Celestica
San Jose, CA
1 day ago
Principal Applications Engineer - DC/DC
...leading semiconductor organization seeking a PrincipalApplications Engineer with deep expertise in DC/DC power conversion.You will work... ...teams to supportadvanced power solutions used in high-performance systems. Responsibilities Support customers with DC/DC power solutions...
Principal
Performance
Charles Brown Recruitment
San Jose, CA
4 days ago
Principal Software Engineer - Large-Scale LLM Memory and Storage Systems
$272k - $425.5k
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software... ...framework for serving generative AI and reasoning models across multi-... ...distributed environments. Built in Rust for performance and Python for extensibility, Dynamo...
Principal
Performance
Local area
Remote work
NVIDIA Corporation
Santa Clara, CA
1 day ago
Principal PMIC Architect: AI-Optimized Power Delivery
$175k - $350k
...Power Architecture Lead for PMIC in San Jose, CA. In this role, you will define and oversee the IVR architecture, ensure optimal performance across power domain specifications, and collaborate with foundry partners. Qualified candidates should possess over 15 years of experience...
Principal
Performance
TylSemi
San Jose, CA
12 hours ago
Principal AI Engineer: Scalable LLM Agents for GTM
$167k - $270.5k
Palo Alto Networks, Inc. is seeking a Technical Leader to develop AI applications within the GTM/CX domain. This role involves defining the architecture for scalable AI/ML systems and leading the design of intelligent agents. Ideal candidates will have 15+ years of experience...
Principal
Palo Alto Networks, Inc.
Santa Clara, CA
12 hours ago
Principal Kernel Software Engineer for AI Compute
...d-Matrix inc. is seeking a Principal Software Engineer specializing in kernels at our headquarters in Santa Clara, CA. In this... ...maintaining software kernels for next-generation AI hardware, ensuring optimized performance. The ideal candidate has a strong background in computer...
Principal
Performance
3 days per week
d-Matrix inc.
Santa Clara, CA
4 days ago
Remote Principal AI Engineer - Scalable ML Platform
...A leading automotive company is seeking a Principal AI Engineer to lead the design and optimization of its AI platform. The successful candidate will guide the infrastructure for large-scale training and cloud inference, working closely with data scientists and engineers...
Principal
Remote work
General Motors
Sunnyvale, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal AI Performance Engineer. Be the first to apply!