Principal AI Performance Engineer
Advanced Micro Devices , Inc.
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. PRINCIPAL AI PERFORMANCE ENGINEER THE ROLE: AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact. THE PERSON: You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything. KEY RESPONSIBILITIES:
At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. PRINCIPAL AI PERFORMANCE ENGINEER THE ROLE: AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact. THE PERSON: You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything. KEY RESPONSIBILITIES:
- Drive performance optimization end-to-end across the stack on leading models and customer-relevant serving configurations, closing competitive gaps through kernel and systems-level optimizations
- Profile, diagnose, and resolve the hardest cross-stack performance bottlenecks, from GPU kernels and operator dispatch to framework-level scheduling and multi-node communication
- Diagnose kernel-level performance issues using profiling tools: identify occupancy limitations, L2 cache thrashing, register pressure, memory coalescing issues, etc, and translate findings into actionable optimizations
- Lead customer-facing technical engagements: present findings, recommend optimizations, and deliver measurable performance uplifts
- Integrate and optimize custom kernels (Triton, Gluon, CK, PyDSL, ASM, AITER) within serving frameworks, understanding dispatch paths, shape extraction, and backend selection
- Optimize multi-node distributed inference: communication-compute overlap, parallelism strategies, and scale-out performance
- Develop and refine shared performance optimization methodology that raises the bar across the broader team
- Leverage AI agents to accelerate daily work and define best practices for AI-assisted performance engineering
- Upstream optimizations into open-source frameworks such as vLLM, SGLang, and PyTorch
- 7+ years of software development experience in GPU computing, AI systems, or high-performance computing
- Deep hands-on experience with AI serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) and their internals
- Strong background in end-to-end workload profiling and bottleneck diagnosis: you can trace from user request to GPU kernel and back
- Understanding of GPU kernel performance characteristics: occupancy, register and LDS pressure, memory coalescing, cache utilization, wavefront scheduling, and instruction-level bottlenecks
- Ability to read and reason about kernel-level profiling data and translate it into concrete optimization actions. You may not write kernels from scratch daily, but you can tell exactly why one is slow and what needs to change
- Understanding of model architectures (transformers, MoE, diffusion), inference paradigms (speculative decoding, prefill-decode disaggregation, continuous batching), and how they map to hardware
- Experience with custom kernel development or integration (HIP, CUDA, Triton, CK, or similar)
- Understanding of multi-GPU and multi-node distributed systems: scale-up and scale-out topologies, RCCL/NCCL, RDMA, and communication-compute overlap
- System and rack-level design awareness: understanding performance tradeoffs across the full deployment stack
- Strong proficiency in Python and C++
- Customer-facing technical leadership experience: ability to engage with customers, present findings, and drive decisions
- Fluent in AI-assisted development: daily user of AI agents and tools, with a mindset toward defining new AI-powered workflows
- Strong Linux systems knowledge
- Excellent written and verbal English communication skills
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Principal AI Performance Engineer in San Jose, CA vacancy
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU... ...collaboration with various teams, monitoring infrastructure performance, and implementing improvements based on AI/ML...PrincipalPerformance$272k - $431.25k
...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing...PrincipalPerformance- ...AI Compiler Engineer Locations available: San Diego and San Jose, California or Austin, Texas NXP is searching for a hands-on AI Compiler... ...silicon and code for maximum impact. Diagnose and crush performance bottlenecks with AI-enabled profiling and diagnostics,...PrincipalPerformanceLocal area
$262.7k - $355.4k
...Job Title Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices? We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running...PrincipalPerformance- ...About the Opportunity We are seeking a Principal Engineer with a deep expertise in autonomous AI agent architecture and deployment, to spearhead the design,... ...simulations, and continuous learning pipelines for agent performance optimization. Stay ahead of the curve on...PrincipalPerformance
- ...Principal AI Performance Engineer US - Milpitas About Us Graphcore is one of the world's leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs...PrincipalPerformanceFlexible hours
- ...that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.... ...future of AI and beyond. Together, we advance your career. PRINCIPAL AI PERFORMANCE ENGINEER THE ROLE: AMD is looking for a performance‑obsessed engineer...PrincipalPerformance
$223k - $306.5k
...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do... ...world. Job Summary As a Sr Principal AI Engineer, you will join a dynamic team to pioneer... ...outputs, and model latency. Cost & Performance Optimization: Create architecture that...PrincipalPerformanceFull timeWork at office$185.2k - $299.48k
...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do... ...best work. Your Career As a Principal AI Engineer for the Enterprise AI Platform, you... ..., distributed AI/ML systems, ensuring performance, reliability, and security. Proactively...PrincipalPerformanceFull timeWork at office$123.24k - $200k
...Overview of Role As a Sr./Principal AI Engineer within TSMC's Artificial Intelligence for Business Intelligence Innovation (AI4BII) Center... ..., machine learning engineering, or related fields in high-performance environments. This should include: ~7+ years of hands-on...PrincipalPerformanceWork at office$313.06k
...for every OK‑er. About the Opportunity We are looking for a Principal AI Engineer to lead the architecture and deployment of large‑scale, LLM... ...simulations, and continuous learning pipelines for Chatbot performance optimization. Design multi‑level intent routing, classifier...PrincipalPerformance$167k - $270.5k
...Integrity, and Inclusion. We weave AI into the fabric of everything... ...experience motion. The Sr. Principal/Principal person will have... ..., observability and performance across all AI/ML/Agentic components... ...validation. Partner with data engineering to design high-quality...PrincipalPerformanceFull timeWork at office- ...Principal Ai Engineer TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider. We are a force... ...for designing, developing, and optimizing scalable, high-performance AI systems. You will play a crucial role in shaping the...PrincipalPerformanceWork from home
- ...accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...ROLE: AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the performance of key applications and benchmarks, with a special focus...PrincipalPerformance
- ...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability...PrincipalPerformance
$170k - $240k
...there’s no better time to join the Extreme team. Position: AI Staff Machine Learning Engineer - Gen AI, Machine Learning, Graph ML, Big Data... ...businesses manage networks by building intelligent, high-performance multi-agent systems that perceive, learn, and act in real...PrincipalPerformanceRemote workShift work$99.6k - $234.6k
...Principal AI Agent / ML Software Engineer The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role... ...critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability...PrincipalPerformanceTemporary workFlexible hours$275.8k - $340.5k
...to meet the unique demands of AI and ML innovation, supporting... ...the productivity of ML engineers, and drive the adoption of cutting... ...Inference: Ensures robust model performance by running large-scale... ...Position Overview: The Principal AI/ML Engineer will lead a growing...PrincipalPerformanceLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$296.3k
...to meet the unique demands of AI and ML innovation, supporting... ...the productivity of ML engineers, and drive the adoption of cutting... ...Inference: Ensures robust model performance by running large-scale... ...Position Overview: The Principal AI/ML Engineer will lead a growing...PrincipalPerformanceLocal areaWork from homeFlexible hours- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters...Principal
$180k - $300k
...unleashing the potential of generative AI to power the transformation of... ...possibilities of AI. AI Software Application Engineer – Technical lead / Principal d-Matrix is seeking an... ...a focus on inference solutions and performance optimization for data center environments...PrincipalPerformance$171k - $256k
...Possible™. Learn more at and on LinkedIn and Twitter (X) . Principal AI Engineer, Intelligent Sensors Role Overview: The Edge AI team... ...of hardware-aware AI, including benchmarking and performance evaluation on embedded platforms. ~ Proficiency in Python...PrincipalPerformancePermanent employmentWork at officeShift workDay shift$170k - $277k
...A leading cybersecurity company is seeking an experienced engineer to join their Cloud Network and AI Security team. The role involves designing and implementing next-generation virtualization features for their firewalls and AI solutions, across both public and private...Principal$190k - $270k
...Celestica Inc. is looking for a Principal, Design Engineering to lead the hands-on design and coding of AI architectures, focusing on high accuracy and optimization. Located in San Jose, CA, this role requires experience in leading AI projects and knowledge of tools like...Principal- ...leading semiconductor organization seeking a PrincipalApplications Engineer with deep expertise in DC/DC power conversion.You will work... ...teams to supportadvanced power solutions used in high-performance systems. Responsibilities Support customers with DC/DC power solutions...PrincipalPerformance
$272k - $425.5k
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software... ...framework for serving generative AI and reasoning models across multi-... ...distributed environments. Built in Rust for performance and Python for extensibility, Dynamo...PrincipalPerformanceLocal areaRemote work$175k - $350k
...Power Architecture Lead for PMIC in San Jose, CA. In this role, you will define and oversee the IVR architecture, ensure optimal performance across power domain specifications, and collaborate with foundry partners. Qualified candidates should possess over 15 years of experience...PrincipalPerformance$167k - $270.5k
Palo Alto Networks, Inc. is seeking a Technical Leader to develop AI applications within the GTM/CX domain. This role involves defining the architecture for scalable AI/ML systems and leading the design of intelligent agents. Ideal candidates will have 15+ years of experience...Principal- ...d-Matrix inc. is seeking a Principal Software Engineer specializing in kernels at our headquarters in Santa Clara, CA. In this... ...maintaining software kernels for next-generation AI hardware, ensuring optimized performance. The ideal candidate has a strong background in computer...PrincipalPerformance3 days per week
- ...A leading automotive company is seeking a Principal AI Engineer to lead the design and optimization of its AI platform. The successful candidate will guide the infrastructure for large-scale training and cloud inference, working closely with data scientists and engineers...PrincipalRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal AI Performance Engineer. Be the first to apply!
Related searches
- principal network engineer San Jose, CA
- senior director engineering San Jose, CA
- engineering director San Jose, CA
- principal engineer San Jose, CA
- principal application developer San Jose, CA
- director systems engineering San Jose, CA
- director software engineering San Jose, CA
- project engineer assistant project manager San Jose, CA
- general engineer San Jose, CA
- director data engineering San Jose, CA


