Lead AI Inference Performance Engineer (GPU)

Advanced Micro Devices

A leading technology company is looking for a Principal AI Performance Engineer to optimize AI inference performance on GPUs. In this role, you will lead a team driving performance optimization across various configurations, diagnose complex performance issues, and interact with customers. Ideal candidates possess extensive experience in GPU computing, strong analytical skills, and a background in AI systems. The position is based in San Jose, CA, offering an innovative environment where you can make a significant impact. #J-18808-Ljbffr Advanced Micro Devices

Apply

Vacancy posted 14 hours ago

Similar jobs that could be interesting for youBased on the Lead AI Inference Performance Engineer (GPU) in San Jose, CA vacancy

Compiler Engineer - AI Inference
$152k - $241.5k
...invention of the GPU 1999 sparked the growth... ...ignited modern AI — the next era of... ...-tier AI Compiler Engineers to drive innovation... ...is possible in AI performance and help build the... ...workloads (both inference and training) and... ...design: Partner with leading experts across our...
Performance
NVIDIA
Santa Clara, CA
2 days ago
Senior Compiler Engineer, AI Inference Performance
$152k - $241.5k
...NVIDIA's invention of the GPU 1999 sparked the... ...learning ignited modern AI — the next era of... ...Deep Learning Compiler Engineer. NVIDIA is hiring... ...backbone of NVIDIA’s inference engine, spanning across... ...compiler must deliver leading inference performance, fast build time, reduced...
Performance
NVIDIA
Santa Clara, CA
2 days ago
GenAI Inference Optimization Lead — GPU Performance
A leading technology company is looking for a Principal GenAI Inference Optimization Engineer in San Jose, CA. This role will focus on optimizing performance and efficiency of generative AI on AMD GPU platforms. The ideal candidate will have significant expertise in GPU...
Performance
Advanced Micro Devices
San Jose, CA
14 hours ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...skilled and motivated software engineers to join us and build AI inference systems that serve large-... ...architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive... ...submissions to the industry-leading MLPerf Inference benchmarking...
Performance
NVIDIA
Santa Clara, CA
2 days ago
Senior Technical Marketing Engineer - GPU and System Architecture
$160k - $253k
Senior Technical Marketing Engineer - GPU and System Architecture page... ...centers are transforming into AI factories, and NVIDIA... ...center platforms integrate high performance compute, networking, and a full... ...and efficiency for AI inference & training.**What you’ll be...
Performance
NVIDIA Corporation
Santa Clara, CA
3 days ago
Principal Software Quality Engineer - GPU & Machine Learning
...experiences-from AI and data centers,... ...Software Quality Engineer to serve as the senior... ...on AMD Instinct™ GPU platforms. You will... ..., workload, performance, stress, stability... ...meet them. ~ Lead system-level testing... ...LLM training and inference (PyTorch, vLLM, Triton...
Performance
Contract work
Shift work
Advanced Micro Devices , Inc.
San Jose, CA
4 days ago
Senior Compiler Engineer: GPU Performance & AI
A leading technology company based in California is seeking a Senior Compiler Engineer to shape the future of compiler technologies. This role... ...design for next-generation GPU architectures. Ideal candidates... ...and GPU computing, driving performance and efficiency in high-...
Performance
Intel Corporation
Santa Clara, CA
2 days ago
Senior LLM Performance Engineer - GPU Inference
$184k - $356.5k
A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms...
Performance
Full time
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior GPU Performance Engineer for AI Acceleration
$207k - $300k
Google is seeking an experienced AI/ML Software Engineer to enhance GPU architectures and optimize performance benchmarks. The role involves collaborating with teams to solve ML model challenges and architect transformative AI solutions, contributing to Google's machine...
Performance
Google
Sunnyvale, CA
14 hours ago
Principal Engineer, Solutions Architect Lead - Industrial & Embedded IoT, Edge AI On‑Prem Appliance
$220.2k - $330.4k
...Technologies, Inc. Job Area: Engineering Group, Engineering... ...edge, focusing on AI, edge computing and connectivity... ...IoT (IE‑IoT) BU leads the transformation of... ...for generative AI inference and computer vision workloads... ...the accessibility and performance of a datacenter...
Performance
Work experience placement
Work at office
Qualcomm
Santa Clara, CA
3 days ago
Burn-In Test Engineer for AI Inference Hardware
$150k - $275k
...cutting-edge tech company in San Jose is seeking a Supercomputing Engineer to ensure the reliability of its inference servers. This role involves designing and executing test suites, analyzing performance, and collaborating with engineering teams. Ideal candidates will...
Performance
Etched
San Jose, CA
2 days ago
Principal Software Engineer (AI Inference / Distributed Systems)
...generation computing experiences-from AI and data centers, to PCs,... ...for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks... ...optimizing scale-up and scale-out inference. Develop methods and tooling...
Performance
Advanced Micro Devices , Inc.
Santa Clara, CA
14 hours ago
Senior Multi‑GPU Signal Processing and System Architecture Engineer
$200k - $322k
...seeking a self‑motivated senior engineer for the Aerial Omniverse... ...you will design and implement GPU kernels that apply time‑varying... ...What we need to see:PhD in high‑performance computing, computer architecture... ...existing vacancy.NVIDIA uses AI tools in its recruiting processes...
Performance
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior System Software Engineer - GPU Power Management
$184k - $287.5k
Senior System Software Engineer - GPU Power Management page is loaded## Senior... ...innovation that’s fueled by leading technology—and amazing people... ...the unlimited potential of AI to define the next era of... ...components with focus on power and performance.* Drive end-to-end GPU power...
Performance
Work experience placement
Local area
Remote work
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior Software Engineer, Deep Learning Inference - Automotive Safety
$152k - $241.5k
...eager to work on cutting-edge AI technology for safety-... ...TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety... ...! What you'll be doing: Lead the design and development of...
Performance
NVIDIA
Santa Clara, CA
14 hours ago
Quality Assurance Software Developer Engineer in Test, GeForce GPU
$140k - $224.25k
...NVIDIA Software Development Engineer in Test Are you passionate about... ...substantial experience with AI technologies for automation of... ...the testing workflows in GPU domain. Write maintainable,... ...reviews including feedback on performance, scalability, and correctness...
Performance
NVIDIA
Santa Clara, CA
1 day ago
Senior AI Inference Engineer — GPU DL, Equity Eligible
$184k - $356.5k
...seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa... ..., CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep learning frameworks like...
Performance
NVIDIA Corporation
Santa Clara, CA
14 hours ago
Senior AI Systems Engineer: GPU Infra & LLM Ops
...Jose, California is seeking a skilled AI Systems Engineer to lead the development and support of AI... ...infrastructure. The role requires managing GPU clusters, deploying advanced AI... ...technical experience, focusing on high-performance computing or AI systems. The position...
Performance
Cadence
San Jose, CA
14 hours ago
Senior AI Systems Engineer — SGLang & Inference on GPUs
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent work,... ...innovative solutions and advancing AI technologies. #J-18808-Ljbffr...
Performance
Advanced Micro Devices
Santa Clara, CA
2 days ago
GPU Design Verification Engineer
$90 - $100 per hour
Job Title GPU Design Verification Engineer Position Description Protingent Staffing has an exciting contract... ...for verifying the functionality and performance of our cutting‑edge GPU designs.... ...innovation — from Software and Aerospace to AI, Clean Tech, Medical Devices, and...
Performance
Contract work
Protingent
San Jose, CA
4 days ago
Principal Engineer - System Software GPU Server
$272k - $431.25k
...We are seeking software engineers to work on next-generation high-speed interconnect... ...demanding high-speed IO applications a GPU or high-performance computing server will encounter in its... ...interpersonal skills ~ Capability to use AI prompt tools Your base salary...
Performance
NVIDIA
Santa Clara, CA
14 hours ago
Application Engineer Low Power Edge Inference (DIB Focus)
$120k - $180k
...redefining the future of AI with our groundbreaking innovations... ..., enabling superior performance and energy efficiency... ...centers. Our talented team of engineers and industry‑leading executives drives this... ...architecting ultra‑low‑power edge inference solutions around the SoC...
Performance
For contractors
TETRAMEM INC
San Jose, CA
14 hours ago
Senior DL Inference & Performance Engineer
$184k - $356.5k
A leading technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model... ...of experience in deep learning and GPU programming. This position offers a competitive...
Performance
NVIDIA Corporation
Santa Clara, CA
3 days ago
Application Engineer - Low Power Edge Inference (DIB Focus)
$120k - $180k
Application Engineer - Low Power Edge Inference (DIB Focus) About this Role We are seeking an Application Engineer... ...the SoC Profile and improve system performance (latency, energy per inference,... ...in deploying cutting‑edge edge AI silicon into real‑world, resource‑constrained...
Performance
For contractors
Internship
TetraMem Inc
San Jose, CA
2 days ago
Senior DL Algorithms Engineer - Inference Performance
$184k - $287.5k
Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer - Inference... ...of the hardware/software stack from GPU architecture to Deep Learning... ...fast-growing technology company that leads the AI revolution.**What you will be doing:...
Performance
NVIDIA Corporation
Santa Clara, CA
1 day ago
Inference Engineer
...Principal Machine Learning Engineer - Inference Serving Frameworks... ...building rack-level AI inference systems. Our... ...source software to serve leading‑edge models with... ...experts to architect high‑performance inference stacks and design... ...‑level debugging. GPU kernel development...
Performance
Full time
Acceler8 Talent
Santa Clara, CA
2 days ago
Senior Performance Engineer, Inference
...builds the world's largest AI chip, 56 times larger than... ...to deliver industry‑leading training and inference speeds and empowers machine... ...Role We are hiring a Senior Performance Engineer to join our Product team.... ...vLLM, SGLang, TensorRT‑LLM), GPU kernel‑level optimization...
Performance
Contract work
Shift work
Cerebras
Sunnyvale, CA
1 day ago
Senior Deep Learning Engineer - Model Evaluation & AI Systems
$224k - $356.5k
...unlimited potential of AI to define the next era... ...computing. An era in which our GPU acts as the brains of... ...of AI and high-performance computing. As a Senior... ...Principal Deep Learning Engineer — Model Evaluation & AI... ...alongside model training, inference, and product divisions...
Performance
NVIDIA
Santa Clara, CA
3 days ago
Developer Experience Engineer
$2,000 per month
...building the world’s first AI inference system purpose‑built for transformers... ...delivering over 10x higher performance and dramatically lower cost... ...investors and staffed by leading engineers, Etched is redefining the... ...workflows and monitoring GPU‑accelerated workloads....
Performance
Work at office
Relocation package
Etched
San Jose, CA
4 days ago
ML Systems Performance Engineer
...builds the world's largest AI chip, 56 times larger... ...Cerebras to deliver industry-leading training and inference speeds and empowers... ...over 10 times faster than GPU-based hyperscale cloud... ...About The Role Engineers on the inference performance team operate at the intersection...
Performance
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Inference Performance Engineer (GPU). Be the first to apply!