Senior Compiler Engineer, AI Inference Performance

$152k - $241.5k

NVIDIA

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”.

We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AI, recommendation systems, image classification, speech recognition, etc. With the rapid advancement of AI, our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Time and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community.

What you’ll be doing:

Analyzing deep learning networks and developing compiler optimization algorithms.
Collaborating with members of the deep learning software framework teams and the GPU architecture teams to accelerate the next generation of deep learning software.
Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler techniques for AI workloads and future NVIDIA GPUs.

What we need to see:

Bachelor’s, Master’s or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience.
3+ years of relevant work or research experience in performance analysis and compiler optimizations.
Experience with compiler technologies (e.g., MLIR, LLVM, XLA, Triton, etc.).
Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design.
Ability to work independently, define project goals and scope, and lead your own development efforts.
Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.

Ways to stand out from the crowd:

Proficient in CPU and/or GPU architecture. CUDA or OpenCL programming experience.
Understanding of deep learning models, algorithms and frameworks, such as PyTorch, JAX.
GPU kernel authoring and performance analysis using tools such as Nsight Compute.
A track record of success in mentoring early-career engineers and interns is a bonus.
Track record on new hardware bring-up is a plus.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until February 28, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Senior Compiler Engineer, AI Inference Performance in Santa Clara, CA vacancy

Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models... .... You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior Software Engineer, Deep Learning Inference - Automotive Safety
$152k - $241.5k
...eager to work on cutting-edge AI technology for safety-... ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront... ...technology, enabling high-performance AI inference solutions for automotive safety... ...into TensorRT's compiler and runtime for specialized...
Senior
Performance
NVIDIA
Santa Clara, CA
1 day ago
Compiler Engineer - AI Inference
$152k - $241.5k
...deep learning ignited modern AI — the next era of computing... ...NVIDIA is seeking top-tier AI Compiler Engineers to drive innovation within... ...boundaries of what is possible in AI performance and help build the... ...problems for AI workloads (both inference and training) and...
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior Compiler Engineer: GPU Performance & AI
...leading technology company based in California is seeking a Senior Compiler Engineer to shape the future of compiler technologies. This role... ...passion for both compiler technology and GPU computing, driving performance and efficiency in high-performance computing applications....
Senior
Performance
Intel Corporation
Santa Clara, CA
3 days ago
Senior Compiler Engineer - AI Inference & MLIR
$152k - $241.5k
NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should...
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior Research Engineer, On-Device Inference, Robotics, DeepMind
$207k - $300k
Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place... ...focused on high-performance inference. Understanding... ...model architectures with AI accelerators (e.g., distillation... ...analysis, improving compilers for mobile platforms, as...
Senior
Performance
Full time
Google Inc.
Mountain View, CA
2 days ago
Senior Kernel & Compiler Performance Engineer (GPU/AI)
A cutting-edge AI company in California is looking for a Member of Technical Staff for Kernel/Compiler/Communication. This critical role requires strong expertise in... ..., along with 5+ years of experience in performance engineering. The ideal candidate will design high-performance...
Senior
Performance
RadixArk
Palo Alto, CA
12 hours ago
Senior Deep Learning Compiler Engineer - XLA
$152k - $241.5k
...learning ignited modern AI — the next era of... ...for versatile software engineers for our XLA team. NVIDIA... ...join us to build high-performance, production-grade software... ...In this role, develop compiler optimization... ...workloads. You will optimize inference and training...
Senior
Performance
NVIDIA
Santa Clara, CA
2 days ago
Senior Performance Compiler Engineer - Triton
$184k - $287.5k
...computing. We are increasingly known as “the AI computing company”. We are looking for a Senior Performance Compiler Engineer to join our team and work on the open-source... ..., accelerating both training and inference. You will be immersed in a diverse, supportive...
Senior
Performance
NVIDIA AI
Santa Clara, CA
3 days ago
Senior AI Systems Engineer — SGLang & Inference on GPUs
...leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative... ...a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
Senior
Performance
Advanced Micro Devices
Santa Clara, CA
3 days ago
Senior ML Systems Engineer
...builds the world's largest AI chip, 56 times larger... ...-leading training and inference speeds and empowers... ...and experienced engineer to join our SOTA Training... ...unprecedented levels of performance, efficiency, and scalability... ..., graph lowering, compiler optimizations, runtime...
Senior
Performance
Internship
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
2 days ago
Senior Backend Engineer, ML Inference Systems
$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-26... ...daily decisions, with a focus on the performance, reliability, and scalability of inference...
Senior
Performance
Work at office
Worldwide
Relocation package
Unity Technologies
Mountain View, CA
1 day ago
Senior Inference Performance Engineer
Cerebras seeks a Senior Performance Engineer to join the Product team in Sunnyvale, CA. The role involves developing benchmarks to measure inference performance and creating competitive pricing models... ...culture in a groundbreaking AI environment, along with job stability...
Senior
Performance
Cerebras
Sunnyvale, CA
1 day ago
Senior DL Inference & Performance Engineer
$184k - $356.5k
A leading technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model inference and collaborating with co-design teams to optimize performance across...
Senior
Performance
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior LLM Performance Engineer - GPU Inference
$184k - $356.5k
A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms...
Senior
Performance
Full time
NVIDIA Corporation
Santa Clara, CA
2 days ago
Senior DL Inference Engineer - GPU Optimization Equity
NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate... ...learning, specifically in inference. This role involves profiling... ...with teams to advance AI solutions. A strong understanding...
Senior
Performance
NVIDIA
Santa Clara, CA
4 days ago
Senior Software Engineer, Machine Learning Inference
$152k - $241.5k
...driving advancements in AI and machine... ...talented and motivated engineers to join our... ...leading deep learning inference software for... ...accelerators. As a Senior Software Engineer... ...Learning Frameworks, Compilers, or System... ...of close-to-metal performance analysis, optimization...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior DL Compiler Engineer- CUDA Tile
$152k - $241.5k
...deep learning ignited modern AI — the next era of computing —... ...”. We are hiring software engineers for the CUDA Tile team. NVIDIA... ...will design and implement compiler transformations, develop MLIR... ...lowering passes, and optimize the performance of tile-based kernels to...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior LLVM Compiler Engineer
$184k - $287.5k
...We are seeking for an expert Senior Compiler Engineer to join our Compute Compiler Team, with a focus... ...effectively Partner with architecture, performance, and product teams to translate NVIDIA... ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes....
Senior
Performance
NVIDIA
Santa Clara, CA
2 days ago
Senior Compiler Engineer - DPU
$152k - $241.5k
...into our future offerings, our Compiler team is growing and seeking top-tier compiler engineers who want an exciting and engaging... ...work or research experience in performance analysis, compiler... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes...
Senior
Performance
NVIDIA
Santa Clara, CA
12 hours ago
Senior Compiler Engineer
...shaping the future of cutting-edge compiler technologies at Intel. As a Senior Compiler Engineer, you will play a critical role... ...you will enable transformative performance and efficiency gains, empowering groundbreaking applications in AI and high-performance computing....
Senior
Performance
Internship
Intel Corporation
Santa Clara, CA
2 days ago
Senior Machine Learning Applications and Compiler Engineer
$152k - $241.5k
...are now looking for a Senior Machine Learning Applications and Compiler Engineer! NVIDIA is seeking engineers... ...for our LPX inference and compiler stack. You... ...develop, and maintain high-performance runtime and compiler components... ...with large-scale AI distributed inference...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior Software Engineer, DL Compilers
$184k - $287.5k
...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software... ...execution stack, targeting high-performance kernel generation for...
Senior
Performance
Work experience placement
NVIDIA
Santa Clara, CA
3 days ago
Senior DL Algorithms Engineer - Inference Performance
$152k - $241.5k
We are looking for a Senior DL Algorithms Engineer for LLM/Omni model optimizations... ...who are mindful of performance analysis and optimization... ...technology company that leads the AI revolution. What you will... ...) on NVIDIA’s accelerated inference SW stack. Contribute new...
Senior
Performance
NVIDIA
Santa Clara, CA
4 days ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make... ...platforms for functionality and performance Develop components of... .... Background in compiler development... ...vacancy. NVIDIA uses AI tools in its recruiting...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior Software Development Engineer - SGLang and Inference Stack
...computing experiences-from AI and data centers, to... ...enhancing GPU kernel performance, accelerating deep... ...SOTA LLM and Multimodal inference at scale across multi-... ...optimize cutting-edge compiler technologies and drive... ...PERSON: Skilled engineer with strong technical...
Senior
Performance
Advanced Micro Devices , Inc.
Santa Clara, CA
2 days ago
Senior Deep Learning Framework Communications Engineer
$152k - $241.5k
...Artificial Intelligence, High Performance Computing and... ...motivated Deep Learning engineer to bring advanced... ...communication technologies into AI stacks, including... ...up to 100K GPUs to inference down at microsecond latency... ...models. Improve AI compilers to hide communications...
Senior
Performance
NVIDIA
Santa Clara, CA
1 day ago
Senior Deep Learning Engineer - Model Evaluation & AI Systems
$224k - $356.5k
...into the unlimited potential of AI to define the next era of... ...at the forefront of AI and high-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems... ...Work alongside model training, inference, and product divisions to provide...
Senior
Performance
NVIDIA
Santa Clara, CA
4 days ago
Senior Deep Learning Compiler Verification Engineer
$140k - $224.25k
...into the unlimited potential of AI to define the next era of... ...building the next generation of compiler technologies to accelerate... ...workloads. We are looking for an engineer to implement compiler... ...guarantee functional quality and performance as models, compiler stacks, and...
Senior
Performance
NVIDIA
Santa Clara, CA
4 days ago
Senior AI Compiler Engineer, Algorithms and Code-Generation
$152k - $241.5k
...the unlimited potential of AI to define the next era of computing... ...for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software... ...the backbone of NVIDIA’s inference engine, spanning across data... ...must deliver leading inference performance, fast build time, reduced...
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Compiler Engineer, AI Inference Performance. Be the first to apply!