Senior AI Inference Compiler Engineer

$152k - $241.5k

NVIDIA Gruppe

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AIs, recommendation systems, image classification, speech recognition, etc. With the rapid advancement of AI, our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Time and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community. What you’ll be doing: Develop compiler IR, programming model and optimizations for future GPU architectures. Collaborating with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler optimizations and kernel generation for neural networks, and other general software engineering work. What we need to see: Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience. 3+ years of relevant work or research experience in performance analysis and compiler optimizations. Experience with compiler technologies (e.g., MLIR, XLA, and LLVM etc.). Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Ability to work independently, define project goals and scope, and lead your own development efforts. Strong interpersonal skills are required along with the ability to work in a fast moving & dynamic product-oriented team. Ways to stand out from the crowd: Understanding of deep learning models, algorithms and frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel generation with high performance and fast build time. Proficient in GPU architecture. CUDA or OpenCL programming experience. Track record on new hardware bring-up is a plus. Benefits and Compensation: Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Apply

Vacancy posted 8 hours ago

Similar jobs that could be interesting for youBased on the Senior AI Inference Compiler Engineer in Santa Clara, CA vacancy

Senior Compiler Engineer, AI Inference Performance
$152k - $241.5k
...recently, GPU deep learning ignited modern AI — the next era of computing — with the... ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal...
Senior
NVIDIA
Santa Clara, CA
17 hours ago
Senior Compiler Engineer - AI Inference & MLIR
$152k - $241.5k
NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should...
Senior
NVIDIA
Santa Clara, CA
17 hours ago
Senior Compiler Engineer, AI Inference Performance
$152k - $241.5k
NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning networks and developing compiler optimization algorithms. Collaborating with members of the deep learning software framework...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency... ...inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across...
Senior
NVIDIA
Santa Clara, CA
17 hours ago
Senior Software Engineer, Deep Learning Inference - Automotive Safety
$152k - $241.5k
...eager to work on cutting-edge AI technology for safety-... ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of... ...enabling high-performance AI inference solutions for automotive... ...functionalities into TensorRT's compiler and runtime for specialized...
Senior
NVIDIA
Santa Clara, CA
2 days ago
Compiler Engineer - AI Inference
$152k - $241.5k
...recently, GPU deep learning ignited modern AI — the next era of computing — with the... ...”. NVIDIA is seeking top-tier AI Compiler Engineers to drive innovation within our world-class... ...problems for AI workloads (both inference and training) and successfully transition...
NVIDIA
Santa Clara, CA
17 hours ago
Senior ML Systems Engineer: Compiler & Performance
...Systems builds the world’s largest AI chip, 56 times larger than... ...-leading training and inference speeds and empowers machine learning... ...a versatile and experienced engineer to join our SOTA Training... ...translation, graph lowering, compiler optimizations, runtime integration...
Senior
Internship
Dormont Manufacturing Co
Sunnyvale, CA
8 hours ago
Senior AI Inference Compiler Engineer - Drive Next-Gen DL
NVIDIA Gruppe in Santa Clara is seeking an AI & Deep Learning Compiler Engineer to join its Deep Learning & AI Compiler team. This role involves developing compiler IR and collaborating with various teams to enhance deep learning software. The ideal candidate will have...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior AI Inference Compiler Engineer — Equity & Impact
$152k - $241.5k
NVIDIA Gruppe is hiring an AI & Deep Learning Compiler Engineer for the Deep Learning & AI Compiler team. This role involves analyzing deep learning networks and developing optimization algorithms while collaborating with software and GPU architecture teams. The ideal...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior GPU Compiler Engineer — Hybrid, AI/ML Performance
Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role... ...collaborating on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The ideal...
Senior
Intel Corporation
Santa Clara, CA
17 hours ago
Senior Quantized Inference Engineer - AI Throughput
A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals...
Senior
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior AI Systems Engineer: Inference Kernels & Runtimes
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Principal AI Inference Systems Engineer
...computing experiences-from AI and data centers, to PCs, gaming... ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving... ...LLM training and inference on AMD GPUs, improving kernel... ...architecture, memory hierarchy, and compiler-level optimization (e.g.,...
Advanced Micro Devices , Inc.
Santa Clara, CA
1 day ago
Senior/Staff Analog IC Design Engineer - AI Inference
...unleashing the potential of generative AI to power the transformation of technology... ...days per week. The role: Analog Design Engineer, Senior / Staff /Sr. Staff What You Will Do: Analog... ...engine for Artificial Intelligence Inference Accelerator and High-Speed Die-2-Die Interface...
Senior
3 days per week
d-Matrix
Santa Clara, CA
8 hours ago
Senior Research Engineer, On-Device Inference, Robotics, DeepMind
$207k - $300k
Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place Mountain View, CA, USA... ...to align model architectures with AI accelerators (e.g., distillation).... ...software performance analysis, improving compilers for mobile platforms, as well as...
Senior
Full time
Google Inc.
Mountain View, CA
3 days ago
Senior Software Engineer, Machine Learning Inference
$152k - $241.5k
...driving advancements in AI and machine learning to... ...talented and motivated engineers to join our TensorRT... ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... ...Deep Learning Frameworks, Compilers, or System Software....
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior Software Engineering - Machine Learning
$184k - $287.5k
...GPUs are at the core of modern AI infrastructure, from training... ...-scale models to running inference in production. That position... ...software as much as hardware, and compiler engineering is a big part of what makes it work. We're hiring senior software engineers for a...
Senior
Work experience placement
NVIDIA
Santa Clara, CA
17 hours ago
Senior Software Development Engineer - SGLang and Inference Stack
...generation computing experiences-from AI and data centers, to PCs,... ...and SOTA LLM and Multimodal inference at scale across multi-GPU and... ...and optimize cutting-edge compiler technologies and drive... ...THE PERSON: Skilled engineer with strong technical and analytical...
Senior
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep... ...optimizations. Background in compiler development Experience in working... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes....
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior Software Development Engineer - LLM Inference Framework
...computing experiences-from AI and data centers, to PCs,... ...career. THE ROLE: As a senior member of the LLM inference framework team, you will... ...intersection of inference engines, distributed systems, and... ...with kernel, compiler, and networking teams to close...
Senior
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior Kernel & Compiler Performance Engineer (GPU/AI)
A cutting-edge AI company in California is looking for a Member of Technical Staff for Kernel/Compiler/Communication. This critical role requires strong expertise in CUDA and... ...5+ years of experience in performance engineering. The ideal candidate will design high-performance...
Senior
RadixArk
Palo Alto, CA
1 day ago
Senior AI Inference Kernel Engineer
$184k - $287.5k
NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a... ...role involves building efficient kernels and compilers for AI workloads while actively...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Sr. AI Inference Systems Engineer
$120.1k - $225.7k
...Role Entails End-to-End Inference Optimization: Lead the... ...inference technology (e.g., compiler optimization, model compression... ...team members to build a robust AI inference technical ecosystem... ...Computer Science, Electronic Engineering, AI, or related fields; significant...
Senior
Relocation package
Tencent
Palo Alto, CA
3 days ago
Senior AI Systems Engineer — SGLang & Inference on GPUs
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
Senior
Advanced Micro Devices
Santa Clara, CA
4 days ago
Senior Performance Compiler Engineer - Triton
$184k - $287.5k
...computing. We are increasingly known as “the AI computing company”. We are looking for a Senior Performance Compiler Engineer to join our team and work on the open-source... ..., accelerating both training and inference. You will be immersed in a diverse, supportive...
Senior
NVIDIA AI
Santa Clara, CA
4 days ago
Senior AI Compiler Engineer, Algorithms and Code-Generation
$152k - $241.5k
...tapping into the unlimited potential of AI to define the next era of computing. An... ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for... ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal...
Senior
Remote work
NVIDIA
Santa Clara, CA
4 days ago
Senior Deep Learning Compiler Engineer - XLA
$152k - $241.5k
...deep learning ignited modern AI — the next era of computing —... ...looking for versatile software engineers for our XLA team. NVIDIA is... ...Responsibilities In this role, develop compiler optimization algorithms for... ...workloads. You will optimize inference and training performance for...
Senior
NVIDIA Gruppe
Santa Clara, CA
17 hours ago
Senior On-Device Inference Engineer - Robotics & AI
$207k - $300k
Google Inc. in Mountain View is seeking a Senior Research Engineer to focus on optimizing on-device inference for robotics at DeepMind. This role requires a Bachelor's degree in a relevant field and 8 years' experience in machine learning. Ideal candidates will have expertise...
Senior
Google Inc.
Mountain View, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Inference Compiler Engineer. Be the first to apply!