Senior AI Inference Compiler Engineer
$152k - $241.5kNVIDIA Gruppe
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AIs, recommendation systems, image classification, speech recognition, etc. With the rapid advancement of AI, our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Time and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community. What you’ll be doing: Develop compiler IR, programming model and optimizations for future GPU architectures. Collaborating with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler optimizations and kernel generation for neural networks, and other general software engineering work. What we need to see: Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience. 3+ years of relevant work or research experience in performance analysis and compiler optimizations. Experience with compiler technologies (e.g., MLIR, XLA, and LLVM etc.). Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Ability to work independently, define project goals and scope, and lead your own development efforts. Strong interpersonal skills are required along with the ability to work in a fast moving & dynamic product-oriented team. Ways to stand out from the crowd: Understanding of deep learning models, algorithms and frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel generation with high performance and fast build time. Proficient in GPU architecture. CUDA or OpenCL programming experience. Track record on new hardware bring-up is a plus. Benefits and Compensation: Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
$152k - $241.5k
...recently, GPU deep learning ignited modern AI — the next era of computing — with the... ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal...Senior$152k - $241.5k
NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should...Senior$152k - $241.5k
NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning networks and developing compiler optimization algorithms. Collaborating with members of the deep learning software framework...Senior$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...Senior$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency... ...inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across...Senior$152k - $241.5k
...eager to work on cutting-edge AI technology for safety-... ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of... ...enabling high-performance AI inference solutions for automotive... ...functionalities into TensorRT's compiler and runtime for specialized...Senior$152k - $241.5k
...recently, GPU deep learning ignited modern AI — the next era of computing — with the... ...”. NVIDIA is seeking top-tier AI Compiler Engineers to drive innovation within our world-class... ...problems for AI workloads (both inference and training) and successfully transition...- ...Systems builds the world’s largest AI chip, 56 times larger than... ...-leading training and inference speeds and empowers machine learning... ...a versatile and experienced engineer to join our SOTA Training... ...translation, graph lowering, compiler optimizations, runtime integration...SeniorInternship
- NVIDIA Gruppe in Santa Clara is seeking an AI & Deep Learning Compiler Engineer to join its Deep Learning & AI Compiler team. This role involves developing compiler IR and collaborating with various teams to enhance deep learning software. The ideal candidate will have...Senior
$152k - $241.5k
NVIDIA Gruppe is hiring an AI & Deep Learning Compiler Engineer for the Deep Learning & AI Compiler team. This role involves analyzing deep learning networks and developing optimization algorithms while collaborating with software and GPU architecture teams. The ideal...Senior- Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role... ...collaborating on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The ideal...Senior
- A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals...Senior
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior- ...computing experiences-from AI and data centers, to PCs, gaming... ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving... ...LLM training and inference on AMD GPUs, improving kernel... ...architecture, memory hierarchy, and compiler-level optimization (e.g.,...
- ...unleashing the potential of generative AI to power the transformation of technology... ...days per week. The role: Analog Design Engineer, Senior / Staff /Sr. Staff What You Will Do: Analog... ...engine for Artificial Intelligence Inference Accelerator and High-Speed Die-2-Die Interface...Senior3 days per week
$207k - $300k
Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place Mountain View, CA, USA... ...to align model architectures with AI accelerators (e.g., distillation).... ...software performance analysis, improving compilers for mobile platforms, as well as...SeniorFull time$152k - $241.5k
...driving advancements in AI and machine learning to... ...talented and motivated engineers to join our TensorRT... ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... ...Deep Learning Frameworks, Compilers, or System Software....Senior$184k - $287.5k
...GPUs are at the core of modern AI infrastructure, from training... ...-scale models to running inference in production. That position... ...software as much as hardware, and compiler engineering is a big part of what makes it work. We're hiring senior software engineers for a...SeniorWork experience placement- ...generation computing experiences-from AI and data centers, to PCs,... ...and SOTA LLM and Multimodal inference at scale across multi-GPU and... ...and optimize cutting-edge compiler technologies and drive... ...THE PERSON: Skilled engineer with strong technical and analytical...Senior
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep... ...optimizations. Background in compiler development Experience in working... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes....Senior- ...computing experiences-from AI and data centers, to PCs,... ...career. THE ROLE: As a senior member of the LLM inference framework team, you will... ...intersection of inference engines, distributed systems, and... ...with kernel, compiler, and networking teams to close...Senior
- A cutting-edge AI company in California is looking for a Member of Technical Staff for Kernel/Compiler/Communication. This critical role requires strong expertise in CUDA and... ...5+ years of experience in performance engineering. The ideal candidate will design high-performance...Senior
$184k - $287.5k
NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a... ...role involves building efficient kernels and compilers for AI workloads while actively...Senior$120.1k - $225.7k
...Role Entails End-to-End Inference Optimization: Lead the... ...inference technology (e.g., compiler optimization, model compression... ...team members to build a robust AI inference technical ecosystem... ...Computer Science, Electronic Engineering, AI, or related fields; significant...SeniorRelocation package- A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro DevicesSenior
$184k - $287.5k
...computing. We are increasingly known as “the AI computing company”. We are looking for a Senior Performance Compiler Engineer to join our team and work on the open-source... ..., accelerating both training and inference. You will be immersed in a diverse, supportive...Senior$152k - $241.5k
...tapping into the unlimited potential of AI to define the next era of computing. An... ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for... ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal...SeniorRemote work$152k - $241.5k
...deep learning ignited modern AI — the next era of computing —... ...looking for versatile software engineers for our XLA team. NVIDIA is... ...Responsibilities In this role, develop compiler optimization algorithms for... ...workloads. You will optimize inference and training performance for...Senior$207k - $300k
Google Inc. in Mountain View is seeking a Senior Research Engineer to focus on optimizing on-device inference for robotics at DeepMind. This role requires a Bachelor's degree in a relevant field and 8 years' experience in machine learning. Ideal candidates will have expertise...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Inference Compiler Engineer. Be the first to apply!
- machine learning ai engineer Santa Clara, CA
- ai engineer remote Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- senior game producer Santa Clara, CA
- senior manager process engineering Santa Clara, CA
- senior manufacturing engineer Santa Clara, CA

