Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Inference Compiler Engineer

$152k - $241.5k

NVIDIA Gruppe

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AIs, recommendation systems, image classification, speech recognition, etc. With the rapid advancement of AI, our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Time and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community. What you’ll be doing: Develop compiler IR, programming model and optimizations for future GPU architectures. Collaborating with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler optimizations and kernel generation for neural networks, and other general software engineering work. What we need to see: Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience. 3+ years of relevant work or research experience in performance analysis and compiler optimizations. Experience with compiler technologies (e.g., MLIR, XLA, and LLVM etc.). Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Ability to work independently, define project goals and scope, and lead your own development efforts. Strong interpersonal skills are required along with the ability to work in a fast moving & dynamic product-oriented team. Ways to stand out from the crowd: Understanding of deep learning models, algorithms and frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel generation with high performance and fast build time. Proficient in GPU architecture. CUDA or OpenCL programming experience. Track record on new hardware bring-up is a plus. Benefits and Compensation: Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Senior AI Inference Compiler Engineer in Santa Clara, CA vacancy
  • $152k - $241.5k

     ...recently, GPU deep learning ignited modern AI — the next era of computing — with the...  ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers...  ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $152k - $241.5k

    NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $152k - $241.5k

    NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning networks and developing compiler optimization algorithms. Collaborating with members of the deep learning software framework... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $152k - $287.5k

    NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency...  ...inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $152k - $241.5k

     ...eager to work on cutting-edge AI technology for safety-...  ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of...  ...enabling high-performance AI inference solutions for automotive...  ...functionalities into TensorRT's compiler and runtime for specialized... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...recently, GPU deep learning ignited modern AI — the next era of computing — with the...  ...”. NVIDIA is seeking top-tier AI Compiler Engineers to drive innovation within our world-class...  ...problems for AI workloads (both inference and training) and successfully transition... 

    NVIDIA

    Santa Clara, CA
    17 hours ago
  •  ...Systems builds the world’s largest AI chip, 56 times larger than...  ...-leading training and inference speeds and empowers machine learning...  ...a versatile and experienced engineer to join our SOTA Training...  ...translation, graph lowering, compiler optimizations, runtime integration... 
    Senior
    Internship

    Dormont Manufacturing Co

    Sunnyvale, CA
    8 hours ago
  • NVIDIA Gruppe in Santa Clara is seeking an AI & Deep Learning Compiler Engineer to join its Deep Learning & AI Compiler team. This role involves developing compiler IR and collaborating with various teams to enhance deep learning software. The ideal candidate will have... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $152k - $241.5k

    NVIDIA Gruppe is hiring an AI & Deep Learning Compiler Engineer for the Deep Learning & AI Compiler team. This role involves analyzing deep learning networks and developing optimization algorithms while collaborating with software and GPU architecture teams. The ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role...  ...collaborating on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The ideal... 
    Senior

    Intel Corporation

    Santa Clara, CA
    17 hours ago
  • A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  •  ...computing experiences-from AI and data centers, to PCs, gaming...  ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving...  ...LLM training and inference on AMD GPUs, improving kernel...  ...architecture, memory hierarchy, and compiler-level optimization (e.g.,... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  •  ...unleashing the potential of generative AI to power the transformation of technology...  ...days per week. The role: Analog Design Engineer, Senior / Staff /Sr. Staff What You Will Do: Analog...  ...engine for Artificial Intelligence Inference Accelerator and High-Speed Die-2-Die Interface... 
    Senior
    3 days per week

    d-Matrix

    Santa Clara, CA
    8 hours ago
  • $207k - $300k

    Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place Mountain View, CA, USA...  ...to align model architectures with AI accelerators (e.g., distillation)....  ...software performance analysis, improving compilers for mobile platforms, as well as... 
    Senior
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  • $152k - $241.5k

     ...driving advancements in AI and machine learning to...  ...talented and motivated engineers to join our TensorRT...  ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the...  ...Deep Learning Frameworks, Compilers, or System Software.... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...GPUs are at the core of modern AI infrastructure, from training...  ...-scale models to running inference in production. That position...  ...software as much as hardware, and compiler engineering is a big part of what makes it work. We're hiring senior software engineers for a... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    17 hours ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...and SOTA LLM and Multimodal inference at scale across multi-GPU and...  ...and optimize cutting-edge compiler technologies and drive...  ...THE PERSON: Skilled engineer with strong technical and analytical... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep...  ...optimizations. Background in compiler development Experience in working...  ...existing vacancy. NVIDIA uses AI tools in its recruiting processes.... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...computing experiences-from AI and data centers, to PCs,...  ...career. THE ROLE: As a senior member of the LLM inference framework team, you will...  ...intersection of inference engines, distributed systems, and...  ...with kernel, compiler, and networking teams to close... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  • A cutting-edge AI company in California is looking for a Member of Technical Staff for Kernel/Compiler/Communication. This critical role requires strong expertise in CUDA and...  ...5+ years of experience in performance engineering. The ideal candidate will design high-performance... 
    Senior

    RadixArk

    Palo Alto, CA
    1 day ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a...  ...role involves building efficient kernels and compilers for AI workloads while actively... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $120.1k - $225.7k

     ...Role Entails End-to-End Inference Optimization: Lead the...  ...inference technology (e.g., compiler optimization, model compression...  ...team members to build a robust AI inference technical ecosystem...  ...Computer Science, Electronic Engineering, AI, or related fields; significant... 
    Senior
    Relocation package

    Tencent

    Palo Alto, CA
    3 days ago
  • A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal...  ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...computing. We are increasingly known as “the AI computing company”. We are looking for a Senior Performance Compiler Engineer to join our team and work on the open-source...  ..., accelerating both training and inference. You will be immersed in a diverse, supportive... 
    Senior

    NVIDIA AI

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An...  ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for...  ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...deep learning ignited modern AI — the next era of computing —...  ...looking for versatile software engineers for our XLA team. NVIDIA is...  ...Responsibilities In this role, develop compiler optimization algorithms for...  ...workloads. You will optimize inference and training performance for... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $207k - $300k

    Google Inc. in Mountain View is seeking a Senior Research Engineer to focus on optimizing on-device inference for robotics at DeepMind. This role requires a Bachelor's degree in a relevant field and 8 years' experience in machine learning. Ideal candidates will have expertise... 
    Senior

    Google Inc.

    Mountain View, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Inference Compiler Engineer. Be the first to apply!