Compiler Engineer - AI Inference

$152k - $241.5k

NVIDIA Gruppe

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self‑driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. NVIDIA is seeking top‑tier AI Compiler Engineers to drive innovation within our world‑class compiler organization. In this role, you will push the boundaries of what is possible in AI performance and help build the technology that powers the next generation of computing. Join us and make a tangible impact on a global scale. What you’ll be doing: Drive technical innovation: Participating in hands‑on development focusing on kernel generation and computational graph optimizations for next‑generation NVIDIA GPUs. Advance the state‑of‑the‑art: Solve complex compilation problems for AI workloads (both inference and training) and successfully transition these breakthroughs into enterprise and consumer products. Collaborate on hardware/software co‑design: Partner with leading experts across our software, hardware, and research divisions to architect and co‑design future silicon. Scale AI to the datacenter: Participating in the advancement and optimization of datacenter‑scale AI workload deployments. What we need to see: BS or MS in Computer Science, Computer Engineering, or a related field (or equivalent experience). A PhD is strongly preferred. Compiler Experience: 3+ years of relevant industry experience specializing in compiler optimizations, synthesis, and placement. MLIR Knowledge: Demonstrated, hands‑on experience working with MLIR. Programming Excellence: Exceptional C/C++ and Python programming and software design skills, including rigorous debugging, performance analysis, and test design. Team Dynamics: Strong communication and interpersonal skills, with the ability to collaborate effectively in a dynamic, fast‑paced, and product‑oriented environment. Ways to stand out from the crowd: Hardware Implementation: Hands‑on experience implementing complex AI workloads on CPU, GPU, and/or custom AI accelerator architectures. LLM Knowledge: Deep understanding of Large Language Model (LLM) inference and its profound implications on computer architecture. Architecture & Design: Demonstrated understanding in the designing and architecting of comprehensive compiler frameworks from the ground up. With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward‑thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you’re a creative and autonomous engineer with a real passion for technology, we want to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is $152,000 USD - $241,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until April 28, 2026. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Compiler Engineer - AI Inference in Santa Clara, CA vacancy

Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency... ...inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
AI Inference Compiler Engineer — MLIR & Kernel Optimizer
NVIDIA Gruppe in Santa Clara, California is seeking AI Compiler Engineers to drive technological innovation within their compiler organization. The role involves working on kernel generation and optimization for next-generation NVIDIA GPUs and solving complex compilation...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior AI Inference Compiler Engineer — Equity Eligible
$152k - $241.5k
NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Compiler Engineer, AI Inference Platforms
$152k - $241.5k
Overview AI & Deep Learning Compiler Engineer for NVIDIA’s Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyze deep learning networks and develop compiler optimization algorithms. Collaborate with deep learning software framework teams and GPU architecture...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior AI Systems Engineer: Inference Kernels & Runtimes
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Software Development Engineer - SGLang and Inference Stack
...generation computing experiences—from AI and data centers, to PCs,... ...and SOTA LLM and Multimodal inference at scale across multi-GPU and... .... THE PERSON: Skilled engineer with strong technical and analyticalexpertisein... ...with GPU Library and Compiler Teams: Work closely with...
Advanced Micro Devices
Santa Clara, CA
3 days ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
## Senior Software Engineer, Deep Learning Inference - TensorRTApplylocations: US, CA, Santa Claratime type:... ...profiling, and optimizations.* Background in compiler development* Experience in working... ...for an existing vacancy.NVIDIA uses AI tools in its recruiting processes....
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior Compiler CodeGen Engineer for AI Hardware
Lemurian Labs in Santa Clara is seeking a Compiler Code Generation Engineer to design the core code generation capabilities of our AI compiler. This role involves translating high-level ML computations into optimized machine code across various hardware platforms. Ideal...
Lemurian Labs
Santa Clara, CA
2 days ago
Tungsten Compiler Engineer for Wafer-Scale AI
Cerebras Systems, Inc. in Sunnyvale, California is seeking Compiler Engineers for their innovative Tungsten programming language. This role involves designing efficient compilers for wafer-scale AI hardware, collaborating with various technical teams to shape the future...
Cerebras Systems, Inc.
Sunnyvale, CA
4 days ago
Edge Inference Engineer: Local AI Latency Optimizer
Intel in Santa Clara, California is seeking a talented individual to optimize inference engines for local environments, impacting the future of AI. Applicants should have a strong background in C++ and software development, with experience in profiling performance issues...
Local area
Intel
Santa Clara, CA
16 hours ago
Graph Optimization Engineer — AI Compiler Stack
A pioneering AI technology company in Santa Clara is seeking a Graph Optimization Compiler Engineer to enhance their AI compiler stack. This role focuses on developing graph-level optimizations to deliver significant performance improvements. The ideal candidate should...
Lemurian Labs
Santa Clara, CA
3 days ago
Senior AI/ML Systems Engineer - Scalable Inference
$165k - $242k
Dormont Manufacturing Co is seeking a Senior Engineer to lead designs and improve engineering standards. The role focuses on evolving our Kubernetes-native inference platform and ensuring reliability across multiple services. Qualified candidates should have 5-8 years...
Dormont Manufacturing Company
Sunnyvale, CA
3 days ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior AI Compiler Engineer - Applied Research
$152k - $241.5k
NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What You'll Be Doing Design...
NVIDIA
Santa Clara, CA
2 days ago
Senior AI Inference Kernel Engineer
$184k - $287.5k
NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a... ...role involves building efficient kernels and compilers for AI workloads while actively...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
AI Inference Performance Engineer
$152k - $241.5k
...optimize and benchmark GenAI inference on NVIDIA's latest accelerators... ...of GPU performance engineering and public accountability. What... ...workflows, and other emerging AI use cases. Collaborate with framework... ...architecture, kernel, and compiler teams to shape GPU roadmaps based...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Production AI Inference Engineering Lead
...NVIDIA Corporation is seeking a Manager, Software Engineering to lead production AI inference for NVIDIA Inference Microservices. This role involves managing a team responsible for the deployment of optimized AI inference solutions and ensuring high-quality software releases...
Jobleads-US
Santa Clara, CA
7 hours ago
Senior Software Engineer, CUDA Deep Learning Systems
$184k - $287.5k
...hardware performance for emerging AI workloads. You will be a... ...bottlenecks in both training and inference pipelines. Collaborate... ...and SW architects, kernel and compiler authors and CUDA driver experts... ...in Computer Science, Computer Engineering, Electrical Engineering, or...
NVIDIA
Santa Clara, CA
4 days ago
Senior Machine Learning Applications and Compiler Engineer, LPX
$152k - $241.5k
...Senior Machine Learning Applications and Compiler Engineer! NVIDIA is seeking engineers to develop... ...and optimizations for our LPX inference and compiler stack. You will work at the... ...or similar. Experience with large-scale AI distributed inference or training systems...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
ML Systems Engineer: Production-Scale LLM Inference
ScOp Venture Capital is looking for an ML Systems Engineer to optimize LLM inference systems crucial for their AI platform. The role focuses on enhancing performance and efficiency via low-level systems optimization, directly impacting industry leader processes in semiconductor...
ScOp Venture Capital
Santa Clara, CA
1 day ago
AI and FSI Developer Technology Engineer - New College Grad 2026
$124k - $195.5k
...model focused on visual and AI computing. For two decades, NVIDIA... ...an AI Developer Technology Engineer to push the limits of... ...with NVIDIA research, hardware, compiler, and tools teams. What we need... ...related field. Experience with inference optimization techniques and deploying...
Internship
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Engineering Manager AI Inference Platform, Distributed Cloud
$262k - $365k
Senior Engineering Manager AI Inference Platform, Distributed Cloud Location: Sunnyvale, CA, USA Pay US: $262,000 - $365,000 (USD) + 25% bonus target... ...serving, continuous batching, or specialized compiler technologies (e.g., XLA). 4+ years of experience utilizing...
Google Inc.
Sunnyvale, CA
16 hours ago
AI Inference Co-Design Engineer for Real-Time HW
$132k - $330k
Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our... ...This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You...
Hourly pay
Full time
Temporary work
Flexible hours
Tesla Motors, Inc.
Palo Alto, CA
2 days ago
Senior AI Systems Engineer — SGLang & Inference on GPUs
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
Advanced Micro Devices
Santa Clara, CA
16 hours ago
AI Inference Systems Engineer: High-Throughput
$135k
United States Digital Space LLC in Palo Alto seeks an Application Software Engineer to develop high-performance AI inference systems. This role emphasizes the design and optimization of large-scale systems used for mission-critical applications. The ideal candidate possesses...
Remote work
United States Digital Space LLC
Palo Alto, CA
2 days ago
Senior Software Engineer, Inference
About the Role We are seeking a Senior Inference Engineer to accelerate the performance of Pika's AI-driven products. In this highly technical role, you will operate... ..., attention acceleration, and deep learning compiler stacks. GPU & Parallelism : Deep knowledge of GPU...
Work at office
3 days per week
PIKA Inc
Palo Alto, CA
16 hours ago
Senior Compiler Engineer for AI Accelerators & MLIR
IC Resources is seeking a Staff / Principal Compiler Engineer to join a stealth-mode AI hardware company in Palo Alto. This hybrid role involves developing software stacks for next-generation ML accelerators, focusing on low-level compiler development and various optimisation...
IC Resources
Palo Alto, CA
1 day ago
Senior Compiler Engineer - DL
$152k - $241.5k
## Senior Compiler Engineer - DLApplylocations: US, CA, Santa Clara: US, TX, Austin: US, TX, Remote... ...into the unlimited potential of AI to define the next era of computing. An... ...Our DLC has been the backbone of NVIDIA inference engine, spanning across data centers, personal...
Remote work
NVIDIA Corporation
Santa Clara, CA
2 days ago
Senior Deep Learning Compiler Engineer - XLA
$152k - $241.5k
...deep learning ignited modern AI — the next era of computing —... ...looking for versatile software engineers for our XLA team. NVIDIA is... ...Responsibilities In this role, develop compiler optimization algorithms for... ...workloads. You will optimize inference and training performance for...
NVIDIA Gruppe
Santa Clara, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Compiler Engineer - AI Inference. Be the first to apply!