Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Compiler Engineer - AI Inference

$152k - $241.5k

NVIDIA Gruppe

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self‑driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. NVIDIA is seeking top‑tier AI Compiler Engineers to drive innovation within our world‑class compiler organization. In this role, you will push the boundaries of what is possible in AI performance and help build the technology that powers the next generation of computing. Join us and make a tangible impact on a global scale. What you’ll be doing: Drive technical innovation: Participating in hands‑on development focusing on kernel generation and computational graph optimizations for next‑generation NVIDIA GPUs. Advance the state‑of‑the‑art: Solve complex compilation problems for AI workloads (both inference and training) and successfully transition these breakthroughs into enterprise and consumer products. Collaborate on hardware/software co‑design: Partner with leading experts across our software, hardware, and research divisions to architect and co‑design future silicon. Scale AI to the datacenter: Participating in the advancement and optimization of datacenter‑scale AI workload deployments. What we need to see: BS or MS in Computer Science, Computer Engineering, or a related field (or equivalent experience). A PhD is strongly preferred. Compiler Experience: 3+ years of relevant industry experience specializing in compiler optimizations, synthesis, and placement. MLIR Knowledge: Demonstrated, hands‑on experience working with MLIR. Programming Excellence: Exceptional C/C++ and Python programming and software design skills, including rigorous debugging, performance analysis, and test design. Team Dynamics: Strong communication and interpersonal skills, with the ability to collaborate effectively in a dynamic, fast‑paced, and product‑oriented environment. Ways to stand out from the crowd: Hardware Implementation: Hands‑on experience implementing complex AI workloads on CPU, GPU, and/or custom AI accelerator architectures. LLM Knowledge: Deep understanding of Large Language Model (LLM) inference and its profound implications on computer architecture. Architecture & Design: Demonstrated understanding in the designing and architecting of comprehensive compiler frameworks from the ground up. With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward‑thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you’re a creative and autonomous engineer with a real passion for technology, we want to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is $152,000 USD - $241,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until April 28, 2026. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Compiler Engineer - AI Inference in Santa Clara, CA vacancy
  • $152k - $241.5k

    NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should... 
    Suggested

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning networks and developing compiler optimization algorithms. Collaborating with members of the deep learning software framework... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...recently, GPU deep learning ignited modern AI — the next era of computing — with the...  ...are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers...  ...DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...generation computing experiences—from AI and data centers, to PCs,...  ...As a senior member of the LLM inference framework team, you will be...  ...the intersection of inference engines, distributed systems, and GPU...  ...and collaborating with kernel, compiler, and networking teams to close... 
    Suggested

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency...  ...inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $287.5k

    NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...for a Senior Staff AI Infra Engineer who is passionate about...  ...accelerate LLM training and inference on AMD GPUs, improving kernel...  ...architecture, memory hierarchy, and compiler-level optimization (e.g.,... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is hiring an AI & Deep Learning Compiler Engineer for the Deep Learning & AI Compiler team. This role involves analyzing deep learning networks and developing optimization algorithms while collaborating with software and GPU architecture teams. The ideal... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • NVIDIA Gruppe in Santa Clara, California is seeking AI Compiler Engineers to drive technological innovation within their compiler organization. The role involves working on kernel generation and optimization for next-generation NVIDIA GPUs and solving complex compilation... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and...  ...AMD is looking for a strategic software engineering lead who is passionate about improving the...  ...for optimizing scale-up and scale-out inference. Develop methods and tooling to utilize... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    18 hours ago
  •  ...Systems builds the world’s largest AI chip, 56 times larger than...  ...-leading training and inference speeds and empowers machine learning...  ...a versatile and experienced engineer to join our SOTA Training...  ...translation, graph lowering, compiler optimizations, runtime integration... 
    Internship

    Dormont Manufacturing Co

    Sunnyvale, CA
    3 days ago
  • $120.1k - $225.7k

     ...Role Entails End-to-End Inference Optimization: Lead the...  ...inference technology (e.g., compiler optimization, model compression...  ...team members to build a robust AI inference technical ecosystem...  ...Computer Science, Electronic Engineering, AI, or related fields; significant... 
    Relocation package

    Tencent

    Palo Alto, CA
    1 day ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...and SOTA LLM and Multimodal inference at scale across multi-GPU and...  ...and optimize cutting-edge compiler technologies and drive...  ...THE PERSON: Skilled engineer with strong technical and analytical... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • Advanced Micro Devices is seeking a strategic software engineering lead in Santa Clara, California. This role involves improving application...  .... Key responsibilities include developing techniques for inference optimization and supporting the ROCm ecosystem expansion. A Bachelor... 

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts...  ...lasting impact on the world. We are looking for a Raytracing Compiler Engineer to join as a member of our international engineering team.... 
    Full time

    NVIDIA

    Santa Clara, CA
    11 hours ago
  • A pioneering AI technology company in Santa Clara is seeking a Graph Optimization Compiler Engineer to enhance their AI compiler stack. This role focuses on developing graph-level optimizations to deliver significant performance improvements. The ideal candidate should... 

    Lemurian Labs

    Santa Clara, CA
    18 hours ago
  • $184k - $287.5k

    NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What you’ll be doing: Designing... 
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact...  ..., and optimizations.Background in compiler developmentExperience in working with...  ...for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role...  ...on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The ideal candidate... 

    Intel Corporation

    Santa Clara, CA
    3 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal... 

    NVIDIA Gruppe

    Santa Clara, CA
    18 hours ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Corporation is looking for a passionate Software Engineer to join the TensorRT team in Santa Clara, California...  ...in deep learning and work with cutting-edge AI technology, contributing to high-performance AI inference solutions. Your role involves designing and developing... 

    NVIDIA Corporation

    Santa Clara, CA
    18 hours ago
  • $152k - $241.5k

     ...optimize and benchmark GenAI inference on NVIDIA's latest accelerators...  ...of GPU performance engineering and public accountability. What...  ...workflows, and other emerging AI use cases. Collaborate with framework...  ...architecture, kernel, and compiler teams to shape GPU roadmaps based... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a...  ...role involves building efficient kernels and compilers for AI workloads while actively... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...Senior Machine Learning Applications and Compiler Engineer! NVIDIA is seeking engineers to develop...  ...and optimizations for our LPX inference and compiler stack. You will work at the...  ...or similar. Experience with large-scale AI distributed inference or training systems... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $207k - $300k

    Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place Mountain...  ...techniques to align model architectures with AI accelerators (e.g., distillation)....  ...performance analysis, improving compilers for mobile platforms, as well as core... 
    Full time

    Google Inc.

    Mountain View, CA
    1 day ago
  • $124k - $195.5k

     ...model focused on visual and AI computing. For two decades, NVIDIA...  ...an AI Developer Technology Engineer to push the limits of...  ...with NVIDIA research, hardware, compiler, and tools teams. What we need...  ...related field. Experience with inference optimization techniques and deploying... 
    Internship

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Compiler Engineer - AI Inference. Be the first to apply!