Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Compiler Engineer, AI Inference Performance

$152k - $241.5k

NVIDIA

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”.

We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AI, recommendation systems, image classification, speech recognition, etc. With the rapid advancement of AI, our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Time and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community.

What you’ll be doing:

  • Analyzing deep learning networks and developing compiler optimization algorithms.

  • Collaborating with members of the deep learning software framework teams and the GPU architecture teams to accelerate the next generation of deep learning software.

  • Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler techniques for AI workloads and future NVIDIA GPUs.

What we need to see:

  • Bachelor’s, Master’s or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience.

  • 3+ years of relevant work or research experience in performance analysis and compiler optimizations.

  • Experience with compiler technologies (e.g., MLIR, LLVM, XLA, Triton, etc.).

  • Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design.

  • Ability to work independently, define project goals and scope, and lead your own development efforts.

  • Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.

Ways to stand out from the crowd:

  • Proficient in CPU and/or GPU architecture. CUDA or OpenCL programming experience.

  • Understanding of deep learning models, algorithms and frameworks, such as PyTorch, JAX.

  • GPU kernel authoring and performance analysis using tools such as Nsight Compute.

  • A track record of success in mentoring early-career engineers and interns is a bonus.

  • Track record on new hardware bring-up is a plus.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until February 28, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Compiler Engineer, AI Inference Performance in Santa Clara, CA vacancy
  • $184k - $287.5k

     ...seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models...  .... You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...eager to work on cutting-edge AI technology for safety-...  ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront...  ...technology, enabling high-performance AI inference solutions for automotive safety...  ...into TensorRT's compiler and runtime for specialized... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...deep learning ignited modern AI — the next era of computing...  ...NVIDIA is seeking top-tier AI Compiler Engineers to drive innovation within...  ...boundaries of what is possible in AI performance and help build the...  ...problems for AI workloads (both inference and training) and... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...leading technology company based in California is seeking a Senior Compiler Engineer to shape the future of compiler technologies. This role...  ...passion for both compiler technology and GPU computing, driving performance and efficiency in high-performance computing applications.... 
    Senior
    Performance

    Intel Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $207k - $300k

    Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place...  ...focused on high-performance inference. Understanding...  ...model architectures with AI accelerators (e.g., distillation...  ...analysis, improving compilers for mobile platforms, as... 
    Senior
    Performance
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago
  • A cutting-edge AI company in California is looking for a Member of Technical Staff for Kernel/Compiler/Communication. This critical role requires strong expertise in...  ..., along with 5+ years of experience in performance engineering. The ideal candidate will design high-performance... 
    Senior
    Performance

    RadixArk

    Palo Alto, CA
    12 hours ago
  • $152k - $241.5k

     ...learning ignited modern AI — the next era of...  ...for versatile software engineers for our XLA team. NVIDIA...  ...join us to build high-performance, production-grade software...  ...In this role, develop compiler optimization...  ...workloads. You will optimize inference and training... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...computing. We are increasingly known as “the AI computing company”. We are looking for a Senior Performance Compiler Engineer to join our team and work on the open-source...  ..., accelerating both training and inference. You will be immersed in a diverse, supportive... 
    Senior
    Performance

    NVIDIA AI

    Santa Clara, CA
    3 days ago
  •  ...leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative...  ...a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
    Senior
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    3 days ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...-leading training and inference speeds and empowers...  ...and experienced engineer to join our SOTA Training...  ...unprecedented levels of performance, efficiency, and scalability...  ..., graph lowering, compiler optimizations, runtime... 
    Senior
    Performance
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $135.8k - $237.05k

     ...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-26...  ...daily decisions, with a focus on the performance, reliability, and scalability of inference... 
    Senior
    Performance
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    1 day ago
  • Cerebras seeks a Senior Performance Engineer to join the Product team in Sunnyvale, CA. The role involves developing benchmarks to measure inference performance and creating competitive pricing models...  ...culture in a groundbreaking AI environment, along with job stability... 
    Senior
    Performance

    Cerebras

    Sunnyvale, CA
    1 day ago
  • $184k - $356.5k

    A leading technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model inference and collaborating with co-design teams to optimize performance across... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

    A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms... 
    Senior
    Performance
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate...  ...learning, specifically in inference. This role involves profiling...  ...with teams to advance AI solutions. A strong understanding... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...driving advancements in AI and machine...  ...talented and motivated engineers to join our...  ...leading deep learning inference software for...  ...accelerators. As a Senior Software Engineer...  ...Learning Frameworks, Compilers, or System...  ...of close-to-metal performance analysis, optimization... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...deep learning ignited modern AI — the next era of computing —...  ...”. We are hiring software engineers for the CUDA Tile team. NVIDIA...  ...will design and implement compiler transformations, develop MLIR...  ...lowering passes, and optimize the performance of tile-based kernels to... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...We are seeking for an expert Senior Compiler Engineer to join our Compute Compiler Team, with a focus...  ...effectively Partner with architecture, performance, and product teams to translate NVIDIA...  ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes.... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...into our future offerings, our Compiler team is growing and seeking top-tier compiler engineers who want an exciting and engaging...  ...work or research experience in performance analysis, compiler...  ...existing vacancy.  NVIDIA uses AI tools in its recruiting processes... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    12 hours ago
  •  ...shaping the future of cutting-edge compiler technologies at Intel. As a Senior Compiler Engineer, you will play a critical role...  ...you will enable transformative performance and efficiency gains, empowering groundbreaking applications in AI and high-performance computing.... 
    Senior
    Performance
    Internship

    Intel Corporation

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...are now looking for a Senior Machine Learning Applications and Compiler Engineer! NVIDIA is seeking engineers...  ...for our LPX inference and compiler stack. You...  ...develop, and maintain high-performance runtime and compiler components...  ...with large-scale AI distributed inference... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software...  ...execution stack, targeting high-performance kernel generation for... 
    Senior
    Performance
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    We are looking for a Senior DL Algorithms Engineer for LLM/Omni model optimizations...  ...who are mindful of performance analysis and optimization...  ...technology company that leads the AI revolution. What you will...  ...) on NVIDIA’s accelerated inference SW stack. Contribute new... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make...  ...platforms for functionality and performance Develop components of...  .... Background in compiler development...  ...vacancy. NVIDIA uses AI tools in its recruiting... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...computing experiences-from AI and data centers, to...  ...enhancing GPU kernel performance, accelerating deep...  ...SOTA LLM and Multimodal inference at scale across multi-...  ...optimize cutting-edge compiler technologies and drive...  ...PERSON: Skilled engineer with strong technical... 
    Senior
    Performance

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...Artificial Intelligence, High Performance Computing and...  ...motivated Deep Learning engineer to bring advanced...  ...communication technologies into AI stacks, including...  ...up to 100K GPUs to inference down at microsecond latency...  ...models. Improve AI compilers to hide communications... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...into the unlimited potential of AI to define the next era of...  ...at the forefront of AI and high-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems...  ...Work alongside model training, inference, and product divisions to provide... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $140k - $224.25k

     ...into the unlimited potential of AI to define the next era of...  ...building the next generation of compiler technologies to accelerate...  ...workloads. We are looking for an engineer to implement compiler...  ...guarantee functional quality and performance as models, compiler stacks, and... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...the unlimited potential of AI to define the next era of computing...  ...for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software...  ...the backbone of NVIDIA’s inference engine, spanning across data...  ...must deliver leading inference performance, fast build time, reduced... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Compiler Engineer, AI Inference Performance. Be the first to apply!