Senior AI Inference Engineer for AIConfigurator (Dynamo)

NVIDIA

NVIDIA Corporation is looking for a Senior Inference Engineer to advance AIConfigurator, enhancing model serving and performance for large-scale LLM inference. This role entails developing production-quality APIs and integrating complex deployment configurations on NVIDIA GPU platforms. The ideal candidate will have over 10 years of software engineering experience, solid Python and Rust skills, and a robust understanding of GPU computing. Additionally, NVIDIA offers competitive salaries, equity, and benefits with a commitment to an inclusive work environment. #J-18808-Ljbffr NVIDIA Corporation

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Senior AI Inference Engineer for AIConfigurator (Dynamo) in Santa Clara, CA vacancy

Senior GPU AI Inference Engineer - Triton & Dynamo
A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...
Senior
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior Inference Engineer, AIConfigurator for Dynamo
NVIDIA is recruiting a Senior Inference Engineer to advance AIConfigurator ( a system that automatically discovers high-... ...architectures. The team partners closely with Dynamo, TensorRT-LLM, vLLM, SGLang,... ...developer‑facing tools. Agentic AI solutions to solve sophisticated...
Senior
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior AI Inference Kernel Engineer
$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior AI Inference Engineer - High-Performance LLM Serving
$152k - $241.5k
...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI Inference Performance Engineer (GPU/Cluster)
$152k - $241.5k
NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...
Senior
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Senior AI Kernel & Inference Engineer
A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations...
Senior
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior AI Systems Engineer: Inference Kernels & Runtimes
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...
Senior
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Senior System Software Engineer - Dynamo-Triton Inference Server
...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server. NVIDIA is hiring software engineers for its GPU-accelerated deep learning... ...the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
Senior
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Senior AI Inference Compiler Engineer Equity Eligible
$152k - $241.5k
...NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI Infra Engineer - Large-Model Inference
$156k - $387.6k
...Ellis Technologies, Inc. is seeking an AI Infra Engineer to develop and optimize next-generation inference systems for large-scale traffic. The ideal candidate will have a strong background in high-performance computing and should be able to work with large-model architectures...
Senior
Ellis Technologies, Inc.
San Jose, CA
10 hours ago
Senior AI Co-Design Engineer - Memory-Driven Inference
$189k - $301k
...Conductor in San Jose, CA is seeking a seasoned engineer to lead co-design efforts for optimizing AI model inference performance. The role requires a deep understanding of AI infrastructure, covering everything from model definition to serving. The ideal candidate should...
Senior
Conductor
San Jose, CA
4 days ago
Senior AI & DL Kernel Engineer for Inference & GPUs Remote
$184k - $287.5k
A leading technology company is seeking a Senior Software Engineer for AI and DL Kernel Libraries in Santa Clara, CA. The role involves designing and optimizing kernels for high-impact AI workloads and collaborating with engineers on innovative solutions. Candidates should...
Senior
Remote job
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior AI Inference Architect (Dynamo) Equity Eligible
$320k
...NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate will provide... ...drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI Systems Engineer — SGLang & Inference on GPUs
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
Senior
Advanced Micro Devices
Santa Clara, CA
1 day ago
Principal AI Inference Engineer Open-Source & GPU-Focused
$272k - $431.25k
...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Software Engineer - AI Inference
$152k - $241.5k
NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM serving by... ...contributions to vLLM, SGLang, PyTorch, Triton, NCCL, Dynamo or adjacent serving/runtime projects....
Senior
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Senior AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM)
$212.8k
...the Team We are dedicated to building the inference infrastructure for ultra-large-scale... ...language models, and frontier multimodal AI systems. Our mission is to provide a robust... ...or above in Computer Science, Software Engineering, Artificial Intelligence, Mathematics,...
Senior
Temporary work
Local area
Tik Tok
San Jose, CA
21 hours ago
Senior Python AI Engineer - LLMs, VectorDB & Guardrails
...A technology firm located in California is seeking candidates with experience in AI and ML algorithm development, particularly in LLM Inference and Similarity Search. Applicants should have strong communication skills and the ability to work independently. Familiarity...
Senior
ETHEREUM TECHNOLOGIES LLC
Sunnyvale, CA
3 days ago
AI Inference Engineer
...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview: We are seeking a highly skilled AI Inference... ...-node, multi-GPU clusters Distributed Serving Platform (Dynamo) Contribute to distributed serving architecture...
Triune Infomatics Inc
San Jose, CA
8 days ago
Principal AI Inference Systems Engineer
...generation computing experiences-from AI and data centers, to PCs, gaming and... ...THE ROLE: AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the... ...Optimize and accelerate LLM training and inference on AMD GPUs, improving kernel,...
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior AI Runtime Engineer: Distributed Training & Scale
...A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8...
Senior
FlexAI
Santa Clara, CA
3 days ago
AI Inference Performance Engineer - New College Grad 2026
$124k - $195.5k
## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014441We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s...
NVIDIA Corporation
Santa Clara, CA
1 day ago
AI Inference Performance Engineer — Scale LLMs & GPU Clusters
$124k - $195.5k
NVIDIA Corporation is seeking an AI Inference Performance Engineer - New College Grad 2026 in Santa Clara. This role involves optimizing AI inference benchmarks using NVIDIA’s accelerators and working with various teams on performance enhancements. Applicants should have...
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior AI Compiler Engineer - Applied Research
$152k - $241.5k
...NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. We are looking for an outstanding...
Senior
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior High Performance AI Engineer
$184k - $287.5k
...tapping into the unlimited potential of AI to define the next era of computing.... .... We are looking for outstanding Senior High Performance AI Engineer to build groundbreaking multi-agent... ...frameworks, distributed training, and inference/serving—and with model/agent teams....
Senior
2100 NVIDIA USA
Santa Clara, CA
4 days ago
Senior Lead AI Engineer: Inference & Platform Optimization
...Capital One is hiring a Sr. Lead AI Engineer in San Jose, CA to innovate AI systems that enhance customer interactions. This role requires leading cross-functional teams to develop and support AI-powered solutions that are robust and scalable. The ideal candidate has a...
Senior
Information Technology Senior Management Forum
San Jose, CA
21 hours ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
...Senior Software Engineer – Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Software & AI Engineer
$110k - $190k
...Role Overview We are hiring a Senior Software & AI Engineer to build production-grade AI systems, with a strong emphasis on deep learning... ...Experience with MLOps and AI infrastructure (training pipelines, inference optimization, monitoring) Experience working with...
Senior
Covalent
Sunnyvale, CA
3 days ago
Senior Software Development Engineer - SGLang and Inference Stack
...next-generation computing experiences-from AI and data centers, to PCs, gaming and... ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...
Senior
Advanced Micro Devices , Inc.
Santa Clara, CA
21 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Inference Engineer for AIConfigurator (Dynamo). Be the first to apply!