Senior AI Inference Engineer for AIConfigurator (Dynamo)
NVIDIA
NVIDIA Corporation is looking for a Senior Inference Engineer to advance AIConfigurator, enhancing model serving and performance for large-scale LLM inference. This role entails developing production-quality APIs and integrating complex deployment configurations on NVIDIA GPU platforms. The ideal candidate will have over 10 years of software engineering experience, solid Python and Rust skills, and a robust understanding of GPU computing. Additionally, NVIDIA offers competitive salaries, equity, and benefits with a commitment to an inclusive work environment. #J-18808-Ljbffr NVIDIA Corporation
- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
- NVIDIA is recruiting a Senior Inference Engineer to advance AIConfigurator ( a system that automatically discovers high-... ...architectures. The team partners closely with Dynamo, TensorRT-LLM, vLLM, SGLang,... ...developer‑facing tools. Agentic AI solutions to solve sophisticated...Senior
$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...Senior$152k - $241.5k
...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...Senior$152k - $241.5k
NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...Senior- A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations...Senior
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...Senior- ...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server. NVIDIA is hiring software engineers for its GPU-accelerated deep learning... ...the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image...Senior
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior$152k - $241.5k
...NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on...Senior$156k - $387.6k
...Ellis Technologies, Inc. is seeking an AI Infra Engineer to develop and optimize next-generation inference systems for large-scale traffic. The ideal candidate will have a strong background in high-performance computing and should be able to work with large-model architectures...Senior$189k - $301k
...Conductor in San Jose, CA is seeking a seasoned engineer to lead co-design efforts for optimizing AI model inference performance. The role requires a deep understanding of AI infrastructure, covering everything from model definition to serving. The ideal candidate should...Senior$184k - $287.5k
A leading technology company is seeking a Senior Software Engineer for AI and DL Kernel Libraries in Santa Clara, CA. The role involves designing and optimizing kernels for high-impact AI workloads and collaborating with engineers on innovative solutions. Candidates should...SeniorRemote job$320k
...NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate will provide... ...drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range...Senior- A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro DevicesSenior
$272k - $431.25k
...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing...$152k - $241.5k
NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM serving by... ...contributions to vLLM, SGLang, PyTorch, Triton, NCCL, Dynamo or adjacent serving/runtime projects....Senior$212.8k
...the Team We are dedicated to building the inference infrastructure for ultra-large-scale... ...language models, and frontier multimodal AI systems. Our mission is to provide a robust... ...or above in Computer Science, Software Engineering, Artificial Intelligence, Mathematics,...SeniorTemporary workLocal area- ...A technology firm located in California is seeking candidates with experience in AI and ML algorithm development, particularly in LLM Inference and Similarity Search. Applicants should have strong communication skills and the ability to work independently. Familiarity...Senior
- ...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview: We are seeking a highly skilled AI Inference... ...-node, multi-GPU clusters Distributed Serving Platform (Dynamo) Contribute to distributed serving architecture...
- ...generation computing experiences-from AI and data centers, to PCs, gaming and... ...THE ROLE: AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the... ...Optimize and accelerate LLM training and inference on AMD GPUs, improving kernel,...
- ...A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8...Senior
$124k - $195.5k
## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014441We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s...$124k - $195.5k
NVIDIA Corporation is seeking an AI Inference Performance Engineer - New College Grad 2026 in Santa Clara. This role involves optimizing AI inference benchmarks using NVIDIA’s accelerators and working with various teams on performance enhancements. Applicants should have...$152k - $241.5k
...NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. We are looking for an outstanding...Senior$184k - $287.5k
...tapping into the unlimited potential of AI to define the next era of computing.... .... We are looking for outstanding Senior High Performance AI Engineer to build groundbreaking multi-agent... ...frameworks, distributed training, and inference/serving—and with model/agent teams....Senior- ...Capital One is hiring a Sr. Lead AI Engineer in San Jose, CA to innovate AI systems that enhance customer interactions. This role requires leading cross-functional teams to develop and support AI-powered solutions that are robust and scalable. The ideal candidate has a...Senior
$152k - $241.5k
...Senior Software Engineer – Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...Senior$110k - $190k
...Role Overview We are hiring a Senior Software & AI Engineer to build production-grade AI systems, with a strong emphasis on deep learning... ...Experience with MLOps and AI infrastructure (training pipelines, inference optimization, monitoring) Experience working with...Senior- ...next-generation computing experiences-from AI and data centers, to PCs, gaming and... ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Inference Engineer for AIConfigurator (Dynamo). Be the first to apply!
- ai engineer remote Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- senior automation controls engineer Santa Clara, CA
- senior accounts payable Santa Clara, CA
- senior brand designer Santa Clara, CA

