Senior Dynamo Architect: Scalable GPU AI Inference

$272k - $431.25k

NVIDIA Gruppe

NVIDIA Gruppe is seeking experienced engineers for its Dynamo platform, focusing on scalable AI systems. You will develop the Kubernetes deployment, optimize GPU resource management, and work on intelligent routing and KV-cache management. Applicants should have 15+ years in systems programming, expertise in Rust and C++, and a strong understanding of distributed systems. The position offers a base salary from 272,000 to 431,250 USD and eligibility for equity and benefits. #J-18808-Ljbffr NVIDIA Gruppe

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Senior Dynamo Architect: Scalable GPU AI Inference in Santa Clara, CA vacancy

Senior AI Inference Architect (Dynamo) — Equity Eligible
$320k
NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate... ...and drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range of $320...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior HPC Architect: Scalable GPU Compute & AI Platforms
NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in...
Senior
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior Performance Testing Architect
$184k - $287.5k
...a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with... ...improvements, optimizing along the axes of scalability/modularity, performance, area, yield,... ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA...
Senior
Work experience placement
Night shift
NVIDIA
Santa Clara, CA
2 days ago
Principal AI Performance Architect for Scalable GPU Training
Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders...
Suggested
Advanced Micro Devices
Santa Clara, CA
2 days ago
Senior System Software Engineer - Dynamo-Triton Inference Server
$152k - $241.5k
...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic... ...using GPUs to power a revolution in AI, enabling breakthroughs in problems...
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior AI/HPC GPU Cluster Architect (Equity)
NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior GPU Cluster Architect for AI/HPC Deployments
$184k - $356.5k
NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior CI Architect — GPU Inference & Open-Source Infra
...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in CI/CD, knowledge of Linux and GPU computing, as well as strong skills in Bash... ...’s passionate about building world-class AI infrastructure, ensuring fast and secure...
Senior
RadixArk
Palo Alto, CA
1 day ago
Senior GPU & DL Architect — Lead Next‑Gen AI Hardware
NVIDIA Gruppe is looking for a Senior GPU & Deep Learning Architect to join its GPU Architecture group in California. In this role, you will lead efforts to design hardware for deep learning and advance parallel computation across projects. The ideal candidate will hold...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior System Software Engineer — GPU AI Inference (Triton)
NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, to develop world-class GPU-accelerated AI inference serving software. This role involves contributing to feature development and optimizing software for deployment in production environments...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior AI Inference Engineer — GPU DL, Equity Eligible
$184k - $356.5k
NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep...
Senior
NVIDIA Corporation
Santa Clara, CA
1 day ago
Principal/Senior Robotics AI Architect
...computing experiences-from AI and data centers, to... ...seeking a Robotics AI Architect to define and scale next... ...enable broad ecosystem scalability. KEY... ...co-design across CPU, GPU, and accelerators... ...understanding of: AI inference runtimes and deployment...
Senior
Advanced Micro Devices , Inc.
San Jose, CA
4 days ago
Senior Staff Software Engineer - High Performance GPU Inference Systems
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,... ...Software Engineer - High Performance GPU Inference Systems Mission Push the limits... ...Systems Engineering : Design and implement scalable, low-latency runtime systems that...
Senior
I did my part and supported the Regular Toilet
Palo Alto, CA
4 days ago
Senior DL Inference & Kernel Architect
$224k - $356.5k
NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer in Santa Clara to design and build an automated inference and deployment solution. You will focus on defining a scalable DL architecture that integrates with frameworks like PyTorch and JAX. Ideal candidates...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior FPGA Architect for AI Inference & Secure Boot
d-Matrix inc. in Santa Clara, CA is seeking a skilled individual for FPGA design and verification for AI solutions. The role involves collaborating with teams to meet project specifications and implementing robust hardware and software modules. The ideal candidate has...
Senior
d-Matrix inc.
Santa Clara, CA
4 days ago
Senior GPU Architect, Deep Learning
Overview We are now looking for a Senior GPU & Deep Learning Architect to join the NVIDIA GPU Architecture group. As a senior architect, you will lead... ...learning architectures targeting both training and inference workloads. Advance the state of parallel computation. Stay...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Deep Learning Communication Architect
...unlimited potential of AI to define the next... ...era in which our GPU acts as the brains... ...Communication Architect. We scale the DNN... ...models and training/inference frameworks to... ...the performance and scalability of deep learning systems... ...servers like Dynamo and Triton. Proficiency...
Senior
Work experience placement
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Principal AI Interconnect Architect
...Job Description An AI Interconnect Architect defines and engineers high-... ...communication systems for AI Inference infrastructure which include... ...bandwidth, power efficiency, scalability, and optimized transport... ...Architecture: Familiarity with GPU/accelerator clusters and...
Senior
Temporary work
Remote work
Flexible hours
Shift work
Sandisk
Milpitas, CA
7 days ago
Senior AI Inference Performance Engineer (GPU/Cluster)
$152k - $241.5k
...Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Sr. Principal AI Interconnect Architect
...Job Summary T he AI Interconnect Architect designs and engineers high-speed... ...systems for AI inference infrastructure, including servers... ...bandwidth, power efficiency, scalability, and optimized transport protocols... ...architecture, including GPU/accelerator clusters and...
Senior
Compunnel
Milpitas, CA
3 days ago
Principal Software Engineer - Dynamo
$272k - $431.25k
Overview NVIDIA Dynamo is an innovative, open-source... ...focused on efficient, scalable inference for large language and... ...models in distributed GPU environments. By... ...achieves high-performance AI inference for demanding... ...Disaggregated Serving: Architect and optimize the...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Principal AI Inference Architect - LLM Serving
NVIDIA Corporation in Santa Clara seeks a Principal Software Engineer - AI Inference to advance open-source LLM serving. This hands-on role focuses on optimizing inference engines like vLLM and SGLang for NVIDIA GPUs, requiring deep technical skill and collaboration across...
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior Software Engineer - AI Inference
$152k - $241.5k
...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM... ...optimization work. Improve multi‑GPU inference performance and... ..., PyTorch, Triton, NCCL, Dynamo or adjacent serving/runtime...
Senior
Remote work
NVIDIA
Santa Clara, CA
4 days ago
Senior Architect, GPU Profiling System
$184k - $287.5k
...NVIDIA’s GPU Architecture Group is looking for architects to contribute to the design of our proprietary profiler subsystem, the apparatus embedded in every... ...impact at a fast-paced company that is spearheading the AI revolution. Join our technically diverse team of GPU...
Senior
NVIDIA
Santa Clara, CA
3 days ago
Senior Staff Software Development Engineer- GPU/AI/ML
...experiences-from AI and data centers,... ...: As a Senior Staff Software Developer... ...: Architect and Drive the AI Software... ...the lowest-level GPU kernels to large-... ...complex, scalable systems using modern... ...MoE) architectures, inference optimizations (e.g...
Senior
Advanced Micro Devices , Inc.
Santa Clara, CA
23 hours ago
Senior architecture architect
$184k - $287.5k
...We are now looking for a Senior GPU Architect! The NVIDIA GPU Architecture group is looking for world class architects and software developers... .... This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to...
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior architecture architect
$208k - $327.75k
...Product Architect NVIDIA is the engine of modern Artificial Intelligence... ...Infrastructure, and Agentic AI - the biggest technology... ..., including performance, scalability, interoperability, and datacenter... ...& RAG-based workflows, inference at scale, large scale training...
Senior
NVIDIA
Santa Clara, CA
1 day ago
Senior Software Development Engineer - LLM Inference Framework
...computing experiences-from AI and data centers, to... ...THE ROLE: As a senior member of the LLM inference framework team, you... ...performance, scalability, and reliability, enabling... ...systems, and GPU runtime and kernel backends... ...& Runtime ~ Architect and optimize...
Senior
Advanced Micro Devices , Inc.
Santa Clara, CA
2 days ago
Senior Deep Learning Performance Architect
$184k - $287.5k
...now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking... ...architectures that accelerate AI and high-... ...science fiction. GPU Deep Learning has provided... ..., efficiency, and scalability of production AI... ..., especially LLM inference/training in real-world...
Senior
NVIDIA
Santa Clara, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Dynamo Architect: Scalable GPU AI Inference. Be the first to apply!