Senior Dynamo Architect: Scalable GPU AI Inference
$272k - $431.25kNVIDIA Gruppe
NVIDIA Gruppe is seeking experienced engineers for its Dynamo platform, focusing on scalable AI systems. You will develop the Kubernetes deployment, optimize GPU resource management, and work on intelligent routing and KV-cache management. Applicants should have 15+ years in systems programming, expertise in Rust and C++, and a strong understanding of distributed systems. The position offers a base salary from 272,000 to 431,250 USD and eligibility for equity and benefits. #J-18808-Ljbffr NVIDIA Gruppe
$320k
NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate... ...and drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range of $320...Senior- NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in...Senior
$184k - $287.5k
...a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with... ...improvements, optimizing along the axes of scalability/modularity, performance, area, yield,... ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA...SeniorWork experience placementNight shift- Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders...Suggested
$152k - $241.5k
...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic... ...using GPUs to power a revolution in AI, enabling breakthroughs in problems...Senior- NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years...Senior
$184k - $356.5k
NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate...Senior- ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in CI/CD, knowledge of Linux and GPU computing, as well as strong skills in Bash... ...’s passionate about building world-class AI infrastructure, ensuring fast and secure...Senior
- NVIDIA Gruppe is looking for a Senior GPU & Deep Learning Architect to join its GPU Architecture group in California. In this role, you will lead efforts to design hardware for deep learning and advance parallel computation across projects. The ideal candidate will hold...Senior
- NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, to develop world-class GPU-accelerated AI inference serving software. This role involves contributing to feature development and optimizing software for deployment in production environments...Senior
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior$184k - $356.5k
NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep...Senior- ...computing experiences-from AI and data centers, to... ...seeking a Robotics AI Architect to define and scale next... ...enable broad ecosystem scalability. KEY... ...co-design across CPU, GPU, and accelerators... ...understanding of: AI inference runtimes and deployment...Senior
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,... ...Software Engineer - High Performance GPU Inference Systems Mission Push the limits... ...Systems Engineering : Design and implement scalable, low-latency runtime systems that...Senior$224k - $356.5k
NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer in Santa Clara to design and build an automated inference and deployment solution. You will focus on defining a scalable DL architecture that integrates with frameworks like PyTorch and JAX. Ideal candidates...Senior- d-Matrix inc. in Santa Clara, CA is seeking a skilled individual for FPGA design and verification for AI solutions. The role involves collaborating with teams to meet project specifications and implementing robust hardware and software modules. The ideal candidate has...Senior
- Overview We are now looking for a Senior GPU & Deep Learning Architect to join the NVIDIA GPU Architecture group. As a senior architect, you will lead... ...learning architectures targeting both training and inference workloads. Advance the state of parallel computation. Stay...Senior
- ...unlimited potential of AI to define the next... ...era in which our GPU acts as the brains... ...Communication Architect. We scale the DNN... ...models and training/inference frameworks to... ...the performance and scalability of deep learning systems... ...servers like Dynamo and Triton. Proficiency...SeniorWork experience placement
- ...Job Description An AI Interconnect Architect defines and engineers high-... ...communication systems for AI Inference infrastructure which include... ...bandwidth, power efficiency, scalability, and optimized transport... ...Architecture: Familiarity with GPU/accelerator clusters and...SeniorTemporary workRemote workFlexible hoursShift work
$152k - $241.5k
...Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...Senior- ...Job Summary T he AI Interconnect Architect designs and engineers high-speed... ...systems for AI inference infrastructure, including servers... ...bandwidth, power efficiency, scalability, and optimized transport protocols... ...architecture, including GPU/accelerator clusters and...Senior
$272k - $431.25k
Overview NVIDIA Dynamo is an innovative, open-source... ...focused on efficient, scalable inference for large language and... ...models in distributed GPU environments. By... ...achieves high-performance AI inference for demanding... ...Disaggregated Serving: Architect and optimize the...- NVIDIA Corporation in Santa Clara seeks a Principal Software Engineer - AI Inference to advance open-source LLM serving. This hands-on role focuses on optimizing inference engines like vLLM and SGLang for NVIDIA GPUs, requiring deep technical skill and collaboration across...
$152k - $241.5k
...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM... ...optimization work. Improve multi‑GPU inference performance and... ..., PyTorch, Triton, NCCL, Dynamo or adjacent serving/runtime...SeniorRemote work$184k - $287.5k
...NVIDIA’s GPU Architecture Group is looking for architects to contribute to the design of our proprietary profiler subsystem, the apparatus embedded in every... ...impact at a fast-paced company that is spearheading the AI revolution. Join our technically diverse team of GPU...Senior- ...experiences-from AI and data centers,... ...: As a Senior Staff Software Developer... ...: Architect and Drive the AI Software... ...the lowest-level GPU kernels to large-... ...complex, scalable systems using modern... ...MoE) architectures, inference optimizations (e.g...Senior
$184k - $287.5k
...We are now looking for a Senior GPU Architect! The NVIDIA GPU Architecture group is looking for world class architects and software developers... .... This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to...Senior$208k - $327.75k
...Product Architect NVIDIA is the engine of modern Artificial Intelligence... ...Infrastructure, and Agentic AI - the biggest technology... ..., including performance, scalability, interoperability, and datacenter... ...& RAG-based workflows, inference at scale, large scale training...Senior- ...computing experiences-from AI and data centers, to... ...THE ROLE: As a senior member of the LLM inference framework team, you... ...performance, scalability, and reliability, enabling... ...systems, and GPU runtime and kernel backends... ...& Runtime ~ Architect and optimize...Senior
$184k - $287.5k
...now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking... ...architectures that accelerate AI and high-... ...science fiction. GPU Deep Learning has provided... ..., efficiency, and scalability of production AI... ..., especially LLM inference/training in real-world...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Dynamo Architect: Scalable GPU AI Inference. Be the first to apply!
- senior game producer Santa Clara, CA
- senior manager process engineering Santa Clara, CA
- senior manufacturing engineer Santa Clara, CA
- senior manager clinical operations Santa Clara, CA
- senior optical engineer Santa Clara, CA
- senior lead project manager Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior device engineer Santa Clara, CA
- senior full stack developer Santa Clara, CA
- senior planner Santa Clara, CA


