Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Dynamo Architect: Scalable GPU AI Inference

$272k - $431.25k

NVIDIA Gruppe

NVIDIA Gruppe is seeking experienced engineers for its Dynamo platform, focusing on scalable AI systems. You will develop the Kubernetes deployment, optimize GPU resource management, and work on intelligent routing and KV-cache management. Applicants should have 15+ years in systems programming, expertise in Rust and C++, and a strong understanding of distributed systems. The position offers a base salary from 272,000 to 431,250 USD and eligibility for equity and benefits. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Dynamo Architect: Scalable GPU AI Inference in Santa Clara, CA vacancy
  • $320k

    NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate...  ...and drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range of $320... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with...  ...improvements, optimizing along the axes of scalability/modularity, performance, area, yield,...  ...for an existing vacancy.  NVIDIA uses AI tools in its recruiting processes. NVIDIA... 
    Senior
    Work experience placement
    Night shift

    NVIDIA

    Santa Clara, CA
    2 days ago
  • Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders... 
    Suggested

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic...  ...using GPUs to power a revolution in AI, enabling breakthroughs in problems... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in CI/CD, knowledge of Linux and GPU computing, as well as strong skills in Bash...  ...’s passionate about building world-class AI infrastructure, ensuring fast and secure... 
    Senior

    RadixArk

    Palo Alto, CA
    1 day ago
  • NVIDIA Gruppe is looking for a Senior GPU & Deep Learning Architect to join its GPU Architecture group in California. In this role, you will lead efforts to design hardware for deep learning and advance parallel computation across projects. The ideal candidate will hold... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, to develop world-class GPU-accelerated AI inference serving software. This role involves contributing to feature development and optimizing software for deployment in production environments... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...computing experiences-from AI and data centers, to...  ...seeking a Robotics AI Architect to define and scale next...  ...enable broad ecosystem scalability. KEY...  ...co-design across CPU, GPU, and accelerators...  ...understanding of: AI inference runtimes and deployment... 
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • $248.71k - $292.6k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,...  ...Software Engineer - High Performance GPU Inference Systems Mission Push the limits...  ...Systems Engineering : Design and implement scalable, low-latency runtime systems that... 
    Senior

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    4 days ago
  • $224k - $356.5k

    NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer in Santa Clara to design and build an automated inference and deployment solution. You will focus on defining a scalable DL architecture that integrates with frameworks like PyTorch and JAX. Ideal candidates... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • d-Matrix inc. in Santa Clara, CA is seeking a skilled individual for FPGA design and verification for AI solutions. The role involves collaborating with teams to meet project specifications and implementing robust hardware and software modules. The ideal candidate has... 
    Senior

    d-Matrix inc.

    Santa Clara, CA
    4 days ago
  • Overview We are now looking for a Senior GPU & Deep Learning Architect to join the NVIDIA GPU Architecture group. As a senior architect, you will lead...  ...learning architectures targeting both training and inference workloads. Advance the state of parallel computation. Stay... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...unlimited potential of AI to define the next...  ...era in which our GPU acts as the brains...  ...Communication Architect. We scale the DNN...  ...models and training/inference frameworks to...  ...the performance and scalability of deep learning systems...  ...servers like Dynamo and Triton. Proficiency... 
    Senior
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Job Description An AI Interconnect Architect defines and engineers high-...  ...communication systems for AI Inference infrastructure which include...  ...bandwidth, power efficiency, scalability, and optimized transport...  ...Architecture: Familiarity with GPU/accelerator clusters and... 
    Senior
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    7 days ago
  • $152k - $241.5k

     ...Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Job Summary T he AI Interconnect Architect designs and engineers high-speed...  ...systems for AI inference infrastructure, including servers...  ...bandwidth, power efficiency, scalability, and optimized transport protocols...  ...architecture, including GPU/accelerator clusters and... 
    Senior

    Compunnel

    Milpitas, CA
    3 days ago
  • $272k - $431.25k

    Overview NVIDIA Dynamo is an innovative, open-source...  ...focused on efficient, scalable inference for large language and...  ...models in distributed GPU environments. By...  ...achieves high-performance AI inference for demanding...  ...Disaggregated Serving: Architect and optimize the... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • NVIDIA Corporation in Santa Clara seeks a Principal Software Engineer - AI Inference to advance open-source LLM serving. This hands-on role focuses on optimizing inference engines like vLLM and SGLang for NVIDIA GPUs, requiring deep technical skill and collaboration across... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM...  ...optimization work. Improve multi‑GPU inference performance and...  ..., PyTorch, Triton, NCCL, Dynamo or adjacent serving/runtime... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...NVIDIA’s GPU Architecture Group is looking for architects to contribute to the design of our proprietary profiler subsystem, the apparatus embedded in every...  ...impact at a fast-paced company that is spearheading the AI revolution. Join our technically diverse team of GPU... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...experiences-from AI and data centers,...  ...: As a Senior Staff Software Developer...  ...: Architect and Drive the AI Software...  ...the lowest-level GPU kernels to large-...  ...complex, scalable systems using modern...  ...MoE) architectures, inference optimizations (e.g... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    23 hours ago
  • $184k - $287.5k

     ...We are now looking for a Senior GPU Architect! The NVIDIA GPU Architecture group is looking for world class architects and software developers...  .... This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $208k - $327.75k

     ...Product Architect NVIDIA is the engine of modern Artificial Intelligence...  ...Infrastructure, and Agentic AI - the biggest technology...  ..., including performance, scalability, interoperability, and datacenter...  ...& RAG-based workflows, inference at scale, large scale training... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...computing experiences-from AI and data centers, to...  ...THE ROLE: As a senior member of the LLM inference framework team, you...  ...performance, scalability, and reliability, enabling...  ...systems, and GPU runtime and kernel backends...  ...& Runtime ~ Architect and optimize... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking...  ...architectures that accelerate AI and high-...  ...science fiction. GPU Deep Learning has provided...  ..., efficiency, and scalability of production AI...  ..., especially LLM inference/training in real-world... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Dynamo Architect: Scalable GPU AI Inference. Be the first to apply!