Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Dynamo Architect: Scalable GPU AI Inference

$272k - $431.25k

NVIDIA Gruppe

NVIDIA Gruppe is seeking experienced engineers for its Dynamo platform, focusing on scalable AI systems. You will develop the Kubernetes deployment, optimize GPU resource management, and work on intelligent routing and KV-cache management. Applicants should have 15+ years in systems programming, expertise in Rust and C++, and a strong understanding of distributed systems. The position offers a base salary from 272,000 to 431,250 USD and eligibility for equity and benefits. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Dynamo Architect: Scalable GPU AI Inference in Santa Clara, CA vacancy
  • $320k

     ...NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate...  ...and drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range of $320... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in... 
    Senior

    NVIDIA

    Santa Clara, CA
    5 hours ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with...  ...improvements, optimizing along the axes of scalability/modularity, performance, area, yield,...  ...for an existing vacancy.  NVIDIA uses AI tools in its recruiting processes. NVIDIA... 
    Senior
    Work experience placement
    Night shift

    NVIDIA

    Santa Clara, CA
    2 days ago
  • Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders... 
    Suggested

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  •  ...NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...We are looking for a Senior System Software Engineer...  ...software engineers for its GPU-accelerated deep...  ...power a revolution in AI, enabling breakthroughs...  ...a highly-performant AI inference platform to make design...  ...Inference Server and NVIDIA Dynamo stacks to establish a unified... 
    Senior

    NVIDIA

    Santa Clara, CA
    5 hours ago
  • $184k - $356.5k

    NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in CI/CD, knowledge of Linux and GPU computing, as well as strong skills in Bash...  ...’s passionate about building world-class AI infrastructure, ensuring fast and secure... 
    Senior

    RadixArk

    Palo Alto, CA
    1 day ago
  •  ...NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, to develop world-class GPU-accelerated AI inference serving software. This role involves contributing to feature development and optimizing software for deployment in production environments... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 hours ago
  • NVIDIA Gruppe is looking for a Senior GPU & Deep Learning Architect to join its GPU Architecture group in California. In this role, you will lead efforts to design hardware for deep learning and advance parallel computation across projects. The ideal candidate will hold... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

     ...NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...computing experiences—from AI and data centers, to...  ...seeking a Robotics AI Architect to define and scale next...  ...enable broad ecosystem scalability. KEY RESPONSIBILITIES...  ...co‑design across CPU, GPU, and accelerators Lighthouse...  ...understanding of: AI inference runtimes and deployment... 
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    5 hours ago
  •  ...A leading technology company is looking for a GPU Software Architecture Engineer to lead server-side ML acceleration and multi-node distribution initiatives. This role involves architecting next-generation distributed ML infrastructure and optimizing for maximum hardware... 
    Senior

    Apple

    Cupertino, CA
    5 hours ago
  •  ...Overview We are now looking for a Senior GPU & Deep Learning Architect to join the NVIDIA GPU Architecture group. As a senior architect, you will...  ...deep learning architectures targeting both training and inference workloads. Advance the state of parallel computation. Stay... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago
  • $248.71k - $292.6k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,...  ...Software Engineer - High Performance GPU Inference Systems Mission Push the limits...  ...Systems Engineering : Design and implement scalable, low-latency runtime systems that... 
    Senior

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    4 days ago
  • $224k - $356.5k

    NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer in Santa Clara to design and build an automated inference and deployment solution. You will focus on defining a scalable DL architecture that integrates with frameworks like PyTorch and JAX. Ideal candidates... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...unlimited potential of AI to define the next...  ...era in which our GPU acts as the brains...  ...Communication Architect. We scale the DNN...  ...models and training/inference frameworks to...  ...the performance and scalability of deep learning systems...  ...servers like Dynamo and Triton. Proficiency... 
    Senior
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $272k - $431.25k

     ...Overview NVIDIA Dynamo is an innovative, open-source...  ...focused on efficient, scalable inference for large language and...  ...models in distributed GPU environments. By...  ...achieves high-performance AI inference for demanding...  ...Disaggregated Serving: Architect and optimize the... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago
  • $152k - $241.5k

     ...Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Job Summary T he AI Interconnect Architect designs and engineers high-speed...  ...systems for AI inference infrastructure, including servers...  ...bandwidth, power efficiency, scalability, and optimized transport protocols...  ...architecture, including GPU/accelerator clusters and... 
    Senior

    Compunnel

    Milpitas, CA
    3 days ago
  • NVIDIA Corporation in Santa Clara seeks a Principal Software Engineer - AI Inference to advance open-source LLM serving. This hands-on role focuses on optimizing inference engines like vLLM and SGLang for NVIDIA GPUs, requiring deep technical skill and collaboration across... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $272k - $431.25k

     ...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago
  • $125k - $175k

     ...Covalent is seeking a Senior Software & AI Test Engineer in Sunnyvale, CA to design a scalable quality framework for software and AI systems. You will lead initiatives in test infrastructure, automated testing, and AI model evaluation while ensuring high-quality releases... 
    Senior

    Covalent

    Sunnyvale, CA
    5 hours ago
  • $261.8k - $379.1k

     ...Adobe is seeking a seasoned software engineer with over 14 years of experience to design and maintain AI/ML systems. Candidates should have a strong background in Machine Learning and Deep Learning, especially within production environments, alongside proficiency in Python... 
    Senior

    Adobe

    San Jose, CA
    1 day ago
  • $320k

     ...NVIDIA Gruppe in Santa Clara, California, is seeking a senior architect to define NVLink Fusion architecture and collaborate with major customers...  ...of experience in system architecture and design, focusing on scalable and performant server systems. This role involves... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...A leading technology company is seeking a Principal Systems Solutions Architect to design and develop scalable AI solutions. This role involves engaging with customers, creating reference designs, and collaborating with various teams. The ideal candidate should have extensive... 
    Senior

    Qualcomm

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...NVIDIA Corporation is hiring a Senior Deep Learning Performance Architect in Santa Clara, CA. This role involves developing cutting-edge architectures to...  ...various teams, the ideal candidate will have experience in GPU architecture and deep learning systems. Competitive... 
    Senior

    NVIDIA

    Santa Clara, CA
    5 hours ago
  •  ...NVIDIA Gruppe in Santa Clara is looking for a Senior HPC Architect to support the deployment of large-scale GPU compute clusters. You will provide engineering solutions for GPU computing products, ensuring technical relationships with teams and assisting in creative solutions... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Dynamo Architect: Scalable GPU AI Inference. Be the first to apply!