Senior Dynamo Architect: Scalable GPU AI Inference
$272k - $431.25kNVIDIA Gruppe
NVIDIA Gruppe is seeking experienced engineers for its Dynamo platform, focusing on scalable AI systems. You will develop the Kubernetes deployment, optimize GPU resource management, and work on intelligent routing and KV-cache management. Applicants should have 15+ years in systems programming, expertise in Rust and C++, and a strong understanding of distributed systems. The position offers a base salary from 272,000 to 431,250 USD and eligibility for equity and benefits. #J-18808-Ljbffr
$320k
...NVIDIA Gruppe is seeking a Distinguished Engineer to join the Dynamo engineering team in Santa Clara, California. The successful candidate... ...and drive product direction while working on state-of-the-art AI inferencing technologies. With a competitive salary range of $320...Senior- ...NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in...Senior
- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
$184k - $287.5k
...a pivotal role in crafting the future of GPU technology. At NVIDIA, you will work with... ...improvements, optimizing along the axes of scalability/modularity, performance, area, yield,... ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA...SeniorWork experience placementNight shift- Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders...Suggested
- ...NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years...Senior
$152k - $241.5k
...We are looking for a Senior System Software Engineer... ...software engineers for its GPU-accelerated deep... ...power a revolution in AI, enabling breakthroughs... ...a highly-performant AI inference platform to make design... ...Inference Server and NVIDIA Dynamo stacks to establish a unified...Senior$184k - $356.5k
NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate...Senior- ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in CI/CD, knowledge of Linux and GPU computing, as well as strong skills in Bash... ...’s passionate about building world-class AI infrastructure, ensuring fast and secure...Senior
- ...NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, to develop world-class GPU-accelerated AI inference serving software. This role involves contributing to feature development and optimizing software for deployment in production environments...Senior
- NVIDIA Gruppe is looking for a Senior GPU & Deep Learning Architect to join its GPU Architecture group in California. In this role, you will lead efforts to design hardware for deep learning and advance parallel computation across projects. The ideal candidate will hold...Senior
$184k - $356.5k
...NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior- ...computing experiences—from AI and data centers, to... ...seeking a Robotics AI Architect to define and scale next... ...enable broad ecosystem scalability. KEY RESPONSIBILITIES... ...co‑design across CPU, GPU, and accelerators Lighthouse... ...understanding of: AI inference runtimes and deployment...Senior
- ...A leading technology company is looking for a GPU Software Architecture Engineer to lead server-side ML acceleration and multi-node distribution initiatives. This role involves architecting next-generation distributed ML infrastructure and optimizing for maximum hardware...Senior
- ...Overview We are now looking for a Senior GPU & Deep Learning Architect to join the NVIDIA GPU Architecture group. As a senior architect, you will... ...deep learning architectures targeting both training and inference workloads. Advance the state of parallel computation. Stay...Senior
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,... ...Software Engineer - High Performance GPU Inference Systems Mission Push the limits... ...Systems Engineering : Design and implement scalable, low-latency runtime systems that...Senior$224k - $356.5k
NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer in Santa Clara to design and build an automated inference and deployment solution. You will focus on defining a scalable DL architecture that integrates with frameworks like PyTorch and JAX. Ideal candidates...Senior- ...unlimited potential of AI to define the next... ...era in which our GPU acts as the brains... ...Communication Architect. We scale the DNN... ...models and training/inference frameworks to... ...the performance and scalability of deep learning systems... ...servers like Dynamo and Triton. Proficiency...SeniorWork experience placement
$272k - $431.25k
...Overview NVIDIA Dynamo is an innovative, open-source... ...focused on efficient, scalable inference for large language and... ...models in distributed GPU environments. By... ...achieves high-performance AI inference for demanding... ...Disaggregated Serving: Architect and optimize the...$152k - $241.5k
...Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...Senior- ...Job Summary T he AI Interconnect Architect designs and engineers high-speed... ...systems for AI inference infrastructure, including servers... ...bandwidth, power efficiency, scalability, and optimized transport protocols... ...architecture, including GPU/accelerator clusters and...Senior
- NVIDIA Corporation in Santa Clara seeks a Principal Software Engineer - AI Inference to advance open-source LLM serving. This hands-on role focuses on optimizing inference engines like vLLM and SGLang for NVIDIA GPUs, requiring deep technical skill and collaboration across...
$272k - $431.25k
...NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing...$125k - $175k
...Covalent is seeking a Senior Software & AI Test Engineer in Sunnyvale, CA to design a scalable quality framework for software and AI systems. You will lead initiatives in test infrastructure, automated testing, and AI model evaluation while ensuring high-quality releases...Senior$261.8k - $379.1k
...Adobe is seeking a seasoned software engineer with over 14 years of experience to design and maintain AI/ML systems. Candidates should have a strong background in Machine Learning and Deep Learning, especially within production environments, alongside proficiency in Python...Senior$320k
...NVIDIA Gruppe in Santa Clara, California, is seeking a senior architect to define NVLink Fusion architecture and collaborate with major customers... ...of experience in system architecture and design, focusing on scalable and performant server systems. This role involves...Senior- ...A leading technology company is seeking a Principal Systems Solutions Architect to design and develop scalable AI solutions. This role involves engaging with customers, creating reference designs, and collaborating with various teams. The ideal candidate should have extensive...Senior
$184k - $287.5k
...NVIDIA Corporation is hiring a Senior Deep Learning Performance Architect in Santa Clara, CA. This role involves developing cutting-edge architectures to... ...various teams, the ideal candidate will have experience in GPU architecture and deep learning systems. Competitive...Senior- ...NVIDIA Gruppe in Santa Clara is looking for a Senior HPC Architect to support the deployment of large-scale GPU compute clusters. You will provide engineering solutions for GPU computing products, ensuring technical relationships with teams and assisting in creative solutions...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Dynamo Architect: Scalable GPU AI Inference. Be the first to apply!
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior software test automation engineer Santa Clara, CA
- senior design technologist Santa Clara, CA
- senior design verification engineer Santa Clara, CA
- senior director quality Santa Clara, CA
- senior director of development Santa Clara, CA
- sr project engineer Santa Clara, CA

