Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior GPU Fleet & DGX Cloud Automation Architect

$320k

NVIDIA

A leading tech company is seeking a seasoned individual to spearhead DGX Cloud strategy, focusing on GPU lifecycle and operational health. The ideal candidate will have over 15 years in technical roles, with significant experience in cloud infrastructure and leadership. Responsibilities include defining technical strategies, collaborating with stakeholders, and managing full software and system lifecycles. If you're passionate about technology and innovation, we want to hear from you! The salary range is $320,000 to $488,750, with equity eligibility. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior GPU Fleet & DGX Cloud Automation Architect in Santa Clara, CA vacancy
  • $184k - $287.5k

    ## Senior Software Engineer, DGX Cloud Production EngineeringApplylocations: US, CA, Santa...  ...and operating large-scale GPU infrastructure for AI...  ...Engineers to help build the automation, tooling, and operational...  ...GitOps, Terraform, ArgoCD, or fleet automation.* Experience... 
    Senior
    Fleet

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $176k - $333.5k

     ...Santa Clara is seeking experienced EngOps and Platform Engineers to develop and maintain extensive GPU clusters. The role requires extensive hands-on experience with automation tools and a robust understanding of computer networks. The ideal candidate has a BS or MS in a... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $320k

    Responsibilities Lead the development of the DGX Cloud strategy for GPU fleet lifecycle, health, observability, utilization monitoring, and remediation...  ...+ years in technical roles with focus on operations and automation for cloud infrastructure, platforms, and applications. 5... 
    Fleet

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $179k - $218k

     ...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe is on a mission...  ...center construction, and cloud services. If you want...  ...methodologies to analyze fleet-wide telemetry (power...  ...Technical Sparing Architecture: Architect the site-level sparing... 
    Senior
    Fleet
    Temporary work

    Crusoe

    Sunnyvale, CA
    10 days ago
  • $184k - $287.5k

    ## Senior Software Engineer, DGX Cloud AI InfrastructureApplylocations: US, CA, Santa...  ...workloads across NVIDIA GPU platforms at the largest...  ...repeatable benchmark suites, automation, acceptance criteria, and...  ...* Proven track record of architecting, debugging, and scaling... 
    Senior
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...computing. An era in which our GPU acts as the brains of...  ...on the world. As part of the DGX Cloud organization, the Attestation...  ...platforms. In this role, you will architect and operate a global, cloud-native...  ..., horizontal scalability, automated rollouts, and robust... 
    Senior
    Remote work

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $225k - $275k

     ...Senior Staff Network Deployment Engineer Crusoe Cloud is seeking a Senior Staff Network Deployment...  ...infrastructure across our global fleet. As we rapidly scale...  ...compute (HPC) and GPU-based AI...  ...set the standards, and architect the automation platform that brings new... 
    Senior
    Fleet
    Temporary work
    Remote work

    Crusoe

    Sunnyvale, CA
    3 days ago
  • Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that...  ...efficiency and availability of AI systems. As a senior DGX Cloud AI Infrastructure software...  ...Computing, and Visualization. The GPU, our invention, serves as the visual cortex... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $153k - $242k

     ...Senior Systems Engineer, OS Automation CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers...  ...pipelines for our massive fleet of GPU-accelerated servers. ~...  ...CI/CD & Auto-Remediation: Architect AI agents that ingest and analyze... 
    Senior
    Fleet
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Local area
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    9 days ago
  • $184k - $287.5k

    The DGX Cloud organization at NVIDIA brings together cutting-edge hardware...  ...looking for an outstanding Senior Systems Software Engineer...  ...experience across the stack - from GPU operator and device plugins...  ...to develop innovative, automated tests that simulate real user... 
    Senior
    Full time
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • NVIDIA Corporation is seeking skilled software engineers to enhance GPU management tools. Responsibilities encompass developing scalable Go programs for Kubernetes and integrating GPU management with state-of-the-art technologies. Ideal candidates will possess extensive... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...heterogeneous compute (CPU/GPU/DSP/NPU) with robust...  ...architecture, and cloud integration, we enable...  ...the technical owner and architect for Qualcomm’s core localization...  ...across on‑device and fleet deployments. Core...  ...CI/CD practices, test automation, and release readiness... 
    Senior
    Fleet
    Local area
    Work from home

    Qualcomm

    Santa Clara, CA
    3 days ago
  • NVIDIA Gruppe is seeking a Senior Software Engineer for GPU Cloud Infrastructure in Santa Clara, California. The role focuses on designing a highly scalable cloud platform and engaging with Kubernetes and KubeVirt communities to drive cloud solutions. The ideal candidate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • NVIDIA Gruppe is seeking highly motivated EngOps and Platform Engineers to develop automated tools for managing large GPU clusters. This position requires strong expertise in high-performance computing and deep learning. The ideal applicants have a BS or MS in a relevant... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $193k - $291k

     ...Driver™, to enable everything from commercial fleets to personally owned vehicles. A key...  ...development, root cause analysis, workflow automation to supercharge our velocity. You will...  ...architecture experience Experience with GPU programming or NVidia Orin Platform... 
    Senior
    Fleet

    Nuro

    Mountain View, CA
    1 day ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave...  ...gives network engineers, fleet engineers, and operations,...  ...drive planning, coordination, automation, of some of the most... 
    Senior
    Fleet

    CoreWeave

    Sunnyvale, CA
    4 days ago
  •  ...to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring platform health, owning infrastructure deployments, and... 
    Senior
    Fleet

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $180k - $240k

     ...and hardware powering the fleet, facilitating effortless...  ...role We are seeking a Senior Cloud Infrastructure Engineer to architect and manage the large-...  ...platform, ensuring that multi-GPU clusters, distributed training frameworks, and automated workflows are scalable,... 
    Senior
    Fleet
    Odd job
    Work at office

    Gatik AI

    Mountain View, CA
    2 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Senior Validation Engineer for the DGX Server Product Engineering Team. In this role, you will work closely with HW/SW engineers to develop automated test plans for leading GPU computing products. Responsibilities include system... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...strong coding skills in JavaScript or Python, a background in automation frameworks, and a keen understanding of consumer electronics. Your...  ...developer experiences and ensuring stability across our device fleet, leveraging real-time metrics to drive insights and... 
    Senior
    Fleet
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    2 days ago
  • $224k - $356.5k

    NVIDIA Corporation is seeking a Senior Systems Software Engineer for AI Stack and Performance to lead optimizations on DGX Station, a workstation-class AI computer. You will profile...  ...will need a strong background in AI/ML, GPU architecture, and deep learning frameworks... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $176k - $276k

     ...delivery and deployment and open source cloud enabling technologies like Kubernetes and...  ...ensures that our internal and external facing GPU cloud services run maximum reliability...  ...on eliminating manual work through automation, performance tuning and growing efficiency... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $182k - $242k

     ...CoreWeave is The Essential Cloud for AI™. Built for...  ...more at About the role Senior Engineers are area owners...  .... You'll partner with fleet, product, and hardware teams to evolve our GPU performance testing platform...  ...performance tests and automation workflows to expand... 
    Senior
    Fleet
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    24 days ago
  • $272k - $431.25k

     ...Gruppe is seeking a Principal Software Engineer to shape the technical direction of our GPU infrastructure in Santa Clara, California. You will define the technical strategy for DGX Cloud cluster operations and lead the design and implementation of critical systems. The... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $256k - $414k

     ...is the global leader in cloud gaming, dedicated to...  ...We are looking for a Senior Manager to lead the design...  ...performance networking for GPU‑based cloud...  ...specialized team of network architects focused on high‑performance...  ...observability frameworks to automate provisioning, scaling,... 
    Senior
    Local area

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...Overview: Job Title: Senior DevOps Engineer / Platform Engineer Job Location: Santa Clara, CA Job Type: Long Term Contract...  ...design, build, and scale infrastructure for a site-builder/network automation platform. This role focuses on CI/CD pipelines, Kubernetes... 
    Senior
    Long term contract

    Stellar IT Group

    Santa Clara, CA
    4 days ago
  • $298k - $368k

     ...Senior Staff Machine Learning Engineer, Depot Automation Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...efficiency for Waymo's rapidly expanding autonomous fleet. You will lead efforts to generalize complex depot... 
    Senior
    Fleet
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $200k - $322k

    NVIDIA’s DGX Cloud is redefining how organizations deploy and scale AI infrastructure. We’re looking for a Senior Technical Program Manager to drive storage‑related initiatives across development, operations, and cloud deployment. This is a high‑impact role interfacing... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • HerculesAI in Campbell, California is looking for a skilled professional to manage GPU compute provisioning and enhance security architectures across cloud platforms. You will design Infrastructure as Code foundations and implement Zero Trust principles, ensuring the security... 
    Senior

    HerculesAI

    Campbell, CA
    3 days ago
  • $136k - $218.5k

    NVIDIA is seeking outstanding Senior Design Verification Engineers with a specialty in tools and automation to drive efficiency and collaboration among our High Speed IO...  ...of industry techniques* Collaborate with architects, designers, verification engineers, as well as... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior GPU Fleet & DGX Cloud Automation Architect. Be the first to apply!