Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, DGX Cloud Production Engineering

$184k - $287.5k

NVIDIA

  • # Senior Software Engineer, DGX Cloud Production EngineeringApplylocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2019319NVIDIA DGX Cloud is building and operating large-scale GPU infrastructure for AI research and production workloads. We are looking for Senior Software Engineers to help build the automation, tooling, and operational systems that make GPU clusters reliable, scalable, and safe to run. This role is part of a production engineering team focused on Kubernetes-based infrastructure, GPU cluster operations, reliability, automation, GitOps, and Day 2 operability across DGX Cloud environments.**What you’ll be doing:*** Build and operate automation for large-scale GPU clusters across NVIDIA Cloud Partners (NCP) and on-prem environments.* Develop tools and services for provisioning, validation, upgrades, monitoring, repair, and cluster lifecycle operations.* Improve Day 0 / Day 1 / Day 2 workflows for cluster bringup, handoff, and production operations.* Reduce manual production touches through APIs, GitOps, automation, and agent-assisted workflows.* Participate in on-call, incident response, debugging, and durable follow-up work.* Partner with platform, storage, networking, security, and workload teams to make infrastructure production-ready.**What we need to see:*** 8+ years of experience building or operating production infrastructure.* Strong programming skills in Python, Go, or similar.* Experience with Linux, Kubernetes, containers, cloud infrastructure, or infrastructure automation.* Ability to troubleshoot distributed systems in production.* Clear communication and ability to work across teams.* BS/MS in Computer Science or equivalent experience.**Ways to stand out from the crowd:*** Experience with GPU infrastructure, Kubernetes operators, GitOps, Terraform, ArgoCD, or fleet automation.* Experience with SLOs, on-call, incident response, observability, and reliability practices.* Exposure to BMaaS, VMaaS, managed Kubernetes, or multi-cloud infrastructure.NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hard-working people on the planet working for us. If you're creative, hard-working and self-motivated, we want to hear from you!Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until June 8, 2026.This posting is for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
  • J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, DGX Cloud Production Engineering in Santa Clara, CA vacancy
  • $184k - $287.5k

    Overview NVIDIA DGX Cloud is building and operating large-scale GPU infrastructure for AI research and production workloads. We are looking for Senior Software Engineers to help build the automation, tooling, and operational systems that make GPU clusters reliable, scalable... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Software Engineer for DGX Cloud Production Engineering in Santa Clara, CA. You will play a critical role in building and operating large-scale GPU infrastructure for AI workloads, focusing on automation, tooling, and operational systems... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $356.5k

    NVIDIA Gruppe is seeking an experienced AI infrastructure software engineer to join its DGX Cloud AI Efficiency Team in Santa Clara, California. This role focuses on developing the infrastructure for optimizing AI workloads and ensuring high availability and efficiency... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

    NVIDIA DGX Cloud is scaling GPU infrastructure across internal, partner...  ...We are looking for Principal Software Engineers to help shape the technical direction for production engineering, Kubernetes-based...  ...GPU clusters. This role is for senior technical leaders who can define... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    ## Senior Software Engineer, DGX Cloud AI InfrastructureApplylocations: US, CA, Santa Clara: US, TX, Austin: US, OR, Remote: US, WA, Remote: US, WA...  ...-attribution capabilities that keep large clusters productive. This is a hands-on senior individual-contributor role... 
    Senior
    Software
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...on the world. As part of the DGX Cloud organization, the...  ...security, silicon, and cloud engineering teams to turn embedded hardware...  ...security, silicon, platform, and software teams to deliver end-to-end...  ...REST APIs and microservices in production. Experience with cloud-... 
    Senior
    Software
    Remote work

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

    ## Senior AI Infrastructure Engineer - DGX CloudApplylocations: US, CA, Santa Clara: US, CA...  ...Engineer to join our DGX Cloud group. This engineering...  ...and maintain large-scale production systems with high efficiency...  ...using a combination of software and systems engineering practices... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    The DGX Cloud organization at NVIDIA brings together cutting-edge hardware and software innovation to deliver industry-leading accelerated computing...  ...We're a team of innovative engineers dedicated to solving some of...  ...looking for an outstanding Senior Systems Software Engineer... 
    Senior
    Software
    Full time
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $168k - $264.5k

    Senior Network Engineer - Cloud Network Infrastructure NVIDIA is seeking an experienced Senior Network Engineer to develop and manage a robust cloud network infrastructure that supports NVIDIA's software development workflows and tools. The role focuses on designing, implementing... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $320k

     ...leading tech company is seeking a seasoned individual to spearhead DGX Cloud strategy, focusing on GPU lifecycle and operational health....  ..., collaborating with stakeholders, and managing full software and system lifecycles. If you're passionate about technology and... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Corporation in Santa Clara is seeking a Sr. Software Engineer to architect a simulation platform for next-generation DGX products. The role involves enhancing simulator components and collaborating with global teams on performance improvements and bug fixes. The... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to...  ...an AI infrastructure software engineer to join our team. You'll be...  ...availability of AI systems. As a senior DGX Cloud AI Infrastructure...  ...Enhance infrastructure and products underpinning NVIDIA's AI platforms... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...for application microservices deployed in both on-prem and on Cloud. Setup test tools to validate environment, application and solutions...  ...science or equivalent with 1+ years hands on professional software development experience with a variety of different testing... 
    Senior
    Software

    Rootshell Enterprise Technologies

    Santa Clara, CA
    4 days ago
  • $112k - $137k

    A leading cybersecurity company in Santa Clara seeks an experienced Software Testing Engineer to design and validate cloud security products. The ideal candidate holds a Bachelor's in Computer Science and has over 10 years of experience in software testing, particularly... 
    Senior
    Software
    Work experience placement

    Fortinet, Inc.

    Santa Clara, CA
    4 days ago
  • $200k - $322k

    Senior Technical Program Manager - DGX Cloud Infra Security page is loaded## Senior Technical...  ...infrastructure, platform, and product teams. This role ensures...  ...roadmaps and the software development lifecycle. It...  ...Security, Compliance, SRE, and Engineering to continually advance... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $200k - $322k

    NVIDIA’s DGX Cloud is redefining how organizations deploy and scale...  .... We’re looking for a Senior Technical Program Manager to...  ...impact role interfacing with engineering, product, operations, finance, and...  ...management of large‑scale software or infrastructure projects.... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $159k - $231k

    Senior Data Center Operations Engineer, Google Cloud Sunnyvale, CA, USA Qualifications Bachelor’s degree in Electrical...  ...logical, mechanical, electrical, software, thermal, etc. Ability to read...  ...this role, you will support new product engineering within Google’s hardware... 
    Senior
    Software
    Full time
    Work at office
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $184k - $287.5k

    Senior Software Engineer, Cloud-Native Stack - CSP Engagements page is loaded Senior Software Engineer, Cloud...  ...cloud-native stack for datacenter products like GB200. In this role, You will define...  ...Jobs (5) Senior Software Engineer, DGX Cloud Lepton Marketplace locations 2... 
    Senior
    Software
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $168k - $258.75k

    Senior Technical Program Manager, DGX Cloud Software Products and Services page is loaded## Senior Technical Program Manager, DGX Cloud Software Products and Serviceslocations...  ...by data and research. You will work closely with engineering, SRE, operations, and researchers to develop... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  •  ...the world. The NVIDIA Cloud Accelerator team develops...  ...a Technical Marketing Engineer passionate about AI Infrastructure...  ...Data Center Management software. You will help our...  ...Marketing Engineering, Product, Engineering, and Field...  ...Base Command Manager, DGX Cloud, Run:ai, GPU... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $224k - $356.5k

    NVIDIA Corporation is seeking a Senior Systems Software Engineer for AI Stack and Performance to lead optimizations on DGX Station, a workstation-class AI computer. You will profile workloads and collaborate with teams to ensure best-in-class performance for AI applications... 
    Senior
    Software

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $320k

    Director, Site Reliability and Software Engineering - DGX Cloud page is loaded## Director, Site Reliability and Software Engineering - DGX Cloudlocations...  ...distributed NVIDIA GPU cloud clusters and contribute to product strategy. You will be the leader for all aspects of... 
    Software

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Sr. Software Engineer to develop and enhance simulation platforms for their DGX Server systems. The role involves working with cross-functional teams to optimize performance and build effective software solutions. Ideal candidates... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • Fortinet, Inc. is hiring for a software engineering role based in Santa Clara, California. The position requires strong programming skills, with an emphasis on Python and extensive experience with AWS or Azure. You will contribute to developing and maintaining GenAI/ML... 
    Senior
    Software

    Fortinet, Inc.

    Santa Clara, CA
    5 days ago
  • $180k - $270k

    A leading data storage company is seeking a Senior Software Engineer in Santa Clara, CA, focusing on building reliable CI/CD pipelines and enhancing developer productivity. The role requires expertise in distributed systems, CI/CD management, and a strong understanding... 
    Senior
    Software

    Pure Storage, Inc.

    Santa Clara, CA
    5 days ago
  • NVIDIA Gruppe is seeking experienced Senior Software Engineers to join their production engineering team in Santa Clara, California. The role involves building automation and operational systems for GPU clusters, with a focus on Kubernetes and reliability practices. The... 
    Senior
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...Principal Security Data Engineer, Infrastructure Security Engineering - DGX CloudApplylocations: US,...  ...id: JR2019415NVIDIA DGX Cloud is the AI supercomputing...  ...building, and operating production data pipelines, lakes, or...  ...-Grade Coding: A strong software engineering background... 
    Software
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • A leading technology company is seeking a Senior Software Engineer to optimize compute infrastructure through novel AI strategies. This role...  ...with bonuses and benefits. Join a dynamic team driving critical services with AI across production. #J-18808-Ljbffr Google Inc.
    Senior
    Software

    Google Inc.

    Sunnyvale, CA
    5 days ago
  • CrowdStrike, Inc. is seeking a Cloud Software Engineer to join the Falcon Complete AI Engineering Team in Sunnyvale, California. In this role, you will design, build, and deploy distributed cloud ecosystems using technologies such as Golang and Python. The ideal candidate... 
    Senior
    Software

    CrowdStrike, Inc.

    Sunnyvale, CA
    2 days ago
  • $135k - $170k

     ...Financial, Inc. is seeking a motivated Senior DevOps Engineer to join our IT team in Sunnyvale,...  ...DevOps principles and have experience with cloud platforms like AWS and Azure. Your...  ...efficient deployment and operation of our software applications. The position offers a... 
    Senior
    Software

    Valid8 Financial, Inc.

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, DGX Cloud Production Engineering. Be the first to apply!