Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Software Engineer - DGX Cloud

$272k - $431.25k

NVIDIA

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.

We are looking for a Principal Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a meaningful role in crafting scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations. As a Principal Engineer in DGX Cloud, you will be at the pinnacle of technical leadership. You will directly craft the platform that fuels the future of AI and cloud computing.

What you'll be doing:

  • Lead the build and development of next-generation APIs, state management, and workflow orchestration systems that automate fleet lifecycle operations at a massive scale.

  • Drive technical alignment across dependent systems and partner teams to ensure cohesive integration, clear interfaces, and reliable end-to-end workflows, with a strong focus on delivery.

  • Act as a force-multiplier by coaching, mentoring, and encouraging senior engineers, elevating the technical standards and guidelines across the organization.

  • Maintain an incredible focus on the customer experience and product requirements, translating deep technical insight into high-impact business solutions.

  • Partner with executive and engineering leadership to codify critical business processes into self-measuring, scalable, and operationally consistent platforms, drastically reducing manual toil.

  • Direct the integration strategy for key technologies, including common AI schedulers (e.g., Kubernetes, Slurm) and innovative observability systems (e.g., Prometheus, OpenTelemetry, Grafana).

What We Need To See:

  • 16+ years of progressive industry experience

  • Master's or Bachelor's degree, or equivalent experience defining and shipping complex distributed systems.

  • Deep, hands-on expertise in establishing, operating, and scaling services in a fast paced, high-reliability environment.

  • Thrive in ambiguous, fast paced environments by rapidly testing ideas, iterating toward working solutions, and then hardening the winners into reliable, scalable systems.

  • Outstanding proficiency in modern systems programming languages such as Go, Java, or Python.

  • Proven track record of defining, owning, and evolving the architecture of high-scale distributed systems, including advanced patterns for APIs, control planes, and data pipelines.

  • Deep understanding of global cloud infrastructure (AWS, GCP, Azure) and container ecosystems (Docker, Kubernetes).

  • Demonstrated ability to drive technical strategy and influence outcomes across organizational boundaries.

  • Outstanding ability to communicate complex technical concepts, drive organizational consensus, and mentor high-performing engineers.

Ways to Stand Out from the Crowd:

  • A history of successfully leading the development and adoption of organization-wide workflow orchestration systems for petabyte-scale infrastructure.

  • Experience in a Principal/Staff+ capacity, delivering measurable improvements in operational efficiency, reliability, and security across a large engineering org.

  • Deep familiarity with the operational and deployment aspects of the NVIDIA AI/ML software stack (CUDA, cuDNN, containerization).

  • Patent contributions or a strong publication record in areas related to distributed systems, cloud computing, or infrastructure automation.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until May 3, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Principal Software Engineer - DGX Cloud in Santa Clara, CA vacancy
  • $272k - $431.25k

     ...NVIDIA DGX Cloud is scaling GPU infrastructure across internal, partner, and cloud environments. We are looking for Principal Software Engineers to help shape the technical direction for production engineering, Kubernetes-based operations, automation, and reliability across... 
    Suggested

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...impact on the world. As part of the DGX Cloud organization, the Attestation Services...  ...directly with security, silicon, and cloud engineering teams to turn embedded hardware trust...  ...with security, silicon, platform, and software teams to deliver end-to-end trust from... 
    Suggested
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This...  ...to foster innovation. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental in designing, building... 
    Suggested

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $320k

     ...leading tech company is seeking a seasoned individual to spearhead DGX Cloud strategy, focusing on GPU lifecycle and operational health....  ..., collaborating with stakeholders, and managing full software and system lifecycles. If you're passionate about technology and... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

    Joining NVIDIA's DGX Cloud Team means contributing to the infrastructure that powers our innovative AI research. This...  ...to champion innovation.We are seeking a distributed software engineer to join our team! As a Principal Engineer, you'll be instrumental in developing and... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...NVIDIA DGX Cloud is building and operating large-scale GPU infrastructure for AI research and production workloads. We are looking for Senior Software Engineers to help build the automation, tooling, and operational systems that make GPU clusters reliable, scalable, and... 
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $136k - $224.25k

    ## Senior Network Reliability Engineer - DGX CloudApplylocations: US, CA, Santa Clara: US, Remotetime...  ...Engineer to support and maintain our cloud and datacenter network infrastructures....  ...serves the needs across the whole software stack for NVIDIA, from Graphics Drivers... 
    Remote work
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $147k - $237.5k

     ...great outcomes. Job Summary Join our Cloud Network and AI Security team and...  ...technologies, various hypervisors, system software, and networking. Qualifications Required...  ...tools. ~10 or more years of related engineering experience. ~ Strong expertise in... 
    Full time
    Work at office
    Local area

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • NVIDIA Corporation is seeking a Senior Software Engineer to join its DGX Cloud Production Engineering team in Santa Clara, CA. This role focuses on building automation and operational systems for large-scale GPU clusters, ensuring reliability and scalability. The ideal... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

    NVIDIA Corporation is looking for a Principal Software Engineer for DGX Cloud Production Engineering to define technical strategies and lead efforts in large-scale GPU operations. The successful candidate will have over 15 years of experience in distributed systems, with... 
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $147k - $237.5k

     ...Job Summary Your Career Help build what is next. Our Cloud Management Platform is a public cloud delivered management...  ...the Palo Alto Networks network security portfolio. Principal Software Engineers are: Design and develop high-volume, low-latency applications... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $210k - $295k

     ...possible, with the ultimate goal of enabling human life on Mars. PRINCIPAL SOFTWARE ENGINEER (PLATFORM TEAM) The Platform Team builds the foundational...  ...as secure gateways and proxies that integrate with any cloud compute provider and multiple frontier model providers.... 
    Permanent employment
    Temporary work

    SpaceX

    Sunnyvale, CA
    1 day ago
  •  ...Principal Engineer (Sr Manager-equivalent) At Palo Alto Networks®, we're united by a shared mission...  ...Career At Palo Alto Networks, Secure Cloud and AI infrastructure is the foundation...  ...velocity, elevate our standards for software quality, and unlock new business opportunities... 
    Full time
    Work at office
    3 days per week

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • $147k - $237.5k

     ...Platform team is expanding, and we're looking for an experienced Software Engineer to join our team. This team is responsible for building...  ...implementation ~ Working knowledge of at least one of the major cloud platforms (eg GCP, AWS, or Azure), preferably GCP ~... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

     ...scaling for HPC and generative AI workload. Scale out is inherent to the design of this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. Join us at the forefront... 

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

     ...Principal Engineer, Security Foundations For Autonomous Agents NVIDIA has been transforming computer graphics, PC gaming, and accelerated...  ...internal and external data sources. You'll partner closely with Cloud, AI/ML & Generative AI workforce, internal platform teams... 

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $143k - $286k

     ...What you'll do... Role Overview: We are seeking a Principal Software Engineer to lead the design and development of enterprise-scale Marketplace...  ...this are data platforms, enterprise architecture, DevOps, cloud computing, and infrastructure. All of these products and... 
    Full time
    Temporary work
    Part time

    Walmart

    Sunnyvale, CA
    4 days ago
  • $143k - $286k

     ...Position Summary... What you'll do... As a Principal Engineer in Walmart's Fraud and Risk platform, you will define and drive the architecture...  .... That's what we do at Walmart Global Tech. We're a team of software engineers, data scientists, cybersecurity expert's and... 
    Full time
    Temporary work
    Part time

    Walmart

    Sunnyvale, CA
    5 days ago
  • $126k - $204.5k

     ...Career Palo Alto Network's Next-Gen Firewall Cloud Security team is looking for a Sr AI Automation/Test Engineer with experience in Public and Private Cloud Security...  ...position. You will be part of a world-class software QA engineering team that works on various ground... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $320k

     ...NVIDIA DGX systems are the foundation of the world’s most advanced AI infrastructure—purpose-built servers, workstations...  ..., NVLink, NVIDIA Networking, and a fully optimized AI software stack. We are seeking an engineering leader responsible for end-to-end delivery of every... 

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

     ...designs. From single node HGX/DGX systems all the way up to...  ...rapidly growing enterprise and cloud provider businesses. Each bringing...  ...optimized NVIDIA AI and HPC software stack. We’re searching for a...  .... Mentor architects and engineering teams to grow them into future... 
    Shift work

    NVIDIA

    Santa Clara, CA
    7 days ago
  • $147k - $237.5k

    Palo Alto Networks, Inc. is seeking a Principal Software Engineer to develop a scalable cloud management platform overseeing next-generation security solutions. Ideal candidates will have over 8 years of experience in enterprise applications and technical leadership, with... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $147k - $237.5k

    Palo Alto Networks, Inc. is seeking a Principal Software Engineer in Santa Clara, California, to design and implement Threat Intelligence Services. The role involves working on the cloud-native malware detection platform, WildFire. Candidates should have extensive knowledge... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • Palo Alto Networks, Inc. is seeking a Senior Staff Engineer to contribute to their innovative cloud security product, Data Loss Prevention (DLP). This role...  ...attendance 3 days a week. Candidates should have extensive software engineering experience, particularly with Core Java... 
    Work at office
    3 days per week

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $320k

    Director, Site Reliability and Software Engineering - DGX Cloud page is loaded## Director, Site Reliability and Software Engineering - DGX Cloudlocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Todayjob requisition id: JR2017420NVIDIA's invention... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $147k - $237.5k

    Palo Alto Networks, Inc. seeks a Principal Software Engineer to join the Cortex Xpanse team in Santa Clara, California. This role focuses on building scalable backend services and APIs while working on the Attack Surface Management platform. Candidates should have 7+ years... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $248k - $391k

     ...diverse and supportive environment, where NVIDIANs are inspired to excel and make a profound global impact. We're hiring a Principal Software Engineer to own the engineering efforts across NVIDIA enterprise systems. You'll partner with IT leadership to transform reactive... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $384k

    NVIDIA is seeking a Senior Director, System Software Engineering, to lead strategy and execution for capacity management in DGX Cloud, building the capacity foundation for NVIDIA's internal AI research clusters. This leader will shape the roadmap for scalable system software... 
    Full time

    NVIDIA

    Santa Clara, CA
    21 hours ago
  • $168k - $264.5k

    NVIDIA is looking for a Senior Network Engineer to develop a cloud network infrastructure. The goal is to craft a reliable, scalable and efficient network to support NVIDIA software development workflows and tools, including CI/CD pipelines, compute resource management... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $168k - $264.5k

    NVIDIA Corporation is seeking a Senior Network Engineer to develop a cloud network infrastructure that supports software development workflows. This role involves designing, implementing, and troubleshooting network stacks, with a focus on automation. Key qualifications... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Software Engineer - DGX Cloud. Be the first to apply!