Principal Software Engineer - DGX Cloud
$272k - $431.25kNVIDIA
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.
We are looking for a Principal Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a meaningful role in crafting scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations. As a Principal Engineer in DGX Cloud, you will be at the pinnacle of technical leadership. You will directly craft the platform that fuels the future of AI and cloud computing.
What you'll be doing:
Lead the build and development of next-generation APIs, state management, and workflow orchestration systems that automate fleet lifecycle operations at a massive scale.
Drive technical alignment across dependent systems and partner teams to ensure cohesive integration, clear interfaces, and reliable end-to-end workflows, with a strong focus on delivery.
Act as a force-multiplier by coaching, mentoring, and encouraging senior engineers, elevating the technical standards and guidelines across the organization.
Maintain an incredible focus on the customer experience and product requirements, translating deep technical insight into high-impact business solutions.
Partner with executive and engineering leadership to codify critical business processes into self-measuring, scalable, and operationally consistent platforms, drastically reducing manual toil.
Direct the integration strategy for key technologies, including common AI schedulers (e.g., Kubernetes, Slurm) and innovative observability systems (e.g., Prometheus, OpenTelemetry, Grafana).
What We Need To See:
16+ years of progressive industry experience
Master's or Bachelor's degree, or equivalent experience defining and shipping complex distributed systems.
Deep, hands-on expertise in establishing, operating, and scaling services in a fast paced, high-reliability environment.
Thrive in ambiguous, fast paced environments by rapidly testing ideas, iterating toward working solutions, and then hardening the winners into reliable, scalable systems.
Outstanding proficiency in modern systems programming languages such as Go, Java, or Python.
Proven track record of defining, owning, and evolving the architecture of high-scale distributed systems, including advanced patterns for APIs, control planes, and data pipelines.
Deep understanding of global cloud infrastructure (AWS, GCP, Azure) and container ecosystems (Docker, Kubernetes).
Demonstrated ability to drive technical strategy and influence outcomes across organizational boundaries.
Outstanding ability to communicate complex technical concepts, drive organizational consensus, and mentor high-performing engineers.
Ways to Stand Out from the Crowd:
A history of successfully leading the development and adoption of organization-wide workflow orchestration systems for petabyte-scale infrastructure.
Experience in a Principal/Staff+ capacity, delivering measurable improvements in operational efficiency, reliability, and security across a large engineering org.
Deep familiarity with the operational and deployment aspects of the NVIDIA AI/ML software stack (CUDA, cuDNN, containerization).
Patent contributions or a strong publication record in areas related to distributed systems, cloud computing, or infrastructure automation.
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.
You will also be eligible for equity and benefits ( .
Applications for this job will be accepted at least until May 3, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$272k - $431.25k
...NVIDIA DGX Cloud is scaling GPU infrastructure across internal, partner, and cloud environments. We are looking for Principal Software Engineers to help shape the technical direction for production engineering, Kubernetes-based operations, automation, and reliability across...Suggested$224k - $356.5k
...impact on the world. As part of the DGX Cloud organization, the Attestation Services... ...directly with security, silicon, and cloud engineering teams to turn embedded hardware trust... ...with security, silicon, platform, and software teams to deliver end-to-end trust from...SuggestedRemote work$184k - $287.5k
...Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This... ...to foster innovation. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental in designing, building...Suggested$320k
...leading tech company is seeking a seasoned individual to spearhead DGX Cloud strategy, focusing on GPU lifecycle and operational health.... ..., collaborating with stakeholders, and managing full software and system lifecycles. If you're passionate about technology and...Suggested$272k - $431.25k
Joining NVIDIA's DGX Cloud Team means contributing to the infrastructure that powers our innovative AI research. This... ...to champion innovation.We are seeking a distributed software engineer to join our team! As a Principal Engineer, you'll be instrumental in developing and...Suggested$184k - $287.5k
...NVIDIA DGX Cloud is building and operating large-scale GPU infrastructure for AI research and production workloads. We are looking for Senior Software Engineers to help build the automation, tooling, and operational systems that make GPU clusters reliable, scalable, and...Remote work$136k - $224.25k
## Senior Network Reliability Engineer - DGX CloudApplylocations: US, CA, Santa Clara: US, Remotetime... ...Engineer to support and maintain our cloud and datacenter network infrastructures.... ...serves the needs across the whole software stack for NVIDIA, from Graphics Drivers...Remote workShift work$147k - $237.5k
...great outcomes. Job Summary Join our Cloud Network and AI Security team and... ...technologies, various hypervisors, system software, and networking. Qualifications Required... ...tools. ~10 or more years of related engineering experience. ~ Strong expertise in...Full timeWork at officeLocal area- NVIDIA Corporation is seeking a Senior Software Engineer to join its DGX Cloud Production Engineering team in Santa Clara, CA. This role focuses on building automation and operational systems for large-scale GPU clusters, ensuring reliability and scalability. The ideal...
$272k - $431.25k
NVIDIA Corporation is looking for a Principal Software Engineer for DGX Cloud Production Engineering to define technical strategies and lead efforts in large-scale GPU operations. The successful candidate will have over 15 years of experience in distributed systems, with...Remote job$147k - $237.5k
...Job Summary Your Career Help build what is next. Our Cloud Management Platform is a public cloud delivered management... ...the Palo Alto Networks network security portfolio. Principal Software Engineers are: Design and develop high-volume, low-latency applications...Full timeWork at office$210k - $295k
...possible, with the ultimate goal of enabling human life on Mars. PRINCIPAL SOFTWARE ENGINEER (PLATFORM TEAM) The Platform Team builds the foundational... ...as secure gateways and proxies that integrate with any cloud compute provider and multiple frontier model providers....Permanent employmentTemporary work- ...Principal Engineer (Sr Manager-equivalent) At Palo Alto Networks®, we're united by a shared mission... ...Career At Palo Alto Networks, Secure Cloud and AI infrastructure is the foundation... ...velocity, elevate our standards for software quality, and unlock new business opportunities...Full timeWork at office3 days per week
$147k - $237.5k
...Platform team is expanding, and we're looking for an experienced Software Engineer to join our team. This team is responsible for building... ...implementation ~ Working knowledge of at least one of the major cloud platforms (eg GCP, AWS, or Azure), preferably GCP ~...Full timeWork at office$272k - $431.25k
...scaling for HPC and generative AI workload. Scale out is inherent to the design of this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. Join us at the forefront...$272k - $431.25k
...Principal Engineer, Security Foundations For Autonomous Agents NVIDIA has been transforming computer graphics, PC gaming, and accelerated... ...internal and external data sources. You'll partner closely with Cloud, AI/ML & Generative AI workforce, internal platform teams...$143k - $286k
...What you'll do... Role Overview: We are seeking a Principal Software Engineer to lead the design and development of enterprise-scale Marketplace... ...this are data platforms, enterprise architecture, DevOps, cloud computing, and infrastructure. All of these products and...Full timeTemporary workPart time$143k - $286k
...Position Summary... What you'll do... As a Principal Engineer in Walmart's Fraud and Risk platform, you will define and drive the architecture... .... That's what we do at Walmart Global Tech. We're a team of software engineers, data scientists, cybersecurity expert's and...Full timeTemporary workPart time$126k - $204.5k
...Career Palo Alto Network's Next-Gen Firewall Cloud Security team is looking for a Sr AI Automation/Test Engineer with experience in Public and Private Cloud Security... ...position. You will be part of a world-class software QA engineering team that works on various ground...Full timeWork at office$320k
...NVIDIA DGX systems are the foundation of the world’s most advanced AI infrastructure—purpose-built servers, workstations... ..., NVLink, NVIDIA Networking, and a fully optimized AI software stack. We are seeking an engineering leader responsible for end-to-end delivery of every...$272k - $431.25k
...designs. From single node HGX/DGX systems all the way up to... ...rapidly growing enterprise and cloud provider businesses. Each bringing... ...optimized NVIDIA AI and HPC software stack. We’re searching for a... .... Mentor architects and engineering teams to grow them into future...Shift work$147k - $237.5k
Palo Alto Networks, Inc. is seeking a Principal Software Engineer to develop a scalable cloud management platform overseeing next-generation security solutions. Ideal candidates will have over 8 years of experience in enterprise applications and technical leadership, with...$147k - $237.5k
Palo Alto Networks, Inc. is seeking a Principal Software Engineer in Santa Clara, California, to design and implement Threat Intelligence Services. The role involves working on the cloud-native malware detection platform, WildFire. Candidates should have extensive knowledge...- Palo Alto Networks, Inc. is seeking a Senior Staff Engineer to contribute to their innovative cloud security product, Data Loss Prevention (DLP). This role... ...attendance 3 days a week. Candidates should have extensive software engineering experience, particularly with Core Java...Work at office3 days per week
$320k
Director, Site Reliability and Software Engineering - DGX Cloud page is loaded## Director, Site Reliability and Software Engineering - DGX Cloudlocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Todayjob requisition id: JR2017420NVIDIA's invention...$147k - $237.5k
Palo Alto Networks, Inc. seeks a Principal Software Engineer to join the Cortex Xpanse team in Santa Clara, California. This role focuses on building scalable backend services and APIs while working on the Attack Surface Management platform. Candidates should have 7+ years...$248k - $391k
...diverse and supportive environment, where NVIDIANs are inspired to excel and make a profound global impact. We're hiring a Principal Software Engineer to own the engineering efforts across NVIDIA enterprise systems. You'll partner with IT leadership to transform reactive...$384k
NVIDIA is seeking a Senior Director, System Software Engineering, to lead strategy and execution for capacity management in DGX Cloud, building the capacity foundation for NVIDIA's internal AI research clusters. This leader will shape the roadmap for scalable system software...Full time$168k - $264.5k
NVIDIA is looking for a Senior Network Engineer to develop a cloud network infrastructure. The goal is to craft a reliable, scalable and efficient network to support NVIDIA software development workflows and tools, including CI/CD pipelines, compute resource management...$168k - $264.5k
NVIDIA Corporation is seeking a Senior Network Engineer to develop a cloud network infrastructure that supports software development workflows. This role involves designing, implementing, and troubleshooting network stacks, with a focus on automation. Key qualifications...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Software Engineer - DGX Cloud. Be the first to apply!
- senior principal software engineer Santa Clara, CA
- principal software engineer Santa Clara, CA
- aws cloud infrastructure engineer Santa Clara, CA
- remote cloud architect Santa Clara, CA
- senior cloud engineer Santa Clara, CA
- cloud architect Santa Clara, CA
- cloud engineering manager Santa Clara, CA
- cloud engineer remote Santa Clara, CA
- principal cloud engineer Santa Clara, CA
- senior principal cloud computing engineer Santa Clara, CA

