Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Technical Program Manager, DGX Cloud Software Products and Services

$168k - $258.75k

NVIDIA

NVIDIA's DGX Cloud (DGXC) powers AI for strategic research and product workloads. The company seeks an expert Technical Program Manager (IC5) to lead strategic programs emphasizing resilience, reliability, and goodput. This role requires collaboration across multiple teams. It involves driving improvements in resilience, service stability, and operational scale. The TPM also guides architectural decisions related to resilience reference architecture. The TPM leads programs spanning DGXC infrastructure, Resilience Tools, and core platform services to deliver fault-tolerant, high-availability training and inference environments at scale.

We are looking for a TPM who is analytical, technically skilled, and comfortable working with cloud infrastructure, software, operations, and environments driven by data and research. You will work closely with engineering, SRE, operations, and researchers to develop scalable resilience strategies, improve operational performance, and assist in building open, modular software components and reference stacks for DGX Cloud at scale.

What You'll Be Doing:

  • Lead cross-functional programs that improve resilience, reliability, operational scale, and fleet-wide goodput across DGX Cloud.

  • Partner across infrastructure, platform, site reliability, operational, and tenant teams to identify systemic risks, resolve cross-stack dependencies, and improve end-to-end service stability.

  • Drive the definition and adoption of resilience reference stacks, operational standards, and scalable guidelines that strengthen service readiness and recovery.

  • Partner with engineering teams and researchers to support the development and delivery of open, modular software components for resilience, facilitating reusable and extensible capabilities across the platform.

  • Build and scale resilience tooling and operational mechanisms that improve observability, failure detection and attribution, root cause analysis, recovery orchestration, and operational readiness.

  • Define, measure, and improve goodput, using data-driven insights to increase usable fleet capacity, workload efficiency, and customer outcomes at scale.

  • Establish clear metrics, dashboards, and operating cadences to track program health, reliability posture, operational maturity, and performance.

What we need to see:

  • MS EE or CS degree, or equivalent experience.

  • 8+ years of experience in program management of large-scale software or infrastructure projects.

  • Proven track record of leading complex cross-functional programs in cloud, infrastructure, distributed systems, or platform environments.

  • Strong analytical skills with the ability to assess issues across infrastructure, software, and operational layers.

  • Excellent organizational skills and ability to use project management tools (e.g. Jira, Aha!, Confluence) and distributed version control systems (e.g. Git).

  • Solid understanding of reliability engineering, resilience development, and service performance metrics, including goodput, efficiency, and utilization.

  • Experience working alongside engineering, SRE, operations, and technical collaborators to advance projects in ambiguous, high-complexity environments.

  • Outstanding communication and presentation skills for diverse technical and non-technical audiences with strong problem-solving and conflict management skills.

Ways To Stand Out From The Crowd:

  • Background in computer science, machine learning, deep learning, open-source software, and GPU technology, AI infrastructure, or large-scale compute platforms.

  • Experience with large-scale AI training environments (e.g., distributed training frameworks, checkpointing, NCCL, Slurm or other schedulers).

  • Prior experience in the management of customer workflows using large scale distributed computing and working with AI researchers or directly training and evaluating AI models.

  • Proven ability to harness AI-enabled workflows and tools to improve program management efficiency, decision-making, execution visibility, and operational efficiency.

Widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 258,750 USD for Level 4, and 200,000 USD - 322,000 USD for Level 5.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until May 8, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Technical Program Manager, DGX Cloud Software Products and Services in Santa Clara, CA vacancy
  • $200k - $322k

     ...NVIDIA's DGX Cloud (DGXC) powers AI for strategic research and product workloads. The company seeks a Senior Technical Program Manager (TPM) to lead complex, cross-functional...  ...s next-generation AI software platforms. In this...  ...across platform services, cloud infrastructure... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $200k - $322k

     ...DGX Cloud Team is looking for a Senior Technical Program Manager (TPM) to guide complex, cross-functional projects...  ...involves leading software-related initiatives across...  ..., infrastructure services, and distributed systems...  ..., infrastructure, product, and platform engineering... 
    Senior
    Software
    Shift work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $168k - $258.75k

     ...class technology. The DGX Cloud organization plays...  ..., crafting the software operating layer...  ...partner validation program, handling engagement...  ...findings into product and engineering plans...  ...onboarding infrastructure, managing capacity...  ...see: 12+ years in technical program management... 
    Senior
    Software
    Full time
    Worldwide

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $200k - $322k

     ...NVIDIA is seeking a Senior Technical Program Manager to lead Trust Services programs for DGX Cloud. DGX Cloud powers large-scale AI infrastructure...  ..., infrastructure security, product security, compliance,...  ...across firmware, platform, and software teams. Establish program... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $200k - $322k

     ...NVIDIA’s DGX Cloud is redefining how organizations...  ...We’re looking for a Senior Technical Program Manager to drive storage-related...  ...with engineering, product, operations, finance...  ..., operations, cloud service providers, clusters...  ...of large-scale software or infrastructure projects... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $200k - $322k

     ...As a Senior Technical Program Manager passionate about Cloud Security, you will drive the DGX Cloud infrastructure security program...  ...with Cloud Service Providers (CSPs) and...  ...infrastructure, platform, and product teams. This role...  ...roadmaps and the software development... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $200k - $322k

     ...accomplished and highly skilled Technical Program Manager (TPM) to join our NVIDIA DGX Cloud team. This is a...  ..., Infrastructure, Software teams and their leadership...  ...ensuring adherence to our Product Lifecycle (PLC) process...  ...at a major Cloud Service Provider (CSP) including... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $171k - $231.4k

     ...drive large-scale programs that deliver...  ...across hundreds of services powering...  ...The Devices Software and Services (DS...  ...team is seeking a Senior Technical Program Manager to lead large-scale...  ...span every Devices product line and demand...  ...updates, telemetry, cloud-to-device messaging... 
    Senior
    Software
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Amazon

    Sunnyvale, CA
    2 days ago
  • $185k - $203k

     ...Fiber Webpass internet services to homes and...  ...responsible for providing technical project management and program management for...  ...projects are related to software developed by the...  ...systems leverage cloud based deployments environments...  ...Present health of production systems to... 
    Senior
    Software
    Full time

    GFiber

    Mountain View, CA
    2 days ago
  • $141k - $229k

     ...Key Responsibilities: Product Roadmap & Strategy: Create,...  ...drive the product roadmap for Technical Services, optimizing PSA (eg: Planview...  ...scoring. Translate Vision into Software: Rapidly move from idea to...  ...experience as a Product Manager in a technology-focused environment... 
    Senior
    Software
    Full time
    Work at office
    Shift work

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...As part of the DGX Cloud organization,...  ...the Attestation Services team is...  ...platform, and software teams to deliver...  ...computing. ~ Strong programming proficiency in...  ...in production. ~ Experience...  ...development and management. ~ Demonstrated...  ...multi-functional technical projects from... 
    Senior
    Software
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $180k - $230k

     ...chip, with the programming simplicity of a...  ...the hassle of managing hundreds of GPUs...  ...based hyperscale cloud inference services. This order of...  ...Role As a Senior Hardware Technical Program Manager...  ..., and software integration of...  ...translation of product strategy and engineering... 
    Senior
    Software

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  •  ...delivered on-premises or in the cloud, combined with state-of-...  ...capable and dynamic Technical Program Manager (TPM) to drive product delivery at Sambanova...  ...Lead various Software programs and initiatives...  ...membership, counseling services with an Employee Assistance... 
    Senior
    Software
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova Systems

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA DGX Cloud is building and operating...  ...AI research and production workloads. We are looking for Senior Software Engineers to help...  ...Develop tools and services for provisioning,...  ...infrastructure. ~ Strong programming skills in Python,...  ...to BMaaS, VMaaS, managed Kubernetes, or... 
    Senior
    Software
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $198k - $235.5k

     ...through faster onboarding, hours of productivity gained each week, and smarter, safer...  ...the Role: Glean is seeking a Sr. Technical Program Manager (TPM) to lead complex, cross-functional...  ...impact. ~ Strong understanding of software product development, including API design... 
    Senior
    Software
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    1 day ago
  • $198k - $235.5k

     ...Senior Technical Program Manager, Product Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most...  ...practical product impact. ~ Strong understanding of software product development, including API design, data systems... 
    Senior
    Software
    Work at office
    Home office
    Flexible hours

    Glean.info

    Palo Alto, CA
    2 hours ago
  • $171k - $231.4k

     ...all inventors! Sponsored Products Off-Search team is re-imagining...  ...Amazon. We are seeking a Senior Technical Program Manager with a passion for...  ...experience ~3+ years of software development experience ~...  ...ensure exceptional customer service; and follow all federal, state... 
    Senior
    Software
    Local area
    Flexible hours

    Amazon

    Palo Alto, CA
    2 days ago
  • $200k - $322k

     ...Infrastructure is seeking a Senior Technical Program Manager to lead the strategy...  ...systems and services that support analytics...  ..., we support software teams specifically through...  ...development of new products. Our mission is to accelerate...  ...principles and cloud cost optimization... 
    Senior
    Software
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...chip, with the programming simplicity of...  ...hassle of managing hundreds of GPUs...  ...hyperscale cloud inference services. This order of...  ...is a highly technical, execution-focused...  ...risks to senior leadership...  ...serious about software make their own...  ...build better products and companies... 
    Senior
    Software

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • $216.15k - $262k

     ...Senior Staff TPM For Vera Rubin Generation...  ..., and cloud services. If you want...  ...introduction. Not manage a workstream...  ...-level program, not a SKU-level...  ...parallel with active production deployments....  ...of that: the technical depth to...  ...effects. Deep software/firmware lifecycle... 
    Senior
    Software
    Temporary work

    Crusoe

    Sunnyvale, CA
    4 hours ago
  • $148k - $235.75k

     ...hard-working leader to join NVIDIA’s DGX Program Management team, focusing on delivery...  ...advancement. As a partner with engineering, product, QA, provide technical teams in the end-to-end...  ...timely documentation that aligns with software releases and supports our impact across... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $168k - $258.75k

     ...are seeking a skilled, motivated Senior Software Technical Program Driver to lead our efforts in...  ...closely with Software Development Managers, Engineers, Product Marketing, Customer Program Management...  ...Compute software solutions for cloud service providers, automotive and OEM... 
    Senior
    Software
    Shift work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $108.22k - $162.1k

     ...Technical Program Manager Marvell's semiconductor solutions are...  ...Across enterprise, cloud and AI, and carrier...  ...Central CAD & Design Services) development and enablement...  ..., leveraging productivity and AI tools to streamline...  ...technology and/or software subject to U.S.... 
    Software
    Permanent employment
    Internship
    Work from home

    Marvell

    Santa Clara, CA
    1 hour ago
  • $152k - $230k

     ...We are looking for a Senior Product Marketing Manager to join the NVIDIA Cloud Accelerator (NCX) team....  ...This role is for a strong technical marketer who...  ...composable infrastructure software that helps cloud partners...  ...Partners (NCPs), cloud service providers (CSPs), infrastructure... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $171k - $231.4k

     ...the Kindle family of products. Lab126 began in 200...  ...Are you a seasoned program leader who thrives at...  ...We're looking for a Senior Technical Program Manager to own and drive the...  ...directly with software engineering teams experience...  ...customer service; and follow all federal... 
    Senior
    Software
    Local area
    Immediate start
    Flexible hours
    Shift work
    Day shift

    Amazon

    Sunnyvale, CA
    3 days ago
  • $208k - $327.75k

     ...will own the full product lifecycle for...  ...team behind DGX systems and DGX...  ...team as a Product Manager who can move...  ...doing: Own the Serviceability aspect of the...  ...Leverage strong technical background to define...  ...and rendering software, is essential....  ...in high-tech, cloud, AI/ML,... 
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $168k - $258.75k

     ...We are looking for a Senior Technical Program Manager (TPM) to join NVIDIA’s Server...  ...engineering teams to understand our product roadmap, as well as...  .... We need solid Service Management or Engineering...  ...crowd: Understanding of software engineering principles, enterprise... 
    Senior
    Software

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $148.7k - $240.53k

     ...great outcomes. Job Summary As the Senior Product Manager for CDSS, you will play a crucial role...  ...of threat detection and prevention services. This role will require collaboration...  ...in Computer Science, other engineering/technical degree ~5+ years of experience in highly... 
    Senior
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  •  ...utilities and cities to manage energy and water....  ...'re looking for a Senior Scrum Master / Technical Program Manager (SM/TPM) to...  ...Coach engineers, Product Owners, and stakeholders...  ...multiphase software programs ~ Strong...  ...energy, water and city services. Our trusted... 
    Senior
    Software

    Itron

    San Jose, CA
    1 day ago
  • $168k - $258.75k

     ...Deep Learning Software is looking for a Technical Program Manager to lead software programs for Deep Learning Training...  ...roadmap. The TPM will work alongside senior management and coordinate efforts...  ..., and model program managers, product managers and engineering teams to... 
    Senior
    Software
    Shift work

    NVIDIA

    Santa Clara, CA
    4 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Technical Program Manager, DGX Cloud Software Products and Services. Be the first to apply!