Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Manager, Next-Gen AI Cluster Validation

$224k - $356.5k

NVIDIA

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for an outstanding technical leader to help develop the next generation of NVIDIA AI supercomputing systems. This leader will play a crucial role in the development and validation of AI computing systems at scale. They will lead all aspects of a group responsible for early deployments of new NVIDIA compute and networking technologies, as well as working closely with external partners to plan at-scale deployments. Be a key player to enable the most exciting supercomputing systems and contribute to the latest breakthroughs in artificial intelligence and GPU computing. Collaborate with top experts in the field to invent new supercomputing architectures for machine learning and HPC. Work in a fast-paced, remote-friendly environment with teammates in many different locations around the world.

What you'll be doing:

  • Lead a team developing next generation system designs and integrating new compute, networking, storage, and software systems

  • Build and support a platform for software development, systems automation, and performance engineering

  • Develop tooling and documentation to support the development of large-scale supercomputing systems for AI and HPC both inside and outside NVIDIA

  • Work closely with teams throughout the company on the cluster architecture, at-scale bringup, and integration of new technologies and products

  • Collaborate closely with partners and customers to support deployment and validation of clusters based on NVIDIA reference architectures

What we need to see:

  • BS (Masters or PhD preferred) in Applied Science or Engineering (or equivalent experience)

  • 8+ overall years experience of experience in the high-performance computing or machine learning fields, including 3+ years of technical leadership experience

  • Proven ability to lead high-performing engineering teams, especially across distributed groups with diverse expertise

  • Proficiency in software development and system automation with languages such as Go, Python, or Ansible

  • Creative problem-solver with excellent teamwork and collaboration skills

  • Ability to work as part of a large, diverse team in a remote-friendly environment

Ways to stand out from the crowd:

  • Experience leading teams building HPC compute and storage systems in a research environment at large scale

  • Well-developed knowledge of deep learning applications, including multi-GPU and multi-node training and inference workloads

  • Expertise with high-performance datacenter networking such as InfiniBand and RoCE

  • Expertise with open-source monitoring technologies such as Prometheus and Grafana

  • Have a proven track record of growing and managing a team that encourages idea sharing, empowers team members, and provides opportunities for professional growth

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until June 1, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Manager, Next-Gen AI Cluster Validation in Santa Clara, CA vacancy
  • $200k - $322k

    Senior Customer Program Manager - Gen AI Software Solution page is loaded## Senior Customer Program Manager - Gen AI Software Solutionlocations...  ...that contemplates all aspects from design, development, validation and deployment.* Collaborate with internal and external... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $237k - $261k

     ...Staff Technical Program Manager - Cluster Orchestration & Applied Training CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence... 
    Suggested
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $126k - $204.5k

     ...a strong background in Python and networking. The role involves validating critical security features and driving quality initiatives within...  ...of experience in Test Engineering, with familiarity in agentic AI frameworks and CI/CD tools. This position offers a competitive salary... 
    Suggested

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  • A leading tech company in Santa Clara is seeking an experienced Machine Learning Manager to lead a team focused on generative AI search products. The role involves overseeing team development, defining product strategies, and executing technical roadmaps. Ideal candidates... 
    Suggested

    Apple Inc.

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...brings accelerated computing and AI to thermal (gas/steam),...  ...relationships with CTOs, product managers, and developers across power generation...  ..., and reliability (model validation, data governance, air-gapped/...  ...Domain familiarity with power-gen operations (plant systems,... 
    Suggested

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $296.3k

     ...and technically strong Senior Engineering Manager - AI/ML Engineering for the Data Foundations...  ...to internally as well as defines the next generation of highest value datasets that...  ...teams involved in AV model development and validation and ensure we meet organizational goals,... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $189.3k - $290.7k

     ...categorized as hybrid. Role: As a Manager, Autonomy Systems Validation, you will lead a team of systems and...  ...validating key features and subsystems of our next generation automated driving stack....  ...data sets. Comfortable using AI tools and agents to automate... 
    Odd job
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $204.72k - $255.9k

     ...Sales, Marketing & Product Management Job Schedule: Full time...  ...group companies (digital, data, AI), and full range of business...  ...constrained metros, new build clusters, edge corridors). * Help...  ...customer advisory boards to validate solution roadmaps. Portfolio... 
    Full time
    Work at office
    Remote work
    Worldwide

    Hitachi America

    Santa Clara, CA
    4 days ago
  • $225k - $260k

     ...statemicrobatterytechnology enabling the next generation of intelligent,...  ...what is possible in powering AI at the edge. Ensurgesolid...  ..., decision-oriented test and validation framework across product and...  ...between: ~ Product Management & Product Validation ~ Cell... 

    ENSURGE

    San Jose, CA
    4 days ago
  • $262.5k - $393.8k

    Sr Engineering Manager (Customer Engagement), AI & Data Platforms (AiDP) Sunnyvale, California, United...  ...Marketing and a growing portfolio of Gen AI initiatives. The ideal...  ...business partners to rapidly prototype, validate, and scale next‑generation AI solutions. You will... 
    Worldwide
    Relocation

    Apple Inc.

    Sunnyvale, CA
    1 day ago
  •  ...clear vision for automotive's next chapter. From operating systems...  ...across connectivity, AI, security and more, we'll map...  ...rigor and speed of our test and validation infrastructure has never been...  ...Electrical Hardware Technical Program Management team is seeking a Senior Manager... 
    Full time
    Contract work
    Local area

    Rivian and Volkswagen Group Technologies

    Palo Alto, CA
    2 days ago
  •  ...Interactive Inc. is seeking a proficient Demand Generation Senior Manager to drive business growth through effective demand generation...  ...channel marketing campaigns to acquire qualified leads, utilizing AI/ML technologies to enhance targeting and efficiency. A successful... 
    Full time

    Grazitti Interactive Inc.

    Mountain View, CA
    2 days ago
  • $184k - $287.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An era in which our...  ...technical Senior Developer Relations Manager to drive adoption of Omniverse and physical...  ...activities by providing technical validation, demonstrating integrated solutions,... 
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA DGX ( systems for massively parallel AI training in the data center; NVIDIA...  ...simulation, synthetic data generation, and validation; and the NVIDIA DRIVE AGX ( in-vehicle computer...  ...Senior Automotive Developer Relationship Manager to assist with the technical engagement... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...modern security with the world's most advanced AI-native platform. Our customers span all...  ...problemsProactive and thorough ability to manage work and prioritize customer...  ...terminations and social/recreational programs—on valid job requirements.If you need assistance accessing... 
    Work at office
    Local area
    Remote work

    CrowdStrike

    Sunnyvale, CA
    5 days ago
  • $184k - $287.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An era in which our...  ...role is essential in crafting how we validate, debug, and optimize complex server platforms...  ...diagnostics across rack-level or cluster-level deployments.* Background in... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $176.1k - $250.4k

     ...received . Senior Staff Product Manager - Splunk AI Foundations Splunk is...  ...Organization delivers Gen AI and ML-based solutions that...  ...with early design partners to validate and iterate on new features....  ...from one calendar year to the next ~ Additional paid time away... 
    Full time
    Temporary work
    Work experience placement
    Local area
    Flexible hours

    Webex Events (formerly Socio)

    San Jose, CA
    3 days ago
  • $220.92k - $311.89k

     ...design data governance, and tapeout manifest management to ensure high productivity,...  ...Custom Compiler. Manage PDK integration, validation, and controlled release in collaboration...  ...packaging technology leadership for the AI era, enabling our customers to design leadership... 
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    4 days ago
  •  ...Who we are: Altimate AI builds AI teammates to automate the work of data teams. These...  ...AI is looking for a Technical Marketing Manager to drive our content strategy and connect...  ...'ll acquire new age skills for this new Gen AI world, and practical marketing and technical... 

    Early Stage Partners LP

    Sunnyvale, CA
    5 days ago
  • $229k - $343k

     ...customers to rapidly innovate AI-powered products. We deliver industry...  ..., MBIST, and boundary scan Manage verification strategy,...  ...closure Guide pre-silicon validation using emulation platforms for...  ...for interconnect IPs enabling next-generation AI systems Your... 
    Remote work

    Synopsys

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

    Developer Relations Manager - GSI page is loaded## Developer Relations...  ...of NVIDIA’s sophisticated AI and computing platforms. The ideal...  ...and the development of next-generation generative AI and physical...  ...e.g., Telco, MarTech, etc.).* Validated experience leading, partnering... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...dynamic, mission-driven Developer Relations Manager to engage leading research labs. In this...  ...the adoption of NVIDIA’s sophisticated AI and computing platforms. The ideal candidate...  ...labs on model accuracy, biological validity, scale, experimental integration, translational... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $272k - $431.25k

     ...hiring an accomplished Developer Relations Manager to lead technical engagement efforts...  ...simulation environments used to train and validate these systems before flight. You will collaborate...  ...initiatives. Support development of AI blueprints and reference builds for... 
    For contractors
    Work at office

    NVIDIA AI

    Santa Clara, CA
    5 days ago
  • $159.4k - $245k

     ...Description Staff Technical Program Manager, Embodied AI At General Motors, our product teams...  ...intuitive design, intelligent software, and next-generation safety and entertainment...  ...evaluation, system integration, validation, and production release. Drive milestone... 
    Local area
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $193.05k - $234k

     ...Join Crusoe: Technical Program Manager Crusoe is on a mission to accelerate the abundance...  .... As the only vertically integrated AI infrastructure company built from the ground...  ...and driver readiness, CUDA and ROCm stack validation, and commissioning criteria for inference... 
    Temporary work

    Crusoe

    Sunnyvale, CA
    2 days ago
  • $157k - $210k

     ...Product Marketing Manager, AI Infrastructure Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers...  ...testimonials that CoreWeave can leverage as customer validations. Collaborate with content marketing teams to create unique... 
    Contract work
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $200k - $322k

     ...combine deep hands-on knowledge of bring-up and validation with the judgment to see integration risks...  ...unblocked. The exceptional hire also uses AI deliberately: they have hands-on experience with AI-powered program management tools (e.g., automated status, risk, and dependency... 
    3 days per week

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $141k - $229k

     ...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and...  ...Living Specs"-functional prototypes that validate business logic and data flows before they...  .... Proven experience as a Product Manager in a technology-focused environment, specifically... 
    Full time
    Work at office
    Shift work

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $200.3k - $293.81k

     ...protect how people, data, and AI agents connect across email, cloud...  ...creating a Principal Product Manager, Applied AI & Agentic...  ...problem discovery through prototype validation, operational adoption, and transition...  ...mindset who anticipate what's next and push cybersecurity forward... 
    Work at office
    Remote work
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    2 days ago
  • $126k - $180k

     ...Senior Technical Project Manager (TPM) We are seeking a Senior Technical...  ...) to drive execution across next-generation network...  ...switching environments, hardware validation workflows, and coordination across...  ...Experience supporting data center, AI infrastructure, networking, or... 
    Full time
    Contract work
    Temporary work
    Flexible hours

    Astreya

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager, Next-Gen AI Cluster Validation. Be the first to apply!