Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Solutions Architect - AI Factory Deployment

$184k - $287.5k

NVIDIA

We are seeking an ambitious Senior Solutions Architect - AI Factory Deployment to join our NVIDIA Infrastructure Specialists team in Santa Clara! This role is uniquely positioned to develop, deploy, and validate AI factories end to end. You will focus on running and debugging AI/LLM workloads and benchmarks on Linux-based GPU clusters, using NCCL and collectives like AllReduce and AllToAll to improve performance and scalability.

As part of our world-class team, you will bring to bear observability and automation to improve benchmarks and validation. You will serve as the expert when workloads or benchmarks do not perform flawlessly. You will collaborate across NVIDIA to ensure AI factories are prepared for customers, validating hardware and software for modern AI deployments.

What You Will be Doing:
  • Set up, adjust, and verify AI factory environments across multi-GPU and multi-node Linux clusters.
  • Ensure configurations align with guidelines for NCCL, collectives, and distributed training frameworks.
  • Own the execution of key AI/LLM benchmarks, including setup, orchestration, result collection, and analysis.
  • Investigate and resolve issues when training jobs or benchmarks fail, hang, or underperform.
  • Build and improve observability for AI factories (metrics, logs, traces, dashboards) to understand workload behavior and system health.
  • Develop automation (Python, Shell) for running benchmarks, collecting results, and performing regression checks
  • Examine communication patterns and NCCL usage for AI/LLM workloads, concentrating on collectives such as AllReduce and AllToAll.
  • Recommend changes to job configuration, parallelism strategies, and cluster settings to improve throughput, latency, and scaling efficiency.
  • Work closely with hardware, software, networking, datacenter, and product teams to prepare AI factories for customer use.
  • Contribute to documentation, guidelines, and readiness collateral that support internal collaborators and customer-facing teams.
What We Need to See:
  • Bachelor's degree or equivalent experience in Computer Science, Mathematics, Engineering, Physics, or related field.
  • More than 6+ years of experience managing Linux-based systems in HPC, distributed systems, or extensive AI/ML settings.
  • Hands-on experience running AI/ML workloads on multi-GPU and/or multi-node clusters, with practical knowledge of NCCL.
  • Solid grasp of collective communication patterns, particularly AllReduce and AllToAll, and how they are applied in contemporary ML/LLM training.
  • Familiarity with LLM training and/or inference workflows using frameworks such as PyTorch or TensorFlow.
  • Proficiency with Python and Shell/Bash for scripting, automation, and tooling.
  • Experience with benchmarking (crafting, executing, and interpreting performance benchmarks).
  • Comfortable working with observability data (metrics, logs, dashboards) to troubleshoot and optimize complex distributed workloads.
  • Strong communication skills and the ability to work effectively with cross-functional teams.
Ways to Stand Out From the Crowd:
  • Experience with AI factory or large-scale AI infrastructure build, deployment, or operations.
  • Background in HPC performance engineering, SRE, or systems performance analysis for GPU-accelerated environments.
  • Familiarity with observability stacks (e.g., metrics/monitoring, logging, tracing systems) used for large distributed systems.
  • Experience building automation and CI-style pipelines for running and validating benchmarks at scale.
  • Demonstrated desire to use AI to solve practical problems, improve workflows, and guide data-driven decisions.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 3, 2026.

This posting is for an existing vacancy.


NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Solutions Architect - AI Factory Deployment in United States vacancy
  • $224k - $356.5k

     ...enabling the next generation of AI development? We are looking for a top-tier Solution Architect to join the growing NVIDIA AI...  ...Help customers with their AI factory journey, including workflow pipelines...  ...with focus on hybrid deployments between cloud and on-prem Deliver... 
    Senior
    Work experience placement
    Remote work

    NVIDIA

    Durham, NC
    4 days ago
  •  ...States and allied partners. We build AI systems that determine how...  ...just how they're executed. From factory to foxhole, we operate at the layer...  ...Role We are looking for a Senior Solutions Architect to lead the design, deployment, and operation of agentic AI platforms... 
    Senior
    Work at office
    Local area

    Gallatin AI, Inc

    Austin, TX
    4 days ago
  •  ...opportunities at dnb.com/careers. The Senior Principal Solutions Architect function leads one or more...  ...integration patterns, approaches and deployment models. • Industry specific domain...  ...We may use artificial intelligence (AI) tools to support parts of the hiring... 
    Senior
    Worldwide

    Dun & Bradstreet

    Florham Park, NJ
    21 hours ago
  •  ...A company is looking for a Senior Solution Architect, Applied AI. Key Responsibilities Architect and oversee the deployment of high-throughput, low-latency LLM inference pipelines Mentor and lead a small team of developers, conducting code reviews and sprint planning... 
    Senior
    Remote work

    Virtual Vocations Inc

    United States
    5 hours ago
  •  ...relentlessly innovate to deliver solutions that enable today’s needs and...  ...workloads and usages of the AI Inference market covering...  ...to the Edge. The AI Solutions Architect must be able to translate usage...  ...optimizations required for deployment of scalable AI/ML solutions.... 
    Senior
    Temporary work
    Work experience placement
    Remote work
    Flexible hours
    Shift work

    SanDisk

    Milpitas, CA
    4 days ago
  • A respected technology company is seeking a Senior SaaS Solutions Architect to lead the deployment of intelligent support solutions. This role requires 7+ years...  ...client IT teams, and ensure the successful launch of AI-driven projects. Ideal for a candidate with exceptional... 
    Senior
    Remote work
    Flexible hours

    Zingtree

    Washington DC
    1 day ago
  •  ...Invesco Senior Principal Solutions Architect As one of the world's leading independent global investment...  ...architectures that power automation, AI, and advanced analytics. If you thrive...  ...in responsible and ethical AI deployment by partnering with Legal, Privacy, Cybersecurity... 
    Senior
    Full time
    Work experience placement
    Work at office

    Telepathy Inc

    Houston, TX
    21 hours ago
  • $184k - $356.5k

     ...NVIDIA Corporation is seeking a Senior Solution Architect for AI Infrastructure to support the U.S. Federal Government in digital transformations...  ...customers. Responsibilities include guiding GPU infrastructure deployments and conducting technical workshops. Candidates must have... 
    Senior
    Remote work

    NVIDIA

    Mission, KS
    1 day ago
  •  ...Overview Senior ML/AI Solution Architect - U.S. (Remote) | 12+ Months Global Enterprise Partners is currently looking for an experienced Senior...  ...ideal candidate will be responsible for the architecture, deployment, and adoption of AI/ML driven forecasting solutions... 
    Senior
    Contract work
    Immediate start
    Remote work

    Global Enterprise Partners

    Austin, TX
    21 hours ago
  •  ...About the job Senior Solution Architect - AI / GPU Cloud Role Overview GMI Cloud is seeking a Senior Solution Architect - AI / GPU Cloud...  ...customer AI/ML/HPC workloads, scaling requirements, and deployment models Architect end-to-end GPU cloud solutions... 
    Senior

    Glint Tech Solutions LLC

    Mountain View, CA
    2 days ago
  •  ...Senior Solution Architect, AI Position Summary As a Senior Solutions Architect, AI, you will design and lead technology-driven...  ...conversion on complex deals. Time-to-value: Faster deployment and measurable cycle-time reduction for AI-enabled... 
    Senior

    Lionbridge

    United States
    3 days ago
  •  ...NVIDIA is seeking outstanding Networking Solutions Architects (SA) to help design and deploy large-scale AI Factories across Canada. In this role, you will collaborate with customers to build end-to-end infrastructure. You will become a trusted technical advisor working... 
    Remote work

    NVIDIA

    United States
    4 hours ago
  •  ...Senior Solution Architect Everforth ECS is seeking a Senior Solution Architect to work in the National...  ...the U.S. Department of War's (DoW) AI-First strategy introduced in early 202...  ...warfighting data, aiming to accelerate the deployment of artificial intelligence (AI) on the... 
    Senior
    Contract work
    For contractors

    ECS

    Fairfax, VA
    2 days ago
  • $122.95k - $210.77k

     ...considered for this role. System Solution Architect - Drive Scalable Innovation...  ...industries. The Xcelerator Deployment organization plays a crucial...  ...automation, simulation, or AI/ML ~10+ years of...  ...more resource-efficient factories, resilient supply chains, and... 
    Work at office
    Local area
    Immediate start
    Relocation

    Siemens

    Chicago, IL
    2 days ago
  •  ...About the job Senior Solutions Architect - West Coast (US) Our client is a fast-growing technology...  ...building modern infrastructure for AI-driven applications. To support their...  ...organization and help customers design, deploy, and scale next-generation search and... 
    Senior
    Contract work
    Remote work
    Flexible hours

    TalentCloud Recruitment Group

    San Francisco, CA
    4 days ago
  • $94.5k - $132.1k

     ...Senior Solution Architect (SSA) At CDW, we make it happen, together. Trust, connection, and commitment...  ..., technology health checks, deployment services, and advisory services. Collaborate...  ...CDW is committed to being an AI-fluent organization. We're looking for... 
    Senior
    Local area
    Remote work

    CDW

    United States
    2 days ago
  • $130k

     ...pioneering a new era in clean energy with factory-fabricated microreactors designed to...  ...TX, we're rapidly growing as we work to deploy the world's first fleet of advanced microreactors...  ...About the role We're hiring an AI Solutions Engineer to work directly with internal... 
    Shift work

    Aalo Atomics

    Austin, TX
    21 hours ago
  • $224k - $356.5k

     ...Join NVIDIA as a Solution Architect on the Infrastructure Specialists team. Help redefine deep...  ...the world's largest and fastest AI Factories and supercomputers. We are seeking a...  ...candidate who can lead the planning and deployment of large scale AI data centers, focusing... 
    Worldwide

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...contributing member of the OEM AI Factory SA team. Our work...  ...various facets of AI Factories deployments. Applicants should be familiar...  ...Lenovo and others) to use NVIDIA solutions integrated in their...  ...Collaborating with solution architects, engineering or product teams... 
    Work experience placement
    Work at office

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $215k

     ...Senior Solutions Architect (Agentic AI / Generative AI) Location: Houston, TX or Dallas, TX (On-site) Compensation: Up to $215,000 base + bonus...  ...-led environments who have experience building and deploying real-world AI solutions at scale -not just conceptualising... 
    Senior
    Relocation
    Visa sponsorship

    Harnham

    Dallas, TX
    4 days ago
  •  ...AWS, Azure, and Google, our solutions focus on business outcomes...  ...embedded cyber resiliency and AI to protect today and enable...  ...of companies. As a Senior Solutions Architect at RapidScale, you will serve...  ...pipelines, IaC, and modern deployment practices is a plus. Compensation... 
    Senior

    Cox Communications

    Jacksonville, FL
    3 days ago
  •  ...Position: Microsoft Senior Solution Architect Company: Chicago-Based Automation Professional...  ...lifecycle design, configuration, and deployment of intelligent automation solutions across...  ...Cloud, Azure, Power Platform, and AI-enhanced agent orchestration. This role... 
    Senior
    Full time
    Contract work
    Remote work

    CoSourcing Partners

    Chicago, IL
    1 day ago
  • $184k - $287.5k

     ...looking for an experienced infrastructure Solutions Architect. Do you want to be part of a team that brings Artificial Intelligence (AI) hardware and software technologies to production...  ...and Hyperscalers to develop, build and deploy compute and networking solutions based on... 
    Senior

    NVIDIA

    Austin, TX
    4 days ago
  •  ...Amazing Happen at CDW. The Microsoft Senior Solution Architect is a critical member of CDW’s Digital...  ...architectures for assessments, deployments, and advisory services that align with...  ...differentials CDW is committed to being an AI-fluent organization We’re looking... 
    Senior
    Local area
    Remote work

    CDW

    United States
    4 days ago
  • $76.2k - $187.74k

     ...Solution Architect - Battery Manufacturing (NA) Choosing Capgemini...  ...are considering A Senior Battery Manufacturing...  .... • Strategize MES deployment and operations...  ...activities, including Factory Acceptance Testing and...  ...leading capabilities in AI, generative AI, cloud... 
    Permanent employment
    Full time
    Contract work
    Local area
    Remote work
    Worldwide

    Capgemini

    Southfield, MI
    1 day ago
  •  ...problems. NVIDIA is looking for Senior Cloud Infrastructure/DevOps Solutions Architect to join its NVIDIA Infrastructure...  ...many of the largest and fastest AI/HPC systems in the world! We are...  ...pipelines Develop tooling to automate deployment and management of large-scale... 
    Senior
    Remote work

    NVIDIA

    United States
    2 days ago
  • $224k - $356.5k

     ...The Financial Services Solution Architect team is looking for an extraordinary person to join an...  ...accelerate High-Performance Computing and AI workloads across various use cases. We’...  ...across the team ~ Skilled in deploying ML/DL models at scale on public cloud computing... 
    Senior
    Remote work

    NVIDIA

    United States
    1 day ago
  • $184k - $287.5k

     ...'s revolutionizing the field of AI with data center scale solutions? We are seeking a highly technical Senior AI compute Engineer to serve as a forward-deployed technical liaison between NVIDIA...  ...deployment and optimization of AI factories that enable customers and... 
    Senior
    Remote work

    NVIDIA

    United States
    2 days ago
  •  ...Senior Solution Architect, Professional Services At Anaplan, we are a team of innovators focused on...  ...business decision-making through our leading AI-infused scenario planning and analysis...  ...and/or lead UAT testing and deployment Your Qualifications: ~ A 4-year... 
    Senior

    Anaplan

    United States
    21 hours ago
  • $180k - $300k

     ...stakes customer projects—the ones where deployment success has material business impact...  ...LOOKING FOR You understand generative AI deeply enough to debug customer...  ...where the sale depends on whether you can architect a solution on the spot. You get energized by translating... 
    Senior
    Remote work
    Worldwide
    2 days per week

    Black Forest Labs

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Solutions Architect - AI Factory Deployment. Be the first to apply!