Senior Solutions Architect - AI Factory Deployment
$184k - $287.5kNVIDIA
We are seeking an ambitious Senior Solutions Architect - AI Factory Deployment to join our NVIDIA Infrastructure Specialists team in Santa Clara! This role is uniquely positioned to develop, deploy, and validate AI factories end to end. You will focus on running and debugging AI/LLM workloads and benchmarks on Linux-based GPU clusters, using NCCL and collectives like AllReduce and AllToAll to improve performance and scalability.
As part of our world-class team, you will bring to bear observability and automation to improve benchmarks and validation. You will serve as the expert when workloads or benchmarks do not perform flawlessly. You will collaborate across NVIDIA to ensure AI factories are prepared for customers, validating hardware and software for modern AI deployments. What You Will be Doing:- Set up, adjust, and verify AI factory environments across multi-GPU and multi-node Linux clusters.
- Ensure configurations align with guidelines for NCCL, collectives, and distributed training frameworks.
- Own the execution of key AI/LLM benchmarks, including setup, orchestration, result collection, and analysis.
- Investigate and resolve issues when training jobs or benchmarks fail, hang, or underperform.
- Build and improve observability for AI factories (metrics, logs, traces, dashboards) to understand workload behavior and system health.
- Develop automation (Python, Shell) for running benchmarks, collecting results, and performing regression checks
- Examine communication patterns and NCCL usage for AI/LLM workloads, concentrating on collectives such as AllReduce and AllToAll.
- Recommend changes to job configuration, parallelism strategies, and cluster settings to improve throughput, latency, and scaling efficiency.
- Work closely with hardware, software, networking, datacenter, and product teams to prepare AI factories for customer use.
- Contribute to documentation, guidelines, and readiness collateral that support internal collaborators and customer-facing teams.
- Bachelor's degree or equivalent experience in Computer Science, Mathematics, Engineering, Physics, or related field.
- More than 6+ years of experience managing Linux-based systems in HPC, distributed systems, or extensive AI/ML settings.
- Hands-on experience running AI/ML workloads on multi-GPU and/or multi-node clusters, with practical knowledge of NCCL.
- Solid grasp of collective communication patterns, particularly AllReduce and AllToAll, and how they are applied in contemporary ML/LLM training.
- Familiarity with LLM training and/or inference workflows using frameworks such as PyTorch or TensorFlow.
- Proficiency with Python and Shell/Bash for scripting, automation, and tooling.
- Experience with benchmarking (crafting, executing, and interpreting performance benchmarks).
- Comfortable working with observability data (metrics, logs, dashboards) to troubleshoot and optimize complex distributed workloads.
- Strong communication skills and the ability to work effectively with cross-functional teams.
- Experience with AI factory or large-scale AI infrastructure build, deployment, or operations.
- Background in HPC performance engineering, SRE, or systems performance analysis for GPU-accelerated environments.
- Familiarity with observability stacks (e.g., metrics/monitoring, logging, tracing systems) used for large distributed systems.
- Experience building automation and CI-style pipelines for running and validating benchmarks at scale.
- Demonstrated desire to use AI to solve practical problems, improve workflows, and guide data-driven decisions.
NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$224k - $356.5k
...enabling the next generation of AI development? We are looking for a top-tier Solution Architect to join the growing NVIDIA AI... ...Help customers with their AI factory journey, including workflow pipelines... ...with focus on hybrid deployments between cloud and on-prem Deliver...SeniorWork experience placementRemote work- ...States and allied partners. We build AI systems that determine how... ...just how they're executed. From factory to foxhole, we operate at the layer... ...Role We are looking for a Senior Solutions Architect to lead the design, deployment, and operation of agentic AI platforms...SeniorWork at officeLocal area
- ...opportunities at dnb.com/careers. The Senior Principal Solutions Architect function leads one or more... ...integration patterns, approaches and deployment models. • Industry specific domain... ...We may use artificial intelligence (AI) tools to support parts of the hiring...SeniorWorldwide
- ...A company is looking for a Senior Solution Architect, Applied AI. Key Responsibilities Architect and oversee the deployment of high-throughput, low-latency LLM inference pipelines Mentor and lead a small team of developers, conducting code reviews and sprint planning...SeniorRemote work
- ...relentlessly innovate to deliver solutions that enable today’s needs and... ...workloads and usages of the AI Inference market covering... ...to the Edge. The AI Solutions Architect must be able to translate usage... ...optimizations required for deployment of scalable AI/ML solutions....SeniorTemporary workWork experience placementRemote workFlexible hoursShift work
- A respected technology company is seeking a Senior SaaS Solutions Architect to lead the deployment of intelligent support solutions. This role requires 7+ years... ...client IT teams, and ensure the successful launch of AI-driven projects. Ideal for a candidate with exceptional...SeniorRemote workFlexible hours
- ...Invesco Senior Principal Solutions Architect As one of the world's leading independent global investment... ...architectures that power automation, AI, and advanced analytics. If you thrive... ...in responsible and ethical AI deployment by partnering with Legal, Privacy, Cybersecurity...SeniorFull timeWork experience placementWork at office
$184k - $356.5k
...NVIDIA Corporation is seeking a Senior Solution Architect for AI Infrastructure to support the U.S. Federal Government in digital transformations... ...customers. Responsibilities include guiding GPU infrastructure deployments and conducting technical workshops. Candidates must have...SeniorRemote work- ...Overview Senior ML/AI Solution Architect - U.S. (Remote) | 12+ Months Global Enterprise Partners is currently looking for an experienced Senior... ...ideal candidate will be responsible for the architecture, deployment, and adoption of AI/ML driven forecasting solutions...SeniorContract workImmediate startRemote work
- ...About the job Senior Solution Architect - AI / GPU Cloud Role Overview GMI Cloud is seeking a Senior Solution Architect - AI / GPU Cloud... ...customer AI/ML/HPC workloads, scaling requirements, and deployment models Architect end-to-end GPU cloud solutions...Senior
- ...Senior Solution Architect, AI Position Summary As a Senior Solutions Architect, AI, you will design and lead technology-driven... ...conversion on complex deals. Time-to-value: Faster deployment and measurable cycle-time reduction for AI-enabled...Senior
- ...NVIDIA is seeking outstanding Networking Solutions Architects (SA) to help design and deploy large-scale AI Factories across Canada. In this role, you will collaborate with customers to build end-to-end infrastructure. You will become a trusted technical advisor working...Remote work
- ...Senior Solution Architect Everforth ECS is seeking a Senior Solution Architect to work in the National... ...the U.S. Department of War's (DoW) AI-First strategy introduced in early 202... ...warfighting data, aiming to accelerate the deployment of artificial intelligence (AI) on the...SeniorContract workFor contractors
$122.95k - $210.77k
...considered for this role. System Solution Architect - Drive Scalable Innovation... ...industries. The Xcelerator Deployment organization plays a crucial... ...automation, simulation, or AI/ML ~10+ years of... ...more resource-efficient factories, resilient supply chains, and...Work at officeLocal areaImmediate startRelocation- ...About the job Senior Solutions Architect - West Coast (US) Our client is a fast-growing technology... ...building modern infrastructure for AI-driven applications. To support their... ...organization and help customers design, deploy, and scale next-generation search and...SeniorContract workRemote workFlexible hours
$94.5k - $132.1k
...Senior Solution Architect (SSA) At CDW, we make it happen, together. Trust, connection, and commitment... ..., technology health checks, deployment services, and advisory services. Collaborate... ...CDW is committed to being an AI-fluent organization. We're looking for...SeniorLocal areaRemote work$130k
...pioneering a new era in clean energy with factory-fabricated microreactors designed to... ...TX, we're rapidly growing as we work to deploy the world's first fleet of advanced microreactors... ...About the role We're hiring an AI Solutions Engineer to work directly with internal...Shift work$224k - $356.5k
...Join NVIDIA as a Solution Architect on the Infrastructure Specialists team. Help redefine deep... ...the world's largest and fastest AI Factories and supercomputers. We are seeking a... ...candidate who can lead the planning and deployment of large scale AI data centers, focusing...Worldwide$152k - $241.5k
...contributing member of the OEM AI Factory SA team. Our work... ...various facets of AI Factories deployments. Applicants should be familiar... ...Lenovo and others) to use NVIDIA solutions integrated in their... ...Collaborating with solution architects, engineering or product teams...Work experience placementWork at office$215k
...Senior Solutions Architect (Agentic AI / Generative AI) Location: Houston, TX or Dallas, TX (On-site) Compensation: Up to $215,000 base + bonus... ...-led environments who have experience building and deploying real-world AI solutions at scale -not just conceptualising...SeniorRelocationVisa sponsorship- ...AWS, Azure, and Google, our solutions focus on business outcomes... ...embedded cyber resiliency and AI to protect today and enable... ...of companies. As a Senior Solutions Architect at RapidScale, you will serve... ...pipelines, IaC, and modern deployment practices is a plus. Compensation...Senior
- ...Position: Microsoft Senior Solution Architect Company: Chicago-Based Automation Professional... ...lifecycle design, configuration, and deployment of intelligent automation solutions across... ...Cloud, Azure, Power Platform, and AI-enhanced agent orchestration. This role...SeniorFull timeContract workRemote work
$184k - $287.5k
...looking for an experienced infrastructure Solutions Architect. Do you want to be part of a team that brings Artificial Intelligence (AI) hardware and software technologies to production... ...and Hyperscalers to develop, build and deploy compute and networking solutions based on...Senior- ...Amazing Happen at CDW. The Microsoft Senior Solution Architect is a critical member of CDW’s Digital... ...architectures for assessments, deployments, and advisory services that align with... ...differentials CDW is committed to being an AI-fluent organization We’re looking...SeniorLocal areaRemote work
$76.2k - $187.74k
...Solution Architect - Battery Manufacturing (NA) Choosing Capgemini... ...are considering A Senior Battery Manufacturing... .... • Strategize MES deployment and operations... ...activities, including Factory Acceptance Testing and... ...leading capabilities in AI, generative AI, cloud...Permanent employmentFull timeContract workLocal areaRemote workWorldwide- ...problems. NVIDIA is looking for Senior Cloud Infrastructure/DevOps Solutions Architect to join its NVIDIA Infrastructure... ...many of the largest and fastest AI/HPC systems in the world! We are... ...pipelines Develop tooling to automate deployment and management of large-scale...SeniorRemote work
$224k - $356.5k
...The Financial Services Solution Architect team is looking for an extraordinary person to join an... ...accelerate High-Performance Computing and AI workloads across various use cases. We’... ...across the team ~ Skilled in deploying ML/DL models at scale on public cloud computing...SeniorRemote work$184k - $287.5k
...'s revolutionizing the field of AI with data center scale solutions? We are seeking a highly technical Senior AI compute Engineer to serve as a forward-deployed technical liaison between NVIDIA... ...deployment and optimization of AI factories that enable customers and...SeniorRemote work- ...Senior Solution Architect, Professional Services At Anaplan, we are a team of innovators focused on... ...business decision-making through our leading AI-infused scenario planning and analysis... ...and/or lead UAT testing and deployment Your Qualifications: ~ A 4-year...Senior
$180k - $300k
...stakes customer projects—the ones where deployment success has material business impact... ...LOOKING FOR You understand generative AI deeply enough to debug customer... ...where the sale depends on whether you can architect a solution on the spot. You get energized by translating...SeniorRemote workWorldwide2 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Solutions Architect - AI Factory Deployment. Be the first to apply!
- digital solutions manager United States
- junior solutions architect United States
- solutions architect United States
- entry level aws solution architect United States
- salesforce solution architect United States
- solution delivery manager United States
- cloud solutions architect United States
- business solutions manager United States
- sap solution architect United States
- senior solutions architect United States

