Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Solutions Architect

ePlus

Principal Solutions Architect

San Ramon, CA

We are seeking an elite Solutions Architect to lead the end-to-end design, sizing, and deployment of NVIDIA AI Factory-aligned infrastructure. In this highly technical, customer-facing role you will translate complex AI and machine learning workload requirements into fully engineered infrastructure solutions spanning colocation facilities, GPU compute, high-performance networking, parallel storage, and the complete NVIDIA AI software stack.

You will serve as a trusted technical advisor to enterprise and hyperscale customers, partnering with sales, product, and engineering teams to win and deliver transformational AI infrastructure programs. Your expertise will directly shape how organizations build and operate production AI Factories capable of training frontier models, running large-scale inference fleets, and accelerating data science pipelines at scale.

Solution Design & Architecture
  • Lead discovery workshops to capture AI/ML workload requirements, including model training scale, inference SLAs, data pipeline throughput, and multi-tenancy needs.
  • Architect full-stack AI Factory solutions aligned to NVIDIA reference architectures, integrating colocation, GPU compute, networking, storage, and software layers.
  • Develop detailed Bills of Materials (BOMs), rack elevation diagrams, network topology drawings, and power/cooling budgets for customer proposals.
  • Define GPU cluster architectures using NVIDIA DGX, HGX, and MGX systems with B200, B300, and GB300 Blackwell SXM and NVLink-Switch configurations.
  • Design RTX PRO 6000 Blackwell Server Edition deployments for inference-optimized and enterprise AI workloads.
  • Conduct workload sizing and TCO/ROI modeling to validate infrastructure dimensioning for training, finetuning, and inference at scale.
Colocation & Facility Planning
  • Specify colocation requirements including critical power load (MW-scale), UPS and generator configurations, and PUE targets.
  • Design high-density GPU deployments utilizing air-cooled, direct liquid cooling (DLC), and rear-door heat exchanger configurations.
  • Define meet-me room (MMR) and cross-connect requirements; specify carrier-neutral telecom diversity strategies.
  • Engage colocation providers and data center operators to validate capacity availability and negotiate technical SLAs.
  • Coordinate with facilities and MEP engineers to validate power infrastructure from utility feed through PDU to rack level.
GPU Compute Infrastructure
  • Architect multi-node GPU clusters optimized for large language model (LLM) pre-training, fine-tuning, and reinforcement learning from human feedback (RLHF).
  • Size and configure DGX SuperPOD, HGX H/B-series, and MGX modular systems based on model parameter count, dataset size, and iteration timelines.
  • Define server firmware, BIOS, BMC, and DGXOS baselines for production GPU infrastructure.
  • Establish GPU health monitoring, RAS (Reliability, Availability, Serviceability) policies, and lifecycle management procedures.
High-Performance Networking
  • Design backend GPU fabric networks using NVIDIA Quantum InfiniBand (NDR 400Gb/s and HDR 200Gb/s) for distributed training traffic.
  • Architect Spectrum-X Ethernet-based AI networking solutions for inference clusters requiring highbandwidth, low-latency connectivity.
  • Specify ConnectX-8/7 HCA deployments and configure RDMA over Converged Ethernet (RoCEv2) or InfiniBand transport for NCCL collective operations.
  • Integrate BlueField-3 DPUs for GPU-accelerated network functions, storage offload, zero-trust security isolation, and bare-metal provisioning.
  • Design leaf-spine and fat-tree topologies for non-blocking bisectional bandwidth in GPU training clusters.
  • Define Quality of Service (QoS) policies separating storage, compute fabric, and management plane traffic.
Parallel Storage Architecture
  • Design high-performance parallel file system solutions using VAST Data, Hammerspace, and Pure Storage FlashBlade//E for AI training and checkpoint storage.
  • Size storage capacity, IOPS, and throughput based on dataset characteristics, checkpoint frequency, and concurrent reader/writer counts.
  • Architect multi-tier storage hierarchies: hot NVMe flash (VAST/FlashBlade) for active datasets, warm object storage for model archives, and cold tape/cloud for long-term retention.
  • Configure VAST Data Universal Storage for disaggregated storage with NFS, S3, and POSIX access; tune for large sequential read performance.
  • Deploy Hammerspace Global Data Environment for distributed data management and NFS-over-RDMA acceleration across geographically dispersed GPU clusters.
  • Define data pipeline architectures ingesting from cloud object stores (S3, GCS, ABS) to local flash for GPUlocal data loading without I/O bottlenecks.
AI Software Stack & Orchestration
  • Deploy and configure NVIDIA AI Enterprise (NVAIE) software stack including NVIDIA GPU Operator, NIM microservices, and RAPIDS accelerated data science libraries.
  • Architect inference serving infrastructure using NVIDIA NIM (NVIDIA Inference Microservices) for optimized LLM and vision model deployment with autoscaling.
  • Implement NVIDIA Dynamo for distributed inference and disaggregated serving of large-scale generative AI models.
  • Configure and optimize CUDA toolkit, cuDNN, NCCL communication libraries, and custom kernel environments for training workloads.
  • Deploy Base Command Manager and DGXOS for cluster lifecycle management, node provisioning, health dashboards, and job scheduling integration.
  • Integrate NVIDIA Mission Control for AI Factory operations, observability, and multi-cluster fleet management.
  • Design and deploy Kubernetes-based AI platforms using NVIDIA GPU Operator, integrating with Run:ai for dynamic GPU resource scheduling and multi-tenant workload isolation.
  • Configure SLURM workload manager for traditional HPC-style job scheduling on bare-metal GPU clusters, including preemption policies, fair-share scheduling, and burst-to-cloud integration.
  • Establish MLOps toolchain integrations with popular frameworks (PyTorch, JAX, TensorFlow) and experiment tracking platforms (MLflow, Weights & Biases).
Customer Engagement & Delivery
  • Serve as primary technical point of contact throughout the pre-sales and delivery lifecycle, from initial discovery through post-deployment optimization.
  • Produce and present architecture design documents, technical proposals, and executive-level briefings to CTO/CIO and VP-level stakeholders.
  • Lead proof-of-concept (POC) and pilot deployments, including benchmark design, execution, and results analysis.
  • Collaborate with procurement, logistics, and deployment teams to ensure on-time delivery of complex infrastructure programs.
  • Provide post-deployment hypercare support, performance tuning, and capacity planning advisory services.
  • Contribute to internal knowledge bases, solution playbooks, and reference architectures for repeatable AI Factory deployments.
Technology Stack

Candidates must demonstrate deep, hands-on expertise across the following technology domains:

GPU Compute - DGX B200 / B300, DGX H100 / H200, HGX B200 / B300, HGX H100 / H200,MGX platforms, GB300 NVL72 / GB200 NVL72, RTX PRO 6000 BlackwellServer Edition, NVLink Switch System, NVLink-C2C

Networking - NVIDIA Quantum InfiniBand (NDR 400G, HDR 200G), Spectrum-X Ethernet, ConnectX-8 / ConnectX-7 HCAs, BlueField-3 DPU, SHARP in-network computing, UFM Fabric Manager, RDMA / RoCEv2 / InfiniBand

Storage - VAST Data Universal Storage (NFS/S3/POSIX), Hammerspace Global Data Environment, Pure Storage FlashBlade//E (Evergreen//One), NFS-over-RDMA, parallel file systems (Lustre, GPFS/WEKA), S3-compatible object storage

AI Software - NVIDIA AI Enterprise (NVAIE), NIM Microservices, RAPIDS (cuDF, cuML, cuGraph), NVIDIA Dynamo, CUDA Toolkit, cuDNN, NCCL, TensorRT, Triton Inference Server

Cluster Mgmt - Base Command Manager, DGXOS, NVIDIA Mission Control, DGX Cloud, UFM, IPMI / Redfish BMC management

Orchestration - Kubernetes (K8

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Principal Solutions Architect in San Ramon, CA vacancy
  • $105 - $115 per hour

     ...Solution Architect (Principal Enterprise Architecture- Product-to-Market) Contract (1 Year + Possible extension) San Francisco, CA or Pleasanton, CA (Onsite) Pay Rate -$105-115 an hour on W2 (DOE) Key Responsibilities: Assess the current P2M (Product-to... 
    Principal
    Contract work
    Local area

    Pride Global

    Pleasanton, CA
    12 hours ago
  •  ...Salesforce Principal engineer/Solutions Architect Walnut creek CA or Phoenix AZ OR Remote Contract to hire No sponsorship available *Data Cloud is a must have Education, Competencies, Certifications/Licenses: • High School degree... 
    Principal
    Contract work
    Remote work

    3B Staffing LLC

    Walnut Creek, CA
    1 day ago
  • $120k - $170k

     ...Sr Solutions Architect - Collaboration Presales San Ramon, CA Hybrid remote opportunity for candidates located in or near the Bay Area. Candidates must be local to accommodate on-site customer meetings. As a Senior Collaboration Presales Architect (Cisco Webex... 
    Suggested
    Local area
    Remote work

    ePlus

    San Ramon, CA
    1 day ago
  • A technology solutions provider is seeking a Solution Architect in San Ramon, CA. This role involves owning application architecture, developing technology roadmaps, and collaborating with stakeholders. Candidates should have a BA/BS degree and 7+ years of Enterprise Architecture... 
    Suggested
    Work at office
    Remote work

    Astreya Inc.

    San Ramon, CA
    2 days ago
  •  ...n and implementation of the overall solution architecture comprising of conceptual (functional and non functional), technical and physical architecture. Demonstrate Thought Leadership towards white space solutions. Provide system & application level solutions framework... 
    Suggested

    Procyon TS

    San Ramon, CA
    1 day ago
  • $175k - $188.3k

    Trc Companies, Inc. is seeking a Lead Solution Architect in San Ramon, CA. This role involves leading the design and implementation of enterprise GIS solutions while driving business growth through client engagement and new opportunities. Candidates should have a Bachelor... 

    Trc Companies, Inc.

    San Ramon, CA
    12 hours ago
  • $175k - $188.3k

     ...combining science with the latest technology to devise innovative solutions that stand the test of time. From pipelines to power plants,...  ...near other TRC offices ( TRC is looking for a Lead Solution Architect to combine your deep technical expertise in GIS and platform... 
    Full time
    Temporary work
    Part time
    Local area

    Trc Companies, Inc.

    San Ramon, CA
    4 days ago
  •  ...revolution, transforming industries throughcutting-edge digital solutions and next-generation AI. We empower businesses-and their...  ...The Role We are looking to add a skilled Senior Solutions Architect to our team in Southern or Northern CA. As a member of the Pre... 
    For contractors

    Presidio

    Pleasanton, CA
    3 days ago
  •  ...Sr. Principal Architect - Platform Engineering Our world is transforming, and PTC is leading the way. Our software brings the physical and...  ...teams to deliver reliable, secure, and maintainable solutions Provide architectural guidance for platform extensions, migrations... 
    Principal
    Local area
    Immediate start
    Flexible hours

    PTC

    San Ramon, CA
    3 days ago
  • Pacific Gas and Electric Company is seeking a Principal Electrical Engineer to provide technical leadership within the Applied Technology...  ...engineering testing and analysis, developing innovative solutions for complex problems, and contributing to safety and quality protocols... 
    Principal

    Pacific Gas and Electric Company

    Danville, CA
    1 day ago
  • Software Engineer V - Solution Architect page is loaded## Software Engineer V - Solution Architectlocations: San Ramon, CAtime type: Full timeposted on: Posted Todayjob requisition id: R0014972We are looking for a Solution Architect for a major utility provider’s Enterprise... 
    Work at office
    Remote work
    2 days per week
    1 day per week

    Astreya Inc.

    San Ramon, CA
    2 days ago
  •  ...Solution Architect Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking a Solution Architect for one of our clients. Location: Hybrid in Dublin, CA; 1 day a week on-site Qualifications... 
    1 day per week

    Rootshell Inc

    Dublin, CA
    1 day ago
  •  ...Visit our website to learn more about what Skedulo does and to learn more about our people and team. Job Specs The Solution Architect's primary responsibility will be to ensure customer success and value of Skedulo by scoping and designing complex, innovative... 
    Work experience placement
    Work from home
    Home office
    Work visa

    Skedulo

    San Lorenzo, CA
    4 days ago
  •  ...JOB TITLE : Test Solution Architect Location: Pleasanton, CA Role Summary We are looking for an experienced Solution Architect who can seamlessly connect business strategy with technology execution. This role demands strong solution architecture... 

    AceStack LLC

    Pleasanton, CA
    1 day ago
  •  ...Solution Architect (Onsite) Primary Location: Pleasanton, California V-Soft Consulting is currently hiring for a Solution Architect for our premier client in Pleasanton, California . Education and Experience " ~ 10+ years of experience... 
    Currently hiring
    Local area

    V-Soft Consulting Group

    Pleasanton, CA
    3 days ago
  • $160k - $180k

     ...Field Service Management Solution Architect1 day ago – Be among the first 25 applicantsCompany: CelerityJob Title: Solution Architect (Field Service Management)Salary: $160 - $180K annuallyLocation: Remote (work-from-home eligible); travel may be requiredCompany Overview... 
    Full time
    Remote work
    Work from home
    Work visa
    Flexible hours

    A Hiring Company

    Walnut Creek, CA
    4 days ago
  •  ...Team Please dont Submit Salesforce Developer or lead. Look for Solutions Architect, with data Modelling . Lower rate is the best , Solution Architect Location: Pleasanton, California Onsite Required Skills - MUST Salesforce Lightning Salesforce Security... 

    Keylent Inc

    Pleasanton, CA
    4 days ago
  •  ...up and maintenance of integrations with UKG Pro instances Participate in technical design sessions and develop detailed technical solutions and documentation that is aligned with business objectives. ~ Effectively handle multiple tasks at the same time and take ownership... 

    Syntricate Technologies

    Pleasanton, CA
    12 hours ago
  • We are seeking an experienced Enterprise Architect – Product-to-Market (P2M) to join our Enterprise Architecture team as a contingent...  ...principles (preferred). ~Exposure to Agentic AI or LLM-based solutions for P2M automation or employee engagement (preferred). ~... 
    Hourly pay
    Freelance

    Solomon Page

    Pleasanton, CA
    1 day ago
  •  ...But Are Not Limited To: Collaborates with the Enterprise Architect (EA), business and the project team to understand business...  ...reusable service components and patterns. Ensures that the solution architecture and design align with the Target Architecture for... 
    Work experience placement

    Software Technology Inc

    Pleasanton, CA
    1 day ago
  •  ...Architecture strategies, processes, methodologies and models. • Promotes the development of common reusable enterprise technology solutions while respecting the main principles of Domain Driver Design (DDD). • Assesses the immediate and long term strategic goals of... 
    Immediate start

    Procyon TS

    Walnut Creek, CA
    3 days ago
  • $179.02k - $210.5k

     ...and for generations to come. Join Roche, where every voice matters. The Position Roche Molecular Systems, Inc. seeks a Solution Architect at its Pleasanton, CA location. Duties: Assess systems architecture, working with technical staff to recommend solutions... 
    Full time
    Local area
    Remote work

    F. Hoffmann-La Roche Ltd

    Pleasanton, CA
    1 day ago
  •  ...Senior Solution Architect Primary Skills: Solution Architecture (Expert), Technical Design (Expert), Agile Practices (Proficient), System Scalability (Expert), Stakeholder Communication (Expert) Contract Type: W2 Duration: 6 Months Location: Pleasanton, CA - 945... 
    Contract work

    Akraya

    Pleasanton, CA
    19 hours ago
  • $154.2k - $208.6k

     ...ABOUT THE ROLE Our Oracle Cloud business solutions team is hiring to help design and re-engineer our SCM planning execution with...  ...We are building an organization of holistically experienced architects to develop robust solutions and best practices to guarantee customer... 
    Full time

    10X Genomics

    Pleasanton, CA
    4 days ago
  •  ...Solution Architect Our client, an American multinational clothing and accessories retailer company, is looking for a Solution Architect for their Pleasanton, CA location. Responsibilities: Defines and documents the technical solution architecture for projects... 

    ICONMA

    Pleasanton, CA
    1 day ago
  • $150.4k - $194.59k

     ...This senior-level individual contributor will serve as the technical and solutions expert across Pharmacy portfolio project teams, providing technical direction, developing solutions, and reviewing Pharmacy systems and solutions created by the team. The role requires strong... 
    Full time
    Temporary work
    Work experience placement
    Work from home
    Flexible hours
    Shift work

    Kaiser Permanente

    Pleasanton, CA
    5 hours ago
  • $108 - $112 per hour

     ...Payrate: $108.00 - $112.00/hr. Summary: Defines and documents the technical solution architecture for projects. Works collaboratively with other architects, project teams, and Product Management to determine appropriate and sustainable technology architecture... 
    Hourly pay
    Full time
    Local area
    Flexible hours

    Aditi Consulting

    Pleasanton, CA
    3 days ago
  • $120k - $190k

     ...on engagement with the lines of business to evolve and scale solutions to best fit Medallia's business needs. In addition, the group...  ...effectively We are seeking a skilled and experienced HCM Solutions Architect to join our dynamic team. In this role, you will be... 
    Temporary work
    Work experience placement
    Local area

    Medallia

    Pleasanton, CA
    1 day ago
  • $73.8k - $218.8k

     ...global brands. We provide global capabilities, customer-centric solutions, and flexible approaches that are specifically rightsized for...  ...IT for the AI era. You Are: An Enterprise Solution Architect with deep expertise in IBM iSeries (AS/400) systems, including... 
    Work experience placement
    Live in
    Work at office
    Local area
    Flexible hours

    Accenture

    Walnut Creek, CA
    3 days ago
  • $138.14k - $186.5k

     ...kinds can tap into the world’s largest network of branded payment solutions. BHN helps businesses grow revenue, increase loyalty, motivate...  ...Blackhawk Network is seeking an exceptional Solution Architect- Observability to design and scale our enterprise observability... 
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours

    Blackhawk Network

    Pleasanton, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Solutions Architect. Be the first to apply!