Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Infrastructure Architect - AI & Data Center

Trilyon, Inc.

Title: Infrastructure Architect (AI & Data Center)

Location: San Jose, CA, onsite Work

Duration: 6+ Months

Job Description:

Role Overview


We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations.

Key Responsibilities

1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)

  • Lead the architectural design and refinement of the Nutanix GPU-as-a-Service (GPUaaS) platform, ensuring a seamless experience for internal R&D, QA, and Sales teams.
  • Provide technical leadership in some of the key initiatives such as Nutanix Validated Designs (NVD) for the AI Factory, incorporating NVIDIA MGX/HGX architectures and high-density Cisco nodes (e.g., UCS 845A).
  • Architect the Management Cluster control plane (NKP, Prism Central, NuDeploy) to ensure it is decoupled from GPU compute nodes for maximum efficiency.
  • Implement policy-driven placement of workloads across on-prem and cloud-burst environments.
2. Data Center Asset & Lifecycle Management
  • Design solution for a centralized Data Center Asset Inventory system, ensuring real-time visibility into all hardware assets, including CPUs, GPUs, Virtual Machines, and networking.
  • Develop a comprehensive Hardware Lifecycle Management strategy, including procurement forecasting, "rack and stack" operationalization, and decommissioning of legacy systems (G3/G4/G5).
  • Lead "Tiger Team" initiatives to navigate supply chain constraints, ensuring critical release milestones are not delayed by hardware shortages.
  • Enforce strict Security Standards for Data Center HW Provisioning.
  • Implement network segmentation for all the critical applications.
  • Ensure all infrastructure meets SOC 2 and ISO 27001 compliance objectives while maintaining low-latency performance.
3. Special Projects
  • Provide required architecture and designs during the project intake process. Review, guide the teams for right architecture for all demands before they become approved projects.
  • Partner with security team and provide guidelines for upcoming projects.
  • Involve and lead projects as an architect on special projects.
Required Qualifications
  • Bachelor's degree in Information Technology, Business, or a related field
  • 5+ years of experience in Data Center projects in an enterprise environment
  • Knowledge of Cisco, Dell, HPE, Supermicro hardware.
  • Hardware Expertise: Deep knowledge of Cisco HW, NVIDIA GPU architectures (H100, B200, RTX 6000 Pro) and high-speed interconnects (RoCE v2, InfiniBand).
  • Infrastructure Mastery: Extensive knowledge and experience with Data Center infrastructure.
  • Management Tools: Proficiency with asset management and automation tools (Netbox, ServiceNow, Terraform, or OpenTofu).
  • Lifecycle Mgmt & Capacity Planning: Experience in Data Center lifecycle mgmt, DC HW capacity planning, decommissioning, defragmentation, building complex financial showback models for shared infrastructure.
  • AI/ML Ops: Proven expertise in Kubernetes (NKP preferred) and NVIDIA AI Enterprise stacks (GPU Operator, DCGM, Triton, vLLM).
Preferred Qualifications
  • Experience managing (as an architect) massive-scale data center environments (1,000+ nodes).
  • Knowledge of Nutanix Cloud Infrastructure (NCI), AHV, and Prism Central
  • Strong background in MLOps and automated pipeline integration (Kubeflow/MLflow).



Mayank Prakash

Recruitment Lead


P:


View phone number on click.appcast.io


E:


View email address on click.appcast.io


Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Infrastructure Architect - AI & Data Center in San Jose, CA vacancy
  • NVIDIA Corporation is seeking a Senior Network Solution Architect for their AI Fabrics team in Santa Clara, CA. This role involves partnering with customers on large data center GPU and networking deployments. Candidates should have a strong background in network engineering... 
    Suggested
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  •  ...Distinguished Technologist to lead architecture and solution design for AI/ML and data center networking opportunities. This role demands over 20 years of technical experience in network infrastructure and requires strong problem-solving capabilities. The ideal candidate... 
    Suggested

    Hewlett Packard Enterprise Development LP

    San Jose, CA
    2 days ago
  • $184k - $287.5k

     ...an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. Do you want to be part of a team...  ...up of server, network, and cluster infrastructure in customer data centers. Demonstrate... 
    Suggested
    Remote work

    NVIDIA

    Santa Clara, CA
    12 days ago
  • A Silicon Valley-based AI infrastructure company is seeking a highly motivated Network Simulation Engineer to lead the simulation of AI communication workloads across diverse data center network topologies. The ideal candidate will have a strong background in network simulation... 
    Suggested

    Eridu AI

    Saratoga, CA
    4 days ago
  •  ...that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture...  ...advance your career. THE ROLE We are seeking a SoC Micro Architect to define and drive architecture for Adaptive SoC and FPGA... 
    Suggested

    Advanced Micro Devices , Inc.

    San Jose, CA
    1 day ago
  • A leading automotive company in Sunnyvale is seeking a Principal AI/ML Engineer to guide their ML Infra team. This leadership role focuses on scaling infrastructure for machine learning, mentorship, and driving large-scale initiatives. Ideal candidates will have over 10... 

    General Motors

    Sunnyvale, CA
    1 day ago
  • $188.3k - $269.28k

    A leading precision timing company is seeking a Networking System Architect to focus on datacenter, AI, and 5G applications. In this senior role, you will foster technical relationships with customers, lead architectural discussions, and influence strategies in cutting... 

    SiTime

    Santa Clara, CA
    4 days ago
  •  ...Simulation Engineer to lead the simulation and analysis of AI communication workloads across various data center network topologies. The ideal candidate will have...  ..., engaging with a world-class team and impacting the future of AI infrastructure. #J-18808-Ljbffr Eridu Corporation

    Eridu Corporation

    Saratoga, CA
    4 days ago
  • A leading technology company is seeking a Systems Architect to design and advance data center infrastructure solutions. The role involves collaboration with cross-functional teams and driving technical investigations focused on network topology and electrical systems.... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $133k - $247k

     ...Infrastructure Architect At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology....  ...infrastructure integration plans for acquired entities, including data center consolidation, network integration, and systems alignment.... 

    Cadence Inc

    San Jose, CA
    1 day ago
  •  ...Network Architect Remote – USA / Onsite – Santa Clara. This is...  ...interconnects for GPU-accelerated data centers and compute clusters....  ...connects and fabric for HPC, AI, and GPU computing clusters....  ...or other languages used in infrastructure automation. SME in networking... 
    Remote work

    Omni Inclusive

    Santa Clara, CA
    1 day ago
  •  ...Network Cluster Architect - Data Center Infrastructure Work Locations (2) Submit Resume The Data Center Hardware Engineering team is responsible for...  ...compute solutions that power Apple's services and AI/ML workloads. We are seeking experienced Systems Architects... 

    Apple

    Cupertino, CA
    16 hours ago
  • $124k - $195.5k

     ...in high performance networking infrastructure for many years. The next unit...  ...looking for you - a Networking Architect, to develop the next generation of network for AI. What you’ll be doing:...  ...next generation of accelerated data centers. It spans over various layers... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...Infrastructure Architect Location: Milpitas, CA The Company: FireEye is the intelligence-led security company. Working as a seamless,...  ...infrastructure (FireEye globally) design, systems analysis, security, data center operation, compute, storage, network & voice communication... 
    Temporary work
    Work at office
    Flexible hours
    Night shift

    Netpace

    Milpitas, CA
    1 day ago
  • $119.4k - $139.7k

    The CCIE Network Architect is responsible for overseeing the delivery and integrity...  ...the physical and virtual infrastructure within the data center environment in San Jose, CA. The architect...  ...with cross-functional teams to support AI, machine learning, and cloud workloads... 
    Permanent employment
    Temporary work
    Work experience placement
    Local area
    Shift work

    Randstad

    San Jose, CA
    4 days ago
  • $184k - $287.5k

     ...looking for a brilliant Software & Systems Architect to join the NIC Software/Firmware...  ...Be part of a team crafting the upcoming data-center DPU generation with hardware and software...  ...wide range of topics, including Generative AI (inference and training), storage, cyber... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • A leading staffing company is seeking an experienced Infrastructure Engineer/Solutions Architect for a 3+ month contract based onsite in Milpitas, CA....  ...chance to work in a dynamic environment supporting global data center operations. #J-18808-Ljbffr ManpowerGroup Global, Inc... 
    Contract work

    ManpowerGroup Global, Inc.

    Milpitas, CA
    3 days ago
  • $210k - $265k

    A cutting-edge AI hardware startup is seeking a Performance Modeling Engineer to steer the development of architectural and performance modeling infrastructure. This role focuses on creating high-level performance models that directly influence ASIC designs. Applicants... 

    Eridu AI

    Saratoga, CA
    4 days ago
  • A leading AI hardware company in Sunnyvale is seeking a Network Architect to design robust interconnect architectures for AI clusters. The role demands extensive experience with large scale network designs, troubleshooting distributed systems, and project management. Candidates... 

    Cerebras

    Sunnyvale, CA
    3 days ago
  • $124k - $195.5k

    A leading semiconductor company is seeking a Technical Marketing Manager specializing in Data Center Infrastructure to join their team in Santa Clara, CA. The role involves developing data-driven marketing strategies, managing data center projects, and collaborating with... 

    NVIDIA Corporation

    Santa Clara, CA
    16 hours ago
  • Infrastructure Engineer/Solutions Architect ONSITE IN MILPITAS, CA 3+ month contract Seeking a highly skilled Senior Systems Administrator / Infrastructure...  ...and Rubrik Orchestrator. This role supports global data center engineering, storage systems, backup and DR... 
    Contract work

    ManpowerGroup Global, Inc.

    Milpitas, CA
    3 days ago
  • $272k - $431.25k

     ...into the unlimited potential of AI to define the next era of...  ...and computing! As a Principal Architect on our powerful team in Santa...  ...technical vision for our modern infrastructure while working with...  ...reconciliation pipelines, ensuring data fidelity and compliance across... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI...  ...hyperscale cloud inference services. About The Role As a Network Architect on the Cluster Architecture Team, you will work closely with the... 

    Cerebras

    Sunnyvale, CA
    3 days ago
  • $100k - $120k

     ...Administrator in Santa Clara to manage and support the company’s network infrastructure across North America. The role requires planning and implementing network architecture, managing data centers, and ensuring secure operations. Candidates should have a Bachelor's... 

    Foxconn Industrial Internet - FII

    Santa Clara, CA
    3 days ago
  •  ...with expertise in Cisco Routing and Switching, and large-scale data center network design. The ideal candidate should have a valid CCIE certification and experience with core networks and cloud infrastructure. Responsibilities include configuring and managing large-scale... 

    E*Pro Inc

    San Jose, CA
    4 days ago
  •  ...software. Due to their expansion, a need for an experienced cloud architect has developed. Qualified candidates for this critical opening...  ...Azure An advanced degree in a technical discipline Knowledge of AI The role is partially remote however, we do need applicants to... 
    Interim role
    Local area
    Remote work

    BluZinc

    San Jose, CA
    2 days ago
  • $100k - $140k

    FII is looking for a Network Administrator in San Jose, CA, to oversee network architecture, data center management, and network optimization initiatives. The ideal candidate will have 5-8 years of experience in network technology, a BA/BS in a relevant field, and strong... 
    Full time

    FII

    San Jose, CA
    2 days ago
  •  ...departments to provide timely support and solutions to network-related problems. Key Responsibilities: Travel to various data center locations across the U.S. to provide technical support and resolve operational issues. Diagnose and troubleshoot network issues... 
    Remote work

    Asiacom Americas Inc.

    San Jose, CA
    16 hours ago
  •  ...integral to the successful delivery of high-profile and complex infrastructure projects for our Fortune 100 hyperscale customers. Working...  ...Lead a variety of projects for our customers across data center, enterprise, and retail space, ensuring all scope is completed... 
    Local area

    DMS

    San Jose, CA
    4 days ago
  • $75 - $85 per hour

     ...Insight Global is looking for a Data Center Network Architect to join one of our largest tech clients in the Bay Area. This person will: •...  ...Compute Farm of builders, packagers, testers, and core infrastructure. • Ensure availability targets are consistently met and... 

    Insight Global

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Infrastructure Architect - AI & Data Center. Be the first to apply!