Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Infrastructure Architect - AI & Data Center

Trilyon, Inc.

Title: Infrastructure Architect (AI & Data Center)

Location: San Jose, CA, onsite Work

Duration: 6+ Months

Job Description:

Role Overview


We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations.

Key Responsibilities

1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)

  • Lead the architectural design and refinement of the Nutanix GPU-as-a-Service (GPUaaS) platform, ensuring a seamless experience for internal R&D, QA, and Sales teams.
  • Provide technical leadership in some of the key initiatives such as Nutanix Validated Designs (NVD) for the AI Factory, incorporating NVIDIA MGX/HGX architectures and high-density Cisco nodes (e.g., UCS 845A).
  • Architect the Management Cluster control plane (NKP, Prism Central, NuDeploy) to ensure it is decoupled from GPU compute nodes for maximum efficiency.
  • Implement policy-driven placement of workloads across on-prem and cloud-burst environments.
2. Data Center Asset & Lifecycle Management
  • Design solution for a centralized Data Center Asset Inventory system, ensuring real-time visibility into all hardware assets, including CPUs, GPUs, Virtual Machines, and networking.
  • Develop a comprehensive Hardware Lifecycle Management strategy, including procurement forecasting, "rack and stack" operationalization, and decommissioning of legacy systems (G3/G4/G5).
  • Lead "Tiger Team" initiatives to navigate supply chain constraints, ensuring critical release milestones are not delayed by hardware shortages.
  • Enforce strict Security Standards for Data Center HW Provisioning.
  • Implement network segmentation for all the critical applications.
  • Ensure all infrastructure meets SOC 2 and ISO 27001 compliance objectives while maintaining low-latency performance.
3. Special Projects
  • Provide required architecture and designs during the project intake process. Review, guide the teams for right architecture for all demands before they become approved projects.
  • Partner with security team and provide guidelines for upcoming projects.
  • Involve and lead projects as an architect on special projects.
Required Qualifications
  • Bachelor's degree in Information Technology, Business, or a related field
  • 5+ years of experience in Data Center projects in an enterprise environment
  • Knowledge of Cisco, Dell, HPE, Supermicro hardware.
  • Hardware Expertise: Deep knowledge of Cisco HW, NVIDIA GPU architectures (H100, B200, RTX 6000 Pro) and high-speed interconnects (RoCE v2, InfiniBand).
  • Infrastructure Mastery: Extensive knowledge and experience with Data Center infrastructure.
  • Management Tools: Proficiency with asset management and automation tools (Netbox, ServiceNow, Terraform, or OpenTofu).
  • Lifecycle Mgmt & Capacity Planning: Experience in Data Center lifecycle mgmt, DC HW capacity planning, decommissioning, defragmentation, building complex financial showback models for shared infrastructure.
  • AI/ML Ops: Proven expertise in Kubernetes (NKP preferred) and NVIDIA AI Enterprise stacks (GPU Operator, DCGM, Triton, vLLM).
Preferred Qualifications
  • Experience managing (as an architect) massive-scale data center environments (1,000+ nodes).
  • Knowledge of Nutanix Cloud Infrastructure (NCI), AHV, and Prism Central
  • Strong background in MLOps and automated pipeline integration (Kubeflow/MLflow).



Mayank Prakash

Recruitment Lead


P:


View phone number on click.appcast.io


E:


View email address on click.appcast.io


Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Infrastructure Architect - AI & Data Center in San Jose, CA vacancy
  • $124k - $195.5k

    NVIDIA Gruppe in Santa Clara is seeking a Networking Architect to spearhead the development of innovative networking solutions for AI-driven data centers. You will engage in R&D, collaborating with internal teams and partners to initiate groundbreaking projects. The ideal... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • Advanced Micro Devices is seeking a System Architecture Fellow - AI & Data Center Networking to define and drive system architecture for next-generation platforms. The ideal candidate will possess deep expertise in hardware architecture and have a proven track record in... 
    Suggested

    Advanced Micro Devices

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe seeks a hands-on Solutions Architect in Santa Clara, CA, to design scalable...  ...experience in Solution Architecture or Infrastructure Engineering. You will work alongside product...  ...and engineering teams, focusing on AI technologies. Candidates should be familiar... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Principal AI/ML Engineer to lead the development of automated network platforms crucial...  ...experience in network engineering and a solid background in AI/ML infrastructure. Join us in redefining the future of computing! #J-18808-... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • A Silicon Valley-based AI infrastructure company is seeking a highly motivated Network Simulation Engineer to lead the simulation of AI communication workloads across diverse data center network topologies. The ideal candidate will have a strong background in network simulation... 
    Suggested

    Eridu AI

    Saratoga, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara, California, is seeking a skilled Networking Solutions Architect to lead the design and implementation of cutting-edge data center networks. Your role will involve working closely with clients to develop tailored solutions using your extensive... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking an experienced Solutions Architect in Santa Clara to support accelerated computing networking solutions for AI/ML and HPC. You will develop and demonstrate solutions with major tech companies while addressing customer needs and performance issues... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...Simulation Engineer to lead the simulation and analysis of AI communication workloads across various data center network topologies. The ideal candidate will have...  ..., engaging with a world-class team and impacting the future of AI infrastructure. #J-18808-Ljbffr Eridu Corporation

    Eridu Corporation

    Saratoga, CA
    2 days ago
  • $164.47k - $269.1k

     ...focused on defining the future of high-performance networking silicon. Our team architects next-generation networking solutions that enable hyperscale data centers, cloud infrastructure, and AI workloads to achieve unprecedented performance and efficiency. We specialize... 
    Local area
    Immediate start
    Shift work

    Intel

    San Jose, CA
    4 days ago
  • $203.2k - $286.87k

     ...Description: As a Network Platform Architect, you will be at the...  ...secure and efficient infrastructure to meet the evolving needs of...  ...customer/OEM expectations, and data center operational needs. Your expertise...  ...technologies, analytics, AI, data centers, and more,... 
    Live in
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    6 days ago
  •  ...Infrastructure Architect Location: Milpitas, CA The Company: FireEye is the intelligence-led security company. Working as a seamless,...  ...infrastructure (FireEye globally) design, systems analysis, security, data center operation, compute, storage, network & voice communication... 
    Temporary work
    Work at office
    Flexible hours
    Night shift

    Netpace

    Milpitas, CA
    4 days ago
  • $265k - $310k

     ...Principal Network Architect Crusoe is on a mission to accelerate the abundance of...  ...intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we...  ...experts across energy, manufacturing, data center construction, and cloud services.... 
    Temporary work

    G2 Venture Partners

    Sunnyvale, CA
    1 day ago
  • $133k - $247k

     ...Infrastructure Architect At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology....  ...infrastructure integration plans for acquired entities, including data center consolidation, network integration, and systems alignment.... 

    Cadence Inc

    San Jose, CA
    14 days ago
  •  ...# Network Architect Location - Santa Clara. This is...  ...interconnects for GPU-accelerated data centers and compute clusters....  ...connects and fabric for HPC, AI, and GPU computing clusters....  ...or other languages used in infrastructure automation. SME in networking... 

    Tranzeal

    Santa Clara, CA
    5 days ago
  • A leading staffing company is seeking an experienced Infrastructure Engineer/Solutions Architect for a 3+ month contract based onsite in Milpitas, CA....  ...chance to work in a dynamic environment supporting global data center operations. #J-18808-Ljbffr ManpowerGroup Global, Inc... 
    Contract work

    ManpowerGroup Global, Inc.

    Milpitas, CA
    1 day ago
  • $124k - $195.5k

     ...in high performance networking infrastructure for many years. The next unit...  ...looking for you - a Networking Architect, to develop the next generation of network for AI. What you’ll be doing: This...  ...next generation of accelerated data centers. It spans over various layers... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Software Architect to define system and software architecture for cutting-edge AI networks. The ideal candidate should have 8+ years of experience in software architecture and strong networking expertise. This role includes collaboration... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $210k - $265k

    A cutting-edge AI hardware startup is seeking a Performance Modeling Engineer to steer the development of architectural and performance modeling infrastructure. This role focuses on creating high-level performance models that directly influence ASIC designs. Applicants... 

    Eridu AI

    Saratoga, CA
    2 days ago
  • NVIDIA Gruppe is seeking a Principal Software Engineer to lead the transformation of AI networking systems. Your deep expertise will guide strategic deployments and influence NVIDIA's networking technologies. This role involves engaging with customers and internal teams... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • A leading AI hardware company in Sunnyvale is seeking a Network Architect to design robust interconnect architectures for AI clusters. The role demands extensive experience with large scale network designs, troubleshooting distributed systems, and project management. Candidates... 

    Cerebras

    Sunnyvale, CA
    1 day ago
  •  ...Infrastructure Engineer/Solutions Architect Onsite in Milpitas, CA 3+ month contract Seeking a highly skilled Senior Systems Administrator /...  ...Zerto and Rubrik Orchestrator. This role supports global data center engineering, storage systems, backup and DR... 
    Contract work

    Experis

    Milpitas, CA
    3 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel...  .... About The Role As a Network Architect on the Cluster Architecture Team, you will...  ...or its third-party tools process personal data. For more details, click here to review our... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  •  ...Greetings from Rootshell Inc, Role: Network Architect Location: Sunnyvale, CA Duration: Long...  ...network software applications used in the management of data center network infrastructure. Skills: • Experience with Switching, Routing,... 
    Night shift

    Rootshell Enterprise Technologies

    Santa Clara, CA
    5 days ago
  • $100k - $120k

     ...Administrator in Santa Clara to manage and support the company’s network infrastructure across North America. The role requires planning and implementing network architecture, managing data centers, and ensuring secure operations. Candidates should have a Bachelor's... 

    Foxconn Industrial Internet - FII

    Santa Clara, CA
    1 day ago
  • $140k - $197k

    Arista Networks is looking for a Senior Systems Engineer in Santa Clara, California. This role requires extensive customer-facing technical sales experience, collaborating with Account teams to provide pre-sales engineering support. Ideal candidates will have 7+ years in...

    Arista Networks

    Santa Clara, CA
    3 days ago
  •  ...departments to provide timely support and solutions to network-related problems. Key Responsibilities: Travel to various data center locations across the U.S. to provide technical support and resolve operational issues. Diagnose and troubleshoot network... 
    Full time
    Remote work

    Asiacom Americas

    Santa Clara, CA
    5 days ago
  •  ...Senior Networking Technician to support high-profile and complex infrastructure projects for Fortune 100 hyperscale customers. This role is...  ...exceptional execution across enterprise, retail, and data center environments. This position requires extensive travel, strong... 
    Local area

    HireSparks AV Recruiting

    San Jose, CA
    2 days ago
  •  ...Technical Staff-Network Architect From applied research to advanced engineering, the Engineering...  ...of next-generation large-scale AI Infrastructure to include acceleratedcompute, AI Fabric...  ..., network-interfacecards (NIC), Data Processing Units (DPU) and AI Fabrics-... 

    Dell Careers

    Santa Clara, CA
    3 days ago
  • $184k - $356.5k

    NVIDIA Gruppe, based in Santa Clara, is seeking a highly experienced Sr. Solutions Architect specializing in embedded software engineering to support innovative networking technologies. This role offers significant agency and the opportunity to work closely with both customers... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $224k - $431.25k

    NVIDIA Gruppe in Santa Clara is seeking experienced networking engineers to join the Solutions Architecture team. This role focuses on integrating cutting-edge NVIDIA networking products and requires both technical proficiency and strong customer-facing skills. The ideal...

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Infrastructure Architect - AI & Data Center. Be the first to apply!