Infrastructure Architect - AI & Data Center
Trilyon, Inc.
Title: Infrastructure Architect (AI & Data Center) Location: San Jose, CA, onsite Work Duration: 6+ Months Job Description: Role Overview
We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations. Key Responsibilities 1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)
Mayank Prakash
Recruitment Lead
P:
View phone number on click.appcast.io
E:
View email address on click.appcast.io
We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations. Key Responsibilities 1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)
- Lead the architectural design and refinement of the Nutanix GPU-as-a-Service (GPUaaS) platform, ensuring a seamless experience for internal R&D, QA, and Sales teams.
- Provide technical leadership in some of the key initiatives such as Nutanix Validated Designs (NVD) for the AI Factory, incorporating NVIDIA MGX/HGX architectures and high-density Cisco nodes (e.g., UCS 845A).
- Architect the Management Cluster control plane (NKP, Prism Central, NuDeploy) to ensure it is decoupled from GPU compute nodes for maximum efficiency.
- Implement policy-driven placement of workloads across on-prem and cloud-burst environments.
- Design solution for a centralized Data Center Asset Inventory system, ensuring real-time visibility into all hardware assets, including CPUs, GPUs, Virtual Machines, and networking.
- Develop a comprehensive Hardware Lifecycle Management strategy, including procurement forecasting, "rack and stack" operationalization, and decommissioning of legacy systems (G3/G4/G5).
- Lead "Tiger Team" initiatives to navigate supply chain constraints, ensuring critical release milestones are not delayed by hardware shortages.
- Enforce strict Security Standards for Data Center HW Provisioning.
- Implement network segmentation for all the critical applications.
- Ensure all infrastructure meets SOC 2 and ISO 27001 compliance objectives while maintaining low-latency performance.
- Provide required architecture and designs during the project intake process. Review, guide the teams for right architecture for all demands before they become approved projects.
- Partner with security team and provide guidelines for upcoming projects.
- Involve and lead projects as an architect on special projects.
- Bachelor's degree in Information Technology, Business, or a related field
- 5+ years of experience in Data Center projects in an enterprise environment
- Knowledge of Cisco, Dell, HPE, Supermicro hardware.
- Hardware Expertise: Deep knowledge of Cisco HW, NVIDIA GPU architectures (H100, B200, RTX 6000 Pro) and high-speed interconnects (RoCE v2, InfiniBand).
- Infrastructure Mastery: Extensive knowledge and experience with Data Center infrastructure.
- Management Tools: Proficiency with asset management and automation tools (Netbox, ServiceNow, Terraform, or OpenTofu).
- Lifecycle Mgmt & Capacity Planning: Experience in Data Center lifecycle mgmt, DC HW capacity planning, decommissioning, defragmentation, building complex financial showback models for shared infrastructure.
- AI/ML Ops: Proven expertise in Kubernetes (NKP preferred) and NVIDIA AI Enterprise stacks (GPU Operator, DCGM, Triton, vLLM).
- Experience managing (as an architect) massive-scale data center environments (1,000+ nodes).
- Knowledge of Nutanix Cloud Infrastructure (NCI), AHV, and Prism Central
- Strong background in MLOps and automated pipeline integration (Kubeflow/MLflow).
Mayank Prakash
Recruitment Lead
P:
View phone number on click.appcast.io
E:
View email address on click.appcast.io
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Infrastructure Architect - AI & Data Center in San Jose, CA vacancy
- NVIDIA Corporation is seeking a Senior Network Solution Architect for their AI Fabrics team in Santa Clara, CA. This role involves partnering with customers on large data center GPU and networking deployments. Candidates should have a strong background in network engineering...SuggestedRemote work
- ...Distinguished Technologist to lead architecture and solution design for AI/ML and data center networking opportunities. This role demands over 20 years of technical experience in network infrastructure and requires strong problem-solving capabilities. The ideal candidate...Suggested
$184k - $287.5k
...an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. Do you want to be part of a team... ...up of server, network, and cluster infrastructure in customer data centers. Demonstrate...SuggestedRemote work- A Silicon Valley-based AI infrastructure company is seeking a highly motivated Network Simulation Engineer to lead the simulation of AI communication workloads across diverse data center network topologies. The ideal candidate will have a strong background in network simulation...Suggested
- ...that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture... ...advance your career. THE ROLE We are seeking a SoC Micro Architect to define and drive architecture for Adaptive SoC and FPGA...Suggested
- A leading automotive company in Sunnyvale is seeking a Principal AI/ML Engineer to guide their ML Infra team. This leadership role focuses on scaling infrastructure for machine learning, mentorship, and driving large-scale initiatives. Ideal candidates will have over 10...
$188.3k - $269.28k
A leading precision timing company is seeking a Networking System Architect to focus on datacenter, AI, and 5G applications. In this senior role, you will foster technical relationships with customers, lead architectural discussions, and influence strategies in cutting...- ...Simulation Engineer to lead the simulation and analysis of AI communication workloads across various data center network topologies. The ideal candidate will have... ..., engaging with a world-class team and impacting the future of AI infrastructure. #J-18808-Ljbffr Eridu Corporation
- A leading technology company is seeking a Systems Architect to design and advance data center infrastructure solutions. The role involves collaboration with cross-functional teams and driving technical investigations focused on network topology and electrical systems....
$133k - $247k
...Infrastructure Architect At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.... ...infrastructure integration plans for acquired entities, including data center consolidation, network integration, and systems alignment....- ...Network Architect Remote – USA / Onsite – Santa Clara. This is... ...interconnects for GPU-accelerated data centers and compute clusters.... ...connects and fabric for HPC, AI, and GPU computing clusters.... ...or other languages used in infrastructure automation. SME in networking...Remote work
- ...Network Cluster Architect - Data Center Infrastructure Work Locations (2) Submit Resume The Data Center Hardware Engineering team is responsible for... ...compute solutions that power Apple's services and AI/ML workloads. We are seeking experienced Systems Architects...
$124k - $195.5k
...in high performance networking infrastructure for many years. The next unit... ...looking for you - a Networking Architect, to develop the next generation of network for AI. What you’ll be doing:... ...next generation of accelerated data centers. It spans over various layers...- ...Infrastructure Architect Location: Milpitas, CA The Company: FireEye is the intelligence-led security company. Working as a seamless,... ...infrastructure (FireEye globally) design, systems analysis, security, data center operation, compute, storage, network & voice communication...Temporary workWork at officeFlexible hoursNight shift
$119.4k - $139.7k
The CCIE Network Architect is responsible for overseeing the delivery and integrity... ...the physical and virtual infrastructure within the data center environment in San Jose, CA. The architect... ...with cross-functional teams to support AI, machine learning, and cloud workloads...Permanent employmentTemporary workWork experience placementLocal areaShift work$184k - $287.5k
...looking for a brilliant Software & Systems Architect to join the NIC Software/Firmware... ...Be part of a team crafting the upcoming data-center DPU generation with hardware and software... ...wide range of topics, including Generative AI (inference and training), storage, cyber...- A leading staffing company is seeking an experienced Infrastructure Engineer/Solutions Architect for a 3+ month contract based onsite in Milpitas, CA.... ...chance to work in a dynamic environment supporting global data center operations. #J-18808-Ljbffr ManpowerGroup Global, Inc...Contract work
$210k - $265k
A cutting-edge AI hardware startup is seeking a Performance Modeling Engineer to steer the development of architectural and performance modeling infrastructure. This role focuses on creating high-level performance models that directly influence ASIC designs. Applicants...- A leading AI hardware company in Sunnyvale is seeking a Network Architect to design robust interconnect architectures for AI clusters. The role demands extensive experience with large scale network designs, troubleshooting distributed systems, and project management. Candidates...
$124k - $195.5k
A leading semiconductor company is seeking a Technical Marketing Manager specializing in Data Center Infrastructure to join their team in Santa Clara, CA. The role involves developing data-driven marketing strategies, managing data center projects, and collaborating with...- Infrastructure Engineer/Solutions Architect ONSITE IN MILPITAS, CA 3+ month contract Seeking a highly skilled Senior Systems Administrator / Infrastructure... ...and Rubrik Orchestrator. This role supports global data center engineering, storage systems, backup and DR...Contract work
$272k - $431.25k
...into the unlimited potential of AI to define the next era of... ...and computing! As a Principal Architect on our powerful team in Santa... ...technical vision for our modern infrastructure while working with... ...reconciliation pipelines, ensuring data fidelity and compliance across...- Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI... ...hyperscale cloud inference services. About The Role As a Network Architect on the Cluster Architecture Team, you will work closely with the...
$100k - $120k
...Administrator in Santa Clara to manage and support the company’s network infrastructure across North America. The role requires planning and implementing network architecture, managing data centers, and ensuring secure operations. Candidates should have a Bachelor's...- ...with expertise in Cisco Routing and Switching, and large-scale data center network design. The ideal candidate should have a valid CCIE certification and experience with core networks and cloud infrastructure. Responsibilities include configuring and managing large-scale...
- ...software. Due to their expansion, a need for an experienced cloud architect has developed. Qualified candidates for this critical opening... ...Azure An advanced degree in a technical discipline Knowledge of AI The role is partially remote however, we do need applicants to...Interim roleLocal areaRemote work
$100k - $140k
FII is looking for a Network Administrator in San Jose, CA, to oversee network architecture, data center management, and network optimization initiatives. The ideal candidate will have 5-8 years of experience in network technology, a BA/BS in a relevant field, and strong...Full time- ...departments to provide timely support and solutions to network-related problems. Key Responsibilities: Travel to various data center locations across the U.S. to provide technical support and resolve operational issues. Diagnose and troubleshoot network issues...Remote work
- ...integral to the successful delivery of high-profile and complex infrastructure projects for our Fortune 100 hyperscale customers. Working... ...Lead a variety of projects for our customers across data center, enterprise, and retail space, ensuring all scope is completed...Local area
$75 - $85 per hour
...Insight Global is looking for a Data Center Network Architect to join one of our largest tech clients in the Bay Area. This person will: •... ...Compute Farm of builders, packagers, testers, and core infrastructure. • Ensure availability targets are consistently met and...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Infrastructure Architect - AI & Data Center. Be the first to apply!
Related searches
- infrastructure architect San Jose, CA
- network architect San Jose, CA
- clinical data San Jose, CA
- master data coordinator San Jose, CA
- clinical data coordinator remote San Jose, CA
- data intern San Jose, CA
- data cabling installation San Jose, CA
- data collection researcher San Jose, CA
- data technician San Jose, CA
- data mining San Jose, CA

