Infrastructure Architect - AI & Data Center
Trilyon, Inc.
Title: Infrastructure Architect (AI & Data Center) Location: San Jose, CA, onsite Work Duration: 6+ Months Job Description: Role Overview
We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations. Key Responsibilities 1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)
Mayank Prakash
Recruitment Lead
P:
View phone number on click.appcast.io
E:
View email address on click.appcast.io
We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engineering requirements and the physical realities of global data center operations. Key Responsibilities 1. AI & GPU Infrastructure Design (GPU Farm / AI Factory)
- Lead the architectural design and refinement of the Nutanix GPU-as-a-Service (GPUaaS) platform, ensuring a seamless experience for internal R&D, QA, and Sales teams.
- Provide technical leadership in some of the key initiatives such as Nutanix Validated Designs (NVD) for the AI Factory, incorporating NVIDIA MGX/HGX architectures and high-density Cisco nodes (e.g., UCS 845A).
- Architect the Management Cluster control plane (NKP, Prism Central, NuDeploy) to ensure it is decoupled from GPU compute nodes for maximum efficiency.
- Implement policy-driven placement of workloads across on-prem and cloud-burst environments.
- Design solution for a centralized Data Center Asset Inventory system, ensuring real-time visibility into all hardware assets, including CPUs, GPUs, Virtual Machines, and networking.
- Develop a comprehensive Hardware Lifecycle Management strategy, including procurement forecasting, "rack and stack" operationalization, and decommissioning of legacy systems (G3/G4/G5).
- Lead "Tiger Team" initiatives to navigate supply chain constraints, ensuring critical release milestones are not delayed by hardware shortages.
- Enforce strict Security Standards for Data Center HW Provisioning.
- Implement network segmentation for all the critical applications.
- Ensure all infrastructure meets SOC 2 and ISO 27001 compliance objectives while maintaining low-latency performance.
- Provide required architecture and designs during the project intake process. Review, guide the teams for right architecture for all demands before they become approved projects.
- Partner with security team and provide guidelines for upcoming projects.
- Involve and lead projects as an architect on special projects.
- Bachelor's degree in Information Technology, Business, or a related field
- 5+ years of experience in Data Center projects in an enterprise environment
- Knowledge of Cisco, Dell, HPE, Supermicro hardware.
- Hardware Expertise: Deep knowledge of Cisco HW, NVIDIA GPU architectures (H100, B200, RTX 6000 Pro) and high-speed interconnects (RoCE v2, InfiniBand).
- Infrastructure Mastery: Extensive knowledge and experience with Data Center infrastructure.
- Management Tools: Proficiency with asset management and automation tools (Netbox, ServiceNow, Terraform, or OpenTofu).
- Lifecycle Mgmt & Capacity Planning: Experience in Data Center lifecycle mgmt, DC HW capacity planning, decommissioning, defragmentation, building complex financial showback models for shared infrastructure.
- AI/ML Ops: Proven expertise in Kubernetes (NKP preferred) and NVIDIA AI Enterprise stacks (GPU Operator, DCGM, Triton, vLLM).
- Experience managing (as an architect) massive-scale data center environments (1,000+ nodes).
- Knowledge of Nutanix Cloud Infrastructure (NCI), AHV, and Prism Central
- Strong background in MLOps and automated pipeline integration (Kubeflow/MLflow).
Mayank Prakash
Recruitment Lead
P:
View phone number on click.appcast.io
E:
View email address on click.appcast.io
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Infrastructure Architect - AI & Data Center in San Jose, CA vacancy
$124k - $195.5k
NVIDIA Gruppe in Santa Clara is seeking a Networking Architect to spearhead the development of innovative networking solutions for AI-driven data centers. You will engage in R&D, collaborating with internal teams and partners to initiate groundbreaking projects. The ideal...Suggested- Advanced Micro Devices is seeking a System Architecture Fellow - AI & Data Center Networking to define and drive system architecture for next-generation platforms. The ideal candidate will possess deep expertise in hardware architecture and have a proven track record in...Suggested
- NVIDIA Gruppe seeks a hands-on Solutions Architect in Santa Clara, CA, to design scalable... ...experience in Solution Architecture or Infrastructure Engineering. You will work alongside product... ...and engineering teams, focusing on AI technologies. Candidates should be familiar...Suggested
- NVIDIA Gruppe in Santa Clara is seeking a Principal AI/ML Engineer to lead the development of automated network platforms crucial... ...experience in network engineering and a solid background in AI/ML infrastructure. Join us in redefining the future of computing! #J-18808-...Suggested
- A Silicon Valley-based AI infrastructure company is seeking a highly motivated Network Simulation Engineer to lead the simulation of AI communication workloads across diverse data center network topologies. The ideal candidate will have a strong background in network simulation...Suggested
$184k - $287.5k
NVIDIA Gruppe in Santa Clara, California, is seeking a skilled Networking Solutions Architect to lead the design and implementation of cutting-edge data center networks. Your role will involve working closely with clients to develop tailored solutions using your extensive...$152k - $241.5k
NVIDIA Gruppe is seeking an experienced Solutions Architect in Santa Clara to support accelerated computing networking solutions for AI/ML and HPC. You will develop and demonstrate solutions with major tech companies while addressing customer needs and performance issues...- ...Simulation Engineer to lead the simulation and analysis of AI communication workloads across various data center network topologies. The ideal candidate will have... ..., engaging with a world-class team and impacting the future of AI infrastructure. #J-18808-Ljbffr Eridu Corporation
$164.47k - $269.1k
...focused on defining the future of high-performance networking silicon. Our team architects next-generation networking solutions that enable hyperscale data centers, cloud infrastructure, and AI workloads to achieve unprecedented performance and efficiency. We specialize...Local areaImmediate startShift work$203.2k - $286.87k
...Description: As a Network Platform Architect, you will be at the... ...secure and efficient infrastructure to meet the evolving needs of... ...customer/OEM expectations, and data center operational needs. Your expertise... ...technologies, analytics, AI, data centers, and more,...Live inLocal areaImmediate startShift work- ...Infrastructure Architect Location: Milpitas, CA The Company: FireEye is the intelligence-led security company. Working as a seamless,... ...infrastructure (FireEye globally) design, systems analysis, security, data center operation, compute, storage, network & voice communication...Temporary workWork at officeFlexible hoursNight shift
$265k - $310k
...Principal Network Architect Crusoe is on a mission to accelerate the abundance of... ...intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we... ...experts across energy, manufacturing, data center construction, and cloud services....Temporary work$133k - $247k
...Infrastructure Architect At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.... ...infrastructure integration plans for acquired entities, including data center consolidation, network integration, and systems alignment....- ...# Network Architect Location - Santa Clara. This is... ...interconnects for GPU-accelerated data centers and compute clusters.... ...connects and fabric for HPC, AI, and GPU computing clusters.... ...or other languages used in infrastructure automation. SME in networking...
- A leading staffing company is seeking an experienced Infrastructure Engineer/Solutions Architect for a 3+ month contract based onsite in Milpitas, CA.... ...chance to work in a dynamic environment supporting global data center operations. #J-18808-Ljbffr ManpowerGroup Global, Inc...Contract work
$124k - $195.5k
...in high performance networking infrastructure for many years. The next unit... ...looking for you - a Networking Architect, to develop the next generation of network for AI. What you’ll be doing: This... ...next generation of accelerated data centers. It spans over various layers...- NVIDIA Gruppe in Santa Clara is seeking a Software Architect to define system and software architecture for cutting-edge AI networks. The ideal candidate should have 8+ years of experience in software architecture and strong networking expertise. This role includes collaboration...
$210k - $265k
A cutting-edge AI hardware startup is seeking a Performance Modeling Engineer to steer the development of architectural and performance modeling infrastructure. This role focuses on creating high-level performance models that directly influence ASIC designs. Applicants...- NVIDIA Gruppe is seeking a Principal Software Engineer to lead the transformation of AI networking systems. Your deep expertise will guide strategic deployments and influence NVIDIA's networking technologies. This role involves engaging with customers and internal teams...
- A leading AI hardware company in Sunnyvale is seeking a Network Architect to design robust interconnect architectures for AI clusters. The role demands extensive experience with large scale network designs, troubleshooting distributed systems, and project management. Candidates...
- ...Infrastructure Engineer/Solutions Architect Onsite in Milpitas, CA 3+ month contract Seeking a highly skilled Senior Systems Administrator /... ...Zerto and Rubrik Orchestrator. This role supports global data center engineering, storage systems, backup and DR...Contract work
- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel... .... About The Role As a Network Architect on the Cluster Architecture Team, you will... ...or its third-party tools process personal data. For more details, click here to review our...
- ...Greetings from Rootshell Inc, Role: Network Architect Location: Sunnyvale, CA Duration: Long... ...network software applications used in the management of data center network infrastructure. Skills: • Experience with Switching, Routing,...Night shift
$100k - $120k
...Administrator in Santa Clara to manage and support the company’s network infrastructure across North America. The role requires planning and implementing network architecture, managing data centers, and ensuring secure operations. Candidates should have a Bachelor's...$140k - $197k
Arista Networks is looking for a Senior Systems Engineer in Santa Clara, California. This role requires extensive customer-facing technical sales experience, collaborating with Account teams to provide pre-sales engineering support. Ideal candidates will have 7+ years in...- ...departments to provide timely support and solutions to network-related problems. Key Responsibilities: Travel to various data center locations across the U.S. to provide technical support and resolve operational issues. Diagnose and troubleshoot network...Full timeRemote work
- ...Senior Networking Technician to support high-profile and complex infrastructure projects for Fortune 100 hyperscale customers. This role is... ...exceptional execution across enterprise, retail, and data center environments. This position requires extensive travel, strong...Local area
- ...Technical Staff-Network Architect From applied research to advanced engineering, the Engineering... ...of next-generation large-scale AI Infrastructure to include acceleratedcompute, AI Fabric... ..., network-interfacecards (NIC), Data Processing Units (DPU) and AI Fabrics-...
$184k - $356.5k
NVIDIA Gruppe, based in Santa Clara, is seeking a highly experienced Sr. Solutions Architect specializing in embedded software engineering to support innovative networking technologies. This role offers significant agency and the opportunity to work closely with both customers...$224k - $431.25k
NVIDIA Gruppe in Santa Clara is seeking experienced networking engineers to join the Solutions Architecture team. This role focuses on integrating cutting-edge NVIDIA networking products and requires both technical proficiency and strong customer-facing skills. The ideal...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Infrastructure Architect - AI & Data Center. Be the first to apply!

