Head of AI Infrastructure
Confidential
Head of AI Infrastructure
About the Company
Early-stage hyperscale innovator building a next-generation neocloud for large-scale AI workloads.
Industry
Information Technology and Services
Type
Privately Held, VC-backed
About the Role
The Company is seeking a Head of AI Infrastructure to take on a pivotal role in the design, deployment, and operation of a next-generation, global, security-first GPU cloud platform. The successful candidate will be responsible for creating and evolving an elastic GPU cloud fabric that can scale from hundreds to thousands of accelerators while ensuring low-latency performance for AI training and inference. This role demands a technical leader with a strong background in cloud infrastructure, platform engineering, or systems architecture, and a proven track record in operating large GPU clusters. Key responsibilities include defining compute, storage, and high-speed network blueprints, owning Kubernetes-based scheduling, and guiding enterprise customers through technical engagements. The Head of AI Infrastructure will also be instrumental in building and mentoring a distributed team of infrastructure architects and site-reliability engineers. Applicants for the Head of AI Infrastructure position at the company should have at least 10 years' of experience in a relevant field, with a focus on PCIe or NVLink topologies, high-performance networking, and distributed storage for AI workloads. Deep production experience with Kubernetes or similar schedulers in GPU environments is essential, as is a proven track record in customer-facing technical roles. The ideal candidate will be comfortable in a fast-paced, venture-backed environment and have a passion for solving problems at a multi-petaflop scale. Bonus points are awarded for hands-on experience with liquid cooling, hybrid or multi-cloud deployments, and large-scale model training frameworks. The role offers the opportunity to make an immediate impact, work with cutting-edge technology, and be part of a global, remote-first culture.
Travel Percent
Less than 10%
Functions
- Engineering
- ...Head of Infrastructure Engineering About the Company Pioneering cloud infrastructure company Industry Information Technology and Services... ...lead the design, deployment, and operations of cutting-edge AI and HPC infrastructure. This pivotal role involves driving...Suggested
$257.4k
...Responsibilities As the head of Infrastructure, you will own the vision, execution, and operational excellence for the infrastructure powering... ...critical, revenue‑generating platforms and its supporting data and AI/ML platforms. You will lead multiple teams spanning platform...SuggestedTemporary work- ...Get AI-powered advice on this job and more exclusive features. Direct message the job poster from IntelliPro We are seeking... ...assignments within the department and manage all aspects of the IT infrastructure, systems, applications, and user support. The ideal candidate...SuggestedFull timeWork experience placementWork at office
- ...Title: Infrastructure Program Manager Duration: 12 months + Location: Sunnyvale, CA Type: Hybrid (3 days on site 2 days off site)... ...administration and project management. Experience leveraging AI-powered tools for developing dashboards and small-scale tooling...SuggestedRemote workFlexible hours
- ...Fortinet is looking for an enthusiastic and talented Infrastructure Engineering Leader to join our cloud infrastructure team to work with software... ...team members support each other, share knowledge, and leverage AI to solve complex technical challenges. Our inclusive and...Suggested
$100k - $150k
The Institute of Foundation Models in Sunnyvale, California, is seeking a motivated IT Specialist to build and maintain IT infrastructure. The role includes ensuring network security, configuring systems, and providing technical support. Ideal candidates should have a...Visa sponsorship- A dynamic technology company in Santa Clara is seeking an experienced professional to manage lab infrastructure and deployments across multiple engineering teams. The ideal candidate has over 5 years of experience in lab administration or IT infrastructure management,...
$168k - $310.5k
NVIDIA Gruppe is seeking a Senior Verification Infrastructure Engineer to join the SoC verification team in Santa Clara, California. In this... ...correctness and performance of NVIDIA’s cutting-edge SoCs used in AI Datacenters, self-driving cars, and robotics. The ideal...- ...perform exceptionally well in challenging environments. RUCKUS Networks leverages advanced technologies like Artificial Intelligence (AI) and Machine Learning (ML) to enhance network performance and reduce total cost of ownership. How You'll help us connect the world...
$108k - $162k
...Responsibilities We are seeking a highly skilled Sr. Systems & Infrastructure Engineer to join a dynamic, security-first IT team operating... ...cloud operations (CloudOps), Microsoft 365 administration, AI-augmented tooling, and endpoint management through Microsoft Intune...Permanent employment$245k - $325k
...Jose, California is seeking a Director of Software Engineering to lead a high-performing engineering team in delivering cutting-edge AI inference platforms. The role involves overseeing team development, driving key engineering initiatives, and directly contributing to...- ...Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture... ...the deployment, configuration, and validation of network infrastructure using Python, including topology provisioning, fabric bring-up,...
$137k - $156k
...leveraging proprietary in-house tools. * Establish expertise in HPC/AI applications and benchmarks, delivering impactful training... ...software and hardware upgrades to sustain exceptional HPC infrastructure performance. * Document and analyze test plans, reports, logs...Work at officeWorldwide- ...with InfiniBand and Ethernet experience for configuring and managing the high-performance computing (HPC) / artificial intelligence (AI) datacenter environment. Must have: Hands-on experience with InfiniBand and Ethernet, including VXLAN and EVPN architectures....
$200k - $400k
...A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate...$200k - $400k
...mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge... ...management, and ensure robust, secure deployment pipelines through Infrastructure‑as‑Code (IaC) best practices. Integration & Collaboration:...Visa sponsorship$2,000 per month
...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x... ...investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history. Job...Work at officeRelocation package$150k - $275k
...A leading AI infrastructure company based in San Jose is seeking a highly skilled Supercomputing Engineer specialized in networking. This role involves developing high-performance networking solutions and optimizing software communication across inference nodes. Candidates...Relocation package$140k - $160k
...the Senior Network Engineer (R50298) role at Cadence Get AI-powered advice on this job and more exclusive features. This... ...monitoring. ~ Proven ability to manage mission-critical infrastructure projects. ~ Degree in Computer Science. ~ Expertise in at least...Full timeInternshipRemote workNight shift$144k - $153.6k
.../ MS) Join to apply for the Network Engineer Graduate (Physical Network Infra) - 2026 Start (BS/ MS) role at ByteDance Get AI-powered advice on this job and more exclusive features. Responsibilities Design, build, operate and optimize ByteDance's global...Temporary workLocal area- ...Join Lambda, The Superintelligence Cloud Lambda, the superintelligence cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute...Work at officeLocal areaWork from homeFlexible hours
- ...Since 2009, we have helped institutions modernize and secure their infrastructure through resilient networking, wireless, security, and cloud... ...solutions include enterprise networking, physical security, UCaaS, AI-enabled communications, and Push-to-Talk, enabling reliable and...Full timeLocal area
- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver...
$139k - $204k
...Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform... ...startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate...Temporary workCasual workWork at officeRemote workFlexible hours$105.5k - $213.5k
...skilled and motivated Senior Network Engineer to join our IT Network Infrastructure team. This role is responsible for leading the implementation... ...have deep expertise in HPE-Juniper wireless technologies, Mist AI-driven networking, and a strong foundation in network...Work experience placementWork at office2 days per week- ...sufficiency of information for identifying root causes and remediations. Validation of model outputs : Assess the accuracy and practicality of AI-generated diagnostic steps, recommendations, and remediation actions. Provide clear, concise summaries when disagreeing with model...Contract workRemote work
- ...leading tech consulting firm is seeking a Remote Network Engineer to collaborate with data scientists. This role involves evaluating AI-generated recommendations for network troubleshooting, requiring extensive knowledge of enterprise networking and various diagnostic...Remote work
$141.91k - $200.34k
...Solutions Group (NSG) focused on enabling next generation programmable Infrastructure Processing Units (IPUs) with our lead customers as part of the... ...familiarity with data center workloads, RDMA, collectives, and AI benchmarking. Understanding of secure boot flows and trusted...Local areaImmediate startShift work$118k - $170k
...perform exceptionally well in challenging environments. RUCKUS Networks leverages advanced technologies like Artificial Intelligence (AI) and Machine Learning (ML) to enhance network performance and reduce total cost of ownership. The Embedded Software Engineering...$184k - $356.5k
NVIDIA Gruppe is seeking an Engineering Manager to lead a team solving AI's infrastructure problems with systems-level software. You will guide engineers in building distributed AI systems, balancing project delivery with innovative research. The ideal candidate has over...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Head of AI Infrastructure. Be the first to apply!
- director of infrastructure San Jose, CA
- head of infrastructure San Jose, CA
- infrastructure manager San Jose, CA
- infrastructure engineering manager San Jose, CA
- information technology infrastructure manager
- director of infrastructure
- infrastructure supervisor
- head of infrastructure
- senior infrastructure manager
- infrastructure manager


