Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Head of AI Infrastructure

Confidential

Head of AI Infrastructure

About the Company

Early-stage hyperscale innovator building a next-generation neocloud for large-scale AI workloads.

Industry
Information Technology and Services

Type
Privately Held, VC-backed

About the Role

The Company is seeking a Head of AI Infrastructure to take on a pivotal role in the design, deployment, and operation of a next-generation, global, security-first GPU cloud platform. The successful candidate will be responsible for creating and evolving an elastic GPU cloud fabric that can scale from hundreds to thousands of accelerators while ensuring low-latency performance for AI training and inference. This role demands a technical leader with a strong background in cloud infrastructure, platform engineering, or systems architecture, and a proven track record in operating large GPU clusters. Key responsibilities include defining compute, storage, and high-speed network blueprints, owning Kubernetes-based scheduling, and guiding enterprise customers through technical engagements. The Head of AI Infrastructure will also be instrumental in building and mentoring a distributed team of infrastructure architects and site-reliability engineers. Applicants for the Head of AI Infrastructure position at the company should have at least 10 years' of experience in a relevant field, with a focus on PCIe or NVLink topologies, high-performance networking, and distributed storage for AI workloads. Deep production experience with Kubernetes or similar schedulers in GPU environments is essential, as is a proven track record in customer-facing technical roles. The ideal candidate will be comfortable in a fast-paced, venture-backed environment and have a passion for solving problems at a multi-petaflop scale. Bonus points are awarded for hands-on experience with liquid cooling, hybrid or multi-cloud deployments, and large-scale model training frameworks. The role offers the opportunity to make an immediate impact, work with cutting-edge technology, and be part of a global, remote-first culture.

Travel Percent
Less than 10%

Functions

  • Engineering
Confidential
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Head of AI Infrastructure in San Jose, CA vacancy
  •  ...Head of Infrastructure Engineering About the Company Pioneering cloud infrastructure company Industry Information Technology and Services...  ...lead the design, deployment, and operations of cutting-edge AI and HPC infrastructure. This pivotal role involves driving... 
    Suggested

    Confidential

    San Jose, CA
    1 day ago
  • $257.4k

     ...Responsibilities As the head of Infrastructure, you will own the vision, execution, and operational excellence for the infrastructure powering...  ...critical, revenue‑generating platforms and its supporting data and AI/ML platforms. You will lead multiple teams spanning platform... 
    Suggested
    Temporary work

    TikTok USDS Joint Venture

    San Jose, CA
    2 days ago
  •  ...Get AI-powered advice on this job and more exclusive features. Direct message the job poster from IntelliPro We are seeking...  ...assignments within the department and manage all aspects of the IT infrastructure, systems, applications, and user support. The ideal candidate... 
    Suggested
    Full time
    Work experience placement
    Work at office

    Intellipro, Inc.

    San Jose, CA
    1 day ago
  •  ...Title: Infrastructure Program Manager Duration: 12 months + Location: Sunnyvale, CA Type: Hybrid (3 days on site 2 days off site)...  ...administration and project management. Experience leveraging AI-powered tools for developing dashboards and small-scale tooling... 
    Suggested
    Remote work
    Flexible hours

    Systems Integration Solutions

    Sunnyvale, CA
    2 days ago
  •  ...Fortinet is looking for an enthusiastic and talented Infrastructure Engineering Leader to join our cloud infrastructure team to work with software...  ...team members support each other, share knowledge, and leverage AI to solve complex technical challenges. Our inclusive and... 
    Suggested

    Fortinet

    Sunnyvale, CA
    7 days ago
  • $100k - $150k

    The Institute of Foundation Models in Sunnyvale, California, is seeking a motivated IT Specialist to build and maintain IT infrastructure. The role includes ensuring network security, configuring systems, and providing technical support. Ideal candidates should have a... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • A dynamic technology company in Santa Clara is seeking an experienced professional to manage lab infrastructure and deployments across multiple engineering teams. The ideal candidate has over 5 years of experience in lab administration or IT infrastructure management,... 

    Nexthop Systems Inc

    Santa Clara, CA
    18 hours ago
  • $168k - $310.5k

    NVIDIA Gruppe is seeking a Senior Verification Infrastructure Engineer to join the SoC verification team in Santa Clara, California. In this...  ...correctness and performance of NVIDIA’s cutting-edge SoCs used in AI Datacenters, self-driving cars, and robotics. The ideal... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...perform exceptionally well in challenging environments. RUCKUS Networks leverages advanced technologies like Artificial Intelligence (AI) and Machine Learning (ML) to enhance network performance and reduce total cost of ownership. How You'll help us connect the world... 

    Vistance Networks, Inc.

    Sunnyvale, CA
    2 days ago
  • $108k - $162k

     ...Responsibilities We are seeking a highly skilled Sr. Systems & Infrastructure Engineer to join a dynamic, security-first IT team operating...  ...cloud operations (CloudOps), Microsoft 365 administration, AI-augmented tooling, and endpoint management through Microsoft Intune... 
    Permanent employment

    Onto

    Milpitas, CA
    4 days ago
  • $245k - $325k

     ...Jose, California is seeking a Director of Software Engineering to lead a high-performing engineering team in delivering cutting-edge AI inference platforms. The role involves overseeing team development, driving key engineering initiatives, and directly contributing to... 

    SambaNova

    San Jose, CA
    2 days ago
  •  ...Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture...  ...the deployment, configuration, and validation of network infrastructure using Python, including topology provisioning, fabric bring-up,... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $137k - $156k

     ...leveraging proprietary in-house tools. * Establish expertise in HPC/AI applications and benchmarks, delivering impactful training...  ...software and hardware upgrades to sustain exceptional HPC infrastructure performance. * Document and analyze test plans, reports, logs... 
    Work at office
    Worldwide

    Supermicro

    San Jose, CA
    4 days ago
  •  ...with InfiniBand and Ethernet experience for configuring and managing the high-performance computing (HPC) / artificial intelligence (AI) datacenter environment. Must have: Hands-on experience with InfiniBand and Ethernet, including VXLAN and EVPN architectures.... 

    Tranzeal

    Santa Clara, CA
    11 days ago
  • $200k - $400k

     ...A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate... 

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $200k - $400k

     ...mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge...  ...management, and ensure robust, secure deployment pipelines through Infrastructure‑as‑Code (IaC) best practices. Integration & Collaboration:... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $2,000 per month

     ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x...  ...investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history. Job... 
    Work at office
    Relocation package

    ETCHED LLC

    San Jose, CA
    18 hours ago
  • $150k - $275k

     ...A leading AI infrastructure company based in San Jose is seeking a highly skilled Supercomputing Engineer specialized in networking. This role involves developing high-performance networking solutions and optimizing software communication across inference nodes. Candidates... 
    Relocation package

    ETCHED LLC

    San Jose, CA
    2 days ago
  • $140k - $160k

     ...the Senior Network Engineer (R50298) role at Cadence Get AI-powered advice on this job and more exclusive features. This...  ...monitoring. ~ Proven ability to manage mission-critical infrastructure projects. ~ Degree in Computer Science. ~ Expertise in at least... 
    Full time
    Internship
    Remote work
    Night shift

    Cadence Inc

    San Jose, CA
    2 days ago
  • $144k - $153.6k

     .../ MS) Join to apply for the Network Engineer Graduate (Physical Network Infra) - 2026 Start (BS/ MS) role at ByteDance Get AI-powered advice on this job and more exclusive features. Responsibilities Design, build, operate and optimize ByteDance's global... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    2 days ago
  •  ...Join Lambda, The Superintelligence Cloud Lambda, the superintelligence cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute... 
    Work at office
    Local area
    Work from home
    Flexible hours

    Lambda Corporation

    San Jose, CA
    2 days ago
  •  ...Since 2009, we have helped institutions modernize and secure their infrastructure through resilient networking, wireless, security, and cloud...  ...solutions include enterprise networking, physical security, UCaaS, AI-enabled communications, and Push-to-Talk, enabling reliable and... 
    Full time
    Local area

    IT Management Corp. dba 101 VOICE

    Santa Clara, CA
    2 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • $139k - $204k

     ...Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform...  ...startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate... 
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    18 hours ago
  • $105.5k - $213.5k

     ...skilled and motivated Senior Network Engineer to join our IT Network Infrastructure team. This role is responsible for leading the implementation...  ...have deep expertise in HPE-Juniper wireless technologies, Mist AI-driven networking, and a strong foundation in network... 
    Work experience placement
    Work at office
    2 days per week

    Hewlett Packard Enterprise

    San Jose, CA
    1 day ago
  •  ...sufficiency of information for identifying root causes and remediations. Validation of model outputs : Assess the accuracy and practicality of AI-generated diagnostic steps, recommendations, and remediation actions. Provide clear, concise summaries when disagreeing with model... 
    Contract work
    Remote work

    Kaygen

    San Jose, CA
    13 days ago
  •  ...leading tech consulting firm is seeking a Remote Network Engineer to collaborate with data scientists. This role involves evaluating AI-generated recommendations for network troubleshooting, requiring extensive knowledge of enterprise networking and various diagnostic... 
    Remote work

    Kaygen

    San Jose, CA
    18 hours ago
  • $141.91k - $200.34k

     ...Solutions Group (NSG) focused on enabling next generation programmable Infrastructure Processing Units (IPUs) with our lead customers as part of the...  ...familiarity with data center workloads, RDMA, collectives, and AI benchmarking. Understanding of secure boot flows and trusted... 
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    1 day ago
  • $118k - $170k

     ...perform exceptionally well in challenging environments. RUCKUS Networks leverages advanced technologies like Artificial Intelligence (AI) and Machine Learning (ML) to enhance network performance and reduce total cost of ownership. The Embedded Software Engineering... 

    RUCKUS Networks

    Sunnyvale, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is seeking an Engineering Manager to lead a team solving AI's infrastructure problems with systems-level software. You will guide engineers in building distributed AI systems, balancing project delivery with innovative research. The ideal candidate has over... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Head of AI Infrastructure. Be the first to apply!