Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineer

vCluster Labs

As vCluster’s AI Infrastructure Specialist , you will work directly with customers at the earliest and most critical stage of their journey: from bare metal GPU nodes through to a production-ready deployment. This is not a traditional professional services role; you operate pre-sale as part of a proof of value engagement scoped to reach production. You will be one of the first team members a neocloud or AI Factory engages with at a technical depth, and the playbooks you develop will scale the motion for the next hire and customer. vCluster is gaining rapid traction with GPU AI Clouds and enterprises building AI Factories: organizations that need to offer Kubernetes as a managed service on bare metal GPU infrastructure, and need to do it fast. This role exists to make that happen. As an AI Infrastructure Engineer, your role will include: Lead Technical Deployments: Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated vCluster environment. Infrastructure Optimization: Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBand. Validation: Deploy and validate Kubernetes and vCluster to provide GPU-powered managed K8s. Knowledge Transfer: Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independently. Scaling through Documentation: Document reusable playbooks and deployment architectures so your learnings become the next customer's head start. Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback loop from the field into the roadmap. Strategic Partnering: Join Sales in the pre-sales process where deep infrastructure work is required to achieve a meaningful proof of value. This role could be a fit for you if you bring: Production K8s Mastery: 5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environments. GPU Fluency: Practical knowledge of NVIDIA GPU Operators, CUDA tooling, and systems‑level configuration for GPU nodes. Networking Fundamentals: Deep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environments. Storage Expertise: Experience with persistent volume configuration, CSI drivers, and distributed systems like Ceph, Rook, Weka, or Longhorn. Operational Agility: Comfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real time. Modern Tech Mindset: You thrive in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal services. Bonus points for: Automation Skills: Experience writing automation scripts with Bash, Python, or Go. Kubernetes Depth: Relevant certifications such as CKA (Certified Kubernetes Administrator) or experience writing Kubernetes Operators. AI/ML Familiarity: Experience with inference serving, GPU scheduling, and the tooling around LLM deployment. Documentation: Experience building AI Automation in documentation to contribute to a shared knowledge base. Benefits We offer the following benefits: Competitive Salary : We offer a competitive compensation package, including equity. Platinum-Level Insurance : Health, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country). Flexible Working Schedule : You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day. Workplace Flexibility : We’re very flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineer in San Francisco, CA vacancy
  •  ...AI Infrastructure SpecialistAs vCluster's AI Infrastructure Specialist, you will work directly with customers at the earliest and most critical...  ...next customer's head start.Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges,... 
    Suggested
    Remote work
    Flexible hours

    vCluster

    San Francisco, CA
    1 day ago
  • $190k - $270k

     ...AI Chopping Block, Inc. in San Francisco is seeking an AI Infrastructure Engineer to maintain user-facing services and production systems. The role involves building and managing infrastructure with tools like Ansible and Kubernetes, ensuring reliability and scalability... 
    Suggested

    AI Chopping Block, Inc.

    San Francisco, CA
    23 hours ago
  •  ...A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training. The ideal candidate has over 3 years of experience in infrastructure engineering and strong... 
    Suggested

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...AI Chopping Block, Inc. seeks a Senior Software Engineer for their Agentic Infrastructure team in San Francisco. This role involves architecting and building AI systems that enable autonomous planning and execution across the platform. Ideal candidates have 4-7 years of... 
    Suggested
    Remote work
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  •  ...A California-based technology company is seeking a Software Engineer for Frontier AI Infrastructure to create secure, scalable backend systems and collaborate with government agencies. The ideal candidate must have an active secret clearance and a strong background in... 
    Suggested

    Scale AI

    San Francisco, CA
    23 hours ago
  • $180k - $250k

     ...AI Engineer Location: San Francisco Onsite Policy: 5 days a week Comp & Ben : $180k - $250k base + 0.3% - 0.8% equity + visa sponsorship...  ...around existing models. This company is building the infrastructure and agentic systems that automate real-world commercial real... 
    Visa sponsorship
    Relocation package

    Trades Workforce Solutions

    San Francisco, CA
    1 day ago
  • $190k - $270k

     ...AI Infrastructure Engineer As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering... 
    Full time
    Work experience placement

    Together AI

    San Francisco, CA
    4 days ago
  •  ...An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented programming skills and a solid foundation in data structures and algorithms. The ideal candidate... 

    SpreeAI

    San Francisco, CA
    1 day ago
  •  ...Handshake is seeking a Senior Software Engineer for its Agentic Infrastructure team in San Francisco. You will build the backbone for AI agents, designing key systems that ensure functionality and safety across Handshake's platform. The ideal candidate has 4-7 years of... 
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    1 day ago
  •  ...Corp, based in San Francisco, is looking for a Core Engineer to design and operate foundational infrastructure for their autonomous agent platform. This role emphasizes...  ...and systems thinking, crucial for ensuring AI reliability. Qualified candidates will have a strong... 

    Rox Data Corp

    San Francisco, CA
    23 hours ago
  • $190k - $270k

     ...AI Chopping Block, Inc. is hiring an AI Infrastructure Engineer in San Francisco, California. This full-time role involves ensuring smooth operation of user-facing services and production systems, alongside building and running infrastructure with Ansible, Terraform, and... 
    Full time

    AI Chopping Block, Inc.

    San Francisco, CA
    23 hours ago
  •  ...A well-funded AI infrastructure startup in San Francisco is seeking a Founding Engineer to design and scale distributed backend systems integral to training advanced AI agents. Ideal candidates will have experience in ML pipelines, systems thinking, and a strong foundation... 

    Jack & Jill/External ATS

    San Francisco, CA
    23 hours ago
  •  ...An innovative AI lab is seeking an experienced engineer to manage and optimize large-scale training infrastructure. You will build core systems that support researchers, focusing on distributed training, performance optimization, and data pipelines. Ideal candidates should... 

    Cognition Corp

    San Francisco, CA
    1 day ago
  •  ...Roboflow in San Francisco is seeking a versatile Infrastructure Engineer to enhance our core infrastructure and scale our cloud operations. You will engage with cutting-edge AI technologies and collaborate with product, operations, and security teams. The role demands... 

    Roboflow

    San Francisco, CA
    1 day ago
  •  ...AI Infrastructure Engineer Spellbrush, the world's leading generative AI studio behind nijijourney, is looking for an AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms. What You'll Do Design... 
    Work experience placement
    Work at office
    Visa sponsorship

    Spellbrush

    San Francisco, CA
    3 days ago
  • $190k - $270k

     ...About the Role As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational... 
    Full time
    Work experience placement

    AI Chopping Block, Inc.

    San Francisco, CA
    23 hours ago
  • $190k - $270k

     ...AI Chopping Block, Inc. is looking for an AI Infrastructure Engineer to manage user-facing services and production systems in San Francisco. The role entails responsibilities like building infrastructure with Ansible and Kubernetes, monitoring systems, and improving product... 
    Full time

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  • An innovative AI infrastructure startup is seeking a Sales Engineer to lead technical discovery and drive successful evaluations with clients. The ideal candidate will have significant experience in customer-facing technical roles focused on AI and machine learning infrastructure... 
    Remote work

    Andromeda

    San Francisco, CA
    1 day ago
  • $230k - $360k

     ...Lead Infrastructure and Reliability Engineer (Systems & Scale) A new class of intelligence is emerging, systems that understand and generate the world...  ...define how reliability works for a new generation of AI infrastructure. The decisions you make here will influence... 
    Immediate start

    Luma AI

    San Francisco, CA
    1 day ago
  •  ...Xterraai is looking for an AI Research Engineer to enhance its geospatial intelligence systems. This role involves building agent infrastructure, developing evaluation frameworks, and designing data systems while collaborating with researchers and geoscientists. The ideal... 

    Xterraai

    San Francisco, CA
    23 hours ago
  • $200k - $240k

     ...Labs provides blockchain analytics and AI solutions to help law enforcement and national...  ..., more secure world for all. The AI Engineering Team is chartered with enabling next-...  ...build robust pipelines, high-performance infrastructure, and operational tooling that allow AI systems... 
    Remote work
    Worldwide

    TRM Labs

    San Francisco, CA
    1 day ago
  •  ...About Brain Co. Brain Co. is an applied AI startup co-founded by Jared Kushner and Elad Gil, and backed by leading Silicon...  ...millions of people. About the Role: As our Security Engineer, Infrastructure, you'll secure the platform layer end-to-end including cloud... 
    Worldwide

    Brainco

    San Francisco, CA
    4 days ago
  • A leading AI fashion-tech company is seeking a Software Engineer Intern to focus on building infrastructure for AI systems. This role involves designing scalable models, developing APIs, and optimizing for performance and reliability. An ideal candidate will have a strong... 
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    1 day ago
  • MintMCP, located in San Francisco, is looking for a versatile builder to own end-to-end features on our AI infrastructure. This role involves backend work and some frontend tasks, where you'll leverage AI tools to enhance productivity. The ideal candidate thrives in a horizontal... 

    MintMCP

    San Francisco, CA
    4 days ago
  •  ...Granica, based in San Francisco, is seeking an expert in distributed systems to enhance their data infrastructure. This role involves architecting a global metadata substrate, developing intelligent data layouts, and implementing algorithms for efficient data representation... 
    Flexible hours

    Granica

    San Francisco, CA
    23 hours ago
  • $50 - $70 per hour

     ...Mercor is looking for a full-time Network Engineer in San Francisco to work with AI systems. You will manage network data, analyze behaviors, and create scripts for data processing. Ideal candidates should have experience in network engineering and programming skills in... 
    Hourly pay
    Full time
    Contract work

    Mercor Inc

    San Francisco, CA
    5 days ago
  •  ...A leading AI research organization seeks a talented individual to drive the development of AI-powered software engineering tools. You will shape Codex, enhancing its reliability and creating secure, observable systems across various platforms. The ideal candidate has... 

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...technology firm in San Francisco is seeking an experienced Infrastructure Engineer to own and evolve its cloud infrastructure. You will work with...  ...edge tools like Kubernetes and Terraform while implementing AI to enhance operational efficiency. The ideal candidate has... 

    Sight Machine

    San Francisco, CA
    23 hours ago
  •  ...About the Company Virtue AI is at the forefront of AI security. As enterprises increasingly...  .... Are you a high‑performing, motivated engineer ready to make a significant impact in...  ...? Virtue AI is seeking a talented AI Infrastructure Engineer (MLOps) to join us. We are a... 

    Virtue AI

    San Francisco, CA
    1 day ago
  •  ...A fast-growing AI startup is seeking a Senior Infrastructure Engineer in San Francisco. In this role, you will architect and scale distributed systems that handle AI-driven phone conversations for major brands. You will contribute to optimizing ML infrastructure and integrating... 

    Open Select

    San Francisco, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!