Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

HPC Operations Engineer — AI Cloud Infra (On-site 4d/wk)

Lambda Inc.

Lambda Inc. is seeking an experienced HPC Engineer to join our team in San Francisco. In this role, you will be responsible for deploying and configuring large-scale HPC clusters for AI workloads, troubleshooting issues, and mentoring junior engineers. The ideal candidate will have 5+ years of experience, a strong understanding of HPC/AI architecture, and a collaborative spirit. Join us at Lambda to help build the future of AI cloud infrastructure. #J-18808-Ljbffr Lambda Inc.

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the HPC Operations Engineer — AI Cloud Infra (On-site 4d/wk) in San Francisco, CA vacancy
  • A tech company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining service level objectives, managing capacity, and implementing secure systems. The ideal candidate has strong... 
    Website

    Hyperbolic Labs

    San Francisco, CA
    1 day ago
  •  ...The Superintelligence Cloud, is a leader in AI cloud infrastructure...  ...currently Tuesday. Engineering at Lambda is responsible...  ...large-scale HPC clusters for AI workloads...  ...install and configure operating systems, firmware, software...  ...deployment teams on-site Provide clear and... 
    Website
    Work experience placement
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours

    Lambda Inc.

    San Francisco, CA
    11 hours ago
  • $202.5k - $247.5k

     ...ngrok is an all‑in‑one cloud networking platform...  ...or running AI workloads in production...  ...device fleets, and site‑to‑site connectivity...  ...your time. About the Infra Platform Team The Infra...  ...builds the systems ngrok engineers rely on to build, deploy, and operate ngrok itself. We... 
    Website
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    jobr.pro

    San Francisco, CA
    4 days ago
  • Neura Market is seeking an HPC Engineer to build and configure large-scale HPC clusters for AI workloads. This role requires working 4 days a week onsite in San Francisco/Bellevue, where you will collaborate closely with teams to troubleshoot and improve systems. The ideal... 
    Suggested

    Neura Market

    San Francisco, CA
    4 days ago
  • $250k - $400k

     ...Founding Engineer Opportunity Location: San...  ...Type: Full-time, on-site We are seeing a...  ...to give inboxes to AI agents and be the sole...  ...strong backend and infra instincts to help...  ...authenticate, and operate in the real world....  ...Experience with cloud infrastructure, distributed... 
    Website
    Full time
    Work experience placement

    AgentMail (YC S25)

    San Francisco, CA
    11 hours ago
  • Senior Software Engineer, Infrastructure & Agents About Reacher...  ...'re making a big bet on AI agents and what they can...  ...main buckets of work: Infra Re-architect our jobs...  ...architecture GCP/cloud infrastructure (GKE/KEDA...  ...still ahead Location On-site in San Francisco. This role... 
    Website
    Flexible hours

    Reacher

    San Francisco, CA
    4 days ago
  •  ...Senior Platform Engineer (Cloud Platform) San Francisco, CA...  ...Amplitude is the leading AI analytics platform,...  ...living our values. We operate from a place of humility...  ...assisted development—think infra primitives that are...  ...engineering, DevOps, or Site Reliability Engineering... 
    Website
    Shift work

    Amplitude

    San Francisco, CA
    11 days ago
  • $110k - $120k

     ...partner for middle market companies, operating through three business lines: Audax...  ...POSITION SUMMARY: The IT Operations Engineer serves as the sole on-site IT resource for Audax Group's San...  .... ~ Familiarity with enterprise AI tools (e.g., Microsoft Copilot, ChatGPT... 
    Website
    Contract work
    Work at office
    Local area
    Remote work
    Relocation
    Night shift

    Audax Group

    San Francisco, CA
    1 day ago
  • $160k - $300k

    Hebbia, Inc. in San Francisco is seeking a Site Reliability Engineer to own critical production systems. You will be responsible for designing, building, and improving these systems while writing production-quality code. The ideal candidate has over 5 years of software... 
    Website

    Hebbia, Inc.

    San Francisco, CA
    3 days ago
  • $140k - $150k

     ...As a Client Platform Engineer, you will connect IT and Engineering for AI tools and automation at...  ...company. At Nextdoor, we operate in an AI-first environment...  ...such as trainings, off-sites, volunteer days, and...  ...patterns across Google Cloud and AWS, including the ability... 
    Website
    Work at office
    Local area
    Work from home

    Nextdoor

    San Francisco, CA
    1 day ago
  • $150k - $300k

     ...Guillermo Rauch (Vercel CEO) The Role We are looking for a AI Cloud Infra Engineer to join our infrastructure team. This role will be...  ...know how to design for reliability and scale with minimal operational overhead. You learn new technologies rapidly because you’re... 

    Algora Public Benefit Corporation

    San Francisco, CA
    11 hours ago
  •  ...Head of Corporate Engineering, you will be responsible...  ...engineering and operations globally. You will...  ...and optimizing cloud infrastructure,...  ...on-call support, Infra as Code, observability...  ...-as-code, SRE (Site Reliability Engineering...  ...is the data and AI company. More than... 
    Website
    Work experience placement
    Remote work
    Worldwide

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • $179k - $218k

     ...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe...  ...only vertically integrated AI infrastructure company built...  ...data center construction, and cloud services. If you want to...  ...generation facilities. For Site Operations: You are the "... 
    Website
    Temporary work

    Crusoe

    San Francisco, CA
    9 days ago
  • Phonely in San Francisco is seeking an experienced DevOps Engineer to join our engineering team and help build reliable cloud infrastructure for voice AI systems. This role is fully on-site and essential to our fast-paced business environment. The ideal candidate will have... 
    Website

    Phonely

    San Francisco, CA
    1 day ago
  •  ...in San Francisco is seeking senior platform engineers to build efficient infrastructure that supports both traditional and AI workloads. The ideal candidate has 3+ years...  ...core architectures and mentor peers to improve operational resilience. This position requires in-office... 
    Work at office
    3 days per week

    TruckSmarter

    San Francisco, CA
    4 days ago
  • $200k - $275k

     ...Senior Backend Engineer (Infra/Platform/SRE) Title of Role: Senior Backend Engineer (Infra...  ...services industry with innovative AI creative tools tailored for artists and...  ...infrastructure utilizing Kubernetes and cloud platforms such as AWS, GCP, and Azure.... 
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    3 days ago
  • $230k - $325k

     ...the Team The Codex Cloud Apps team builds cloud-...  ..., deployed, and operated. We own end-user experiences such as ChatGPT Sites, Code Review, and future...  ...increasingly complex work to AI. Our team sits at...  ...intersection of product, engineering, design, and research.... 
    Website

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...ultimately become the perception engine for a company’s physical...  ...perimeter visibility, autonomous operations management, and “digital twinning...  ...approaching world of physical AI and robotics. We are a small,...  ...for long days, remote work sites, and hard, physical work. Desired... 
    Website
    Local area
    Remote work

    Specter

    San Francisco, CA
    4 days ago
  • A fast-growing AI company in San Francisco is seeking a Senior/Staff Infrastructure Engineer to build and operate cloud infrastructure. This full-time, hybrid role focuses on GCP, Kubernetes, and infrastructure-as-code. You will be responsible for securing deployments... 
    Full time

    Motion Recruitment Partners LLC

    San Francisco, CA
    2 days ago
  • $156.86k - $191.72k

     ...Berkeley National Laboratory is hiring an HPC Scientific Support Engineer for the NERSC division, the U.S....  ...project work. Monitor emerging HPC and AI trends and identify opportunities for...  ...Work modality: Work may be performed on‑site, hybrid, or full‑time telework. The... 
    Website
    Full time
    Work at office
    Remote work

    Lawrence Berkeley National Laboratory

    San Francisco, CA
    11 hours ago
  •  ...Us Conversion is the AI-native marketing automation...  ...San Francisco and includes engineers, designers, and operators from Airbnb, Palantir, Pinterest...  ...our customers. The best infra work here is measured in...  ...years of experience building cloud infrastructure, dev tooling... 
    Website

    Conversion Services

    San Francisco, CA
    1 day ago
  • Crusoe Energy Systems LLC is seeking a Senior Engineering Manager to lead a talented team in revolutionizing our cloud infrastructure. You will drive the Insights & Actions...  ...projects. Join us in building the future of AI infrastructure! #J-18808-Ljbffr Crusoe Energy Systems... 

    Crusoe Energy Systems LLC

    San Francisco, CA
    2 days ago
  • Crusoe is seeking a Senior Engineering Manager to lead a team focused on enhancing cloud infrastructure. The role involves building systems that convert raw infrastructure...  .... Join Crusoe and contribute to shaping the future of AI infrastructure. #J-18808-Ljbffr Epoch Biodesign
    Full time

    Epoch Biodesign

    San Francisco, CA
    1 day ago
  • $127k - $225k

     ...Waabi, founded by AI visionary Raquel Urtasun, is...  ...visit: As a Software Engineer on our Labelling and Data...  ...- Understanding of cloud job orchestration, monitoring...  ...Experience working with infra as code (Terraform, CloudFormation...  ...social events both on-site, off-site & virtually.... 
    Website
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    4 days ago
  •  ...startup that is building the AI backbone for the next generation...  ...are hiring a Backend Software Engineer (ML Infrastructure) to help...  ...distributed training pipelines, cloud-native infrastructure, and internal...  ...). - Excited to work on-site in San Francisco with a fast-... 
    Website

    Rockstar

    San Francisco, CA
    3 days ago
  • $148.5k - $223.9k

     ...Senior Member of Technical Staff (SMTS) - Site Reliability Engineer (Cloud Automation) Location: New York, NY...  ...Salesforce Salesforce is the #1 AI CRM, where humans with agents drive...  ...Platform Engineering team builds and operates the highly available, active-active... 
    Website
    Work experience placement
    Shift work

    Salesforce

    San Francisco, CA
    3 days ago
  • $120k - $160k

     ...capabilities to the world’s biggest AI Labs at industry-defining...  ...Technical Writer to join our Operations team. In this role, you will...  ..., vendor specifications, engineering drawings, and product data sheets...  ..., and archiving — within the site DMS; maintain revision... 
    Website
    Work at office
    Local area

    Fluidstack

    San Francisco, CA
    4 days ago
  •  ...Bedrock, we’re moving AI out of the lab and into...  ...improving safety on job sites. Backed by $350M in funding...  ...and world‑class engineers to solve physical‑world...  ...Engineering, Commercial, and Operations. You will partner...  ...Do Manage execution of cloud platform, fleet technology... 
    Website
    Work at office
    Flexible hours

    Bedrock Robotics Inc.

    San Francisco, CA
    1 day ago
  •  ...leading 3D generative AI company on a mission...  ...seeking a Software Engineer Intern to join our Data Infra team and help...  ...agent‑driven workflows Cloud infrastructure on AWS...  ...Work On Design and operate CI/CD pipelines covering...  ...— this role is on‑site / hybrid. Our... 
    Website
    Internship
    Work at office
    Remote work
    Flexible hours
    1 day per week

    Meshy LLC.

    San Francisco, CA
    3 days ago
  •  ...Platform/Infrastructure engineer to help shape how...  ...environment setup, operations, policy checks, on-...  ...platform work with AI Respond to...  ...spent in a platform, infra, or SRE role ~ Kubernetes...  ...Have Google Cloud Platform experience...  ...in-office or on-site at least four days... 
    Website
    Permanent employment
    Work at office
    Local area
    Flexible hours

    Promise Co.

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to HPC Operations Engineer — AI Cloud Infra (On-site 4d/wk). Be the first to apply!