Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineering (Cloud, DevOps)

Virtue AI

Location: San Francisco, CA (Onsite | Remote)

About Virtue AI

Virtue AI sets the standard for advanced AI security platforms. Built on decades of foundational and award-winning research in AI security, its AI-native architecture unifies automated red-teaming, real-time multimodal guardrails, and systematic governance for enterprise apps and agents. Deploy in minutes-across any environment-to keep your AI protected and compliant. We are a well-funded, early-stage startup founded by industry veterans, and we're looking for passionate builders to join our core team.

What You'll Do

As an AI infra Engineer, you will own the reliability, scaling, automation, and operational discipline of Virtue AI's AI production systems, focusing on deployment and model serving performance.

You will:
  • Design and maintain deployment workflows for Virtue AI on major cloud providers (e.g., AWS and GCP )
  • Own IaC (Terraform / Pulumi) for repeatable, auditable customer deployments.
  • Package our services into secure, customer-ready deployment units (Docker, Helm, Marketplace images).
  • Design, build, and maintain product CI/CD pipelines using GitHub Actions.
  • Serve and optimize the LLM inference pipeline; build necessary inference APIs and routers; auto-scaling
  • Design production-grade system observability (Metrics, logs, alerts, dashboards) using tools like Datadog, Grafana, and Prometheus .
  • Implement secure networking (VPCs, IAM, service accounts, private endpoints, firewalling).
  • Collaborate with product developers to align infrastructure and inference behavior with product requirements.
Required Qualifications
  • Bachelor's degree or higher in CS, CE, EE, or related field.
  • Strong experience deploying production systems on major cloud platforms, e.g., AWS and/or GCP .
  • Deep hands-on experience with Docker and containerized workloads, Kubernetes (EKS, GKE, or equivalent).
  • Strong experience serving LLMs and embedding models in production.
  • Strong hands-on experience with CI/CD (GitHub Actions required) and repository management (monorepos, release branches, tagging, rollbacks).
Preferred Qualifications
  • Experience with SGLang, vLLM, or similar inference frameworks .
  • Strong understanding of GPU behavior (memory limits, batching, fragmentation, utilization) and experience with GPU-level optimization
  • Experience with model-level inference optimization (Quantization, KV-cache optimization, Speculative decoding or batching strategies) and inference kernels
  • Startup experience: you move fast, take ownership, and fix things properly.
Why Join Virtue AI
  • Competitive salary + equity
  • High ownership - You define how production runs
  • Real impact - Your work directly affects customers and revenue
  • Hard problems - Distributed systems, GPUs, scale, security
  • Strong technical peers - Engineers who ship and debug, not just designLocation: San Francisco, CA (Onsite | Remote)
About Virtue AI

Virtue AI sets the standard for advanced AI security platforms. Built on decades of foundational and award-winning research in AI security, its AI-native architecture unifies automated red-teaming, real-time multimodal guardrails, and systematic governance for enterprise apps and agents. Deploy in minutes-across any environment-to keep your AI protected and compliant. We are a well-funded, early-stage startup founded by industry veterans, and we're looking for passionate builders to join our core team.

What You'll Do

As a DevOps Engineer, you will own the reliability, automation, and operational discipline of Virtue AI's production systems. When something breaks, you fix it. When it doesn't scale, you redesign it.

You will:
  • Design, build, and maintain CI/CD pipelines using GitHub Actions
  • Own repo structure, branching strategy, release workflows, and versioning
  • Build and operate Kubernetes infrastructure on GKE
  • Package, deploy, and optimize services using Docker
  • Design production-grade system observability
    • Metrics, logs, alerts, dashboards
    • Datadog, Grafana, Prometheus
  • Monitor and improve service reliability, latency, and uptime
  • Debug real production issues across infra, networking, containers, and code
  • Partner with backend, ML, and platform engineers to remove operational bottlenecks
What Makes You a Great Fit

You don't just "set up pipelines." You understand why systems fail, and you design so they don't fail the same way twice.

Required Qualifications
  • Bachelor's degree or equivalent practical experience
  • Strong hands-on experience with:
    • CI/CD (GitHub Actions required)
    • Repository management (monorepos, release branches, tagging, rollbacks)
  • Deep experience with:
    • Kubernetes
    • Docker
  • Experience designing and operating observability systems
    • Datadog and/or Grafana in production
  • Strong understanding of system design
    • Availability, scalability, fault isolation
  • Proven ability to solve real production problems, not just configure tools
  • Comfortable working directly on production systems
Preferred Qualifications
  • Experience operating ML / LLM inference systems
  • Experience with GPU workloads and resource scheduling
  • Experience supporting enterprise customers with SLAs
  • Familiarity with infrastructure-as-code (Terraform / Pulumi)
  • Startup experience: you move fast, take ownership, and clean up after yourself
Why Join Virtue AI
  • Competitive salary + equity
  • High ownership - You define how production runs
  • Real impact - Your work directly affects customers and revenue
  • Hard problems - Distributed systems, GPUs, scale, security
  • Strong technical peers - Engineers who ship and debug, not just design
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineering (Cloud, DevOps) in San Francisco, CA vacancy
  • About the Company Virtue AI is at the forefront of AI security. As enterprises increasingly...  .... Are you a high‑performing, motivated engineer ready to make a significant impact in...  ...? Virtue AI is seeking a talented AI Infrastructure Engineer (MLOps) to join us. We are a... 
    Devops

    Virtue AI

    San Francisco, CA
    16 hours ago
  •  ...Benefits: 401(k) AI Infrastructure Engineer / MLOps San Francisco Bay Area, CA (100% Onsite)...  ...machine learning platforms running in cloud native environments. Responsibilities...  ...Knowledge of infrastructure-as-code and DevOps best practices Experience with monitoring... 
    Devops

    EITACIES

    San Francisco, CA
    3 days ago
  •  ...AI Infra Engineer We are looking for an AI Infra engineer to join our growing team. We work...  ...PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering...  ...management Previous roles in SRE, DevOps, or Platform Engineering with focus on... 
    Devops

    Perplexity AI

    San Francisco, CA
    4 days ago
  • Braintrust Data, Inc. is seeking a Cloud Infrastructure Engineer to build reliable infrastructure and enhance our DevOps processes. The role involves maintaining Terraform modules...  ...time off, and competitive salary with an AI stipend. #J-18808-Ljbffr Braintrust Data, Inc... 
    Devops
    Flexible hours

    Braintrust Data, Inc.

    San Francisco, CA
    4 days ago
  • A fast-growing AI startup is seeking a Senior Infrastructure Engineer in San Francisco. In this role, you will architect and scale distributed systems that handle...  .... Ideal candidates have 3-6 years of experience in cloud infrastructure and deep knowledge of Kubernetes, AWS... 
    Suggested

    Open Select

    San Francisco, CA
    3 days ago
  • $200k - $300k

    Senior Engineering Manager page is loaded## Senior Engineering...  ...products rooted in AI, automation, and...  ...lead our team focused on cloud governance, IAM, secrets...  ...environment infrastructure; etc.* Partner with Corp...  ...understanding of modern DevOps methodologies: Infrastructure... 
    Devops
    Work at office
    Remote work

    Zendesk Group

    San Francisco, CA
    5 days ago
  •  ...Job Title- Senior Cloud Infrastructure Engineer / DevOps Engineer Reporting Type- San Francisco, CA 94105 Work Timing- Monday to Friday, 9am to 5pm Duration: 07 months Job Type: Onsite Summary We are seeking an experienced... 
    Devops
    Monday to Friday

    campus4tech

    San Francisco, CA
    16 hours ago
  • $150k - $220k

     ...Senior Cloud DevSecOps Infrastructure Engineer Title of Role: Senior Cloud DevSecOps Infrastructure Engineer...  ...Funding: Venture-Backed — Healthcare, AI, Security, Enterprise Office Type...  ...~6+ years of experience in DevOps or Infrastructure Engineering, with... 
    Devops
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    4 days ago
  • $99.6k - $234.6k

     ...Principal Software Engineer Join Oracle's Health Data...  ...the next generation of cloud-native platforms,...  ...automation frameworks, and AI-powered operational tooling...  ...across Oracle Cloud Infrastructure and multi-cloud environments...  ..., Docker CI/CD and DevOps platforms Prometheus... 
    Devops
    Temporary work
    Flexible hours

    Oracle

    San Francisco, CA
    4 days ago
  • $216k - $270k

     ...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large...  ...raw compute into breakthrough AI. You will: Architect and scale...  ...specialized hardware. ~ Familiarity with cloud infrastructure (AWS, GCP) and... 
    Full time

    Scale AI

    San Francisco, CA
    19 days ago
  •  ...Brain Co. is an applied AI startup co-founded by Jared Kushner...  ...- and Terraform-based infrastructure across customer environments....  ...leaders on architecture and mentor engineers. You Might Be a Great Fit...  ...Know Kubernetes, Terraform, cloud networking, orchestration, and... 

    Brainco

    San Francisco, CA
    4 days ago
  • $210k - $260k

    Staff AI Systems Engineer — Agentic Platforms The role of the software engineer...  ...real workflows, operate infrastructure, and improve over time. The...  ...enterprises, automating DevOps and SecOps workflows with real...  ...environments, or in the cloud, with enterprise‑grade security... 
    Devops
    Remote work
    Shift work

    Kindo

    San Francisco, CA
    7 days ago
  • Harrison Clarke is seeking a Founding Engineer in San Francisco for a pioneering role in an early-stage AI and cloud infrastructure startup. This position offers a unique opportunity to shape the technical direction and build core systems from scratch. Ideal candidates... 

    Harrison Clarke

    San Francisco, CA
    2 days ago
  • $216k - $270k

     ...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable...  ..., Kubernetes). ~ Familiarity with cloud infrastructure (AWS, GCP) and...  ..., our mission is to develop reliable AI systems for the world's most important... 
    Full time

    Scale AI

    San Francisco, CA
    19 days ago
  •  ...platform for evaluating and deploying AI systems. Our mission is to help enterprises...  .... About the role We’re looking for a Cloud Infrastructure Engineer to help us build reliable, scalable...  ...credentials 5+ years of experience in DevOps, SRE, or Infrastructure Engineering... 
    Devops
    Flexible hours

    Braintrust

    San Francisco, CA
    1 day ago
  • A cutting-edge AI infrastructure company is seeking an Infrastructure Product Engineer to design core platform components and build robust APIs. Candidates should have over 5 years of experience in infrastructure or backend engineering, with strong skills in Kubernetes... 
    Remote job

    Andromeda

    San Francisco, CA
    3 days ago
  • A cutting-edge AI infrastructure firm is seeking a Software Engineer specializing in Infrastructure to design core platform components. You will develop APIs and services, translate customer needs into actionable requirements, and enhance system performance. The ideal candidate... 
    Remote job

    Andromeda Cluster

    San Francisco, CA
    16 hours ago
  • Electric Capital is seeking a skilled Software Engineer to build and scale the core infrastructure for Bluenote’s AI platform in life sciences. This role involves designing...  ..., proficiency in Python, and familiarity with cloud platforms. The position offers a hybrid work... 

    Electric Capital

    San Francisco, CA
    16 hours ago
  • Build Technologies in San Francisco is seeking a hands-on AI Engineer to develop the infrastructure and systems critical for their agentic AI platform. The ideal candidate has strong systems engineering skills, is fluent in Python, and possesses backend systems experience... 

    Build Technologies

    San Francisco, CA
    1 day ago
  •  ...Market in San Francisco is seeking a Software Engineer to design, build, and maintain core systems for its AI platform. The role emphasizes reliability, scalability...  ..., and performance, requiring expertise in cloud infrastructure and CI/CD pipelines. The ideal candidate will... 
    Flexible hours

    Neura Market

    San Francisco, CA
    16 hours ago
  • $150k - $300k

    Prime Intellect is seeking a skilled developer to work on both infrastructure and platform development focusing on distributed AI workloads. This hybrid role involves designing orchestration systems in Go and Rust, alongside building web interfaces and REST APIs in Python... 
    Visa sponsorship

    Prime Intellect

    San Francisco, CA
    2 days ago
  • $144k - $174k

     ...Cloud Infrastructure Engineer Mountain View, CA About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced...  ...Support Engineering, Professional Services, DevOps Engineering ~ Must have experience in... 
    Devops
    Work experience placement
    Work at office
    Home office
    Flexible hours
    Shift work

    Glean.info

    San Francisco, CA
    1 day ago
  • A leading tech company is seeking an Infrastructure Engineer to build and scale its core platform powering AI systems. The role involves designing Kubernetes and Terraform-based infrastructures, defining standards for security and performance, and ensuring reliability.... 

    Brain Co.

    San Francisco, CA
    2 days ago
  • Principal Engineer, AI Platform & Infrastructure About the Role SPREEAI is building the future of AI-powered commerce...  ..., PyTorch, Kubernetes, Docker, cloud infrastructure, and GPU-based...  ...Role Matters This is not a traditional DevOps role. This is the infrastructure backbone... 
    Devops

    SpreeAI

    San Francisco, CA
    2 days ago
  •  ...Job Description Job Description Infrastructure/DevOps Engineer - San Francisco, CA - Onsite (6 days a week) We are looking for an Infrastructure...  ...workflows. Tech stack Python, CI/CD, AWS, Google Cloud, Terraform, Helm Seniority 1 - 4 years of experience... 
    Devops
    Work experience placement
    Work at office
    Night shift

    RST Recruitment

    San Francisco, CA
    21 days ago
  •  ...Job Title: AI Software Engineer (Mid-Level) Location: San Francisco, CA...  ...maintain AI pipelines and infrastructure to support model training,...  ...learning. Familiarity with cloud platforms such as AWS,...  ...deployment. Knowledge of DevOps practices and tools like Docker... 
    Devops

    Full Scope

    San Francisco, CA
    16 hours ago
  • $156.86k - $191.72k

     ...System Infrastructure / Platform Engineer The National Energy Research Scientific Computing Center (NERSC...  ...deployments in a high-performance computing, cloud computing, or hyper-scale environment...  ...as strace, lsof, ebpf, or gdb) ~ DevOps tools (such as Gitlab or Jira) and... 
    Devops
    Full time
    Remote work
    Flexible hours

    Berkely Lab

    San Francisco, CA
    16 hours ago
  • $152k - $222k

    Cloud Platforms and Infrastructure Engineer, TPU/GPU Google San Francisco, CA, USA; Sunnyvale, CA, USA Qualifications...  ...infrastructure provisioning, DevOps, continuous integration, or delivery...  .../key management. Experience running AI/ML training and inference workloads... 
    Devops

    Google Inc.

    San Francisco, CA
    16 hours ago
  • A pioneering AI firm in San Francisco seeks its first full-time engineer. This role emphasizes ownership of product and infrastructure, requiring an individual who can make architectural decisions and ship features rapidly. Candidates should have a builder mindset, fluency... 
    Devops
    Full time

    Mendral (YC W26)

    San Francisco, CA
    2 days ago
  •  ...steady supply of both. Job Description This is a Senior Oracle DevOps Engineer / Infrastructure Systems Engineer position that centers on architecting, implementing, and managing the Oracle Cloud infrastructure and applications. You'll play a crucial role in... 
    Devops
    Contract work

    Code

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineering (Cloud, DevOps). Be the first to apply!