AI Infrastructure Engineering (Cloud, DevOps)
Virtue AI
Location: San Francisco, CA (Onsite | Remote)
About Virtue AI Virtue AI sets the standard for advanced AI security platforms. Built on decades of foundational and award-winning research in AI security, its AI-native architecture unifies automated red-teaming, real-time multimodal guardrails, and systematic governance for enterprise apps and agents. Deploy in minutes-across any environment-to keep your AI protected and compliant. We are a well-funded, early-stage startup founded by industry veterans, and we're looking for passionate builders to join our core team. What You'll Do As an AI infra Engineer, you will own the reliability, scaling, automation, and operational discipline of Virtue AI's AI production systems, focusing on deployment and model serving performance. You will:- Design and maintain deployment workflows for Virtue AI on major cloud providers (e.g., AWS and GCP )
- Own IaC (Terraform / Pulumi) for repeatable, auditable customer deployments.
- Package our services into secure, customer-ready deployment units (Docker, Helm, Marketplace images).
- Design, build, and maintain product CI/CD pipelines using GitHub Actions.
- Serve and optimize the LLM inference pipeline; build necessary inference APIs and routers; auto-scaling
- Design production-grade system observability (Metrics, logs, alerts, dashboards) using tools like Datadog, Grafana, and Prometheus .
- Implement secure networking (VPCs, IAM, service accounts, private endpoints, firewalling).
- Collaborate with product developers to align infrastructure and inference behavior with product requirements.
- Bachelor's degree or higher in CS, CE, EE, or related field.
- Strong experience deploying production systems on major cloud platforms, e.g., AWS and/or GCP .
- Deep hands-on experience with Docker and containerized workloads, Kubernetes (EKS, GKE, or equivalent).
- Strong experience serving LLMs and embedding models in production.
- Strong hands-on experience with CI/CD (GitHub Actions required) and repository management (monorepos, release branches, tagging, rollbacks).
- Experience with SGLang, vLLM, or similar inference frameworks .
- Strong understanding of GPU behavior (memory limits, batching, fragmentation, utilization) and experience with GPU-level optimization
- Experience with model-level inference optimization (Quantization, KV-cache optimization, Speculative decoding or batching strategies) and inference kernels
- Startup experience: you move fast, take ownership, and fix things properly.
- Competitive salary + equity
- High ownership - You define how production runs
- Real impact - Your work directly affects customers and revenue
- Hard problems - Distributed systems, GPUs, scale, security
- Strong technical peers - Engineers who ship and debug, not just designLocation: San Francisco, CA (Onsite | Remote)
- Design, build, and maintain CI/CD pipelines using GitHub Actions
- Own repo structure, branching strategy, release workflows, and versioning
- Build and operate Kubernetes infrastructure on GKE
- Package, deploy, and optimize services using Docker
- Design production-grade system observability
- Metrics, logs, alerts, dashboards
- Datadog, Grafana, Prometheus
- Monitor and improve service reliability, latency, and uptime
- Debug real production issues across infra, networking, containers, and code
- Partner with backend, ML, and platform engineers to remove operational bottlenecks
- Bachelor's degree or equivalent practical experience
- Strong hands-on experience with:
- CI/CD (GitHub Actions required)
- Repository management (monorepos, release branches, tagging, rollbacks)
- Deep experience with:
- Kubernetes
- Docker
- Experience designing and operating observability systems
- Datadog and/or Grafana in production
- Strong understanding of system design
- Availability, scalability, fault isolation
- Proven ability to solve real production problems, not just configure tools
- Comfortable working directly on production systems
- Experience operating ML / LLM inference systems
- Experience with GPU workloads and resource scheduling
- Experience supporting enterprise customers with SLAs
- Familiarity with infrastructure-as-code (Terraform / Pulumi)
- Startup experience: you move fast, take ownership, and clean up after yourself
- Competitive salary + equity
- High ownership - You define how production runs
- Real impact - Your work directly affects customers and revenue
- Hard problems - Distributed systems, GPUs, scale, security
- Strong technical peers - Engineers who ship and debug, not just design
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineering (Cloud, DevOps) in San Francisco, CA vacancy
- About the Company Virtue AI is at the forefront of AI security. As enterprises increasingly... .... Are you a high‑performing, motivated engineer ready to make a significant impact in... ...? Virtue AI is seeking a talented AI Infrastructure Engineer (MLOps) to join us. We are a...Devops
- ...Benefits: 401(k) AI Infrastructure Engineer / MLOps San Francisco Bay Area, CA (100% Onsite)... ...machine learning platforms running in cloud native environments. Responsibilities... ...Knowledge of infrastructure-as-code and DevOps best practices Experience with monitoring...Devops
- ...AI Infra Engineer We are looking for an AI Infra engineer to join our growing team. We work... ...PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering... ...management Previous roles in SRE, DevOps, or Platform Engineering with focus on...Devops
- Braintrust Data, Inc. is seeking a Cloud Infrastructure Engineer to build reliable infrastructure and enhance our DevOps processes. The role involves maintaining Terraform modules... ...time off, and competitive salary with an AI stipend. #J-18808-Ljbffr Braintrust Data, Inc...DevopsFlexible hours
- A fast-growing AI startup is seeking a Senior Infrastructure Engineer in San Francisco. In this role, you will architect and scale distributed systems that handle... .... Ideal candidates have 3-6 years of experience in cloud infrastructure and deep knowledge of Kubernetes, AWS...Suggested
$200k - $300k
Senior Engineering Manager page is loaded## Senior Engineering... ...products rooted in AI, automation, and... ...lead our team focused on cloud governance, IAM, secrets... ...environment infrastructure; etc.* Partner with Corp... ...understanding of modern DevOps methodologies: Infrastructure...DevopsWork at officeRemote work- ...Job Title- Senior Cloud Infrastructure Engineer / DevOps Engineer Reporting Type- San Francisco, CA 94105 Work Timing- Monday to Friday, 9am to 5pm Duration: 07 months Job Type: Onsite Summary We are seeking an experienced...DevopsMonday to Friday
$150k - $220k
...Senior Cloud DevSecOps Infrastructure Engineer Title of Role: Senior Cloud DevSecOps Infrastructure Engineer... ...Funding: Venture-Backed — Healthcare, AI, Security, Enterprise Office Type... ...~6+ years of experience in DevOps or Infrastructure Engineering, with...DevopsWork at office$99.6k - $234.6k
...Principal Software Engineer Join Oracle's Health Data... ...the next generation of cloud-native platforms,... ...automation frameworks, and AI-powered operational tooling... ...across Oracle Cloud Infrastructure and multi-cloud environments... ..., Docker CI/CD and DevOps platforms Prometheus...DevopsTemporary workFlexible hours$216k - $270k
...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large... ...raw compute into breakthrough AI. You will: Architect and scale... ...specialized hardware. ~ Familiarity with cloud infrastructure (AWS, GCP) and...Full time- ...Brain Co. is an applied AI startup co-founded by Jared Kushner... ...- and Terraform-based infrastructure across customer environments.... ...leaders on architecture and mentor engineers. You Might Be a Great Fit... ...Know Kubernetes, Terraform, cloud networking, orchestration, and...
$210k - $260k
Staff AI Systems Engineer — Agentic Platforms The role of the software engineer... ...real workflows, operate infrastructure, and improve over time. The... ...enterprises, automating DevOps and SecOps workflows with real... ...environments, or in the cloud, with enterprise‑grade security...DevopsRemote workShift work- Harrison Clarke is seeking a Founding Engineer in San Francisco for a pioneering role in an early-stage AI and cloud infrastructure startup. This position offers a unique opportunity to shape the technical direction and build core systems from scratch. Ideal candidates...
$216k - $270k
...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable... ..., Kubernetes). ~ Familiarity with cloud infrastructure (AWS, GCP) and... ..., our mission is to develop reliable AI systems for the world's most important...Full time- ...platform for evaluating and deploying AI systems. Our mission is to help enterprises... .... About the role We’re looking for a Cloud Infrastructure Engineer to help us build reliable, scalable... ...credentials 5+ years of experience in DevOps, SRE, or Infrastructure Engineering...DevopsFlexible hours
- A cutting-edge AI infrastructure company is seeking an Infrastructure Product Engineer to design core platform components and build robust APIs. Candidates should have over 5 years of experience in infrastructure or backend engineering, with strong skills in Kubernetes...Remote job
- A cutting-edge AI infrastructure firm is seeking a Software Engineer specializing in Infrastructure to design core platform components. You will develop APIs and services, translate customer needs into actionable requirements, and enhance system performance. The ideal candidate...Remote job
- Electric Capital is seeking a skilled Software Engineer to build and scale the core infrastructure for Bluenote’s AI platform in life sciences. This role involves designing... ..., proficiency in Python, and familiarity with cloud platforms. The position offers a hybrid work...
- Build Technologies in San Francisco is seeking a hands-on AI Engineer to develop the infrastructure and systems critical for their agentic AI platform. The ideal candidate has strong systems engineering skills, is fluent in Python, and possesses backend systems experience...
- ...Market in San Francisco is seeking a Software Engineer to design, build, and maintain core systems for its AI platform. The role emphasizes reliability, scalability... ..., and performance, requiring expertise in cloud infrastructure and CI/CD pipelines. The ideal candidate will...Flexible hours
$150k - $300k
Prime Intellect is seeking a skilled developer to work on both infrastructure and platform development focusing on distributed AI workloads. This hybrid role involves designing orchestration systems in Go and Rust, alongside building web interfaces and REST APIs in Python...Visa sponsorship$144k - $174k
...Cloud Infrastructure Engineer Mountain View, CA About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced... ...Support Engineering, Professional Services, DevOps Engineering ~ Must have experience in...DevopsWork experience placementWork at officeHome officeFlexible hoursShift work- A leading tech company is seeking an Infrastructure Engineer to build and scale its core platform powering AI systems. The role involves designing Kubernetes and Terraform-based infrastructures, defining standards for security and performance, and ensuring reliability....
- Principal Engineer, AI Platform & Infrastructure About the Role SPREEAI is building the future of AI-powered commerce... ..., PyTorch, Kubernetes, Docker, cloud infrastructure, and GPU-based... ...Role Matters This is not a traditional DevOps role. This is the infrastructure backbone...Devops
- ...Job Description Job Description Infrastructure/DevOps Engineer - San Francisco, CA - Onsite (6 days a week) We are looking for an Infrastructure... ...workflows. Tech stack Python, CI/CD, AWS, Google Cloud, Terraform, Helm Seniority 1 - 4 years of experience...DevopsWork experience placementWork at officeNight shift
- ...Job Title: AI Software Engineer (Mid-Level) Location: San Francisco, CA... ...maintain AI pipelines and infrastructure to support model training,... ...learning. Familiarity with cloud platforms such as AWS,... ...deployment. Knowledge of DevOps practices and tools like Docker...Devops
$156.86k - $191.72k
...System Infrastructure / Platform Engineer The National Energy Research Scientific Computing Center (NERSC... ...deployments in a high-performance computing, cloud computing, or hyper-scale environment... ...as strace, lsof, ebpf, or gdb) ~ DevOps tools (such as Gitlab or Jira) and...DevopsFull timeRemote workFlexible hours$152k - $222k
Cloud Platforms and Infrastructure Engineer, TPU/GPU Google San Francisco, CA, USA; Sunnyvale, CA, USA Qualifications... ...infrastructure provisioning, DevOps, continuous integration, or delivery... .../key management. Experience running AI/ML training and inference workloads...Devops- A pioneering AI firm in San Francisco seeks its first full-time engineer. This role emphasizes ownership of product and infrastructure, requiring an individual who can make architectural decisions and ship features rapidly. Candidates should have a builder mindset, fluency...DevopsFull time
- ...steady supply of both. Job Description This is a Senior Oracle DevOps Engineer / Infrastructure Systems Engineer position that centers on architecting, implementing, and managing the Oracle Cloud infrastructure and applications. You'll play a crucial role in...DevopsContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Infrastructure Engineering (Cloud, DevOps). Be the first to apply!
Related searches
- ai engineer remote San Francisco, CA
- ai prompt engineer San Francisco, CA
- senior ai engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- ai engineer San Francisco, CA
- ai developer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai research engineer San Francisco, CA
- entry level infrastructure engineer San Francisco, CA
- infrastructure automation engineer San Francisco, CA



