Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Devops Engineer

Way2B1

Job Description

Job Description

This is a hands-on Staff DevOps Engineer role responsible for designing, operating, and evolving a highly available, multi-tenant platform on AWS. You will work closely with software engineering to deploy, operate, and scale production systems while driving improvements in reliability, automation, and performance.

This role requires strong ownership of infrastructure and production systems. You will also provide technical leadership and mentorship to other DevOps engineers.

You will also help introduce and operationalize AI/LLM capabilities within the platform.

AI / LLM Systems (Emerging Area)

  • Experience operating or integrating LLM/AI services in production environments, including tracing and evaluation
  • (OpenTelemetry, LangSmith, LangFuse or equivalent)
  • Experience managing performance, cost, and reliability of LLM workloads (latency, token usage, rate limiting, fallbacks)
  • Experience using AI/agentic developer tools (e.g., Claude Code, Cursor or similar) to accelerate DevOps, workflows and improve engineering efficiency

What You'll Be Responsible For

  • Design, build, and operate scalable, highly available infrastructure in AWS
  • Own and evolve infrastructure as code (Terraform) across all environments
  • Operate and optimize Aurora PostgreSQL (replication, failover, performance tuning)
  • Operate ECS (Fargate), ECR, and containerized services
  • Operate Kafka-based event streaming systems
  • Manage Auto Scaling Groups and EC2-based workloads
  • Design and maintain CI/CD pipelines (Buildkite)
  • Build automation to eliminate manual operational work
  • Manage and secure secrets and access (Vault, AWS Secrets Manager, IAM)
  • Partner with engineering teams to improve system reliability and performance
  • Provide technical leadership and mentorship to DevOps engineers
  • Drive cost optimization across AWS infrastructure
  • Operate systems behind Cloudflare (WAF, CDN, traffic management)

Production Reliability & Incident Ownership

  • Own production incident response end-to-end (triage, mitigation, coordination)
  • Lead high-severity outage response under pressure
  • Drive root cause analysis (RCA) and enforce follow-ups
  • Continuously improve system resilience and recovery mechanisms

Observability & System Insight

  • Design and operate end-to-end observability (metrics, logs, tracing)
  • Build high-signal monitoring, alerting, and dashboards
  • Define and enforce SLIs/SLOs and alerting standards
  • Reduce alert fatigue and improve signal-to-noise ratio

What You Bring

  • Deep experience operating production systems on AWS (ECS/Fargate, EC2, networking, IAM)
  • Expert-level Terraform experience managing infrastructure at scale
  • Strong experience with containerized applications and distributed systems (e.g., Kafka)
  • Experience operating multi-tenant, highly available systems
  • Proven ownership of production on-call and resolving critical incidents
  • Strong systems fundamentals (Linux, networking, debugging)
  • Strong scripting ability (Bash, Python or equivalent)
  • Experience designing and operating CI/CD systems
  • Strong understanding of security best practices (IAM, secrets management)

Nice to Have

  • Experience operating multi-region or globally distributed systems
  • Experience working with Cloudflare at scale
  • Experience optimizing high-throughput or event-driven systems
Vacancy posted 14 days ago
Similar jobs that could be interesting for youBased on the Staff Devops Engineer in San Francisco, CA vacancy
  • A leading AI technology company is seeking a DevOps Engineer to own the infrastructure layer of their AI employee platform. This staff-level role involves building and maintaining deployment processes, overseeing Kubernetes infrastructure, and ensuring system reliability... 
    Suggested
    Remote job

    Artisan

    San Francisco, CA
    3 days ago
  •  ...a step change in what an AI employee can do. The engineering problems are hard and the surface area is enormous...  ...work reliably is the job. You’ll own the full DevOps and infrastructure layer at Artisan. This is a staff‑level role where you’ll set the foundation, define... 
    Suggested
    Local area
    Immediate start
    Remote work

    Artisan

    San Francisco, CA
    3 days ago
  • $221k - $277k

    Apply for the Staff DevOps Engineer role at Crunchyroll. About Crunchyroll Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We serve over 100 million anime and manga fans in more than 200 countries and territories, powering streaming... 
    Suggested
    Work at office
    Flexible hours
    3 days per week

    Crunchyroll

    San Francisco, CA
    2 days ago
  • A leading streaming platform is looking for a Staff DevOps Engineer in San Francisco, CA, to automate and scale systems supporting their streaming services. The role involves leading projects on infrastructure automation, best-practice adoption, and collaboration across... 
    Suggested

    Crunchyroll

    San Francisco, CA
    2 days ago
  •  ...while maintaining consistency, security, and performance across BI tools, spreadsheets, and embedded applications. As a Staff DevOps Engineer at Cube, you will set the technical direction for the infrastructure that runs Cube Cloud and the agentic analytics... 
    Suggested
    Remote work

    Cube Dev

    San Francisco, CA
    a month ago
  • $245k - $295k

     ...of a high‑performing team that believes in each other, come build with us at Crusoe. About the Role We are seeking a Sr. Staff Engineer, Platform R&D to embed senior research and development capacity directly into Crusoe's Managed Platform Services (MAPS) team. In... 
    Temporary work

    Epoch Biodesign

    San Francisco, CA
    2 days ago
  • $180k - $280k

     ...How you’ll make an impact As a Senior/Staff Engineer at sunbeam, you’ll provide technical leadership across the product while remaining deeply hands-on. You’ll partner closely with the CTO to shape system architecture, guide execution, and raise the quality bar... 
    Work at office
    Relocation package

    Daydream

    San Francisco, CA
    2 days ago
  • $215k - $265k

     ...performance environments” - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA DDN is the global leader in AI and multi-cloud data...  ...AI and data storage. Job Description We are seeking a Sr Staff Software Engineer for the ongoing development of an S3... 
    Local area
    Remote work
    Worldwide

    Data Direct Networks

    San Francisco, CA
    2 days ago
  • Plato is looking for a Member of Technical Staff to develop infrastructure for training AI agents in San Francisco. In this role, you will build and operate systems that enhance reliability and scalability for complex AI experiments. You'll work closely with researchers... 

    Plato

    San Francisco, CA
    5 days ago
  •  ...We are seeking a highly skilled Senior AWS DevOps Engineer with strong expertise in Terraform and AWS cloud services to design, implement, and manage scalable, secure, and automated cloud infrastructure. The ideal candidate will have hands-on experience with Infrastructure... 
    Remote work

    VMC Soft Technologies , Inc

    San Francisco, CA
    5 days ago
  • $215k - $265k

    Overview We are seeking a Senior Staff Engineer - LustreFS with 15+ years of experience in distributed storage, Linux kernel and large-scale...  ...degradation. Partner with principal engineers, support, QE, DevOps and release teams to improve product quality, test depth and release... 
    Local area

    DataDirect Networks, Inc.

    San Francisco, CA
    5 days ago
  • A staffing and technology solutions company based in San Francisco is seeking a qualified candidate with expertise in Kubernetes and Containerization technologies. The ideal applicant will have at least 6 years of experience, including 3-4 years with OpenShift and 3+ years...

    Insight Global

    San Francisco, CA
    4 days ago
  • $145k - $185k

     ...Description We are seeking a highly skilled and technically strong Staff Engineer in Test to lead system level quality engineering efforts for...  ...closely with architects, developers, release engineering, DevOps, and customer engineering to drive quality‑first design decisions... 
    Local area
    Remote work

    DataDirect Networks, Inc.

    San Francisco, CA
    3 days ago
  •  ...The Role Abridge’s services and engineering teams are in hyperscale mode. We are looking for experienced Staff Platform Engineers to join our team and help scale our cloud infrastructure, developer platform, and operational maturity in kind. You’ll work on a centralized... 
    Hourly pay
    Full time
    Local area
    Remote work
    Flexible hours

    Neura Market

    San Francisco, CA
    1 day ago
  • A technology startup in the San Francisco Bay Area is seeking a DevOps Engineer to build and maintain scalable cloud infrastructure. The ideal candidate will have 4-10+ years of experience and strong skills in Docker, Kubernetes, and CI/CD systems. This role offers competitive... 

    Fabrion

    San Francisco, CA
    4 days ago
  • $90k - $125k

     ...Platform Engineer | Cloud Security, AI, Start-Up | LATAM (Remote) Brio Digital is partnered with one of the fastest growing and innovative...  ...Minimum 3–5 years’ experience as a Platform, Cloud or DevOps Engineer. Previous experience as a Software Engineer is highly... 
    Remote work

    Trades Workforce Solutions

    San Francisco, CA
    2 days ago
  • $160k - $300k

     ...Databricks, GM, and Character, our mission is to revolutionize how engineering decisions are made, turning complexity into clarity for the...  ...-defining company together. About the Role As a Senior / Staff Infrastructure Engineer at Apiphany, you’ll design, build, and... 
    Work at office
    Visa sponsorship
    Flexible hours

    Apiphany

    San Francisco, CA
    4 days ago
  • $248k - $282k

    The Staff CI/CD Engineer is pivotal in transforming our CI/CD landscape to enhance developer efficiency across the organization. You will architect...  ...abreast of the latest trends and technologies in CI/CD and DevOps, recommending and implementing improvements to our systems.... 
    Work at office
    Local area
    Work from home
    Worldwide

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  • Icehouseventures is seeking a skilled DevOps Engineer in San Francisco to join their Infrastructure team. This role focuses on building and maintaining Docker-based environments, managing infrastructure using Pulumi, and optimizing CI/CD workflows. Ideal candidates will... 

    Icehouseventures

    San Francisco, CA
    6 days ago
  • A healthcare AI company is seeking a Sr. Infrastructure Engineer to enhance and scale their systems. Responsibilities include managing infrastructure with Terraform and Kubernetes, creating monitoring solutions, and troubleshooting. Ideal candidates will have 5+ years... 
    Remote job

    AKASA

    San Francisco, CA
    2 days ago
  •  ...role involves extensive work with cutting-edge technologies and various deployment strategies to enhance the efficiency of internal engineering teams. The ideal candidate has experience with Pulumi, Terraform, and deploying applications using GitOps methodologies. A... 

    Slope

    San Francisco, CA
    6 days ago
  • $190k - $215k

    Gridware Technologies Inc. in San Francisco is looking for a DevOps Engineer to manage AWS infrastructure, scale monitoring solutions, and collaborate with teams to enhance grid reliability. The ideal candidate has over 5 years of experience, particularly with Kubernetes... 

    Gridware Technologies Inc.

    San Francisco, CA
    2 days ago
  • Lemon.io is seeking a Senior DevOps Engineer for remote work. This unique position offers flexibility and autonomy, emphasizing asynchronous communication. Candidates should have at least 4 years of DevOps experience and expertise in Azure DevOps or AWS/GCP. Strong self... 
    Remote job

    Cortes 23

    San Francisco, CA
    5 days ago
  • $170k - $190k

     ...days for team or company events. Identity & Corporate Security Engineering @ Ironclad In this role, you’ll own security‑critical identity...  ...engineering for macOS and Windows SW Eng/Dev engineering and DevOps proficiency: Python and/or Go, Terraform, GAM scripting,... 
    Full time
    Contract work
    Work at office

    Ironclad Inc.

    San Francisco, CA
    4 days ago
  • $200k - $300k

     ...collaboration that creates great hospitality experiences, is what powers Engineering teams to build great products. As we expand from national...  ...," - Robert Linder, CFO of Loop AI Customer, Lazy Dog Staff Engineer / Tech Lead / Senior Software Engineer / Founding... 

    Loop AI

    San Francisco, CA
    5 days ago
  • $198k - $233k

     ...Staff FEA Engineer Hybrid - San Diego, California; Hybrid - San Francisco, California Our mission at Oura is to empower every person to own their inner potential. Our award-winning products help our global community gain a deeper knowledge of their readiness, activity... 
    Work at office
    Local area
    Remote work
    Flexible hours

    Oura

    San Francisco, CA
    2 days ago
  • EITACIES Inc. is seeking an experienced engineer to take ownership of complex platform challenges and build innovative automation solutions. You will manage Kafka and NoSQL platforms, design scalable architectures, and integrate AI capabilities into operational tools.... 

    EITACIES

    San Francisco, CA
    5 days ago
  •  ...Staff Engineer Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build... 
    Flexible hours

    Emergent Labs

    San Francisco, CA
    2 days ago
  • $200k - $280k

     ...Senior/Staff Engineer Location: San Francisco, CA or New York, NY Compensation: $200,000 – $280,000 + 0.4% – 0.8% Equity Type: Full-Time Visa Sponsorship: H-1B, O-1, OPT Priority: High (Hiring Multiple) About the Company Client is building global... 
    Full time
    H1b
    Visa sponsorship

    Fuku

    San Francisco, CA
    5 days ago
  •  ...the world's top educational institutions Work together with engineers, scientists, operators, and more from Palantir, Meta, Scale AI,...  ...complex data at the largest scale. About the Role As a Staff Forward Deployed Engineer, you'll define and drive the technical... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Devops Engineer. Be the first to apply!