Staff Devops Engineer
Way2B1
Job Description
Job Description
This is a hands-on Staff DevOps Engineer role responsible for designing, operating, and evolving a highly available, multi-tenant platform on AWS. You will work closely with software engineering to deploy, operate, and scale production systems while driving improvements in reliability, automation, and performance.
This role requires strong ownership of infrastructure and production systems. You will also provide technical leadership and mentorship to other DevOps engineers.
You will also help introduce and operationalize AI/LLM capabilities within the platform.
AI / LLM Systems (Emerging Area)
- Experience operating or integrating LLM/AI services in production environments, including tracing and evaluation
- (OpenTelemetry, LangSmith, LangFuse or equivalent)
- Experience managing performance, cost, and reliability of LLM workloads (latency, token usage, rate limiting, fallbacks)
- Experience using AI/agentic developer tools (e.g., Claude Code, Cursor or similar) to accelerate DevOps, workflows and improve engineering efficiency
What You'll Be Responsible For
- Design, build, and operate scalable, highly available infrastructure in AWS
- Own and evolve infrastructure as code (Terraform) across all environments
- Operate and optimize Aurora PostgreSQL (replication, failover, performance tuning)
- Operate ECS (Fargate), ECR, and containerized services
- Operate Kafka-based event streaming systems
- Manage Auto Scaling Groups and EC2-based workloads
- Design and maintain CI/CD pipelines (Buildkite)
- Build automation to eliminate manual operational work
- Manage and secure secrets and access (Vault, AWS Secrets Manager, IAM)
- Partner with engineering teams to improve system reliability and performance
- Provide technical leadership and mentorship to DevOps engineers
- Drive cost optimization across AWS infrastructure
- Operate systems behind Cloudflare (WAF, CDN, traffic management)
Production Reliability & Incident Ownership
- Own production incident response end-to-end (triage, mitigation, coordination)
- Lead high-severity outage response under pressure
- Drive root cause analysis (RCA) and enforce follow-ups
- Continuously improve system resilience and recovery mechanisms
Observability & System Insight
- Design and operate end-to-end observability (metrics, logs, tracing)
- Build high-signal monitoring, alerting, and dashboards
- Define and enforce SLIs/SLOs and alerting standards
- Reduce alert fatigue and improve signal-to-noise ratio
What You Bring
- Deep experience operating production systems on AWS (ECS/Fargate, EC2, networking, IAM)
- Expert-level Terraform experience managing infrastructure at scale
- Strong experience with containerized applications and distributed systems (e.g., Kafka)
- Experience operating multi-tenant, highly available systems
- Proven ownership of production on-call and resolving critical incidents
- Strong systems fundamentals (Linux, networking, debugging)
- Strong scripting ability (Bash, Python or equivalent)
- Experience designing and operating CI/CD systems
- Strong understanding of security best practices (IAM, secrets management)
Nice to Have
- Experience operating multi-region or globally distributed systems
- Experience working with Cloudflare at scale
- Experience optimizing high-throughput or event-driven systems
- A leading AI technology company is seeking a DevOps Engineer to own the infrastructure layer of their AI employee platform. This staff-level role involves building and maintaining deployment processes, overseeing Kubernetes infrastructure, and ensuring system reliability...SuggestedRemote job
- ...a step change in what an AI employee can do. The engineering problems are hard and the surface area is enormous... ...work reliably is the job. You’ll own the full DevOps and infrastructure layer at Artisan. This is a staff‑level role where you’ll set the foundation, define...SuggestedLocal areaImmediate startRemote work
$221k - $277k
Apply for the Staff DevOps Engineer role at Crunchyroll. About Crunchyroll Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We serve over 100 million anime and manga fans in more than 200 countries and territories, powering streaming...SuggestedWork at officeFlexible hours3 days per week- A leading streaming platform is looking for a Staff DevOps Engineer in San Francisco, CA, to automate and scale systems supporting their streaming services. The role involves leading projects on infrastructure automation, best-practice adoption, and collaboration across...Suggested
- ...while maintaining consistency, security, and performance across BI tools, spreadsheets, and embedded applications. As a Staff DevOps Engineer at Cube, you will set the technical direction for the infrastructure that runs Cube Cloud and the agentic analytics...SuggestedRemote work
$245k - $295k
...of a high‑performing team that believes in each other, come build with us at Crusoe. About the Role We are seeking a Sr. Staff Engineer, Platform R&D to embed senior research and development capacity directly into Crusoe's Managed Platform Services (MAPS) team. In...Temporary work$180k - $280k
...How you’ll make an impact As a Senior/Staff Engineer at sunbeam, you’ll provide technical leadership across the product while remaining deeply hands-on. You’ll partner closely with the CTO to shape system architecture, guide execution, and raise the quality bar...Work at officeRelocation package$215k - $265k
...performance environments” - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA DDN is the global leader in AI and multi-cloud data... ...AI and data storage. Job Description We are seeking a Sr Staff Software Engineer for the ongoing development of an S3...Local areaRemote workWorldwide- Plato is looking for a Member of Technical Staff to develop infrastructure for training AI agents in San Francisco. In this role, you will build and operate systems that enhance reliability and scalability for complex AI experiments. You'll work closely with researchers...
- ...We are seeking a highly skilled Senior AWS DevOps Engineer with strong expertise in Terraform and AWS cloud services to design, implement, and manage scalable, secure, and automated cloud infrastructure. The ideal candidate will have hands-on experience with Infrastructure...Remote work
$215k - $265k
Overview We are seeking a Senior Staff Engineer - LustreFS with 15+ years of experience in distributed storage, Linux kernel and large-scale... ...degradation. Partner with principal engineers, support, QE, DevOps and release teams to improve product quality, test depth and release...Local area- A staffing and technology solutions company based in San Francisco is seeking a qualified candidate with expertise in Kubernetes and Containerization technologies. The ideal applicant will have at least 6 years of experience, including 3-4 years with OpenShift and 3+ years...
$145k - $185k
...Description We are seeking a highly skilled and technically strong Staff Engineer in Test to lead system level quality engineering efforts for... ...closely with architects, developers, release engineering, DevOps, and customer engineering to drive quality‑first design decisions...Local areaRemote work- ...The Role Abridge’s services and engineering teams are in hyperscale mode. We are looking for experienced Staff Platform Engineers to join our team and help scale our cloud infrastructure, developer platform, and operational maturity in kind. You’ll work on a centralized...Hourly payFull timeLocal areaRemote workFlexible hours
- A technology startup in the San Francisco Bay Area is seeking a DevOps Engineer to build and maintain scalable cloud infrastructure. The ideal candidate will have 4-10+ years of experience and strong skills in Docker, Kubernetes, and CI/CD systems. This role offers competitive...
$90k - $125k
...Platform Engineer | Cloud Security, AI, Start-Up | LATAM (Remote) Brio Digital is partnered with one of the fastest growing and innovative... ...Minimum 3–5 years’ experience as a Platform, Cloud or DevOps Engineer. Previous experience as a Software Engineer is highly...Remote work$160k - $300k
...Databricks, GM, and Character, our mission is to revolutionize how engineering decisions are made, turning complexity into clarity for the... ...-defining company together. About the Role As a Senior / Staff Infrastructure Engineer at Apiphany, you’ll design, build, and...Work at officeVisa sponsorshipFlexible hours$248k - $282k
The Staff CI/CD Engineer is pivotal in transforming our CI/CD landscape to enhance developer efficiency across the organization. You will architect... ...abreast of the latest trends and technologies in CI/CD and DevOps, recommending and implementing improvements to our systems....Work at officeLocal areaWork from homeWorldwide- Icehouseventures is seeking a skilled DevOps Engineer in San Francisco to join their Infrastructure team. This role focuses on building and maintaining Docker-based environments, managing infrastructure using Pulumi, and optimizing CI/CD workflows. Ideal candidates will...
- A healthcare AI company is seeking a Sr. Infrastructure Engineer to enhance and scale their systems. Responsibilities include managing infrastructure with Terraform and Kubernetes, creating monitoring solutions, and troubleshooting. Ideal candidates will have 5+ years...Remote job
- ...role involves extensive work with cutting-edge technologies and various deployment strategies to enhance the efficiency of internal engineering teams. The ideal candidate has experience with Pulumi, Terraform, and deploying applications using GitOps methodologies. A...
$190k - $215k
Gridware Technologies Inc. in San Francisco is looking for a DevOps Engineer to manage AWS infrastructure, scale monitoring solutions, and collaborate with teams to enhance grid reliability. The ideal candidate has over 5 years of experience, particularly with Kubernetes...- Lemon.io is seeking a Senior DevOps Engineer for remote work. This unique position offers flexibility and autonomy, emphasizing asynchronous communication. Candidates should have at least 4 years of DevOps experience and expertise in Azure DevOps or AWS/GCP. Strong self...Remote job
$170k - $190k
...days for team or company events. Identity & Corporate Security Engineering @ Ironclad In this role, you’ll own security‑critical identity... ...engineering for macOS and Windows SW Eng/Dev engineering and DevOps proficiency: Python and/or Go, Terraform, GAM scripting,...Full timeContract workWork at office$200k - $300k
...collaboration that creates great hospitality experiences, is what powers Engineering teams to build great products. As we expand from national... ...," - Robert Linder, CFO of Loop AI Customer, Lazy Dog Staff Engineer / Tech Lead / Senior Software Engineer / Founding...$198k - $233k
...Staff FEA Engineer Hybrid - San Diego, California; Hybrid - San Francisco, California Our mission at Oura is to empower every person to own their inner potential. Our award-winning products help our global community gain a deeper knowledge of their readiness, activity...Work at officeLocal areaRemote workFlexible hours- EITACIES Inc. is seeking an experienced engineer to take ownership of complex platform challenges and build innovative automation solutions. You will manage Kafka and NoSQL platforms, design scalable architectures, and integrate AI capabilities into operational tools....
- ...Staff Engineer Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build...Flexible hours
$200k - $280k
...Senior/Staff Engineer Location: San Francisco, CA or New York, NY Compensation: $200,000 – $280,000 + 0.4% – 0.8% Equity Type: Full-Time Visa Sponsorship: H-1B, O-1, OPT Priority: High (Hiring Multiple) About the Company Client is building global...Full timeH1bVisa sponsorship- ...the world's top educational institutions Work together with engineers, scientists, operators, and more from Palantir, Meta, Scale AI,... ...complex data at the largest scale. About the Role As a Staff Forward Deployed Engineer, you'll define and drive the technical...Full timeWork at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Devops Engineer. Be the first to apply!
- assistant civil engineer San Francisco, CA
- engineering aide San Francisco, CA
- assistant mechanical engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- senior staff systems engineer San Francisco, CA
- staff automation engineer San Francisco, CA
- staff design engineer San Francisco, CA
- staff security engineer San Francisco, CA
- staff engineer San Francisco, CA


