Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

CloudDevs: Senior Site Reliability Engineer (SRE)

Breakout Tools

CloudDevs works with fast-moving, venture-backed startups across the US. We’re building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities. You will either be placed directly into one of our partner startups or added to our vetted SRE network for future projects.

This role is ideal for engineers who care about reliability, metrics, performance, and building simple, scalable systems. If you enjoy designing for scale and improving how teams ship software, you’ll fit right in.

Key Responsibilities
  • Work as a hands‑on engineer focused on system reliability, performance, and observability.
  • Define and track SLIs, SLOs, and error budgets.
  • Optimize monitoring cost and signal quality across metrics, logs, and traces.
  • Improve deployment safety, canary rollouts, and UAT pipelines.
  • Build tools for automated and local performance testing and track benchmarks.
  • Lead resilience work like failover drills, chaos tests, and redundancy checks.
  • Partner with engineering teams to improve scaling patterns and architecture as the product grows.
  • Support incident response processes and help reduce operational noise.
  • Write clean, maintainable code in Go, Python, or Node.js.
  • Contribute to CI/CD improvements and automation efforts.
  • Collaborate with engineers across teams to raise reliability standards.
Requirements
  • 5+ years in SRE, DevOps, or Platform Engineering roles.
  • Strong experience with cloud infrastructure (AWS preferred), Terraform, and Kubernetes.
  • Deep knowledge of observability tools like DataDog, Prometheus, or OpenTelemetry.
  • Strong debugging skills across services, networking, and data layers.
  • Hands‑on experience designing and monitoring SLIs/SLOs.
  • Experience with CI/CD tools such as GitHub Actions, Jenkins, or ArgoCD.
  • Ability to write production‑grade code in Go, Python, or Node.js.
  • Comfort working independently in fast‑paced environments.
Nice to Have
  • Experience tuning observability costs and optimizing data ingestion.
  • Exposure to chaos engineering and progressive deployments.
  • Background with high‑throughput or latency‑sensitive systems.
  • AWS at scale (EKS, Lambda, DynamoDB, S3).
  • Experience in regulated industries like fintech, payments, or SOC2 environments.
  • Performance testing pipelines or load‑testing automation.
  • Experience handling systems processing tens of millions of API calls.
Open Pool for SREs

Even if you don’t meet every requirement or aren’t a fit for the current role, strong SREs with real production experience are welcome to join our talent pool. We regularly place engineers with different strengths across reliability, DevOps, platform, observability, backend, and infrastructure engineering.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the CloudDevs: Senior Site Reliability Engineer (SRE) in San Francisco, CA vacancy
  • A leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their infrastructure. Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform... 
    Senior

    Speak

    San Francisco, CA
    4 days ago
  • $300k

     ...experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and...  .... Skills / Must Have: ~7+ years of experience in SRE, DevOps, or Infrastructure Engineering roles supporting... 
    Senior
    Permanent employment
    San Francisco, CA
    more than 2 months ago
  •  ...that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building...  ...end-to-end. Improve operational reliability: Identify recurring issues and reliability...  .... What we’re looking for 3–6+ years in SRE, DevOps, Platform, or operations-heavy engineering... 
    Senior
    Work at office
    Worldwide

    Heidi Health Ltd

    San Francisco, CA
    3 days ago
  • $210k - $240k

     ...Join to apply for the Senior Site Reliability Engineer role at Alembic Technologies This range is provided by Alembic Technologies. Your actual...  ...re looking for an experienced Site Reliability Engineer (SRE) to help us scale our platform with reliability, observability... 
    Senior
    Full time

    Alembic Technologies

    San Francisco, CA
    2 days ago
  • $140k - $220k

     ...About the Job You’ll own reliability and operational excellence for Pylon's production systems...  ...'ll build tooling that makes the entire engineering team more effective, establish on-call...  ...not a pure ops role. At Pylon, we believe SRE work should be a maximum of 50%... 
    Senior

    Pylon

    San Francisco, CA
    3 days ago
  •  ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from...  ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data... 
    Senior

    Unify

    San Francisco, CA
    2 days ago
  •  ...values and enthusiasm for building a great culture and product, you will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our... 
    Senior
    Remote work
    Work from home
    Flexible hours

    Fieldguide.ai

    San Francisco, CA
    2 days ago
  •  ...The TeamPlatform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that...  ...alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    5 days ago
  •  ...For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime...  ...Engineering Function Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software... 
    Senior
    Immediate start
    Remote work
    Worldwide

    OutSystems

    San Francisco, CA
    2 days ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam...  ...: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability, scalability... 
    Senior
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    San Francisco, CA
    5 days ago
  • # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep...  ...stories, and career news.**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice.... 
    Senior
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    2 days ago
  • $165k - $225k

     ...growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our...  ...of working in cloud-based systems operations, as a SRE or DevOps engineer. First-hand experience with configuration... 
    Senior
    Temporary work
    Work at office
    Local area
    Worldwide
    Flexible hours

    Stellar Services

    San Francisco, CA
    2 days ago
  • $50 per hour

     ...system technologies. You Will Thrive In This Role If: 5+ years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer... 
    Senior
    Temporary work
    Work experience placement

    Epoch Biodesign

    San Francisco, CA
    2 days ago
  • $166.9k - $225.9k

     ...Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team where you grow your...  ...What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or... 
    Senior
    Flexible hours

    Drata

    San Francisco, CA
    2 days ago
  • $220k - $235k

     ...are seeking a strategic, high-output Staff/Senior Staff SRE to define the future of our cloud platform and champion engineering excellence across Ironclad. In this role,...  ...leadership and strategic direction for the Site Reliability Engineering team and our broader Cloud... 
    Senior
    Full time
    Work at office

    Jobr

    San Francisco, CA
    2 days ago
  • Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production systems in San Francisco, CA. The...  ...Candidates should have at least 5 years of experience in SRE or related fields, proficiency in operating distributed cloud... 
    Senior
    Remote job
    Flexible hours

    Fieldguide

    San Francisco, CA
    4 days ago
  •  ...An innovative R&D company in San Francisco is seeking a Site Reliability Engineer to join its Platform Engineering team. This position focuses on ensuring the reliability and performance of an AI-powered code review platform. The ideal candidate will have 6-8 years of... 
    Senior

    CodeRabbit

    San Francisco, CA
    2 days ago
  •  ...Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded...  ...and engineering. The Role This is not a generalist SRE role. You will design, operate, and debug large‑scale GPU... 
    Senior
    Full time
    Remote work

    Cortes 23

    San Francisco, CA
    2 days ago
  • $227.2k - $324.5k

     ...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that...  .... We are seeking an experienced and visionary Senior SRE Manager to lead and grow our newly built Site Reliability... 
    Senior
    Full time
    Contract work
    Temporary work
    Local area
    Flexible hours

    Tubi

    San Francisco, CA
    5 days ago
  • $210.6k - $305.1k

     ...~ Lead, inspire, and develop a talented SRE team, fostering a culture of innovation,...  ...~ You have led a distributed team of 5+ engineers, can demonstrate strong technical vision...  ...insurance. Please see the Cisco careers site to discover more benefits and perks. Employees... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Francisco, CA
    5 days ago
  • $181k - $263k

     ...evolving compliance and privacy requirements.The Global SRE team is responsible for owning and supporting deployments...  ...first line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability engineering... 
    Senior
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    2 days ago
  • What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud...  ...not mandatory. Minimum qualifications 6+ years in an SRE, DevOps, or infrastructure-focused engineering role. Bachelor... 
    Senior

    Airwallex-

    San Francisco, CA
    5 days ago
  • $127k - $249k

     ...We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, you will be very hands‑on technically while also mentoring a small team of SREs. The InfraSec team collaborates... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    2 days ago
  • $60 per hour

    Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms...  ...zones. You’re a Great Fit If You Have 3-6+ years in SRE, DevOps, or infrastructure roles with production ownership... 
    Senior
    Full time
    Work at office
    Flexible hours

    Bonfirevc

    San Francisco, CA
    1 day ago
  •  ...Airwallex- is seeking a Senior Site Reliability Engineer in San Francisco, California, to work with product teams to build and maintain robust cloud...  ...performance of services. The ideal candidate has over 6 years of SRE or DevOps experience, holds a Bachelor's degree in... 
    Senior

    Airwallex-

    San Francisco, CA
    3 days ago
  • $163k - $203k

    GoTo Meeting is looking for a Senior Site Reliability Engineer in San Francisco. You will be responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This role requires expertise in Kubernetes, cloud platforms (preferably GCP),... 
    Senior

    GoTo Meeting

    San Francisco, CA
    1 day ago
  •  ...Senior Infrastructure Engineer – Bland As a Senior Infrastructure Engineer at Bland, responsibilities include...  ...processing with strict latency and reliability requirements; building and supporting...  ...in global deployments. Work with Site Reliability Engineering to establish... 
    Senior
    Temporary work

    AI Chopping Block, Inc.

    San Francisco, CA
    2 days ago
  • $15 per hour

    Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to support and develop the platform serving the world’s favorite...  ...around the globe. Wikimedia’s Site Reliability Engineering (SRE) team is principally responsible for ensuring our global... 
    Senior
    Permanent employment
    For contractors
    Remote work

    Nerdleveltech

    San Francisco, CA
    5 days ago
  • Drata is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you will engage in reliability architecture for product teams, lead...  ...The ideal candidate has at least 6 years of experience in SRE or Cloud Engineering, expertise in Terraform and Datadog,... 
    Senior

    Careers at Drata

    San Francisco, CA
    2 days ago
  • $232k - $319k

     ...scale the service with great people and reliable, cost-effective, and efficient infrastructure...  ...org and various initiatives across SRE & Infrastructure organization. Lead the...  ...partnership with architects and product engineering Build a world-class observability... 
    Senior
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to CloudDevs: Senior Site Reliability Engineer (SRE). Be the first to apply!