CloudDevs: Senior Site Reliability Engineer (SRE)
Breakout Tools
CloudDevs works with fast-moving, venture-backed startups across the US. We’re building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities. You will either be placed directly into one of our partner startups or added to our vetted SRE network for future projects.
This role is ideal for engineers who care about reliability, metrics, performance, and building simple, scalable systems. If you enjoy designing for scale and improving how teams ship software, you’ll fit right in.
Key Responsibilities
- Work as a hands‑on engineer focused on system reliability, performance, and observability.
- Define and track SLIs, SLOs, and error budgets.
- Optimize monitoring cost and signal quality across metrics, logs, and traces.
- Improve deployment safety, canary rollouts, and UAT pipelines.
- Build tools for automated and local performance testing and track benchmarks.
- Lead resilience work like failover drills, chaos tests, and redundancy checks.
- Partner with engineering teams to improve scaling patterns and architecture as the product grows.
- Support incident response processes and help reduce operational noise.
- Write clean, maintainable code in Go, Python, or Node.js.
- Contribute to CI/CD improvements and automation efforts.
- Collaborate with engineers across teams to raise reliability standards.
Requirements
- 5+ years in SRE, DevOps, or Platform Engineering roles.
- Strong experience with cloud infrastructure (AWS preferred), Terraform, and Kubernetes.
- Deep knowledge of observability tools like DataDog, Prometheus, or OpenTelemetry.
- Strong debugging skills across services, networking, and data layers.
- Hands‑on experience designing and monitoring SLIs/SLOs.
- Experience with CI/CD tools such as GitHub Actions, Jenkins, or ArgoCD.
- Ability to write production‑grade code in Go, Python, or Node.js.
- Comfort working independently in fast‑paced environments.
Nice to Have
- Experience tuning observability costs and optimizing data ingestion.
- Exposure to chaos engineering and progressive deployments.
- Background with high‑throughput or latency‑sensitive systems.
- AWS at scale (EKS, Lambda, DynamoDB, S3).
- Experience in regulated industries like fintech, payments, or SOC2 environments.
- Performance testing pipelines or load‑testing automation.
- Experience handling systems processing tens of millions of API calls.
Open Pool for SREs
Even if you don’t meet every requirement or aren’t a fit for the current role, strong SREs with real production experience are welcome to join our talent pool. We regularly place engineers with different strengths across reliability, DevOps, platform, observability, backend, and infrastructure engineering.
#J-18808-Ljbffr- A leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their infrastructure. Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform...Senior
$300k
...experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and... .... Skills / Must Have: ~7+ years of experience in SRE, DevOps, or Infrastructure Engineering roles supporting...SeniorPermanent employment- ...that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building... ...end-to-end. Improve operational reliability: Identify recurring issues and reliability... .... What we’re looking for 3–6+ years in SRE, DevOps, Platform, or operations-heavy engineering...SeniorWork at officeWorldwide
$210k - $240k
...Join to apply for the Senior Site Reliability Engineer role at Alembic Technologies This range is provided by Alembic Technologies. Your actual... ...re looking for an experienced Site Reliability Engineer (SRE) to help us scale our platform with reliability, observability...SeniorFull time$140k - $220k
...About the Job You’ll own reliability and operational excellence for Pylon's production systems... ...'ll build tooling that makes the entire engineering team more effective, establish on-call... ...not a pure ops role. At Pylon, we believe SRE work should be a maximum of 50%...Senior- ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from... ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data...Senior
- ...values and enthusiasm for building a great culture and product, you will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our...SeniorRemote workWork from homeFlexible hours
- ...The TeamPlatform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that... ...alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours
- ...For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime... ...Engineering Function Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software...SeniorImmediate startRemote workWorldwide
$140k - $205k
...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam... ...: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability, scalability...SeniorFull timeTemporary workWork at officeFlexible hoursWeekend work- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep... ...stories, and career news.**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice....SeniorWork at officeImmediate startWorldwideMonday to FridayFlexible hours
$165k - $225k
...growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our... ...of working in cloud-based systems operations, as a SRE or DevOps engineer. First-hand experience with configuration...SeniorTemporary workWork at officeLocal areaWorldwideFlexible hours$50 per hour
...system technologies. You Will Thrive In This Role If: 5+ years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer...SeniorTemporary workWork experience placement$166.9k - $225.9k
...Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team where you grow your... ...What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or...SeniorFlexible hours$220k - $235k
...are seeking a strategic, high-output Staff/Senior Staff SRE to define the future of our cloud platform and champion engineering excellence across Ironclad. In this role,... ...leadership and strategic direction for the Site Reliability Engineering team and our broader Cloud...SeniorFull timeWork at office- Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production systems in San Francisco, CA. The... ...Candidates should have at least 5 years of experience in SRE or related fields, proficiency in operating distributed cloud...SeniorRemote jobFlexible hours
- ...An innovative R&D company in San Francisco is seeking a Site Reliability Engineer to join its Platform Engineering team. This position focuses on ensuring the reliability and performance of an AI-powered code review platform. The ideal candidate will have 6-8 years of...Senior
- ...Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded... ...and engineering. The Role This is not a generalist SRE role. You will design, operate, and debug large‑scale GPU...SeniorFull timeRemote work
$227.2k - $324.5k
...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that... .... We are seeking an experienced and visionary Senior SRE Manager to lead and grow our newly built Site Reliability...SeniorFull timeContract workTemporary workLocal areaFlexible hours$210.6k - $305.1k
...~ Lead, inspire, and develop a talented SRE team, fostering a culture of innovation,... ...~ You have led a distributed team of 5+ engineers, can demonstrate strong technical vision... ...insurance. Please see the Cisco careers site to discover more benefits and perks. Employees...SeniorFull timeTemporary workLocal areaFlexible hours$181k - $263k
...evolving compliance and privacy requirements.The Global SRE team is responsible for owning and supporting deployments... ...first line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability engineering...SeniorWork from homeFlexible hoursNight shift- What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud... ...not mandatory. Minimum qualifications 6+ years in an SRE, DevOps, or infrastructure-focused engineering role. Bachelor...Senior
$127k - $249k
...We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, you will be very hands‑on technically while also mentoring a small team of SREs. The InfraSec team collaborates...SeniorLocal areaRemote workWorldwideFlexible hours$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms... ...zones. You’re a Great Fit If You Have 3-6+ years in SRE, DevOps, or infrastructure roles with production ownership...SeniorFull timeWork at officeFlexible hours- ...Airwallex- is seeking a Senior Site Reliability Engineer in San Francisco, California, to work with product teams to build and maintain robust cloud... ...performance of services. The ideal candidate has over 6 years of SRE or DevOps experience, holds a Bachelor's degree in...Senior
$163k - $203k
GoTo Meeting is looking for a Senior Site Reliability Engineer in San Francisco. You will be responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This role requires expertise in Kubernetes, cloud platforms (preferably GCP),...Senior- ...Senior Infrastructure Engineer – Bland As a Senior Infrastructure Engineer at Bland, responsibilities include... ...processing with strict latency and reliability requirements; building and supporting... ...in global deployments. Work with Site Reliability Engineering to establish...SeniorTemporary work
$15 per hour
Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to support and develop the platform serving the world’s favorite... ...around the globe. Wikimedia’s Site Reliability Engineering (SRE) team is principally responsible for ensuring our global...SeniorPermanent employmentFor contractorsRemote work- Drata is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you will engage in reliability architecture for product teams, lead... ...The ideal candidate has at least 6 years of experience in SRE or Cloud Engineering, expertise in Terraform and Datadog,...Senior
$232k - $319k
...scale the service with great people and reliable, cost-effective, and efficient infrastructure... ...org and various initiatives across SRE & Infrastructure organization. Lead the... ...partnership with architects and product engineering Build a world-class observability...SeniorPermanent employmentLocal areaWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to CloudDevs: Senior Site Reliability Engineer (SRE). Be the first to apply!
- site reliability engineer remote San Francisco, CA
- site reliability engineer sre San Francisco, CA
- site reliability engineer San Francisco, CA
- senior cost analyst San Francisco, CA
- senior computer engineer San Francisco, CA
- senior electrical estimator San Francisco, CA
- senior program specialist San Francisco, CA
- senior manager quality engineering San Francisco, CA
- senior software test automation engineer San Francisco, CA
- senior design technologist San Francisco, CA

