Site Reliability Engineer - Scale & Observability
gamma.app
A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and strong programming skills. You will manage production systems' reliability and lead incident response efforts to prevent issues, all while contributing to the scalability and efficiency of their services. Ideal candidates will have 5+ years of relevant experience and a passion for leveraging technology to drive outcomes. #J-18808-Ljbffr gamma.app
$175k - $250k
...did my part and supported the Regular Toilet is seeking a Site Reliability Engineer to enhance the reliability and performance of our systems at... ...environment. Join us to help ensure our platform runs reliably at scale. #J-18808-Ljbffr I did my part and supported the Regular...SuggestedRemote jobFlexible hours- ...is currently Tuesday. Engineering at Lambda is responsible for building and scaling our cloud offering. Our... ...Do Deploy and operate observability platforms for logging,... ...and improve product reliability. Lead members of other... ...years of experience in Site Reliability Engineering...SuggestedWork at officeLocal areaWork from home
- A leading AI research company based in San Francisco is seeking experienced reliability engineers to scale their infrastructure and ensure system performance and reliability. This role involves collaborating with diverse teams to develop resilient systems and enhance operations...Suggested
- Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production systems in San Francisco... ...teams to define reliability standards and build robust observability practices. Candidates should have at least 5 years of experience...SuggestedRemote jobFlexible hours
$147k - $202k
...Overview: We are seeking a highly technical Staff Observability Site Reliability Engineer with a specialty in Splunk to own and evolve our Splunk... ...: Eliminate "toil" by automating the deployment and scaling of observability agents and collectors. Required Skills...SuggestedPermanent employmentWork at officeLocal areaWorldwideFlexible hours$230k - $310k
A tech company is seeking an experienced Site Reliability Engineer to ensure the reliability and performance of its production systems across AWS infrastructure. You will build observability tools, lead incident responses, and collaborate on architectural improvements....$177.19k - $364.8k
Pinterest is seeking a Staff Software Engineer to join the Observability team. This role involves designing and building observability solutions while collaborating with various teams. Ideal candidates will have over 7 years of experience in distributed systems, a Bachelor...Work at office- ...in San Francisco seeks infrastructure engineers to enhance the tooling and systems... ...include building GPU orchestration, scaling cloud batchjob systems, and designing... ...infrastructure and a strong focus on reliability and observability. This position is in-person, and international...Visa sponsorship
- A leading AI research company in San Francisco is seeking a Software Engineer to enhance infrastructure supporting cutting-edge AI systems. The role involves designing reliable systems and optimizing performance for millions of users. Ideal candidates possess experience...
$175k - $250k
...base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering... ...remains fast, reliable, and resilient at scale. We build the systems and practices... ...modes Care deeply about uptime, observability, and performance, placing...Remote work$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio... ...bringing total funding to $91 M. We’re scaling rapidly and looking for exceptional... ...coordination. Build safe, repeatable, and observable workflows. GitHub Operations: Manage...Full timeWork at officeFlexible hours- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission &... ...operates as both a central engineering function and an embedded reliability... ...native stack to help Drata scale reliably for a rapidly... ...artifacts - SLO templates, observability checklists, alerting...Work at officeImmediate startWorldwideMonday to FridayFlexible hours
$166.9k - $225.9k
...operates as both a central engineering function and an embedded reliability practice. You'll be part... ...stack to help Drata scale reliably for a rapidly growing... ...—SLO templates, observability checklists, alerting standards... ...years of experience in Site Reliability Engineering,...Flexible hours- ...role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance... ...systems that power agentic AI at scale. Your mission: keep our ultra-low-latency... ...our reliability posture end-to-end—observability, performance tuning, incident ops, infrastructure...
- ...Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be... ...ensuring the reliability, scalability, and observability of our production systems. You will... ..., highly available, and capable of scaling with rapid growth. You’ll work closely...Remote workWork from homeFlexible hours
- ...the economics of data integration at scale. And now Airbyte is building the frontier... ...: You'll be the infrastructure and reliability engineer on the Data Replication team - a full-... ...infrastructure. Maintain and enhance observability, alerting, and anomaly detection with...Local area
- ...users while enabling our engineering teams to ship fast.... ...tooling that improves reliability and partnering with engineering... ...systems that are observable, resilient, and easy... ...help shape how Gamma scales to serve its next 100... ...ll bring 5+ years in Site Reliability Engineering...Work at officeWork from home
- ...’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems... ...will influence how we build, scale and operate our platform as we... ...What you’ll do Reliability, Observability and Performance: Maintain and...Work at officeRemote workFlexible hours2 days per week
- ...Connor was a machine learning research engineer at Scale AI. The rest of our team comes from... ...Senior SRE, you'll tackle the scaling and reliability challenges that come with adding... ...services, and building the automation and observability that keep Unify fast and reliable at...
- ...computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU... ...affordable, accessible AI compute at scale. Who You Are Expert in site reliability... ...automated rollback mechanisms Proficient in observability tools and practices including metrics...
$151.5k - $252.5k
...enable the acceleration of safe AI at scale. As the market leader in both data... ...are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering... ..., the Serverless Framework, etc.) Observability (Azure Monitor, AppInsights, Elastic...Base plus commissionLocal areaWorldwide- ...significantly outperforms individual engineers. We combine language models... ...are seeking an experienced Site Reliability Engineer to join our... ...to deploy, monitor, and scale our services reliably. As... ...monitoring, alerting, and observability solutions using Datadog and...
- What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend... ...readiness. Lead incident response, observability, and automation across critical systems... ...Able to lead SRE strategy for large‑scale, cross‑functional projects. Strong...
- ...The TeamPlatform Engineering is the department within SRE that is responsible... ...internal service mesh), and observability and alerting systems.The... ...that ensure cluster reliability and security (e.g., CoreDNS,... ...Gatekeeper). As our infrastructure scales to support new use cases and...Work at officeLocal areaRemote workWorldwideFlexible hours
$140k - $205k
...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer... ...and maintain automated, resilient, and observable systems that support high... ...using Terraform Automate deployment, scaling, and recovery processes to reduce manual...Full timeTemporary workWork at officeFlexible hoursWeekend work- ...onboard services and teams to the reliability tenets. Establish and... ...development teams to build resilient, observable, fault‑tolerant, recoverable... .... 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of end‑to‑end...
$210.6k - $305.1k
...helping customers deploy at scale while also delivering AI-powered... ...Security, Collaboration, and Observability portfolios Your Impact... ...led a distributed team of 5+ engineers, can demonstrate strong technical... ...Please see the Cisco careers site to discover more benefits and...Full timeTemporary workLocal areaFlexible hours$150k
...Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on... ...implement alerting and dashboards using observability tooling (e.g., CloudWatch, Datadog, Grafana... ...and vulnerability remediation at scale, including OS-level patching (Amazon...$227.2k - $324.5k
...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team... ...challenges of building and running large-scale, distributed systems. Our mission is... ...strategy and vision for Tubi's observability, and automation platforms. Partner with...Full timeContract workTemporary workLocal areaFlexible hours- ...growing, early-stage startup to identify a top-tier Site Reliability Engineer who will play a critical role in scaling and strengthening a high-performance platform... ...Deep understanding of system performance, observability, and debugging techniques Experience identifying...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer - Scale & Observability. Be the first to apply!
- site reliability engineer San Francisco, CA
- site reliability engineer sre San Francisco, CA
- site reliability engineer remote San Francisco, CA
- on site coordinator San Francisco, CA
- website content developer San Francisco, CA
- site recruiter San Francisco, CA
- site safety San Francisco, CA
- site services specialist San Francisco, CA
- on-site clinical research associate (traveling/remote) San Francisco, CA
- IT site lead San Francisco, CA


