Senior Site Reliability Engineer
OutSystems, Inc.
Hybrid onsite in Menlo Park, CA. Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Design and implement scalable, reliable, and secure infrastructure, ensuring cloud‑native best practices. Collaborate with software development teams to build resilient, observable, fault‑tolerant, recoverable, and scalable systems. Implement monitoring, alerting, logging, and tracing solutions to detect and respond to incidents. Lead incident response efforts, ensuring rapid resolution and minimal downtime, and conduct root‑cause analysis (RCA) and post‑mortems. Automate operational tasks, focusing on fast incident detection and recovery. Program in Python, using Gen AI tooling to accelerate automation and tool development. Foster a culture of continuous improvement and knowledge sharing. Communicate effectively with stakeholders, providing updates on system reliability and performance. Participate in on‑call rotation to provide 24/7 support for production systems. Performance Indicators SLA and Service Level Objectives (SLO) compliance; SLO coverage and detection ratio; Mean time to acknowledge (MTTA); Mean time to resolve (MTTR). Qualifications Bachelor's or Master’s degree in Computer Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of end‑to‑end project delivery. Experience managing Hadoop and Kubernetes infrastructure or equivalent. Advanced knowledge of Linux, networking, and containers. Proficiency in at least one high‑level programming language (Python, Go, etc.). Strong troubleshooting and debugging skills. Fluency in English with excellent communication skills. Experience with prompt engineering, AI‑native IDEs, or AI assistants such as Cursor, GitHub Copilot, or Claude. Technical Skills Establishment, monitoring, and improvement of SLOs, SLIs, and SLAs aligned with business needs. Containerization technologies and orchestration platforms—mainly Kubernetes and EKS (CKA, CKAD, CKS certifications are valued). Automation and Infrastructure as Code (IaC) tools, such as AWS CloudFormation, Terraform, Puppet, Chef, Spacelift, etc. Python, Go, Bash/Shell scripting, or other automation languages. Familiarity with AWS services like EC2, RDS, ELB, CloudFront, Lambda, etc. Monitoring and troubleshooting complex distributed systems using Grafana, ELK stack, Prometheus, or similar. Designing resilient and fault‑tolerant systems; debugging complex distributed systems. Soft Skills Effective communication (oral and written) in English, with empathy for stakeholders. Collaboration and proactive presentation of ideas to leadership. Humbleness—admitting mistakes, mitigating impact, and learning from errors. Accountability—owning problems and driving them to resolution. Negotiation skills—defusing conflicts and leading toward mutual agreement. Process orientation—following defined processes while challenging inefficiencies. Problem‑solving and critical thinking—breaking problems into smaller parts and analyzing objectively. EEO Statement As an equal opportunity employer, all qualified applicants receive equal consideration regardless of race, origin, religion, sex, sexual orientation, gender identity, disability, veteran status, or any other protected status. #J-18808-Ljbffr OutSystems, Inc.
$181.69k - $213.75k
...Senior Site Reliability Engineer San Francisco, California; Santa Clara, California; Seattle, WA The Company You'll Join Carta connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private...SeniorFull timeWork at office$195k - $240k
...Senior Site Reliability Engineer San Francisco (Hybrid) At You.com, we are building the AI Search Infrastructure that powers modern AI systems. Our goal is to create the trusted knowledge layer that agents, applications, and enterprises rely on to retrieve real-...SeniorFull timeImmediate startRemote workWork from homeFlexible hours$127k - $249k
...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper)....SeniorWork at officeLocal areaRemote workWorldwideFlexible hours$159.2k - $301.6k
...running Graphs on the cloud. In this reliability-focused role, you will own the availability... .... You'll partner with the backend engineers building these APIs to make sure the system... ...Science. ~5-10 years of experience in site reliability engineering, infrastructure,...SeniorTemporary workLocal areaWorldwide$166.9k - $225.9k
...Summary: Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team... ...What you'll bring: ~6+ years of experience in Site Reliability Engineering, Cloud Engineering, or building...SeniorWork at officeImmediate startWorldwideMonday to FridayFlexible hours$220k - $235k
...Staff/Senior Staff Site Reliability Engineer Ironclad is the leading AI contracting platform that transforms agreements into assets. Contracts move faster, insights surface instantly, and agents push work forward, all with you in control. Whether you're buying or selling...SeniorFull timeContract workWork at office$181k - $263k
...and supporting deployments of global products, and providing first line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability engineering across LiveRamp's global infrastructure. This is a...SeniorWork from homeFlexible hoursNight shift- US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems...Senior
- OutSystems, Inc. is looking for a Site Reliability Engineer to join their team in San Francisco, CA. The ideal candidate will lead the onboarding of services and teams to reliability tenets while establishing SLOs and SLAs. Proficiency in Python and experience with Kubernetes...SeniorFlexible hours
- ...co‑founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and...Senior
$300k
...thousands of H100s, H200s, and B200s, ready for experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and automation of this GPU-powered infrastructure, ensuring...Senior- What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud infrastructure in support of key product initiatives. Aligned to the roadmap, you’ll lead on infrastructure design and...Senior
- ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building... ...observability adoptable and improve product reliability. Lead members of other engineering teams... ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess...SeniorWork at officeLocal areaWork from home
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing... ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks...Senior- ...about this role, we encourage you to apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and... ...goals are met. What You Will Be Doing Improving production reliability and system resilience within an SRE scoped team Championing...SeniorFlexible hours
- We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You’ll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You’ll also provide guidance...SeniorRemote job
- ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from... ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data...Senior
- For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime type: Full timeposted on: Posted Yesterdayjob requisition id: R1478**There are NO limits to your career: come...SeniorImmediate startRemote workWorldwide
$50 per hour
...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer Science or related field, or 8+ years relevant work...SeniorTemporary workWork experience placement$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,...SeniorFull timeWork at officeFlexible hours$166.9k - $225.9k
Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team... ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or...SeniorFlexible hours$117k - $209.33k
Position Overview Want to help make a better world? As a Senior Site Reliability Engineer at Autodesk, you will build and operate reliable, secure, and scalable cloud services for Autodesk GovCloud products. This foundational role helps establish the operating model, reliability...Senior$165k - $241.4k
...efficient, functional and very effective. We’re looking for talented engineers with a software or operations background, experienced in... ...closely with our application development teams to ensure the reliability, performance and security of our infrastructure....SeniorFull timeTemporary workWork at officeFlexible hours1 day per week- CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving, venture-backed startups throughout the US. We’re constructing a pool of world-class Web site Reliability Engineers for present roles and for upcoming alternatives. You’ll both be positioned...Senior
- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early‑stage startups access to the kind of scaled AI infrastructure once reserved...SeniorFull timeRemote work
- Drata is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you will engage in reliability architecture for product teams, lead production readiness reviews, and build automation around monitoring and alerting. The ideal candidate has at least 6...Senior
- Anyscale is seeking a Senior Site Reliability Engineer to join our Infrastructure team in San Francisco, California. The ideal candidate will enhance distributed AI application development and work on open-source Ray integration. We need engineers with strong experience...Senior
$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...SeniorLocal areaRemote work- ...getting here.) About the Role We're building infrastructure that has to perform under real-world scale, reliability, and security demands — and we're looking for an engineer who wants to own the foundation it runs on. This isn't a traditional "keep the lights on" role. You...Senior
- ...by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- site reliability engineer remote San Francisco, CA
- site reliability engineer sre San Francisco, CA
- site reliability engineer San Francisco, CA
- senior cloud service delivery manager San Francisco, CA
- senior business analyst contract San Francisco, CA
- senior product design engineer San Francisco, CA
- senior game producer San Francisco, CA
- senior software manager San Francisco, CA
- senior manager business analytics San Francisco, CA
- senior marketing account manager San Francisco, CA

