Site Reliability Engineer
$175k - $250kI did my part and supported the Regular Toilet
About WorkOS WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. The company is fully distributed across North American time zones and supports a fast‑growing customer base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at scale. We build the systems and practices that keep everything running smoothly, handling hundreds of millions of requests and continuously improving service performance. Who we’re looking for We seek engineers who are excited to improve the reliability of complex systems and enjoy digging into how things work. As an early member of the SRE team, you’ll help shape our approach to reliability at scale and collaborate closely across the company. Bring a generalist mindset and are comfortable working across infrastructure layers—from compute and networking to storage, databases, and app runtime environments Are curious and proactive, with a strong desire to understand systems end‑to‑end and uncover hidden failure modes Care deeply about uptime, observability, and performance, placing reliability as a product feature Think through architectural trade‑offs with reliability, simplicity, and maintainability in mind Take initiative, work independently, and follow through—from identifying reliability risks to driving improvements Collaborate well with engineers across disciplines and support teams through production readiness, incident response, and post‑mortem reviews Responsibilities Design and evolve the systems, tooling, and processes that improve the reliability and performance of WorkOS Collaborate with product and infrastructure teams to ensure services are production‑ready, observable, and resilient to failure Define and measure SLIs/SLOs to guide reliability improvements Write and optimize backend systems in TypeScript with a focus on performance, maintainability, and graceful degradation Improve our incident response process, lead post‑mortems, and drive follow‑through on reliability risks Develop internal tools and automations that make it easier to operate and scale our systems Participate in on‑call rotation—responding to, resolving, and learning from production incidents Contribute to design and architecture discussions with a focus on operability and long‑term sustainability Document systems, share learnings, and help grow a reliability‑minded engineering culture Qualifications Experience operating and scaling production systems in cloud environments (we use AWS) Familiarity with service reliability concepts—monitoring, alerting, incident response, and root cause analysis Comfort working across infrastructure layers (e.g., compute, networking, storage, observability tooling) Strong debugging and systems thinking skills, able to follow problems across services and layers Ability to work independently, take ownership, and drive projects from problem discovery through resolution Nice to Have Familiarity with Kubernetes or similar orchestration systems Exposure to observability stacks (e.g., Prometheus, Grafana, Datadog, OpenTelemetry) Exposure to TypeScript or interest in working in a TypeScript‑based codebase The annual US base salary falls within the range of $175,000 to $250,000. This range does not encompass the full spectrum of benefits such as equity, health insurance, vacation time, and paid parental leave. Final compensation will be determined based on experience, skills, and qualifications. Benefits Competitive pay Substantial equity grants Health care coverage (medical, dental, and vision) for you and your family 401(k) matching Wellness and fitness allowances Paid time off, paid holidays, and unlimited sick leave Autonomy and flexibility with remote work Benefits for those outside the US are available upon inquiry. Equal Opportunity Employer WorkOS is an equal opportunity employer, committed to diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. #J-18808-Ljbffr I did my part and supported the Regular Toilet
- ...shape the future of healthcare, we’d love to meet you. About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow...SuggestedWork at officeRemote workFlexible hours2 days per week
$150k
...Job Description Job Description About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and...Suggested$163k - $203k
...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our...SuggestedWork experience placementWork at officeLocal areaRemote workFlexible hours2 days per week- ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (... ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of...Suggested
- US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems...Suggested
- What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud infrastructure in support of key product initiatives. Aligned to the roadmap, you’ll lead on infrastructure design and...
- For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime type: Full timeposted on: Posted Yesterdayjob requisition id: R1478**There are NO limits to your career: come...Immediate startRemote workWorldwide
$166.9k - $225.9k
Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team... ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or...Flexible hours- ...manifesto. About the Role We're looking for an Infrastructure Engineer to take the lead on scaling our operational resilience as we... ...This is a high-impact, high-trust role where you’ll shape how reliability is done - reducing incident load, building internal tooling, and...WorldwideShift work
- ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building... ...observability adoptable and improve product reliability. Lead members of other engineering teams... ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess...Work at officeLocal areaWork from home
- ...in the design of information and operational support systems. Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting...Work experience placementStart working todayRemote workFlexible hours
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing... ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks...- A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and strong programming skills. You will manage production systems' reliability...
- ...advanced algorithms that significantly outperforms individual engineers. We combine language models with human ingenuity to push the... ...and quality. The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area...
- The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-...
$163k - $203k
...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our...Work experience placementWork at officeLocal areaRemote workFlexible hours2 days per week- ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that... ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes...Work at officeWorldwide
- ...millions of daily users while enabling our engineering teams to ship fast. You'll own the... ...building automation and tooling that improves reliability and partnering with engineering to... ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems...Work at officeWork from home
$125k - $165k
Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud...Temporary workWork at officeVisa sponsorshipWork visaRelocation packageFlexible hours$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,...Full timeWork at officeFlexible hours- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of... ...**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part...Work at officeImmediate startWorldwideMonday to FridayFlexible hours
- ...co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and...
$125k - $165k
Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability Engineer will help ensure the reliability, scalability, and performance of the systems that power our AI products. This role...Temporary workRemote workVisa sponsorshipWork visaFlexible hours$151.5k - $252.5k
...and making a real impact for some of the world’s biggest brands. About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will be working with a global team to build the world’s next modern...Base plus commissionLocal areaWorldwide- ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like... ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and...
$227.2k - $324.5k
...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset and toolkit to the challenges of building and running large-scale, distributed systems....Full timeContract workTemporary workLocal areaFlexible hours- ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...
$50 per hour
...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer Science or related field, or 8+ years relevant work...Temporary workWork experience placement$127k - $249k
The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...Work at officeLocal areaRemote workWorldwideFlexible hours$175k - $250k
I did my part and supported the Regular Toilet is seeking a Site Reliability Engineer to enhance the reliability and performance of our systems at WorkOS. As a key member of the SRE team, you will handle critical responsibilities like improving incident responses and collaborating...Remote jobFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer remote San Francisco, CA
- site reliability engineer sre San Francisco, CA
- site reliability engineer San Francisco, CA
- website content developer San Francisco, CA
- website coordinator San Francisco, CA
- on site coordinator San Francisco, CA
- IT site lead San Francisco, CA
- on-site clinical research associate (traveling/remote) San Francisco, CA
- junior website developer San Francisco, CA
- site services specialist San Francisco, CA



