Senior Site Reliability Engineer (Upmarket)
Heidi
Who We Are Healthcare needs a better rhythm: one that keeps care continuous and deeply human. Heidi is building an AI Care Partner that works alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that help clinicians stay focused on what matters most: their patients. In just 18 months, Heidi has given back more than 18 million hours to healthcare professionals — supporting 73 million patient visits in 116 countries. Today, more than two million patient visits each week are powered by Heidi worldwide. Backed by nearly $100 million in funding, we’re growing in the US, UK, Canada, and Europe, partnering with leading health systems including the NHS, Beth Israel Lahey Health, and Monash Health. What you’ll do Participate in on-call and incident response: Respond to production incidents, contribute to service restoration, and support clear communication during incidents. Over time, take increasing responsibility for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes through better alerting, automation, system changes, or process improvements. Own parts of the production environment: Operate and improve Kubernetes clusters, cloud infrastructure, and core platform services, with growing ownership as familiarity increases. Strengthen observability: Improve dashboards, alerts, logs, and traces so issues are detected earlier and diagnosed faster, with a strong focus on actionable signals. Reduce operational toil: Automate repetitive tasks, simplify runbooks, and improve tooling to make on-call and day-to-day operations easier and safer. Support safe change: Improve deployments, rollback mechanisms, and operational readiness to reduce the risk of incidents caused by change. Contribute to operational practices: Write and maintain runbooks, participate in blameless post-mortems, and help improve incident response processes over time. Collaborate closely with engineers: Work with product and feature teams to improve production readiness, service ownership, and reliability expectations. What we’re looking for 3–6+ years in SRE, DevOps, Platform, or operations-heavy engineering roles. Experience supporting production systems and participating in on-call rotations. Comfortable debugging live systems under pressure. Experience operating cloud infrastructure (AWS preferred). Working knowledge of Kubernetes and containerised workloads. Infrastructure as Code experience (Terraform or similar). Familiarity with monitoring and alerting tools (Datadog, Prometheus, etc). Scripting or automation experience (Python, Bash, or similar). The way we work 1. Build to Last We design for safety and reliability so clinicians, patients, and our teams can trust what we build every day. 2. Own Your Practice Ideas rise on merit, not title, and everyone shares responsibility for the standards we set together. 3. Move Fast, Stay Steady We move quickly but never at the cost of trust. Progress only matters if people can depend on what we make. 4. Make Others Better Honest feedback, steady support, and shared growth keep our teams improving together. Why you will flourish with us In office to collaborate with like-minded professionals Healthcare, Dental, Vision benefit options 401k with 3% match Personal development budget of $500 per annum Become an owner, with shares (equity) in the company, if Heidi wins, we all win The rare chance to create a global impact as you immerse yourself in one of the leading healthtech startups The opportunity to fast track your startup career! Heidi is dedicated to creating an equitable, inclusive, and supportive work environment that brings people together from diverse backgrounds, experiences, and perspectives. Our strength is in our differences. We're proud to be an equal opportunity employer and welcome all applicants as we're committed to promoting a culture of opportunity for all. #J-18808-Ljbffr Heidi
- US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems...Senior
- OutSystems, Inc. is looking for a Site Reliability Engineer to join their team in San Francisco, CA. The ideal candidate will lead the onboarding of services and teams to reliability tenets while establishing SLOs and SLAs. Proficiency in Python and experience with Kubernetes...SeniorFlexible hours
$300k
...thousands of H100s, H200s, and B200s, ready for experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and automation of this GPU-powered infrastructure, ensuring...Senior- ...co‑founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and...Senior
- ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (... ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of...Senior
- What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud infrastructure in support of key product initiatives. Aligned to the roadmap, you’ll lead on infrastructure design and...Senior
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing... ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks...Senior- ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building... ...observability adoptable and improve product reliability. Lead members of other engineering teams... ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess...SeniorWork at officeLocal areaWork from home
- ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from... ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data...Senior
- ...about this role, we encourage you to apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and... ...goals are met. What You Will Be Doing Improving production reliability and system resilience within an SRE scoped team Championing...SeniorFlexible hours
- We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You’ll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You’ll also provide guidance...SeniorRemote job
- For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime type: Full timeposted on: Posted Yesterdayjob requisition id: R1478**There are NO limits to your career: come...SeniorImmediate startRemote workWorldwide
$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,...SeniorFull timeWork at officeFlexible hours$50 per hour
...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer Science or related field, or 8+ years relevant work...SeniorTemporary workWork experience placement$127k - $249k
The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of their... ...Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be...SeniorWork at officeImmediate startWorldwideMonday to FridayFlexible hours
$117k - $209.33k
Position Overview Want to help make a better world? As a Senior Site Reliability Engineer at Autodesk, you will build and operate reliable, secure, and scalable cloud services for Autodesk GovCloud products. This foundational role helps establish the operating model, reliability...Senior$165k - $241.4k
...efficient, functional and very effective. We’re looking for talented engineers with a software or operations background, experienced in... ...closely with our application development teams to ensure the reliability, performance and security of our infrastructure....SeniorFull timeTemporary workWork at officeFlexible hours1 day per week$166.9k - $225.9k
Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team... ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or...SeniorFlexible hours- CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving, venture-backed startups throughout the US. We’re constructing a pool of world-class Web site Reliability Engineers for present roles and for upcoming alternatives. You’ll both be positioned...Senior
$220k - $235k
...are seeking a strategic, high-output Staff/Senior Staff SRE to define the future of our cloud platform and champion engineering excellence across Ironclad. In this role,... ...leadership and strategic direction for the Site Reliability Engineering team and our broader Cloud...SeniorFull timeWork at office- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early‑stage startups access to the kind of scaled AI infrastructure once reserved...SeniorFull timeRemote work
- Drata is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you will engage in reliability architecture for product teams, lead production readiness reviews, and build automation around monitoring and alerting. The ideal candidate has at least 6...Senior
- Anyscale is seeking a Senior Site Reliability Engineer to join our Infrastructure team in San Francisco, California. The ideal candidate will enhance distributed AI application development and work on open-source Ray integration. We need engineers with strong experience...Senior
$181k - $263k
## Senior Staff Site Reliability EngineerApplylocations: San Franciscotime type: Full timeposted on: Posted Yesterdayjob requisition id: JR01220... ...support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability...SeniorWork from homeFlexible hoursNight shift- ...getting here.) About the Role We're building infrastructure that has to perform under real-world scale, reliability, and security demands — and we're looking for an engineer who wants to own the foundation it runs on. This isn't a traditional "keep the lights on" role. You...Senior
- ...by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make...Senior
$127k - $249k
We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, you will be very hands‑on technically while also mentoring a small team of SREs. The InfraSec team collaborates...SeniorLocal areaRemote workFlexible hours$266k - $398k
...Director, Site Reliability Engineering – Infrastructure Platform Okta is The World’s Identity Company. Okta provides secure access, authentication, and automation, placing identity at the core of business security and growth. The Infrastructure Platform and Shared...SeniorPermanent employmentFlexible hours- Airwallex- is seeking a Senior Site Reliability Engineer in San Francisco, California, to work with product teams to build and maintain robust cloud infrastructure. In this role, you will lead critical infrastructure projects, ensuring the reliability and performance of...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer (Upmarket). Be the first to apply!
- site reliability engineer remote San Francisco, CA
- site reliability engineer San Francisco, CA
- site reliability engineer sre San Francisco, CA
- senior data management analyst San Francisco, CA
- senior app developer San Francisco, CA
- senior game producer San Francisco, CA
- senior retail sales associate San Francisco, CA
- senior manager quality engineering San Francisco, CA
- senior software test automation engineer San Francisco, CA
- senior quantitative risk analyst San Francisco, CA

