Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

Plenful

Plenful is on a mission to transform healthcare operations from the inside out. Fresh off our recent founding round and backed by Notable Capital, Bessemer Venture Partners, TQ Ventures, Susa/Kivu Ventures, and other leading investors, we’re building the category-defining AI workflow automation platform that healthcare teams rely on to operate smarter, faster, and more efficiently. We automate manual tasks across disparate systems to improve compliance posture, streamline manual work, and unlock critical revenue, so teams can deliver better patient care.
Built by healthcare operators for healthcare operators, Plenful is driven by a deep understanding of the challenges facing today’s care teams. We’re passionate about equipping healthcare teams with world-class tools that deliver real, measurable impact, and we’re proud to serve 70+ leading health systems across the country. If you’re excited to help shape the future of healthcare, we’d love to meet you.

About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow engine, serverless pipelines, containerized services and Postgres based data layer. This role reports into engineering leadership and will influence how we build, scale and operate our platform as we continue to grow.

You’ll bring strong technical judgment, calm problem solving during incidents and a practical approach to improving reliability. You’ll collaborate closely with backend, ML and DevOps engineers and help shape a culture where operational excellence is clear, repeatable and shared across the team.

What you’ll do

Reliability, Observability and Performance:

  • Maintain and evolve alerting so engineers receive clear, actionable signals for anomalies, latency regressions and reliability risks.
  • Define observability standards across metrics, logs and tracing with a focus on reliability, performance and customer impact instead of vanity data.
  • Investigate performance bottlenecks across our distributed systems including serverless task execution, containerized services, workflow orchestration and Postgres.
  • Lead incident response, coordinate root cause analysis and ensure reliability improvements are fully implemented and measured.

Infrastructure and Platform Operations:

  • Improve the reliability of our distributed task processing, including autoscaling behavior, execution patterns, retry logic, rate limiting and failure isolation.
  • Support the stability of our serverless pipelines that process high volume workloads across multiple execution layers.
  • Partner with backend and ML teams on designing resilient mechanisms for scheduling, queueing and workflow execution.
  • Maintain efficient and predictable resource usage across compute, networking and storage.

Security, Compliance and Operational Excellence:

  • Support security and compliance work including patching, audit readiness and vulnerability management.
  • Participate in the on‑call rotation and respond to production incidents quickly and calmly with a focus on restoring stable service and clear communication.
  • Contribute to blameless post‑mortems, drive follow through on fixes and ensure learnings are documented for future engineers.

What we’re looking for

  • 5+ years of professional engineering experience in a B2B, SaaS company.
  • Strong experience operating production systems in cloud environments, ideally AWS.
  • Hands‑on experience with serverless compute patterns, containerized services, distributed workflows and Postgres.
  • Solid understanding of observability tooling, performance debugging and system behavior under load.
  • A high ownership mindset, empathy for teammates, straightforward communication and a one‑team attitude.
  • Comfortable working in a fast‑paced startup environment with a bias for action and thoughtful engineering judgment.
  • Comprehensive Benefits Package: Enjoy unlimited PTO, fully covered health insurance (medical, dental, and vision), meal stipend, health & wellness stipend, 401(k) matching, and stock options.
  • Mission‑Driven, World‑Class Team: Join an exceptional group of professionals aligned around a meaningful mission and committed to making an impact.
  • Opportunities for Growth: Strengthen your partnership expertise through collaboration with experienced, high‑performing leaders across the organization.
  • Flexible Work Environment: Employees based in the Bay Area enjoy two days per week in a brand‑new downtown San Francisco office. Employees based in other cities enjoy a fully remote work environment with the ability to travel for collaboration.
#J-18808-Ljbffr

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in San Francisco, CA vacancy
  •  ...manifesto. About the Role We're looking for an Infrastructure Engineer to take the lead on scaling our operational resilience as we grow...  ...This is a high-impact, high-trust role where you’ll shape how reliability is done - reducing incident load, building internal tooling,... 
    Suggested
    Worldwide
    Shift work

    Happy Robot

    San Francisco, CA
    5 days ago
  •  ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building...  ...observability adoptable and improve product reliability. Lead members of other engineering teams...  ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess... 
    Suggested
    Work at office
    Local area
    Work from home

    Lambda

    San Francisco, CA
    4 days ago
  •  ...For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime type: Full timeposted on: Posted Yesterdayjob requisition id: R1478**There are NO limits to your career: come... 
    Suggested
    Immediate start
    Remote work
    Worldwide

    OutSystems

    San Francisco, CA
    4 days ago
  •  ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like...  ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and... 
    Suggested

    Unify

    San Francisco, CA
    3 days ago
  • $60 per hour

    Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,... 
    Suggested
    Full time
    Work at office
    Flexible hours

    Bonfirevc

    San Francisco, CA
    2 days ago
  • # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of...  ...**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part... 
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    3 days ago
  • US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems... 

    Axiom Pursuits

    San Francisco, CA
    2 days ago
  •  ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (...  ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of... 

    OutSystems, Inc.

    San Francisco, CA
    2 days ago
  • $166.9k - $225.9k

    Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team...  ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or... 
    Flexible hours

    Drata

    San Francisco, CA
    3 days ago
  • What you’ll do As a Senior Site Reliability Engineer, you’ll work closely with product teams in Spend to deliver and maintain scalable, reliable cloud infrastructure in support of key product initiatives. Aligned to the roadmap, you’ll lead on infrastructure design and... 

    Airwallex-

    San Francisco, CA
    1 day ago
  • $125k - $165k

    Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability Engineer will help ensure the reliability, scalability, and performance of the systems that power our AI products. This role... 
    Temporary work
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    TELCOR Inc

    San Francisco, CA
    5 days ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much a platform engineering role as it is an SRE role— you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    2 days per week

    GoTo Meeting

    San Francisco, CA
    2 days ago
  • $151.5k - $252.5k

     ...and making a real impact for some of the world’s biggest brands. About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will be working with a global team to build the world’s next modern... 
    Base plus commission
    Local area
    Worldwide

    Veeam

    San Francisco, CA
    3 days ago
  •  ...co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and... 

    Hyperbolic Labs

    San Francisco, CA
    4 days ago
  • The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-... 

    Blaxel

    San Francisco, CA
    3 days ago
  •  ...millions of daily users while enabling our engineering teams to ship fast. You'll own the...  ...building automation and tooling that improves reliability and partnering with engineering to...  ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems... 
    Work at office
    Work from home

    gamma.app

    San Francisco, CA
    5 days ago
  • $125k - $165k

    Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud... 
    Temporary work
    Work at office
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    TELCOR

    San Francisco, CA
    5 days ago
  •  ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that...  ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes... 
    Work at office
    Worldwide

    Heidi Health Ltd

    San Francisco, CA
    2 days ago
  •  ...in the design of information and operational support systems. Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting... 
    Work experience placement
    Start working today
    Remote work
    Flexible hours

    Hamilton Barnes Associates Limited

    San Francisco, CA
    5 days ago
  • A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and strong programming skills. You will manage production systems' reliability... 

    gamma.app

    San Francisco, CA
    5 days ago
  • $140k - $220k

    About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing...  ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks... 

    Pylon

    San Francisco, CA
    5 days ago
  •  ...advanced algorithms that significantly outperforms individual engineers. We combine language models with human ingenuity to push the...  ...and quality. The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area... 

    CodeRabbit

    San Francisco, CA
    2 days ago
  •  ...TELCOR Inc is looking for a Site Reliability Engineer to ensure the reliability, scalability, and performance of our AI products' systems. The role involves designing and operating resilient systems in cloud and containerized environments while managing production infrastructure... 
    Remote work

    TELCOR

    San Francisco, CA
    3 days ago
  • $50 per hour

     ...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer Science or related field, or 8+ years relevant work... 
    Temporary work
    Work experience placement

    Epoch Biodesign

    San Francisco, CA
    4 days ago
  • $227.2k - $324.5k

     ...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset and toolkit to the challenges of building and running large-scale, distributed systems.... 
    Full time
    Contract work
    Temporary work
    Local area
    Flexible hours

    Tubi

    San Francisco, CA
    1 day ago
  •  ...Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only... 
    Full time
    Remote work

    Andromeda Cluster

    San Francisco, CA
    3 days ago
  • $125k - $195k

     ...improve. We’re building a small team of exceptional, hands-on engineers to make this happen. Mechanical, electrical, hardware,...  ...40 years. About the role We are seeking an Infrastructure & Site Reliability Engineer to design, build, deploy, and manage the on-prem backend... 
    Work at office
    Visa sponsorship
    Night shift

    Atomicsemi

    San Francisco, CA
    4 days ago
  • $325k

     ...Engineering at Ivo Engineers At Ivo Are Inventors. Ivo Was First-to-market With An AI agent that lives in MS Word and edits the...  ...us to hit our SLAs. We’re looking for an Senior or Staff Site level Reliability Engineer as part of the Infrastructure team to: Own uptime... 
    Contract work

    Icehouseventures

    San Francisco, CA
    3 days ago
  • $138k - $179k

     ...write up and follow up tasks to close any gaps identified. We partner with a wide variety of other teams from infrastructure and engineering, to QA and business teams, so strong collaborative instincts and clear communication skills are a key part of our toolset. As well... 
    Flexible hours

    MSCI

    San Francisco, CA
    3 days ago
  • We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You’ll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You’ll also provide guidance... 
    Remote job

    Blockchain Works

    San Francisco, CA
    9 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!