Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior/Staff Site Reliability Engineer

$175k - $230k

Sage Group plc

About Us

Sage is on a mission to improve care and quality of life for older adults, starting with those residing in senior living facilities. Falls are the leading cause of injury-related death among adults over 65. And yet, fall prevention and emergency response systems for older adults are archaic and ineffective. At Sage we've built a more modern way of understanding when older adults need help, including methods for residents to alert caregivers when in need of help, and corresponding software for caregivers to triage response. Our company mission is to create a product that our client counterparts love, and this role is a key part of that objective.

Sage is a small, tight team of ambitious, multi-disciplinary entrepreneurs. We are a software-enabled, mission-driven company, and are focused only on the problems that are central to achieving that mission. At Sage, we work hard and fast but also know that to build a truly important company, we need to treat our work as a marathon, and not a sprint. The journey matters.

About this Role

Sage provides life-saving functionality that improves the lives of our older population. This role is critical to ensure Sage can live up to its mission to be a 24x7, highly available platform for elder care. As a Site Reliability Engineer, you'll partner with engineering teams across the organization to achieve four 9s of uptime for our platform.

Responsibilities
  • Design and evolve highly reliable system architectures , ensuring high availability, fault tolerance, and scalability across Sage's production infrastructure.
  • Lead complex incident response efforts , coordinating across engineering teams to quickly diagnose and resolve production issues while driving thorough post-incident reviews and long-term reliability improvements.
  • Define and implement organization-wide observability practices , including metrics, logging, tracing, and actionable alerting to ensure strong visibility into system health.
  • Establish and maintain reliability standards , including defining SLIs, SLOs, and error budgets, and partnering with engineering teams to integrate these practices into the software development lifecycle.
  • Drive automation and infrastructure improvements that reduce operational toil and improve the efficiency and reliability of deployments, monitoring, and operational workflows.
  • Partner with engineering teams on system design and architecture reviews , ensuring reliability, scalability, and operational best practices are considered early in the development process.
  • Evolve Sage's cloud infrastructure , including networking, compute, storage, and security practices to support scalable and resilient systems.
  • Operate and improve critical data infrastructure , ensuring high availability, performance, backup strategies, and disaster recovery processes for production databases.
  • Lead capacity planning and auto-scaling efforts , ensuring infrastructure and systems scale effectively as product usage grows.
  • Build internal tooling and platforms that improve the developer experience, simplify debugging, and enable safer and more reliable deployments.
Qualifications
  • 7-12+ years of experience in software engineering, infrastructure engineering, or site reliability engineering, operating large-scale distributed systems in production.
  • Experience operating and supporting edge or device-based systems, including managing connectivity, observability, remote updates, and reliability for distributed hardware deployments such as IoT or field devices.
  • Strong networking fundamentals, including experience debugging distributed system issues across load balancers, DNS, TLS, and VPC networking within platforms like Amazon Virtual Private Cloud or similar cloud networking environments.
  • Experience operating and scaling production databases, including performance tuning, replication, backup/recovery strategies, and high availability for systems such as PostgreSQL, MySQL, or distributed databases.
  • Deep expertise in cloud infrastructure, such as Amazon Web Services or Google Cloud Platform
  • Strong experience designing and operating highly available systems, including strategies for redundancy, failover, disaster recovery, and capacity planning.
  • Expertise in containerization and orchestration, particularly with Kubernetes and modern container platforms.
  • Advanced observability and monitoring skills, using tools such as Datadog, Prometheus or Grafana.
  • Strong programming ability in languages commonly used for infrastructure and reliability engineering (e.g., Go, Python, or Java), with experience building internal tooling and automation.
  • Deep knowledge of infrastructure-as-code practices, including tools like Terraform or Pulumi. Proven experience leading reliability initiatives, such as defining SLOs/SLIs, improving incident response processes, and driving post-incident reviews.
  • Ability to influence engineering teams across the organization, guiding best practices for reliability, scalability, and operational excellence.
  • Strong incident management and production debugging skills, with experience coordinating responses to complex outages and improving long-term system resilience.
Preferred Qualifications
  • Experience introducing and scaling SRE practices in early-stage or high-growth organizations, helping transition teams from reactive operations to proactive reliability engineering.
  • Experience designing disaster recovery and business continuity strategies, including multi-region deployments, backup validation, and recovery testing for critical systems.

Benefits and Pay

Our headquarters are located in New York City's Union Square. We believe in cross team collaboration. We think good ideas can come from anyone, and we've designed our processes to encourage participation from all. While we take our mission seriously, we don't take ourselves too seriously. We like to host offsites, outings, and team meals where we can connect as people, not just as colleagues. We offer office lunch and a fully stocked snack bar. While we are an in office culture, we allow up to 2 remote days per week.

Our benefits package for employees includes competitive base compensation along with stock options. The expected annual salary range for this role is $175,000-$230,000 USD, depending on your level of expertise, your experience, and your performance in the interview process. We also provide fully-paid health and dental insurance coverage for all of our employees, along with other health benefits including vision insurance, membership to premium primary and urgent care, and online medical health providers. We also have a take as you need time off policy, in addition to 7 paid holidays and a company wide winter break during the holidays.

EEO Statement

Sage is an equal opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.

This policy applies to all employment practices within our organization, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. Sage makes hiring decisions based solely on qualifications, merit, and business needs at the time.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior/Staff Site Reliability Engineer in New York, NY vacancy
  •  ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient... 
    Senior

    TechChain Talent

    New York, NY
    1 day ago
  • $182.3k - $220k

     ...putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team, you'll sit at the...  ...infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across... 
    Senior
    Local area
    Flexible hours

    Modern Fertility

    New York, NY
    3 days ago
  • $150k - $175k

     ...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed... 
    Senior
    Remote work

    ASAPP

    New York, NY
    1 day ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper).... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    1 day ago
  •  ...critical services in a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure...  ...of a clear career path in our SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior Principal. Each step... 
    Senior
    Work at office
    Remote work

    Akamai

    New York, NY
    1 day ago
  • $150k - $170k

     ...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned... 
    Senior
    Casual work
    Work at office
    Remote work
    Flexible hours

    ZIP

    New York, NY
    3 days ago
  • $200k - $240k

     ...expertise across machine learning, UI/UX, large language models, and medicine. Job Description We’re hiring an experienced Site Reliability Engineer for our Boston or NYC office! You can expect to: Design, build, and maintain resilient, scalable, and secure... 
    Senior
    Work at office

    Verana Health

    New York, NY
    3 days ago
  • A leading technology firm is seeking a Sr. Site Reliability Engineer in the United States. The ideal candidate will enhance system reliability and stability and should possess over 8 years of relevant experience in site reliability engineering. The position covers cloud... 
    Senior

    Jobgether

    New York, NY
    1 day ago
  • Tavily Inc. in New York City is seeking a Senior Site Reliability Engineer to manage Kubernetes clusters and own the full infrastructure. You will improve CI/CD pipelines and ensure systems are reliable and scalable. This role offers the chance to work on real scaling... 
    Senior

    Tavily Inc.

    New York, NY
    4 days ago
  • $175k - $245k

    A leading asset management firm in New York is seeking a Site Reliability Engineer to ensure high availability of technology services. The ideal candidate will have experience with AWS, Docker, and various operating systems. This role includes responsibilities like streamlining... 
    Senior

    Point72 Asset Management, L.P

    New York, NY
    4 days ago
  •  ...human risk—the leading cause of cybersecurity breaches—and build safer, more resilient organizations. The Role: As a Senior Site Reliability Engineer (SRE) at Dune Security, you will play a critical role in ensuring our platform's stability, scalability, and security.... 
    Senior
    Full time
    Work at office

    Dune Security

    New York, NY
    4 days ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve... 
    Senior
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    1 day ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team: Site Reliability Engineering About Snapsheet Snapsheet exists to simplify claims. We leverage... 
    Senior
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    New York, NY
    15 hours ago
  • $185k - $227k

     ...by this common purpose and we are hiring the world’s best engineers, scientists, designers, product managers, operations experts...  ...compelling, read on for more details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the operational stability... 
    Senior
    Remote work

    JUUL Labs

    New York, NY
    1 day ago
  • $157.5k - $254.35k

     ...signature and contract lifecycle management (CLM). What you’ll do We are looking for a self‑motivated, driven and creative Senior Site Reliability Engineer to join the Site Reliability team. Metrics and analytics drive engineering at DocuSign and ensure that we are... 
    Senior
    Contract work
    Work at office
    Local area
    Remote work

    DocuSign

    New York, NY
    2 days ago
  •  ...requirements unforgiving, and the impact immediate. This isn’t a reactive firefighting role. It’s proactive, engineering-focused SRE where you’ll automate reliability, engineer for performance, and shape infrastructure strategy at the firm level. What they’re looking for:... 
    Senior
    Immediate start

    Campbell North Ltd.

    New York, NY
    4 days ago
  • Legora-Ab is seeking a Senior Site Reliability Engineer to join our NYC engineering hub. You will own critical services, enhancing reliability across our platform and collaborating closely with engineering teams in Stockholm. This is a full-time, in-office position focused... 
    Senior
    Full time
    Work at office

    Legora-Ab

    New York, NY
    1 day ago
  • $180k - $200k

    Parabola is looking for a Senior Site Reliability Engineer to improve performance and reliability of its software systems in New York. This role requires 5+ years of SRE or DevOps experience and expertise in AWS and containerization tools. Offering a salary of $180,000... 
    Senior
    Work at office
    3 days per week

    Parabola

    New York, NY
    1 day ago
  • Curated careers, resources, tips and trends from the DevOps World. As a Senior Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and performance of our cloud-based infrastructure. Your primary responsibilities will include monitoring system... 
    Senior
    Remote work
    Flexible hours

    DevOpsChat

    New York, NY
    1 day ago
  • Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible enterprise blockchain ecosystems in market, supporting a proof-of-stake public network governed by major institutions across... 
    Senior

    Storm2

    New York, NY
    1 day ago
  •  ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from...  ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data... 
    Senior

    Unify

    New York, NY
    2 days ago
  •  ...look forward to hearing from passionate, goal-oriented applicants ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure and blockchain, building the platform that our product teams... 
    Senior

    SSV Labs

    New York, NY
    1 day ago
  • $116.63k - $181.24k

    Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to join our team, reporting to the Sr. Engineering Manager. As the Site Reliability Engineer, you will play a key role in designing, developing, and maintaining reliable, scalable, and highly... 
    Senior

    Wikimedia Foundation

    New York, NY
    3 days ago
  • New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions... 
    Senior
    Remote work

    Govserviceshub

    New York, NY
    1 day ago
  • $65 - $75 per hour

     ...Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and...  ...event management, and automation across the IT organization. Seniority level Mid-Senior level Employment type Contract Job function Information... 
    Senior
    Contract work
    Remote work

    SBS Creatix

    New York, NY
    1 day ago
  •  ...airplane, or remote military base, Ditto's peer-to-peer sync engine ensures devices stay connected and data stays consistent, even...  ...the demands of our enterprise customers, we need experienced Site Reliability Engineers to ensure our infrastructure delivers enterprise-... 
    Senior
    Remote work
    Flexible hours

    Ditto

    New York, NY
    1 day ago
  • $156k - $262k

    Senior Site Reliability Engineer (Agentic Search) New York City, New York, United States About Tavily We're building the infrastructure layer for agentic web interaction at scale. Our API is designed from the ground up to power Retrieval-Augmented Generation (RAG) and... 
    Senior
    Temporary work
    Immediate start
    Remote work

    Tavily Inc.

    New York, NY
    4 days ago
  •  ...contributes. No one coasts. If you’re driven by impact, pace, and raising the bar. This is the place. The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering hub, sitting within Foundations. You'll own critical services... 
    Senior
    Work at office

    Legora-Ab

    New York, NY
    1 day ago
  • $117k - $209.33k

    Position Overview Job Requisition ID # 26WD99278. Want to help make a better world? As a Senior Site Reliability Engineer at Autodesk, you can help us build and operate reliable, secure, and scalable cloud services for Autodesk GovCloud products. As part of a new SRE team... 
    Senior

    Autodesk, Inc.

    New York, NY
    4 days ago
  • $125k - $165k

     ...capacity for consumer ease. For more information, visit or follow us on LinkedIn. About the Role We're looking for a Senior Site Reliability Engineer who genuinely enjoys the craft. Someone who takes pride in a clean Terraform module, cares about observability because... 
    Senior
    Temporary work
    Remote work

    DexCare

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior/Staff Site Reliability Engineer. Be the first to apply!