Senior/Staff Site Reliability Engineer

$175k - $230k

Sage Group plc

About Us

Sage is on a mission to improve care and quality of life for older adults, starting with those residing in senior living facilities. Falls are the leading cause of injury-related death among adults over 65. And yet, fall prevention and emergency response systems for older adults are archaic and ineffective. At Sage we've built a more modern way of understanding when older adults need help, including methods for residents to alert caregivers when in need of help, and corresponding software for caregivers to triage response. Our company mission is to create a product that our client counterparts love, and this role is a key part of that objective.

Sage is a small, tight team of ambitious, multi-disciplinary entrepreneurs. We are a software-enabled, mission-driven company, and are focused only on the problems that are central to achieving that mission. At Sage, we work hard and fast but also know that to build a truly important company, we need to treat our work as a marathon, and not a sprint. The journey matters.

About this Role

Sage provides life-saving functionality that improves the lives of our older population. This role is critical to ensure Sage can live up to its mission to be a 24x7, highly available platform for elder care. As a Site Reliability Engineer, you'll partner with engineering teams across the organization to achieve four 9s of uptime for our platform.

Responsibilities

Design and evolve highly reliable system architectures , ensuring high availability, fault tolerance, and scalability across Sage's production infrastructure.
Lead complex incident response efforts , coordinating across engineering teams to quickly diagnose and resolve production issues while driving thorough post-incident reviews and long-term reliability improvements.
Define and implement organization-wide observability practices , including metrics, logging, tracing, and actionable alerting to ensure strong visibility into system health.
Establish and maintain reliability standards , including defining SLIs, SLOs, and error budgets, and partnering with engineering teams to integrate these practices into the software development lifecycle.
Drive automation and infrastructure improvements that reduce operational toil and improve the efficiency and reliability of deployments, monitoring, and operational workflows.
Partner with engineering teams on system design and architecture reviews , ensuring reliability, scalability, and operational best practices are considered early in the development process.
Evolve Sage's cloud infrastructure , including networking, compute, storage, and security practices to support scalable and resilient systems.
Operate and improve critical data infrastructure , ensuring high availability, performance, backup strategies, and disaster recovery processes for production databases.
Lead capacity planning and auto-scaling efforts , ensuring infrastructure and systems scale effectively as product usage grows.
Build internal tooling and platforms that improve the developer experience, simplify debugging, and enable safer and more reliable deployments.

Qualifications

7-12+ years of experience in software engineering, infrastructure engineering, or site reliability engineering, operating large-scale distributed systems in production.
Experience operating and supporting edge or device-based systems, including managing connectivity, observability, remote updates, and reliability for distributed hardware deployments such as IoT or field devices.
Strong networking fundamentals, including experience debugging distributed system issues across load balancers, DNS, TLS, and VPC networking within platforms like Amazon Virtual Private Cloud or similar cloud networking environments.
Experience operating and scaling production databases, including performance tuning, replication, backup/recovery strategies, and high availability for systems such as PostgreSQL, MySQL, or distributed databases.
Deep expertise in cloud infrastructure, such as Amazon Web Services or Google Cloud Platform
Strong experience designing and operating highly available systems, including strategies for redundancy, failover, disaster recovery, and capacity planning.
Expertise in containerization and orchestration, particularly with Kubernetes and modern container platforms.
Advanced observability and monitoring skills, using tools such as Datadog, Prometheus or Grafana.
Strong programming ability in languages commonly used for infrastructure and reliability engineering (e.g., Go, Python, or Java), with experience building internal tooling and automation.
Deep knowledge of infrastructure-as-code practices, including tools like Terraform or Pulumi. Proven experience leading reliability initiatives, such as defining SLOs/SLIs, improving incident response processes, and driving post-incident reviews.
Ability to influence engineering teams across the organization, guiding best practices for reliability, scalability, and operational excellence.
Strong incident management and production debugging skills, with experience coordinating responses to complex outages and improving long-term system resilience.

Preferred Qualifications

Experience introducing and scaling SRE practices in early-stage or high-growth organizations, helping transition teams from reactive operations to proactive reliability engineering.
Experience designing disaster recovery and business continuity strategies, including multi-region deployments, backup validation, and recovery testing for critical systems.

Benefits and Pay

Our headquarters are located in New York City's Union Square. We believe in cross team collaboration. We think good ideas can come from anyone, and we've designed our processes to encourage participation from all. While we take our mission seriously, we don't take ourselves too seriously. We like to host offsites, outings, and team meals where we can connect as people, not just as colleagues. We offer office lunch and a fully stocked snack bar. While we are an in office culture, we allow up to 2 remote days per week.

Our benefits package for employees includes competitive base compensation along with stock options. The expected annual salary range for this role is $175,000-$230,000 USD, depending on your level of expertise, your experience, and your performance in the interview process. We also provide fully-paid health and dental insurance coverage for all of our employees, along with other health benefits including vision insurance, membership to premium primary and urgent care, and online medical health providers. We also have a take as you need time off policy, in addition to 7 paid holidays and a company wide winter break during the holidays.

EEO Statement

Sage is an equal opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.

This policy applies to all employment practices within our organization, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. Sage makes hiring decisions based solely on qualifications, merit, and business needs at the time.

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Senior/Staff Site Reliability Engineer in New York, NY vacancy

Senior Site Reliability Engineer (SRE)
...Senior Site Reliability Engineer (SRE) Our client is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies....
Senior
Local area
E-Solutions
New York, NY
2 days ago
Senior Site Reliability Engineer
$400k
...Salary : Up to $400,000 Total Compensation Senior Site Reliability Engineer We are working with a leading trading technology firm building a high-performance infrastructure engineering team focused on reliability, automation, and large-scale platform resilience. This...
Senior
Hamilton Barnes ?
New York, NY
1 day ago
Senior Site Reliability Engineer
...A leading quantitative trading firm is seeking a Senior Site Reliability Engineer to build and evolve the reliability, observability, and automation capabilities powering a highly performance-sensitive trading environment. Working at the intersection of software and infrastructure...
Senior
Acquire Me
New York, NY
1 day ago
Senior Site Reliability Engineer
...New York City or Chicago (Hybrid) A technology-driven investment firm is expanding its Platform Engineering organization and is seeking an experienced Senior Site Reliability Engineer to help shape reliability practices across its infrastructure and production...
Senior
Mission Staffing
New York, NY
1 day ago
Senior Site Reliability Engineer
...Responsibilities Improve the reliability of mission-critical solutions, applications, and platforms Software development for enterprises... ...Windows and Linux Years of Experience: 5 Years of Software Engineering Seniority level Mid-Senior level Employment type Full-time Job...
Senior
Full time
Work experience placement
InterEx Group
New York, NY
1 day ago
Senior Site Reliability Engineer (SRE)
$100 per hour
...- join early. As our Senior SRE, you'll be in charge of... ...create impact Improve reliability of our systems Build & maintain... ...frameworks and solutions to engineering problems Fast-moving: you... ...~401k benefits ~ On-site team culture - high collaboration...
Senior
Immediate start
Weekend work
DualEntry
New York, NY
2 days ago
Senior Site Reliability Engineer
...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...
Senior
TechChain Talent
New York, NY
6 days ago
Senior Site Reliability Engineer
$150k - $170k
...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned...
Senior
Casual work
Work at office
Remote work
Flexible hours
ZIP
New York, NY
4 days ago
Senior Site Reliability Engineer
...the future of legal tech — we’re defining it. Ready to join us in building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering hub, sitting within Foundations. You'll own critical...
Senior
Work at office
Legora AB
New York, NY
2 days ago
Senior Site Reliability Engineer (SRE)
...human risk—the leading cause of cybersecurity breaches—and build safer, more resilient organizations. The Role: As a Senior Site Reliability Engineer (SRE) at Dune Security, you will play a critical role in ensuring our platform's stability, scalability, and security....
Senior
Full time
Work at office
Dune Security
New York, NY
5 days ago
Senior Site Reliability Engineer
...millions of people around the world. About the role Novellia is a Series A health tech startup, and we're hiring our first Site Reliability Engineer. You'll join Platform Engineering as its second member, working directly with the Head of Platform Engineering to...
Senior
Flexible hours
Novellia
New York, NY
2 days ago
Senior Site Reliability Engineer
$150k - $175k
...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed...
Senior
Remote work
ASAPP
New York, NY
2 days ago
Senior Site Reliability Engineer
$164k - $205k
...ensuring high availability and performance Design intelligent alerting and observability systems Collaborate with engineering teams to embed reliability into the development lifecycle, shifting left on operational concerns Automate incident response workflows and build...
Senior
Work experience placement
Summer holiday
Work at office
Local area
Flexible hours
Shift work
2 days per week
BetterUp
New York, NY
1 day ago
Senior Site Reliability Engineer, Fleet Management
$127k - $249k
...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...
Senior
Work at office
Local area
Remote work
Flexible hours
MongoDB
New York, NY
2 days ago
Senior Software Engineer, Site Reliability Engineering
$153k - $210k
...Senior Software Engineer, Site Reliability Engineering Reno, NV; San Ramon, CA; NYC - Hybrid Are you passionate about building resilient, highly available cloud platforms that enable engineering teams to move quickly and confidently? Do you enjoy automating complex operational...
Senior
Ridge Line Services
New York, NY
1 day ago
Senior Site Reliability Engineer
$189k - $283.6k
...the SRE team, you will proactively and reactively improve the reliability of Block's platform and critical infrastructure. You are metrics... ...~ A strong desire to perform and grow as an engineer ~5+ years of software development experience Technologies...
Senior
Full time
Local area
Remote work
Relocation package
Flexible hours
Shift work
Block USA
New York, NY
1 day ago
Senior DevOps Engineer / Site Reliability Engineer
...Job Description A major financial services company in NYC is growing its team rapidly, and they are looking for a Senior DevOps Engineer / Site Reliability Engineer who can join. If you’re passionate about high-availability, reliability, automation, we’d be excited...
Senior
The Greene Group
New York, NY
19 days ago
Senior Site Reliability Engineer
$140k - $170k
...prioritize data security, navigate complex regulatory compliance and optimize business interactions. Role Description: As a Site Reliability Engineer, you will work with Agile engineering teams to provide production insight into running and operating software at-scale in...
Senior
Full time
Local area
Symphony Communication Services
New York, NY
2 days ago
Senior Project Manager - Luxury Restaurant Design
A hospitality management firm in New York is seeking a Project Manager with luxury experience to oversee restaurant development projects in the US and Canada. The ideal candidate will manage project budgets, schedules, and collaborate with various departments to ensure...
Senior
SA Hospitality Group
New York, NY
4 days ago
Sr. Site Reliability Engineer I
$89k - $178k
...industry. Learn more at What You’ll Do Build and maintain the reliability, scalability, and performance of our digital media... ...prevent recurrence Required Experience & Skills 4+ years in Site Reliability Engineering, DevOps, or related operational roles with proven...
Senior
DoubleVerify
New York, NY
1 day ago
Senior Staff Attorney, Family Defense Practice, Bronx
$100.8k
...CFR designed the Senior Staff Attorney position to create additional professional development opportunities for experienced staff attorneys that go beyond direct client representation and case management and to provide CFR with additional supervisory, administrative,...
Senior
Internship
Office of the Appellate Defender
New York, NY
1 day ago
Senior Staff Attorney Insurance Defense
$132.4k - $198.6k
...Sr Staff Attorney – LM07DE The Hartford currently has an in‑house opportunity for a Senior Staff Attorney to litigate cases throughout Houston and surrounding counties. This position... ...and litigation experience in construction site accidents, premise liability, products...
Senior
Temporary work
Work at office
Local area
The Hartford
New York, NY
4 days ago
VIP Casino Host — Senior Guest Relations & Rewards
...Executive Casino Host who will build and maintain relationships with domestic VIP players. The role involves supervising Casino Host staff, monitoring comp issuance, and ensuring adherence to company policies. The ideal candidate should have three years of experience in...
Senior
Resorts World New York
New York, NY
3 days ago
Senior Site Reliability Engineer - Banking & Finance
$400k
...in financial markets, the organization combines innovation, engineering excellence, and data-driven insights to support complex trading operations worldwide. This opportunity is for a Senior Site Reliability Engineer to join a high-performance infrastructure...
Senior
Permanent employment
Worldwide
New York, NY
27 days ago
Lead Site Reliability Engineer
...exceptional professionals for this role. JOB DESCRIPTION As a Site Reliability Engineering at JPMorgan Chase within the Enterprise technology,... ...system stability Partner with engineering peers and senior stakeholders to drive strong, shared outcomes Scale SRE...
J.P. Morgan
New York, NY
7 days ago
SENIOR STAFF ASSISTANT
...incumbent reports to the Deputy Director and the Director of Division. Responsibilities Within limits of delegated authority, the Senior Staff Assistant will be responsible for the following duties: • Assists in the overall administration of the division, i.e. provides...
Senior
Permanent employment
Full time
Fixed term contract
Work experience placement
Work at office
Local area
Relocation
United Nations
New York, NY
1 day ago
Dining Server - Senior Living Community
...Why Join Sunrise Senior Living At Sunrise Senior Living, we believe meaningful work starts with purpose. Our team members are passionate about making a positive difference in the lives of residents every day. At Sunrise Senior Living, we champion the quality of...
Senior
Shift work
Sunrise Senior Living
New York, NY
4 days ago
Food & Beverage - Senior Barista/Barista, A-OK Café (Front of House)
$20 - $30 per hour
...providing a world-class Café program to Aritzia clients. As the Senior Barista/Barista, A-OK Café, you will support with delivering... ...working at Aritzia: A-OK Café - Our world-class café located on-site Product Discount - Maybe you've heard of our famous product...
Senior
Hourly pay
Aritzia
New York, NY
2 days ago
Senior BARTENDER - Premier Professional Services Firm
$85k - $105k
...Job Description Job Description Senior BARTENDER – Premier Professional Services Firm $85,000–$105,000 Base Salary DOE + Exceptional... ...you're supporting a high-profile client reception, training staff, or managing day-to-day lounge operations, you consistently...
Senior
Seasonal work
Work at office
Remote work
Afternoon shift
Early shift
Career Group
New York, NY
a month ago
Senior Staff Attorney Family Law
...Responsibilities May Include: Direct supervision of one to two staff attorneys and holding/writing professional development reviews... ...Eligibility and Qualifications: Candidates for the Senior Staff Attorney position must have at least three years of experience...
Senior
Work from home
Flexible hours
LEGACY LEGAL RECRUITING LLC
New York, NY
7 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior/Staff Site Reliability Engineer. Be the first to apply!