Sr. Site Reliability Engineer
$160k - $250kStandard Template Labs
Standard Template Labs is an AI-native startup reimagining the future of IT Service and Configuration Management. Backed by leading investors, we're leveraging AI to transform how enterprises manage and engage with their IT ecosystems.
About the Role We're looking for a Senior Site Reliability Engineer (SRE) to own the reliability, performance, and scalability of our AI-native platform. You'll operate at the intersection of software engineering and infrastructure, building systems that keep our platform highly available, observable, and resilient in production. This is a hands-on engineering role where you'll write production code (primarily in Python) while also owning on-call operations and incident response. ResponsibilitiesReliability & Production Ownership
- Own the availability, latency, and performance of critical production systems
- Participate in and improve a 24/7 on-call rotation, responding to incidents and driving resolution
- Lead incident response, root cause analysis (RCA), and postmortems
- Design systems that fail gracefully and recover automatically
- Write production-grade Python code to:
- Automate infrastructure workflows
- Build internal reliability tools
- Improve deployment, rollback, and recovery systems
- Eliminate manual operational work through automation and self-healing systems
- Design and implement:
- Metrics, logging, tracing
- Alerting systems (reduce noise, improve signal)
- Build dashboards and tooling to give real-time visibility into system health
- Operate and improve systems running on:
- Cloud platforms (AWS/GCP/Azure)
- Containers (Docker, Kubernetes)
- Scale systems to handle enterprise workloads and high-throughput traffic
- Improve deployment pipelines, CI/CD, and infrastructure-as-code
- Define and enforce:
- SLAs / SLOs / error budgets
- Conduct:
- Load testing
- Chaos testing
- Build resilient systems that can tolerate failure
- Partner with product and backend engineers to:
- Improve system reliability
- Embed observability into services
- Help teams design production-ready systems from day one
Core Requirements
- Strong software engineering background (not just ops)
- Proficiency in Python (required) for building tools and services
- Experience operating production systems at scale
- Experience with:
- Kubernetes / Docker
- Cloud platforms (AWS/GCP/Azure)
- Distributed systems
- Experience with:
- On-call rotations and incident response
- Monitoring tools (Grafana, Prometheus, etc.)
- Debugging production issues under pressure
- Experience with:
- AI/ML systems or data pipelines
- Event-driven architectures
- High-availability systems
- Build foundational product features for an AI-first enterprise platform
- The opportunity to take ownership of critical systems that scale to millions of users
- A culture that values craftsmanship, autonomy, and technical excellence
- Competitive compensation, equity, and benefits package
- Work from our Flatiron District, Manhattan office, where you'll be side-by-side with the founding team in a supportive, collaborative setting. Our team works on-site five days a week, growing and building together, and the location is easy to reach with plenty of public transportation options.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Sr. Site Reliability Engineer in New York, NY vacancy
$89k - $178k
...Sr. Site Reliability Engineer I NYC Global HQ Hybrid (3 days per week in office) DV is the leader in digital performance solutions, helping our advertiser and agency partners verify the quality of their digital campaigns, optimize to improve performance and prove...SeniorWork at office3 days per week$150k - $200k
This position is posted by Jobgether on behalf of a partner company. We are looking for a Sr. Site Reliability Engineer in the United States. The role offers a unique opportunity to ensure the stability, scalability, and reliability of critical systems in a fast‑paced,...SeniorWork from homeFlexible hours- ...remote role, we will consider applicants based in LATAM. Our Engineering team is having a blast while delivering the most... ...engineers building and maintaining Kraken's infrastructure. As a Site Reliability Engineer, you will keep one of the fastest growing companies...SeniorLocal areaRemote work
$170k - $225k
About The Role Zora is looking for an experienced infrastructure / site reliability software engineer to work closely with the development team to ensure that the infrastructure / site reliability meets the needs of the business and is scalable and highly available, including...SeniorLocal areaRemote workHome officeFlexible hours$180k - $200k
...Objectives (SLOs) and Service Level Agreements (SLAs) to ensure reliable and consistent service delivery Incident Response and... ...support and guidance on infrastructure‑related issues Software Engineering for Operations: Develop and maintain internal tools and services...SeniorFor contractorsWork at officeWork from homeFlexible hours- ...systems. We’re looking for a experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have... ...and failure simulations to harden the platform. Mentor engineers and set best practices for SRE across the company. What You Bring...SeniorRemote work
- Sr. SRE (Engineering & Administration Background) St Louis, MO (Hybrid, 3 days onsite/Week) Long Term Contract Preferably looking for 7+ years of experience candidate Card Payment knowledge We are looking for a site reliability engineer (SRE).SeniorLong term contract3 days per week
- ...Senior Site Reliability Engineer (SRE) Our client is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies....SeniorLocal area
- ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...Senior
$182.3k - $220k
...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the... ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams...SeniorLocal areaFlexible hours- Sr Site Reliability Engineer (Linux, UNIX, Reliability Engineering, Python, C, C++, Java, DevOps) in New York City C, C++, DevOps Engineer, Java, Linux, Perl, Python, Reliability Engineering, SQL, Unix Location: New York Job Function: Reliability Engineering Date Of Job...Senior
$150k - $175k
...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed...SeniorRemote work$127k - $249k
...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper)....SeniorWork at officeLocal areaRemote workWorldwideFlexible hours$189k - $283.6k
...the SRE team, you will proactively and reactively improve the reliability of Block's platform and critical infrastructure. You are metrics... ...~ A strong desire to perform and grow as an engineer ~5+ years of software development experience Technologies...SeniorFull timeLocal areaRemote workRelocation packageFlexible hoursShift work$175k - $230k
...This role is critical to ensure Sage can live up to its mission to be a 24x7, highly available platform for elder care. As a Site Reliability Engineer, you'll partner with engineering teams across the organization to achieve four 9s of uptime for our platform....SeniorApprenticeshipWork at officeLocal areaRemote work2 days per week$182.3k - $220k
...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the... ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams...SeniorLocal areaFlexible hours$175k - $245k
A leading asset management firm in New York is seeking a Site Reliability Engineer to ensure high availability of technology services. The ideal candidate will have experience with AWS, Docker, and various operating systems. This role includes responsibilities like streamlining...Senior- Tavily Inc. in New York City is seeking a Senior Site Reliability Engineer to manage Kubernetes clusters and own the full infrastructure. You will improve CI/CD pipelines and ensure systems are reliable and scalable. This role offers the chance to work on real scaling challenges...Senior
- A leading technology firm is seeking a Sr. Site Reliability Engineer in the United States. The ideal candidate will enhance system reliability and stability and should possess over 8 years of relevant experience in site reliability engineering. The position covers cloud...Senior
- ...Description A major financial services company in NYC is growing its team rapidly, and they are looking for a Senior DevOps Engineer / Site Reliability Engineer who can join. If you’re passionate about high-availability, reliability, automation, we’d be excited to talk...Senior
$180k - $200k
Parabola is looking for a Senior Site Reliability Engineer to improve performance and reliability of its software systems in New York. This role requires 5+ years of SRE or DevOps experience and expertise in AWS and containerization tools. Offering a salary of $180,000...SeniorWork at office3 days per week- ...requirements unforgiving, and the impact immediate. This isn’t a reactive firefighting role. It’s proactive, engineering-focused SRE where you’ll automate reliability, engineer for performance, and shape infrastructure strategy at the firm level. What they’re looking for:...SeniorImmediate start
- Legora-Ab is seeking a Senior Site Reliability Engineer to join our NYC engineering hub. You will own critical services, enhancing reliability across our platform and collaborating closely with engineering teams in Stockholm. This is a full-time, in-office position focused...SeniorFull timeWork at office
- Curated careers, resources, tips and trends from the DevOps World. As a Senior Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and performance of our cloud-based infrastructure. Your primary responsibilities will include monitoring system...SeniorRemote workFlexible hours
- ⚡ Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible enterprise blockchain ecosystems in market, supporting a proof-of-stake public network governed by major institutions across...Senior
- ...forward to hearing from passionate, goal-oriented applicants ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure and blockchain, building the platform that our product teams...Senior
- ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like... ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and...Senior
$165k - $242k
...Senior Site Reliability Engineer, Data Infrastructure CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading...SeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours$65 - $75 per hour
...virtualization technologies. Knowledge of ITIL frameworks, Jira, Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and customers to identify end-user requirements for infrastructure monitoring...SeniorContract workRemote work$116.63k - $181.24k
Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to join our team, reporting to the Sr. Engineering Manager. As the Site Reliability Engineer, you will play a key role in designing, developing, and maintaining reliable, scalable, and highly...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Site Reliability Engineer. Be the first to apply!
Related searches
- site reliability engineering manager New York, NY
- site reliability engineer remote New York, NY
- site reliability engineer sre New York, NY
- site reliability engineer New York, NY
- senior cloud service delivery manager New York, NY
- senior business analyst contract New York, NY
- senior product design engineer New York, NY
- senior game producer New York, NY
- senior software manager New York, NY
- senior creative strategist New York, NY


