Senior Site Reliability Engineer, Observability
Framework Ventures
Overview About Chainlink Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for institutional tokenized assets, lending, payments, stablecoins, and more. Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi. Many of the world’s largest financial services institutions have adopted Chainlink’s standards and infrastructure, including Swift, Euroclear, Mastercard, Fidelity International, UBS, S&P Dow Jones Indices, FTSE Russell, WisdomTree, ANZ, and top protocols such as Aave, Lido, GMX and many others. Chainlink leverages a novel fee model where offchain and onchain revenue from enterprise adoption is converted to LINK tokens and stored in a strategic Chainlink Reserve. Learn more at chain.link. The Observability Team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Reliability is vital to the success of our company. As a Senior SRE, you will help us accelerate and enable other engineering teams by increasing self-service and decreasing cognitive load. This role is ideal for someone with a strong DevOps mindset, a passion for building and maintaining a mature GitOps environment, and experience focusing on observability. The entire engineering team is expanding, offering opportunities to build, learn, and grow. We welcome applicants from diverse backgrounds. If you think you would do a great job at Chainlink, we look forward to speaking with you, even if you don\'t match 100% of the job requirements: those describe people we\'ve usually had a great time working with, but they\'re not a tick-box exercise. Your Impact Build and orchestrate Modern OTEL-based Observability Platform Support multiple telemetry types, like metrics, logs and traces Define and support modern governance in observability and problems at scale Ensure reliability, security, and performance exceed our defined SLAs Collaborate with engineers across the company to troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load Lead the design and deployment of monitoring/observability services to detect and alert the team of needed action Ingest, aggregate, transform, and utilize data from multiple sources in our real-time data pipeline Oversee availability, performance, and supportability of our observability infrastructure Create processes around alert response operations and support the team to ensure reliable delivery of oracle data Suggest metrics to enable alerts with every new feature release Champion reliability and security by doing work right the first time Requirements 7+ years of relevant professional experience in devops, infrastructure, SRE, and/or platform teams Ability to develop software beyond typical infrastructure requirements and configurations Experience programming in C, C++, Java, Python, Go, Perl, or Ruby Expert knowledge in designing, developing, and managing large real-time systems Experience with monitoring and logging; exporting metrics with Prometheus; Grafana dashboards; and centralized logging solutions like ELK Stack, Splunk, or Grafana Stack Experience with distributed systems and container orchestration; maintenance or building Kubernetes clusters; deploying new services on Kubernetes Strong communication skills with comfort in planning meetings and code reviews Desired Qualifications Excitement for blockchain, Web 3.0, and decentralized technologies Experience running infrastructure in the blockchain/web3 space Ability to scale systems sustainably through automation and evolving systems for reliability and velocity Experience working remotely in a distributed team Desire to grow and automate services to reduce toil Tools and Services AWS; Terraform/Terragrunt; Kubernetes, Calico and ArgoCD; Prometheus and Grafana; GitHub Actions; Packer We expect proficiency with these tools and related ones All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, overlap with Eastern Standard Time (EST) is encouraged. We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert, so please prepare your application thoughtfully. We will fully consider your experience and skills, and you will hear from us regarding the status of your application shortly after the closing date. Commitment to Equal Opportunity Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form. Global Data Privacy Notice for Job Candidates and Applicants Information collected and processed as part of your Chainlink Labs Careers profile and any job applications you submit is subject to our Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required. #J-18808-Ljbffr Framework Ventures
- jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing workflows...Senior
$182.3k - $220k
...first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team,... ...building tools that empower our engineers to ship safely and confidently.... ...drive uptime, performance and observability – partnering closely with product...SeniorLocal areaFlexible hours$130k
...Job Title : Senior Site Reliability Engineer Location: New York City - Hybrid ( 3 days onsite) Type: Full Time Salary: $130000- $150000+... ...broader infrastructure ecosystem, production operations, observability, reliability engineering, and business impact of the...SeniorFull time- ...includes everything from kernel tuning and system observability to container orchestration and deployment... ...’t a reactive firefighting role. It’s proactive, engineering-focused SRE where you’ll automate reliability, engineer for performance, and shape infrastructure...SeniorImmediate start
- ⚡ Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible... ...extend the network's capabilities Contributing to observability, reliability, and incident response across production environments...Senior
- ...ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure... ...Grafana LGTM stack (Loki, Grafana, Tempo, Mimir) for observability. 3-5 years of experience in using Infrastructure as Code...Senior
- ...was a machine learning research engineer at Scale AI. The rest of our team... ...with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding... ...and building the automation and observability that keep Unify fast and reliable...Senior
- ...building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering... ...with full ownership Build and maintain a high‑signal observability stack (metrics, logs, traces) and translate signals...SeniorWork at office
- New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming... ...’ll architect resilient infrastructure, design modern observability solutions, and build sustainable on-call processes that...SeniorRemote work
- ...the Internet. Summary At Latitude.sh, the Reliability team is responsible for the health and resilience... ...powers our global bare metal cloud. As a Senior Site Reliability Engineer (SRE), you’ll focus on building reliable, observable, and self-healing systems at scale. SREs...SeniorFor contractors
$160k - $195k
...agencies fuels the RapidSOS HARMONY AI engine that delivers this intelligence to those... ...you excited to work on systems where reliability directly impacts real‑world outcomes? At... ...improve system behavior under stress. Build observability into system behavior: Proactively...SeniorLocal areaFlexible hours- ...military base, Ditto's peer-to-peer sync engine ensures devices stay connected and... ...enterprise customers, we need experienced Site Reliability Engineers to ensure our infrastructure... ...to join a specialized team focused on observability, system reliability and operational...SeniorRemote workFlexible hours
$175k - $190k
...behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer - AWS in United States. This role sits at the core of... ...will play a key role in strengthening CI/CD pipelines, observability, and incident response practices. This is a highly...SeniorFull timeTemporary work$170k - $190k
As a Medrio Senior Site Reliability Engineer, you will be a part of the ITOps group responsible for maintaining all environments supporting the... ...problem solving Experience with AI/ML tools for automation, observability, predictive maintenance, and incident management...SeniorRemote jobTemporary workWork from homeFlexible hours- ...enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across... ...in building scalable infrastructure patterns, advancing observability, improving incident response, and partnering with engineering...SeniorRemote workFlexible hoursNight shift
$115k
Embark on a transformative journey as a Senior Site Reliability Engineer - AVP - Credit Trade Floor. At Barclays, our vision is clear -to redefine... ...experience in implementing monitoring, alerting, and observability frameworks for critical platforms, tools such as ITRS...SeniorHourly payWork at office$127k - $249k
The Team Platform Engineering is the department within SRE that is responsible for a range... ...edge and internal service mesh), and observability and alerting systems. The Fleet... ...critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours$170k - $230k
...industry. Responsibilities As a Senior SRE, you will help own and... ...while exemplifying engineering rigor and excellence across... ...opinionated views on what good observability is and help other teams see... ...innovate faster in a safe and reliable way. Reliability, resiliency...SeniorWork experience placementWork at officeFlexible hours3 days per week1 day per week- ...OPPORTUNITY TechInsights is building the reliability and AI operations foundation for... ...in the world. We’re looking for a Senior Site Reliability Engineer who wants to own that foundation.... ...the lights on — you’re building the observability, internal Developer Platform (IDP),...SeniorRemote jobFlexible hours
$143k - $179k
...you can connect with your customers reliably and securely, at every step of... ...enterprises alike. We're looking for a Senior Site Reliability Engineer to join our SRE team, the group... ...experience with modern monitoring and observability tools such as Prometheus, Grafana,...SeniorRemote workFlexible hours$130k - $165k
Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full... ...Job Department: Technology Team: Site Reliability Engineering About Snapsheet... ...Build and operate our core internal observability platform Monitor our systems for capacity...SeniorFull timeTemporary workLocal areaRemote workVisa sponsorshipWork visaFlexible hours$150k - $200k
Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve...SeniorFull timeLocal areaRemote workWork from home$165k - $242k
...The Platform & Infrastructure Engineering team in the Data... ...organization is responsible for the reliability, scalability, and security of... ...requirements and a focus on automation, observability, and resilience. About the role: As a Senior Site Reliability Engineer, you...SeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours- Palantir is seeking a Senior Software Engineer for their New York office to own the observability platform. The successful candidate will work on log ingestion, processing, and monitoring solutions, while collaborating with leadership to define technical strategies. Ideal...SeniorWork at officeFlexible hours
$175k - $200k
...Place to Work by BuiltIn and Inc. Magazine. The Role As a Senior Site Reliability Engineer on the Platform team, you will ensure that software... ...Continuously improve system reliability through automation, observability, performance tuning, and capacity planning Develop...SeniorPart timeWork at officeFlexible hours$185k - $227k
...purpose and we are hiring the world’s best engineers, scientists, designers, product... ...details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the... ..., ECS, Cloud Run with service mesh, observability, and security best practices Implement...SeniorRemote work- ...the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud... ...In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards. The ideal candidate has 5+ years...SeniorRemote jobFlexible hours
- Senior Site Reliability / Gitops Engineer Home based - Worldwide Canonical is a leading provider of open source software and operating systems to the... ...investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch;...SeniorContract workFor contractorsFor subcontractorWork at officeLocal areaWork from homeWorldwideFlexible hours
$206.7k - $330.3k
...infrastructure and critical shared services Observability, monitoring, and cost management for... ...security teams About The Role As a Senior Engineering Manager (M4) for FUB Infrastructure (... ...responsible for the infrastructure, reliability, and developer experience that...SeniorPermanent employmentLive inWork at officeLocal areaImmediate startRemote workShift work- ...Senior Software Engineer/SRE - TRAX Observability TRAde Automation and eXecution (TRAX) is part of Bloomberg Enterprise Products Engineering. We build... ...and analysis required to reason about performance and reliability. We partner closely with TRAX engineering teams and...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer, Observability. Be the first to apply!
- senior Bogota, NJ
- on-site clinical research associate (traveling/remote) Bogota, NJ
- lead site reliability engineer
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer
- site reliability engineering manager
- junior site reliability engineer
- senior managing editor
- senior physiotherapy



