Senior Site Reliability Engineer, Observability
Chainlink Labs
Senior Site Reliability Engineer, Observability Join Chainlink Labs as a Senior Site Reliability Engineer focused on Observability. The role supports our engineering teams by building a modern, OTEL‑based observability platform, driving reliability, security, and performance across a rapidly growing suite of blockchain services. About Chainlink Chainlink is the industry‑standard oracle platform bringing capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides the essential data, interoperability, compliance, and privacy standards required for institutional tokenized assets, lending, payments, stablecoins, and more. Since inventing decentralized oracle networks, Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi. About the Observability Team The Observability Team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Reliability is vital to the success of our company. As a Senior SRE you will accelerate and enable other engineering teams by increasing self‑service and decreasing cognitive load. Your Impact Build and orchestrate a modern OTEL‑based observability platform. Support multiple telemetry types such as metrics, logs and traces. Define and enforce governance for observability and problem management at scale. Ensure reliability, security, and performance exceed defined SLAs. Collaborate with engineers across the company to troubleshoot issues, deploy new products, and increase velocity while reducing cognitive load. Lead the design and deployment of monitoring/observability services to detect and alert the team of needed actions. Ingest, aggregate, transform, and utilize data from a multitude of sources in our real‑time data pipeline. Oversee the availability, performance, and supportability of our observability infrastructure. Create processes around alert‑response operations and support the team to ensure reliable delivery of oracle data. Make recommendations to ensure sufficient metrics are collected to create alerts with every new feature release. Champion reliability and security by taking the time to do your work right the first time. Requirements 7+ years of relevant professional experience; typically on devops, infrastructure, SRE, or platform teams. Ability to develop software beyond typical infrastructure configurations. Experience programming in C, C++, Java, Python, Go, Perl, or Ruby. Expert knowledge in designing, developing, and managing large real‑time systems. Experience with monitoring and logging: exporting metrics with Prometheus, building Grafana dashboards, and using a centralized logging solution such as ELK Stack, Splunk, or Grafana Stack. Experience with distributed systems and container orchestration, including maintaining or building Kubernetes clusters and deploying new services on them. Strong communication skills; capable of giving and receiving constructive feedback and participating in planning meetings and code reviews. Desired Qualifications Excitement for blockchain, Web 3.0, and similar decentralized technologies. Experience running any infrastructure in the blockchain/web3 space. Ability to scale systems sustainably through automation and evolution of reliability and velocity. Experience working remotely in a distributed team. A strong desire to grow and challenge yourself by continuously improving and automating services to reduce toil. Tools and Services we use daily AWS, Terraform/Terragrunt, Kubernetes, Calico, ArgoCD, Prometheus, Grafana, GitHub Actions, Packer. Comfortable and proficient use of the above tools is expected. Commitment to Equal Opportunity Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via the designated form. Global Data Privacy Notice for Job Candidates and Applicants Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you choose to submit, is subject to our Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required. Seniority level Mid‑Senior level Employment type Full‑time Job function Engineering and Information Technology Industries Technology, Information and Internet Location All roles with Chainlink Labs are global and remote‑based. Unless otherwise stated, we ask that you overlap some working hours with Eastern Standard Time (EST). Recruitment Process We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert, so we encourage you to take the time to thoughtfully prepare your application and you will hear from us regarding the status of your application shortly after the closing date. #J-18808-Ljbffr Chainlink Labs
- jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing workflows...Senior
$182.3k - $220k
...first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team,... ...building tools that empower our engineers to ship safely and confidently.... ...drive uptime, performance and observability – partnering closely with product...SeniorLocal areaFlexible hours- ...includes everything from kernel tuning and system observability to container orchestration and deployment... ...’t a reactive firefighting role. It’s proactive, engineering-focused SRE where you’ll automate reliability, engineer for performance, and shape infrastructure...SeniorImmediate start
- ⚡ Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible... ...extend the network's capabilities Contributing to observability, reliability, and incident response across production environments...Senior
- ...was a machine learning research engineer at Scale AI. The rest of our team... ...with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding... ...and building the automation and observability that keep Unify fast and reliable...Senior
- ...ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure... ...Grafana LGTM stack (Loki, Grafana, Tempo, Mimir) for observability. 3-5 years of experience in using Infrastructure as Code...Senior
- New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming... ...’ll architect resilient infrastructure, design modern observability solutions, and build sustainable on-call processes that...SeniorRemote work
- ...building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering... ...with full ownership Build and maintain a high‑signal observability stack (metrics, logs, traces) and translate signals...SeniorWork at office
- ...military base, Ditto's peer-to-peer sync engine ensures devices stay connected and... ...enterprise customers, we need experienced Site Reliability Engineers to ensure our infrastructure... ...to join a specialized team focused on observability, system reliability and operational...SeniorRemote workFlexible hours
$160k - $195k
...agencies fuels the RapidSOS HARMONY AI engine that delivers this intelligence to those... ...you excited to work on systems where reliability directly impacts real‑world outcomes? At... ...improve system behavior under stress. Build observability into system behavior: Proactively...SeniorLocal areaFlexible hours- ...the Internet. Summary At Latitude.sh, the Reliability team is responsible for the health and resilience... ...powers our global bare metal cloud. As a Senior Site Reliability Engineer (SRE), you’ll focus on building reliable, observable, and self-healing systems at scale. SREs...SeniorFor contractors
- Position: Senior Site Reliability Engineer + MongoDB Basic Purpose The Platform Database Engineer is responsible for designing, deploying, administering... ...provisioning and automation. Implement and refine observability and monitoring solutions using Dynatrace, CloudWatch,...SeniorRemote workWork from home
- ...Palantir is seeking a Senior Software Engineer for their New York office to own the observability platform. The successful candidate will work on log ingestion, processing, and monitoring solutions, while collaborating with leadership to define technical strategies. Ideal...SeniorWork at officeFlexible hours
- ...OPPORTUNITY TechInsights is building the reliability and AI operations foundation for... ...in the world. We’re looking for a Senior Site Reliability Engineer who wants to own that foundation.... ...the lights on — you’re building the observability, internal Developer Platform (IDP),...SeniorRemote jobFlexible hours
- ...enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across... ...in building scalable infrastructure patterns, advancing observability, improving incident response, and partnering with engineering...SeniorRemote workFlexible hoursNight shift
$143k - $179k
...you can connect with your customers reliably and securely, at every step of... ...enterprises alike. We're looking for a Senior Site Reliability Engineer to join our SRE team, the group... ...experience with modern monitoring and observability tools such as Prometheus, Grafana,...SeniorRemote workFlexible hours$170k - $230k
...industry. Responsibilities As a Senior SRE, you will help own and... ...while exemplifying engineering rigor and excellence across... ...opinionated views on what good observability is and help other teams see... ...innovate faster in a safe and reliable way. Reliability, resiliency...SeniorWork experience placementWork at officeFlexible hours3 days per week1 day per week$130k - $165k
Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full... ...Job Department: Technology Team: Site Reliability Engineering About Snapsheet... ...Build and operate our core internal observability platform Monitor our systems for capacity...SeniorFull timeTemporary workLocal areaRemote workVisa sponsorshipWork visaFlexible hours$150k - $200k
Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve...SeniorFull timeLocal areaRemote workWork from home$175k - $200k
...Place to Work by BuiltIn and Inc. Magazine. The Role As a Senior Site Reliability Engineer on the Platform team, you will ensure that software... ...Continuously improve system reliability through automation, observability, performance tuning, and capacity planning Develop...SeniorPart timeWork at officeFlexible hours- Senior Site Reliability / Gitops Engineer Home based - Worldwide Canonical is a leading provider of open source software and operating systems to the... ...investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch;...SeniorContract workFor contractorsFor subcontractorWork at officeLocal areaWork from homeWorldwideFlexible hours
- Sei Labs in New York is seeking an experienced Platform Engineer to enhance the Sei Blockchain's performance and security. This pivotal... ...advanced infrastructure for decentralized finance, ensuring observability, and streamlining CI/CD workflows. Ideal candidates have over...Senior
$165k - $242k
...The Platform & Infrastructure Engineering team in the Data... ...organization is responsible for the reliability, scalability, and security of... ...requirements and a focus on automation, observability, and resilience. About the role: As a Senior Site Reliability Engineer, you...SeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud... ...In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards. The ideal candidate has 5+ years...SeniorRemote jobFlexible hours
- ...Senior Software Engineer/SRE - TRAX Observability TRAde Automation and eXecution (TRAX) is part of Bloomberg Enterprise Products Engineering. We build... ...and analysis required to reason about performance and reliability. We partner closely with TRAX engineering teams and...Senior
$135k - $200k
...Senior Software Engineer - Observability Own the observability platform roadmap and delivery to scale ingestion, monitoring, and alerting Location:... ...design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems. 1+ years of...SeniorRelocation packageFlexible hours- ...software or infrastructure engineering building production-grade backend... ...and APIs. ~ Proficient in reliability engineering, including fault... .... ~ Familiar with observability systems such as ClickHouse,... ...involves We are seeking senior engineers to lead our Observability...Senior
$135k - $200k
...missing children, and more. The Role As a Senior Software Engineer, you will be directly responsible for Palantir’s observability platform. This includes everything from log... ...(architecture, design patterns, reliability and scaling) of new and existing systems....SeniorWork experience placementWork at officeRemote workWork from homeRelocation package- About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in... ...knowledge. DNS, TLS, certificates, ingresses, etc. Observability with log collection, metrics, APM, etc. preferably Datadog...SeniorRemote jobWork from homeSleeping nights
$124.9k - $228.9k
...time to building systems that operate reliably on a global scale. When you work here,... ...the tools and infrastructure that help engineers at The Trade Desk understand and operate... ...touches distributed systems, Kubernetes, observability pipelines, and web-based tooling...SeniorFull timeTemporary workLocal areaWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer, Observability. Be the first to apply!
- senior Bogota, NJ
- on-site clinical research associate (traveling/remote) Bogota, NJ
- lead site reliability engineer
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer
- site reliability engineering manager
- junior site reliability engineer
- senior managing editor
- senior physiotherapy


