Senior Software Engineer - Site Reliability Engineering (Remote)
$90k - $180kHome Depot
Position Purpose:
The Senior Software Engineer for Site Reliability Engineering (Store Systems Enablement) builds and operates the internal platforms that keep Home Depot's store systems observable, reliable, and automated. This is a platform engineering role: you will design, develop, and maintain the tools that hundreds of development and reliability teams depend on, not just use them.
The team owns and operates a portfolio of reliability platforms, including a custom-built synthetic testing system that runs inside physical Home Depot stores, operational automation infrastructure serving dozens of teams, and the full observability stack (logging, tracing, and profiling) for Store Systems. You will write code, deploy infrastructure, tune distributed systems, and reduce operational toil through automation, including AI-assisted workflows.
Key focus areas include:
Platform Development: Build and extend internal reliability tools using Kubernetes, Terraform, and modern infrastructure-as-code patterns on Google Cloud Platform.
Observability Operations: Deploy, configure, and maintain production logging, tracing, and profiling systems. Own the SLO/CUJ platform that enables multi-window, multi-burn-rate alerting and automated tracking dashboards for RE teams across Store Systems.
Toil Reduction & Automation: Identify repetitive operational work and engineer it away. Build self-service capabilities, Copilot skills, and automation pipelines so teams can operate independently.
SLO & CUJ Enablement: Maintain and extend the platform that powers SLO and Critical User Journey definition across the organization. Educate RE teams on what good SLOs and CUJs look like, assist with onboarding, and build automation and documentation so teams can self-serve. You will have strong opinions on the right way to measure reliability and the tooling to back them up.
Synthetic Monitoring: Extend our in-store synthetic testing platform: onboard teams, enable them to write and deploy their own tests, and evolve the platform's orchestration, alerting, and self-service capabilities.
Incident Response & Resilience: Participate in on-call rotation for observability infrastructure. Lead and contribute to blameless post-mortems. Design and execute destructive tests to validate platform resilience.
You will work on a small, high-impact team where the work is varied: some weeks you're writing Terraform and Helm charts, others you're debugging Loki query performance or building a Copilot skill to automate a support workflow. You will be expected to own problems end-to-end, from investigation through implementation to production deployment.
Key Responsibilities:
50% Delivery and Execution - Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide; Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm; Consistently achieves results, even under tough circumstances; Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production; Takes a broad view when approaching issues; using a global lens
20% Learns and Grows - Learns through successful and failed experiment when tackling new problems; Actively seeks ways to grow and be challenged using both formal and informal development channels
20% Plans and Aligns - Collaborates with other team members in agile processes; Creates new and better ways for the organization to be successful; Works the Product Team to ensure user stories are valuable, developer ready, easy to understand and testable; Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences; Adapts approach and demeanor in real time to match the shifting demands of different situations; Relates openly and comfortably with diverse groups of people
10% Supports and Enables - Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions
Direct Manager/Direct Reports:
This position typically reports to Software Engineer Manager or Sr. Manager
This position has 0 Direct Reports
Travel Requirements:
- No travel required.
Physical Requirements:
- Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles.
Working Conditions:
- Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.
Minimum Qualifications:
Must be eighteen years of age or older.
Must be legally permitted to work in the United States.
Preferred Qualifications:
3-5 years of experience in Site Reliability Engineering, Platform Engineering, DevOps, or Infrastructure Engineering
Hands-on experience with Google Cloud Platform (GCP), including GKE, GCS, BigQuery, Cloud Pub/Sub, Cloud Logging, IAM, and Workload Identity. Experience with other major cloud providers (AWS, Azure) is also valuable.
Strong Kubernetes experience: deploying and managing workloads on GKE or similar managed Kubernetes services, writing and debugging Helm charts, managing namespaces, RBAC, service accounts, and troubleshooting issues
Experience with infrastructure-as-code tools, particularly Terraform for cloud resource management. Familiarity with cdk8s (CDK for Kubernetes) or similar programmatic IaC tools is a plus.
Proficiency in one or more of: Go, Python, JavaScript/TypeScript, YAML. You don't need all of them, but you should be comfortable reading Go, writing YAML and HCL, and scripting in Python or JavaScript.
Experience with observability platforms: deploying, configuring, or operating log aggregation, distributed tracing, metrics, dashboarding, or continuous profiling
Practical understanding of SLOs, SLIs, and error budgets. Experience defining Service Level Objectives, instrumenting services for SLI measurement, and configuring burn-rate alerting is highly preferred.
Experience with synthetic monitoring or performance testing frameworks (k6, Playwright, Selenium, Locust, or similar). Bonus if you've built or operated a synthetic testing platform rather than just consumed one.
Familiarity with incident management and on-call practices: Blameless post-mortems, runbook development, and incident communication
Experience with CI/CD pipelines using GitHub Actions, Spinnaker, ArgoCD, or similar. Understanding of deployment strategies (blue/green, canary, rolling).
Experience with automation to reduce operational toil: building self-service tooling, writing scripts or bots to handle repetitive tasks, or developing internal developer platforms
Familiarity with AI-assisted development tools (GitHub Copilot, LLM-based automation, MCP servers) is a plus. We are actively building AI skills and automation into our workflows.
Experience writing clear technical documentation, runbooks, and onboarding guides
Comfort working on a small team with broad ownership: you will context-switch between writing code, debugging production systems, and onboarding partner teams
Minimum Education:
The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job.
Preferred Education:
- No additional education
Minimum Years of Work Experience:
- 3
Preferred Years of Work Experience:
- No additional years of experience
Minimum Leadership Experience:
- None
Preferred Leadership Experience:
- None
Certifications:
- None
Competencies:
Global Perspective
Manages Ambiguity
Nimble Learning
Self-Development
Collaborates
Cultivates Innovation
Situational Adaptability
Communicates Effectively
Drives Results
Interpersonal Savvy
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.
Apply End Date: 06/18/2026
- $90,000.00 - $180,000.00
- ...Bright Vision Technologies is seeking a Site Reliability Engineer to ensure the high availability of... ...distributed systems. This role requires strong software engineering principles and operational... ...tasks.This position is a full-time, remote opportunity within the Continental...Remote workSeniorFull time
- ...A leading company in crypto and Web3 is seeking a Senior Site Reliability Engineer to join their Onchain infrastructure team. The role involves automating and managing the infrastructure for digital asset access, focusing on cloud-based solutions. Ideal candidates have...Remote workSenior
- ...Seeking a highly skilled Senior Site Reliability Engineer, this full-time remote position will design and build observability pipelines for datacenter infrastructure, ensuring actionable visibility and operational efficiency across Vultr's global footprint. Key Responsibilities...Remote workSeniorFull time
- ...Do you want to shape reliability practices for a new AI inference platform? Are you a senior technical leader who drives solutions... ...architecture decisions with product engineering teams, and shape SRE... ...energize and inspire you! #LI-Remote Job Info Job Identification 2...Remote workSeniorPermanent employmentWork at officeWork from homeWorldwideFlexible hours
$131.46k - $158.8k
...Senior Site Reliability Engineer I Ensure that our database migration from the Boca data center to Microsoft Azure is a success. Deliver resilient... ...needed. Bachelor's degree (or foreign equivalent) in Software Engineering, Computer Science, Information Technology, or...Remote workSeniorWork at office$100k - $115k
...Software And Systems Engineer Why you will love this job: Great opportunity to use software and... ...benefits including 6% match on 401K! Remote WFH role. Salary: $100,000 - $115... ...strategy across platform Troubleshoot site down issues and respond to emergency...Remote workSeniorWork from home- ...New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions...Remote workSenior
- ...FullStack is one of the fastest-growing software consultancy companies in the... ...Position We're looking to hire a Senior Site Reliability Engineer to join our team. You'll work with our... ...parental leave, holidays). ~100% remote work. ~ The ability to work with leading...Remote workSenior
$65 - $75 per hour
...on our W2- no C2C, no exceptions Fully remote Key Responsibilities: Process customer requests... ...Management tools. Description: As an Engineer 2, you will collaborate with management,... ...automation across the IT organization. Seniority level Mid-Senior level Employment type...Remote workSeniorContract work$75 per hour
...Contract Compensation: $55–$75/hour Location: Remote Role Responsibilities Build a realistic digital workspace... ...team . ~ Background in cloud architecture, site reliability engineering, platform engineering, DevOps/DevSecOps, or cloud FinOps....Remote workSeniorHourly payFull timeContract workFor contractorsSummer work$153k - $190k
...and not open to contract work. This role is open to fully remote work. Company Description b.well is on a mission to... ...to change healthcare for the better! Job Description As a Senior Site Reliability Engineer you will be tasked with making sure we build a reliable,...Remote workSeniorFull timeContract workLive in$104.9k - $174.7k
...Senior Site Reliability Engineer (SRE) About the Business: LexisNexis Risk Solutions is the essential partner in the assessment of risk. Within... ...may work a hybrid schedule. If not, this role is fully remote. We do not restrict applicants based on job site or posting...Remote workSeniorFull timeWork at officeLocal areaFlexible hours$91.7k - $163.7k
...Senior Observability Engineer Opportunities with Logistics Health Incorporated (LHI), part of the... ..., a satellite office in Chicago and remote employees throughout the country, we... ...role is focused on maintaining the reliability, scalability and availability of our...Remote workSeniorMinimum wageFull timeWork experience placementWork at officeLocal area- ...No H1 or C2C. Must be Permanent Resident or US Citizen Senior Site Reliability Engineer Description and Requirements About Our Team We... ...high engineering standards. Location: Open to remote work in the US. The preferred work location is Chicago, IL...Remote workSeniorPermanent employment
$150k - $200k
...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑... ...and acceleration platform that helps software teams adopt and improve DORA capabilities... ..., and optimize local developer and remote CI feedback loops. Our software is...Remote workSeniorFull timeLocal areaWork from home- ...era of financial, creative, and personal freedom. The Department: Onchain The Role: Senior Site Reliability Engineer The Onchain infrastructure team at Gemini creates and manages software tools and platforms, automates the creation and support of this infrastructure,...Remote workSeniorFlexible hours
- ...Senior Site Reliability Engineer (Senior SRE) Cybercrime is rising, reaching record highs in 2024. According to the FBI's IC3 report, total losses... ...paid leave ~ Wellness reimbursement of $300/year ~ Remote worker reimbursement of $300/year ~ Professional development...Remote workSeniorFlexible hoursNight shift
$91.7k - $163.7k
...Sr. Site Reliability Engineer For those who want to invent the future of health care, here's your... ...You'll enjoy the flexibility to work remotely from anywhere within the U.S. as you... ...clouds. The role will work closely with software engineers, architects, and DevOps...Remote workSeniorMinimum wageFull timeWork experience placementWork at officeLocal area- ...Senior Site Reliability Engineer At Jamf, we believe in an open, flexible culture based on respect and... ...at Jamf. This role is offered as remote in Minneapolis, MN; Eau Claire, WI;... ...~ Minimum of 5 years experience in software engineering, SRE or production operations...Remote workSeniorWork at officeFlexible hours
- ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies... ...engineering solutions in Azure Cloud, as part of a remote role, with occasional travel to headquarters in Seattle,...Remote workSeniorFull timeLocal areaImmediate startFlexible hours
- ...for a highly motivated,diligent,andskillfulSite Reliability Engineerto join theCyber SecurityEngineering(CSE... ...impacts on people’s lives. This position can be remote anywhere in the U.S. TheSeniorSite Reliability Engineer willbe responsible forensuring production systems...Remote workSeniorTemporary work
$100k - $120k
...Senior Site Reliability Engineer Austin, TX, US; Remote, US; San Francisco, CA, US UJET leads the way in AI-powered contact center innovation, delivering a future-proof, cloud platform that redefines the customer experience with cutting-edge AI, true multimodality...Remote workSeniorWork experience placementLocal area$86.9k - $198k
...Job Description Remote Work: No Job Number: R023... ...Share job via: Share Site Reliability Engineer, Senior The Opportunity: Engineering... ...engineering, systems administration, or software development, if you have a passion for...Remote workSeniorFull timeContract workPart timeWork at officeLocal area- ...Senior Site Reliability Engineer - Operations As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered... ...Kansas City, MO | Hybrid | MO, TX, GA, and FL | remote Get To Know The Team We are seeking a highly skilled...Remote workSeniorOngoing contractCasual workFlexible hours
$185k - $227k
...Remote - United States; United States of America THE COMPANY Juul Labs's mission is... ...purpose and we are hiring the world’s best engineers, scientists, designers, product... ...details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the...Remote workSenior- ...governments realize their greatest potential. Senior Site Reliability EngineerThe B&MI BizOps team is looking for a Senior BizOps Engineer who can help us solve problems, build our... ...during the application build phase in software run principals that includes operational design...Remote workSeniorFull timePart timeWorldwideFlexible hoursShift work
$170k - $200k
...Great Place to Work®. THE JOB We are looking for a Senior Site Reliability Engineer who understands that at VEG, "reliability" is a medical... ...opportunity to work at our VQ in White Plains or could be open to remote work. WHAT YOU'LL DO Formulate short- and long-...Remote workSeniorFull timeTemporary workCasual workWork at office$160k - $210k
...growing! The Role We are looking for a senior site reliability engineer to work on expanding our global... ...in office (Mon/Tue/Wed) and 2 days remote (Thursday/Friday). Responsibilities... ...years of experience in operations, software engineering, or as an SRE. ~...Remote workSeniorFull timeWork at officeImmediate startWork from home$149.1k - $157.8k
...of unmatched reverse engineering, teardown, and market... ...TechInsights is building the reliability and AI operations... ...We're looking for a Senior Site Reliability Engineer... .... This role is a remote role for candidates... ...reliability liaison to Software and AI Engineering,...Remote workSeniorPermanent employmentFlexible hours- ...businesses and governments realize their greatest potential. Senior Site Reliability Engineer The Biz Ops team is looking for a Senior Site Reliability... .... • Support the application CI/CD pipeline for promoting software into higher environments through validation and...Remote workSeniorFull timePart timeWorldwideFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer - Site Reliability Engineering (Remote). Be the first to apply!
- graduate software developer United States
- rust software engineer United States
- senior software design engineer United States
- software engineer student United States
- software engineer amazon United States
- software developer positions United States
- software engineer full time United States
- software qa engineer United States
- new graduate software engineer United States
- junior software developer United States


