Senior Software Engineer - Site Reliability Engineering (Remote)
$90k - $180kHome Depot
- Remote job
Position Purpose: The Senior Software Engineer for Site Reliability Engineering (Store Systems Enablement) builds and operates the internal platforms that keep Home Depot's store systems observable, reliable, and automated. This is a platform engineering role: you will design, develop, and maintain the tools that hundreds of development and reliability teams depend on, not just use them. The team owns and operates a portfolio of reliability platforms, including a custom-built synthetic testing system that runs inside physical Home Depot stores, operational automation infrastructure serving dozens of teams, and the full observability stack (logging, tracing, and profiling) for Store Systems. You will write code, deploy infrastructure, tune distributed systems, and reduce operational toil through automation, including AI‑assisted workflows. Key Focus Areas: Platform Development: Build and extend internal reliability tools using Kubernetes, Terraform, and modern infrastructure-as-code patterns on Google Cloud Platform. Observability Operations: Deploy, configure, and maintain production logging, tracing, and profiling systems. Own the SLO/CUJ platform that enables multi-window, multi-burn-rate alerting and automated tracking dashboards for RE teams across Store Systems. Toil Reduction & Automation: Identify repetitive operational work and engineer it away. Build self-service capabilities, Copilot skills, and automation pipelines so teams can operate independently. SLO & CUJ Enablement: Maintain and extend the platform that powers SLO and Critical User Journey definition across the organization. Educate RE teams on what good SLOs and CUJs look like, assist with onboarding, and build automation and documentation so teams can self‑serve. You will have strong opinions on the right way to measure reliability and the tooling to back them up. Synthetic Monitoring: Extend our in-store synthetic testing platform: onboard teams, enable them to write and deploy their own tests, and evolve the platform's orchestration, alerting, and self‑service capabilities. Incident Response & Resilience: Participate in on-call rotation for observability infrastructure. Lead and contribute to blameless post‑mortems. Design and execute destructive tests to validate platform resilience. You will work on a small, high‑impact team where the work is varied: some weeks you’re writing Terraform and Helm charts, others you’re debugging Loki query performance or building a Copilot skill to automate a support workflow. You will be expected to own problems end‑to‑end, from investigation through implementation to production deployment. Key Responsibilities: 50% Delivery and Execution – Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide; Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm; Consistently achieves results, even under tough circumstances; Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production; Takes a broad view when approaching issues; using a global lens. 20% Learns and Grows – Learns through successful and failed experiment when tackling new problems; Actively seeks ways to grow and be challenged using both formal and informal development channels. 20% Plans and Aligns – Collaborates with other team members in agile processes; Creates new and better ways for the organization to be successful; Works the Product Team to ensure user stories are valuable, developer ready, easy to understand and testable; Delivers multi‑mode communications that convey a clear understanding of the unique needs of different audiences; Adapts approach and demeanor in real time to match the shifting demands of different situations; Relates openly and comfortably with diverse groups of people. 10% Supports and Enables – Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions. Direct Manager / Direct Reports: This position typically reports to Software Engineer Manager or Sr. Manager. This position has 0 Direct Reports. Travel Requirements: No travel required. Physical Requirements: Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles. Working Conditions: Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable. Minimum Qualifications: Must be eighteen years of age or older. Must be legally permitted to work in the United States. Preferred Qualifications: 3‑5 years of experience in Site Reliability Engineering, Platform Engineering, DevOps, or Infrastructure Engineering. Hands‑on experience with Google Cloud Platform (GCP), including GKE, GCS, BigQuery, Cloud Pub/Sub, Cloud Logging, IAM, and Workload Identity. Experience with other major cloud providers (AWS, Azure) is also valuable. Strong Kubernetes experience: deploying and managing workloads on GKE or similar managed Kubernetes services, writing and debugging Helm charts, managing namespaces, RBAC, service accounts, and troubleshooting issues. Experience with infrastructure-as-code tools, particularly Terraform for cloud resource management. Familiarity with cdk8s (CDK for Kubernetes) or similar programmatic IaC tools is a plus. Proficiency in one or more of: Go, Python, JavaScript/TypeScript, YAML. You don’t need all of them, but you should be comfortable reading Go, writing YAML and HCL, and scripting in Python or JavaScript. Experience with observability platforms: deploying, configuring, or operating log aggregation, distributed tracing, metrics, dashboarding, or continuous profiling. Practical understanding of SLOs, SLIs, and error budgets. Experience defining Service Level Objectives, instrumenting services for SLI measurement, and configuring burn‑rate alerting is highly preferred. Experience with synthetic monitoring or performance testing frameworks (k6, Playwright, Selenium, Locust, or similar). Bonus if you’ve built or operated a synthetic testing platform rather than just consumed one. Familiarity with incident management and on‑call practices: Blameless post‑mortems, runbook development, and incident communication. Experience with CI/CD pipelines using GitHub Actions, Spinnaker, ArgoCD, or similar. Understanding of deployment strategies (blue/green, canary, rolling). Experience with automation to reduce operational toil: building self‑service tooling, writing scripts or bots to handle repetitive tasks, or developing internal developer platforms. Familiarity with AI‑assisted development tools (GitHub Copilot, LLM‑based automation, MCP servers) is a plus. We are actively building AI skills and automation into our workflows. Experience writing clear technical documentation, runbooks, and onboarding guides. Comfort working on a small team with broad ownership: you will context‑switch between writing code, debugging production systems, and onboarding partner teams. Minimum Education: The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job. Preferred Education: No additional education. Minimum Years of Work Experience: 3 Preferred Years of Work Experience: No additional years of experience. Minimum Leadership Experience: None Preferred Leadership Experience: None Certifications: None Competencies: Global Perspective Manages Ambiguity Nimble Learning Self-Development Collaborates Cultivates Innovation Situational Adaptability Communicates Effectively Drives Results Interpersonal Savvy We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class. Apply End Date: 06/18/2026 $90,000.00 - $180,000.00 #J-18808-Ljbffr Home Depot
- ...applications and next steps. Our partner is looking for a Senior Software Engineer, Site Reliability Engineering based in United States. This role sits... ...plan and financial wellbeing support ~ Flexible remote work options within North America ~ Flexible paid time...Remote jobSeniorFull timeFlexible hours
- ...infra has to match. The role We're looking for a Senior SRE to own the reliability, scalability, and operational posture of Satsuma's multi... ...-assisted development workflows Partner closely with engineering on reliability reviews and architecture decisions Requirements...Remote workSenior
$129k - $161k
...Description Job Description Job title: Senior Site Reliability Engineer Reports to: Director, Site... ...Department: Cloud Platforms Location: Remote Grade: 20 About Priority:... ...who thrives at the intersection of software, systems, and operations, and who...Remote workSenior$175k - $250k
.../yr - $250,000.00/yr Job Title: Senior Cloud Infrastructure Engineer Location: San Francisco, CA. Remote unavailable. Modality: On-Site only. Must live within commuting... ...ensuring scalability, performance, and reliability across environments. What You’ll...Remote workSeniorFull timeRelocationRelocation package$160k - $210k
...growing! The Role We are looking for a senior site reliability engineer to work on expanding our global... ...in office (Mon/Tue/Wed) and 2 days remote (Thursday/Friday). Responsibilities... ...years of experience in operations, software engineering, or as an SRE. ~...Remote workSeniorWork at officeImmediate startWork from home- A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have...Remote jobSeniorFlexible hours
- ...and India—embracing both hybrid and remote work to bring the best minds together... ...is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall... ...of relevant experience bringing software to production at high scale ~ Participation...Remote workSenior
- Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production systems in San Francisco, CA. The role involves working closely with product teams to define reliability standards and build robust observability practices...Remote jobSeniorFlexible hours
- Upstart is seeking a Senior Staff Engineer to lead technical initiatives in shaping the applicant... ...across the loan process. This role offers remote flexibility while ensuring... ...ideal candidate has over 10 years of software engineering experience, excels in distributed...Remote jobSenior
- ...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native...Remote jobSenior
- ...No H1 or C2C. Must be Permanent Resident or US Citizen Senior Site Reliability Engineer Description and Requirements About Our Team We... ...high engineering standards. Location: Open to remote work in the US. The preferred work location is Chicago, IL...Remote workSeniorPermanent employment
- Bright Vision Technologies is seeking a Site Reliability Engineer to ensure the high availability of... ...distributed systems. This role requires strong software engineering principles and operational... ...tasks. This position is a full-time, remote opportunity within the Continental...Remote jobSeniorFull time
- ...looking for a highly motivated, diligent, and skillful Site Reliability Engineer to join the Cyber Security Engineering (CSE) Team.... ...on people’s lives. This position can be remote anywhere in the U.S. The Senior Site Reliability Engineer will be responsible for...Remote workSeniorTemporary workLocal area
$104.9k - $174.7k
...link below, About the Role: We are hiring a hands-on Senior Site Reliability Engineer (SRE) to actively build, operate, and improve the... ...you may work a hybrid schedule. If not, this role is fully remote. We do not restrict applicants based on job site or posting...Remote workSeniorWork at officeLocal areaFlexible hours$130k - $180k
...Mondays and Fridays are reserved for (remote-friendly) focus time to get things... ...collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage Means… You are an... ...cloud product. We’re seeking senior software and systems engineers specializing...Remote workSeniorWork at officeLocal areaWorldwideMonday to FridayFlexible hours- ...Senior Site Reliability Engineer (Senior SRE) Cybercrime is rising, reaching record highs in 2024. According to the FBI's IC3 report, total losses... ...paid leave ~ Wellness reimbursement of $300/year ~ Remote worker reimbursement of $300/year ~ Professional development...Remote workSeniorFlexible hoursNight shift
$100k - $120k
...Senior Site Reliability Engineer Austin, TX, US; Remote, US; San Francisco, CA, US UJET leads the way in AI-powered contact center innovation, delivering a future-proof, cloud platform that redefines the customer experience with cutting-edge AI, true multimodality...Remote workSeniorWork experience placementLocal area- ...Senior Site Reliability Engineer - Operations As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered... ...: Kansas City, MO | Hybrid | MO, TX, GA, and FL | remote Get To Know The Team: We are seeking a highly skilled...Remote workSeniorOngoing contractCasual workFlexible hours
$165k - $215k
...collaboration, fostering creativity and innovation that drives real impact in the market.. We are seeking a highly skilled Senior DevOps / Site Reliability Engineer (SRE) to join our globally distributed engineering organization. This is a hands-on senior-level role focused on...Remote workSenior$130k - $165k
...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team : Site Reliability Engineering About Snapsheet: Snapsheet exists to simplify claims. We leverage our expertise...Remote workSeniorFull timeTemporary workLocal areaVisa sponsorshipWork visaFlexible hours$149.1k - $157.8k
...of unmatched reverse engineering, teardown, and market... ...TechInsights is building the reliability and AI operations... ...We're looking for a Senior Site Reliability Engineer... .... This role is a remote role for candidates... ...reliability liaison to Software and AI Engineering,...Remote workSeniorPermanent employmentFlexible hours- ...behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Germany. This is an exciting opportunity to join a... ...systems, incident management, and platform evolution in a remote‑first and innovation‑driven environment....Remote workSeniorWork at officeWork from homeWorldwide
$127k - $249k
...basis, or it can be fully remote while working from a location... ...looking for an experienced Senior Engineer for our SRE, Atlas team to... ...alongside the various Atlas software engineering teams to... ...We are seeking a talented Site Reliability Engineer (SRE) with a strong...Remote workSeniorLocal areaWorldwideFlexible hours- New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions...Remote workSenior
- ...financial audit. Put simply, we build software for the people who enable trust... ...in San Francisco, CA, but built as a remote‑first company that enables you to do... ...at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible...Remote workSeniorWork from homeFlexible hours
$127k - $249k
...We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the... ...on a hybrid basis, or it can be fully remote while working from a location based in... ...transform, and disrupt industries with software. MongoDB’s unified database platform—the...Remote workSeniorLocal areaWorldwideFlexible hours$150k - $170k
Senior Site Reliability Engineer - Zip Co Join to apply for the Senior Site Reliability Engineer role at... ...Zip Co At Zip, we build cloud‑native software applications that serve millions of... ...mentor our engineering team. We offer a remote‑first opportunity for US‑based...Remote workSeniorCasual workWork at officeFlexible hours- We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our... ...of a leading blockchain protocol. Remote Flexibility: Enjoy the freedom and flexibility... ...developer with a solid foundation in software engineering, particularly in backend...Remote jobSenior
- Overview Garmin International seeks Senior Site Reliability Engineer (Olathe, KS; Multiple positions): Essential... ...and automations that promote the software integrity and stability Lead design... ...products/systems Full-time remote work is not available. Individual contributor...Remote workSeniorFull timeWork experience placement
- Position: Senior Site Reliability Engineer + MongoDB Basic Purpose The Platform Database Engineer is responsible for designing, deploying, administering... ...and observability providers. Work Environment Remote: This role is fully remote, and the associate is expected...Remote workSeniorWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer - Site Reliability Engineering (Remote). Be the first to apply!
- software sales engineer Atlanta, GA
- software engineer internship remote Atlanta, GA
- IT software developer Atlanta, GA
- software engineer staff Atlanta, GA
- integration software engineer Atlanta, GA
- machine learning software engineer Atlanta, GA
- software engineer part time Atlanta, GA
- facebook software engineer Atlanta, GA
- senior robotics software engineer Atlanta, GA
- junior software developer Atlanta, GA




