Senior Site Reliability Engineer
$125.04k - $187.56kViziRecruiter,LLC.
Introduction Ahold Delhaize USA, a division of global food retailer Ahold Delhaize, is part of the U.S. family of brands, which also includes five leading omnichannel grocery brands – Food Lion, Giant Food, The GIANT Company, Hannaford and Stop & Shop. Ahold Delhaize USA associates support the brands with a wide range of services, including Finance, Legal, Sustainability, Commercial, Digital and E-commerce, Technology and more. Overview The Site Reliability Engineer (SRE) III is responsible for ensuring the scalability, reliability, and performance of production systems through automation, observability, incident response, and infrastructure engineering. This role involves designing and implementing robust operational processes and tooling to support highly available, fault-tolerant systems in a cloud-native environment. The SRE III collaborates closely with engineering squads, product teams, and stakeholders to embed reliability best practices across the software delivery lifecycle. The role includes ownership of system uptime, service level objectives (SLOs), and operational excellence, along with mentoring junior engineers and leading cross-functional initiatives that improve system resilience. Our flexible/hybrid work schedule includes 3 in-person days at our Chicago office and 2 remote days. Applicants must be currently authorized to work in the United States on a full-time basis. Responsibilities Design and implement infrastructure solutions that ensure system availability, scalability, and reliability across cloud-native environments like AKS and Kubernetes. Develop automation for provisioning, deployment, configuration, monitoring, and incident remediation using tools such as Terraform, ArgoCD, and GitHub Actions. Collaborate with engineering teams to define and track service level objectives (SLOs) and service level indicators (SLIs). Build and manage microservices-based platforms leveraging Spring Boot, Java, Tomcat, and Redis. Monitor production environments using Datadog and proactively address performance and reliability issues. Perform root cause analysis and lead post-incident reviews to drive continual improvement. Manage CI/CD pipelines and deployment automation using GitHub, Docker, and container orchestration technologies. Create and maintain infrastructure as code (IaC) using Terraform, with deployment pipelines integrated into GitOps workflows. Lead and support operational readiness reviews, game days, chaos engineering practices, and failure mode analysis. Build scalable observability and alerting frameworks with Datadog. Implement resilient, asynchronous architectures using Kafka for event-driven services. Reduce operational toil through self-healing automation and proactive system tuning. Troubleshoot Linux-based environments such as Ubuntu and optimize them for reliability. Provide on-call support and ensure 24/7/365 system reliability for mission-critical applications. Collaborate with the security team to enforce secure operational practices and cloud compliance. Mentor junior engineers and contribute to documentation, technical design, and knowledge-sharing across the organization. Requirements Bachelor's Degree in Computer Science, Information Systems, or a related technical field; equivalent training, certifications, or experience will be considered. 5+ years of experience in a Site Reliability Engineering, or DevOps, or Java programming role. Experience managing production-grade systems and services on AKS/Kubernetes in distributed environments. Proficiency in programming and scripting languages including Python, Java, Bash, or Go. Proven experience with Spring Boot, Tomcat, Redis, and microservices architecture. Hands‑on experience in managing Linux environments, particularly Ubuntu. Proficiency with observability stacks and performance monitoring using Datadog, Prometheus, and ELK. Deep understanding of containerization and orchestration using Docker, Kubernetes, and ArgoCD. Experience managing event‑driven systems using Kafka. Expertise in IaC and automation using Terraform and GitHub Actions. Familiarity with networking concepts, DNS, load balancing, and cloud infrastructure (AWS, Azure, or GCP). Strong analytical, debugging, and problem‑solving skills. Excellent verbal and written communication skills and the ability to collaborate effectively across teams. Salary Range: $125,040 - $187,560 Actual compensation offered to a candidate may vary based on their unique qualifications and experience, internal equity, and market conditions. Final compensation decisions will be made in accordance with company policies and applicable laws. #J-18808-Ljbffr ViziRecruiter,LLC.
$140k - $210.9k
...environments, strong communication, and a background in infrastructure or software engineering. Successful candidates will be responsible for producing CI/CD automation and ensuring reliability in distributed systems. A salary range of $140,000 - $210,900 is offered for...Senior- An innovative technology firm in Boston is seeking a Site Reliability Engineer to join their Cloud Infrastructure Team. This role involves working in high-scale environments, handling significant data processing and ensuring robust operation of FedRAMP cloud products. The...Senior
$140k - $210.9k
...States. The position will be primarily on‑site with residency commutable to one of our... .../DevOps backgrounds or software engineering backgrounds (e.g., Java, Python, Go) with... ...strong interest in operating and improving reliability of distributed production systems. Responsibilities...Senior$134.25k - $214.8k
...matters at a company where you matter. Your Impact Are you an engineer who gets excited about the challenge of making complex... ...it. You will be part of the Observability team within Axon’s Site Reliability organization - a focused team responsible for Axon’s metrics,...SeniorWork at officeRemote work$127k - $249k
The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours$134.25k - $214.8k
...you matter. Your Impact As a Senior SRE on the APX SRE CloudOps... ...platforms that Axon's product engineering teams depend on. You will architect... ...experience to drive reliability improvements and inform platform... ...engineering, cloud infrastructure, or site reliability engineering....SeniorWork experience placementWork at officeRemote work- A global food retailer is seeking a Site Reliability Engineer III to ensure system reliability, scalability, and performance in their cloud-native environment. Responsibilities include designing infrastructure solutions and mentoring junior engineers, while requirements...Senior
$165.75k - $224.45k
...are dedicated to solving complex problems and making a huge impact. Where You Fit We're looking for a skilled staff level Site Reliability Engineer focused on designing, building, and operating our hybrid cloud/on-prem environment. What You’ll Do If you\'re the right...SeniorHourly pay- ...proven success as founders of previous start-ups. Location Cambridge, MA — Kendall Square HQ (In-Office) The Role As a Senior Site Reliability Engineer at Blitzy's Kendall Square headquarters, you will be a foundational force behind the reliability, scalability, and...SeniorWork at office
$180k - $225k
Your Impact You are a Sr. Site Reliability Engineer II who will help define how Axon builds and operates its core platforms, with a primary focus... ...cross‑functional. You will collaborate with staff and senior engineers across product and platform teams to shape how we...SeniorWork at officeImmediate startRemote work$160k - $180k
Blitzy Inc., based in Cambridge, MA, is seeking a Senior Site Reliability Engineer to enhance the reliability and performance of our AI development platform. In this hands-on role, you will work on cloud infrastructure, maintain systems, and collaborate with engineering...Senior- ...improve software solutions to ensure system reliability and availability, mitigate operational... ...issues. # You will help lead chaos engineering efforts in a production-alike environment... ...professionals, with engineers focused on site reliability engineering and...SeniorPermanent employmentFlexible hours
$146.96k - $220.44k
...including Finance, Legal, Sustainability, Commercial, Digital and E-commerce, Technology and more. Overview The Site Reliability Engineer (SRE) IV is a senior technical leader responsible for designing, guiding, and scaling site reliability engineering practices across...Full timeWork at officeRemote workFlexible hours- ...Software Engineer, Front End The Software Engineer, Front End will build modular web applications that are easy to use and fully tested... ...: Implement user interfaces that are highly intuitive, reliable, and meet the needs of our customers Contribute to component...Senior
- ...Software Engineer, Back End We are a company dedicated to harnessing nature to help farmers sustainably feed the planet. With a vision... ...Engineer) and their API needs Deep commitment to quality, reliability, scalability and maintainability Works and interacts well...Senior
- ...DevOps/Site Reliability Engineer We are hiring DevOps/Site Reliability Engineers to innovate upon the way we deploy, test, and develop our industry-leading marketing and analytics software. Engineers here solve problems in distributed computing, infrastructure automation...
- ...Software Development Engineer We're creating a platform that will change the way organizations measure their software development efforts... ...teams can work and the tools they use. Location: on-site in Boston. We believe that it takes a diverse team to build the...Senior
- ## Site Reliability EngineerBoston, MA · Full-time · Senior#### About The PositionCoralogix is a modern, full-stack observability platform transforming how businesses... ...up to 70%.We are looking for a Site Reliability Engineer to work as part of our Cloud Infrastructure Team....Full time
$112k - $132.1k
...The Senior Software Engineer is responsible for developing research and/or clinical applications within DFCI, providing technical oversight of all aspects of one or more software product, and leading technical discussions with team members and stakeholders. Located...Senior- ...Description Job Description RESPONSIBILITIES: Own and drive reliability strategy across New Product Introduction (NPI), in-market... ...environments ~ Bachelor's or Master’s degree in Mechanical Engineering, Materials Science, Reliability Engineering, or a related...Senior
- A leading software company in Boston is seeking a Senior Manager, Principal Software Engineer to develop the Pharmacometrics product suite. This pivotal role requires collaboration with diverse teams, deploying AI/ML solutions and optimizing resource utilization. Ideal...SeniorFlexible hours
$137k - $170.9k
...coordinate support and resolve platform issues across CPU/radio SoCs, MCU/PIC, NPU/GPU, and peripheral devices. Support hardware engineering teams with deep technical debugging and contribute to OS/platform modernization efforts. What You'll Need Basic...SeniorFull timeTemporary workWork at officeImmediate startVisa sponsorshipWork visa$151k - $297k
The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...Local areaImmediate startRemote workFlexible hoursShift work- DevOps / Site Reliability Engineer ID70127 Full time | AgileEngine | United States Posted On 06/10/2026 Job Information City Boston State/Province... ...Place to Work awards. ABOUT THE ROLE We are looking for a Senior Site Reliability Engineer to maintain operational...Full timeWork at officeRemote workVisa sponsorshipWork visaFlexible hours
$141k
...matters at a company where you matter. Your Impact As a Senior Software Engineer on Axon’s Robotics team, you’ll be at the forefront of... ...planning, stand‑ups, and long‑term planning. Build robust and reliable mission critical software that meets high standards for...SeniorWork experience placementWork at officeRemote work$193.39k - $318.98k
Red Hat, Inc. is seeking a Senior Principal Software Engineer to join the Azure Red Hat OpenShift Engineering team in Boston, MA. This high-impact role demands extensive experience in software development, particularly in Linux and Golang, and expertise in Azure cloud...Senior$129k - $171k
Anduril Industries in Boston is seeking a Solutions Engineer to design and implement business capabilities for the people systems. You will collaborate closely with engineering, supply chain, and finance teams to align systems with the company’s goals. The ideal candidate...Senior- ...Software Engineer Opportunity We're looking for talented software engineers to join our rapidly growing team in Boston! Be a part of a company poised to dominate an untapped segment of the construction industry! We built a cloud-based construction logistics technology...SeniorCasual workFlexible hours
- ...Job Title: Generative AI Engineer (Senior / Lead / Principal)- Multiple openings Experience Level: 8+ to 13+ Years Location: Hybrid - Remote (India-based) with onsite every Thursday in Chennai Industry: AI/ML, Enterprise Applications, Healthcare...SeniorWork at officeRemote work
$119.8k - $234.7k
...Overview About the Role ~ We'rebuildingAIfirst engineering systemsthat power growth at Microsoft - designing, shipping,... ...client applications ~ Proficiencyin designingscalable, reliable systemsthat support rapid iteration and experimentation...SeniorOngoing contractLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- senior manager quality engineering Quincy, MA
- senior cloud solutions architect Quincy, MA
- sr operations manager Quincy, MA
- senior performance engineer Quincy, MA
- senior manager diversity & inclusion Quincy, MA
- senior robotics software engineer Quincy, MA
- senior customer service Quincy, MA
- senior mainframe developer Quincy, MA
- senior cloud security engineer Quincy, MA
- senior strategy analyst Quincy, MA



