Senior Site Reliability Engineer
$125.04k - $187.56kViziRecruiter,LLC.
Introduction Ahold Delhaize USA, a division of global food retailer Ahold Delhaize, is part of the U.S. family of brands, which also includes five leading omnichannel grocery brands – Food Lion, Giant Food, The GIANT Company, Hannaford and Stop & Shop. Ahold Delhaize USA associates support the brands with a wide range of services, including Finance, Legal, Sustainability, Commercial, Digital and E-commerce, Technology and more. Overview The Site Reliability Engineer (SRE) III is responsible for ensuring the scalability, reliability, and performance of production systems through automation, observability, incident response, and infrastructure engineering. This role involves designing and implementing robust operational processes and tooling to support highly available, fault-tolerant systems in a cloud-native environment. The SRE III collaborates closely with engineering squads, product teams, and stakeholders to embed reliability best practices across the software delivery lifecycle. The role includes ownership of system uptime, service level objectives (SLOs), and operational excellence, along with mentoring junior engineers and leading cross-functional initiatives that improve system resilience. Applicants must be currently authorized to work in the United States on a full-time basis. Our flexible/hybrid work schedule includes 3 in-person days at our Chicago office and 2 remote days. Responsibilities Design and implement infrastructure solutions that ensure system availability, scalability, and reliability across cloud-native environments like AKS and Kubernetes. Develop automation for provisioning, deployment, configuration, monitoring, and incident remediation using tools such as Terraform, ArgoCD, and GitHub Actions. Collaborate with engineering teams to define and track service level objectives (SLOs) and service level indicators (SLIs). Build and manage microservices-based platforms leveraging Spring Boot, Java, Tomcat, and Redis. Monitor production environments using Datadog and proactively address performance and reliability issues. Perform root cause analysis and lead post-incident reviews to drive continual improvement. Manage CI/CD pipelines and deployment automation using GitHub, Docker, and container orchestration technologies. Create and maintain infrastructure as code (IaC) using Terraform, with deployment pipelines integrated into GitOps workflows. Lead and support operational readiness reviews, game days, chaos engineering practices, and failure mode analysis. Build scalable observability and alerting frameworks with Datadog. Implement resilient, asynchronous architectures using Kafka for event-driven services. Reduce operational toil through self-healing automation and proactive system tuning. Troubleshoot Linux-based environments such as Ubuntu and optimize them for reliability. Provide on-call support and ensure 24/7/365 system reliability for mission-critical applications. Collaborate with the security team to enforce secure operational practices and cloud compliance. Mentor junior engineers and contribute to documentation, technical design, and knowledge-sharing across the organization. Requirements Bachelor's Degree in Computer Science, Information Systems, or a related technical field; equivalent training, certifications, or experience will be considered. 5+ years of experience in a Site Reliability Engineering, or DevOps, or Java programming role. Experience managing production-grade systems and services on AKS/Kubernetes in distributed environments. Proficiency in programming and scripting languages including Python, Java, Bash, or Go. Proven experience with Spring Boot, Tomcat, Redis, and microservices architecture. Hands‑on experience in managing Linux environments, particularly Ubuntu. Proficiency with observability stacks and performance monitoring using Datadog, Prometheus, and ELK. Deep understanding of containerization and orchestration using Docker, Kubernetes, and ArgoCD. Experience managing event‑driven systems using Kafka. Expertise in IaC and automation using Terraform and GitHub Actions. Familiarity with networking concepts, DNS, load balancing, and cloud infrastructure (AWS, Azure, or GCP). Strong analytical, debugging, and problem‑solving skills. Excellent verbal and written communication skills and the ability to collaborate effectively across teams. Salary Range: $125,040 - $187,560 Actual compensation offered to a candidate may vary based on their unique qualifications and experience, internal equity, and market conditions. Final compensation decisions will be made in accordance with company policies and applicable laws. #J-18808-Ljbffr ViziRecruiter,LLC.
- ...No H1 or C2C. Must be Permanent Resident or US Citizen Senior Site Reliability Engineer Description and Requirements About Our Team We are building Quantum , a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision...SeniorPermanent employmentRemote work
$160k - $200k
Ripple is seeking a Senior Site Reliability Engineer in Chicago. In this role, you will enhance platform reliability by embedding with engineering teams and coaching them on CI/CD practices, observability, and application security. Your expertise will help us redefine...Senior$160k - $200k
Ripple is seeking a Senior Software Engineer, Site Reliability in Chicago, Illinois. This role involves ensuring the reliability and availability of Ripple's products while mentoring engineering teams on best practices. The ideal candidate has over 5 years of experience...SeniorFlexible hours$127k - $249k
...Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the... ...workloads. Role Overview We are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background....SeniorLocal areaRemote workWorldwideFlexible hours$129k - $160k
...About the Company As a Senior Site Reliability Engineer (SRE) at TAG – The Aspen Group, you will be responsible for ensuring the reliability, performance, and scalability of our core systems. This role involves proactively building and managing, monitoring solutions...Senior$130k - $180k
...best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer atiManageMeans… You are an engineer, a builder, and a systems thinker. You’ll create middleware and platform...SeniorWork at officeLocal areaRemote workWorldwideMonday to FridayFlexible hours- CME Chicago Mercantile Exchange Inc. is seeking a Site Reliability Engineer III to enhance stability for CME Clearing & Risk. In this role, you will ensure secure and reliable technology solutions, bridging development and operations while maintaining risk management services...Senior
- Senior Site Reliability Engineer - Google Distributed Cloud Edge (Edge SRE) Location: Hybrid - Chicago, IL (preferred) | Employment Type: W2, Contract to Hire, Direct Hire Overview Our client is seeking a highly skilled Edge Site Reliability Engineer (Edge SRE) to lead...SeniorContract work
- About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in Google Cloud that is used by hundreds of engineers to provide a first class experience to millions of end users around the world...SeniorRemote jobWork from homeSleeping nights
$94k - $163k
...Summary We’re looking for a Technical Senior Manager of SRE to play a central role in... ...approximately 70% of time to hands‑on engineering tasks, such as developing new deployments... ...: Proven ability to collaborate with Site Reliability Engineers and cross‑functional teams, facilitating...SeniorWork at officeFlexible hours$145k - $175k
...to help you gain your full potential. Job Overview The Site Reliability Engineer supports deployments, cloud infrastructure, and monitoring... ...and infrastructure improvements. You'll be joining a small, senior SRE team with broad ownership of the platforms and infrastructure...SeniorFull timeTemporary workWork at officeLocal areaFlexible hours3 days per week$130k - $140k
...GlobalLogic is seeking a Senior Infrastructure Engineer in Deer Park, IL, to design and operate the enterprise observability stack. The ideal candidate has 7+ years in SRE or cloud infrastructure engineering, deep expertise in Microsoft Azure, and strong skills in Infrastructure...Senior$130k - $165k
...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team : Site Reliability Engineering About Snapsheet: Snapsheet exists to simplify claims. We leverage...SeniorFull timeTemporary workLocal areaRemote workVisa sponsorshipWork visaFlexible hours$111k - $188k
...drives our business. Our team is made up of talented software engineers, infrastructure engineers, leaders and UX professionals. We... ...centers, infrastructure, design and grit. The Role: Senior Site Reliability Engineer with extensive experience in automation and...SeniorTemporary workWork at officeImmediate startRemote work3 days per week- Hitachi Vantara Corporation is looking for a Site Reliability Engineer (SRE) to design and operate the enterprise observability stack, including Azure Monitor and Managed Grafana. This position requires extensive experience in SRE and cloud infrastructure, with a focus...Senior
$130k - $150k
...Site Reliability Engineer - Disaster Recovery & Business Continuity Boston, MA, United States; Chicago, IL, United States About Charles River... ...career mentoring and performance coaching from an assigned senior colleague. Additional leadership and collaboration...Work at officeWork from home3 days per week$93.9k - $156.5k
Site Reliability Engineer II page is loaded## Site Reliability Engineer IIlocations: Chicago - 20 S. Wackertime type: Full timeposted on: Posted... ...trading days.The successful candidate will work alongside senior engineers to learn how we observe, monitor, automate, and improve...Work at officeLocal areaWorldwide2 days per week$127.33k - $159.17k
...Service Management. It’s our goal to always provide an engaging, relevant, and simple experience for our customers. The Site Reliability Engineer (SRE) - Edge Platform is a key member of the Edge Operations and SRE team within Global Technology Infrastructure & Operations...Local areaFlexible hoursShift work- Site Reliability Engineer (Chicago, IL; Dallas, TX; ...) Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of: work experience, training, experience, education. Contractor will implement and maintain scalable...Contract workFor contractorsWork experience placement
$93.9k - $156.5k
...work model, requiring 2 days per week on‑site at our corporate office 20 S Wacker Dr,... ...low‑latency performance and rock‑solid reliability to seamlessly handle the world’s... ...Responsibilities Work alongside product teams and senior engineers to assist with building out...Work at officeLocal area2 days per week$130k - $140k
...#LI-VK1 Requirements 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large‑scale enterprise... ...By joining GlobalLogic, you’re placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone...Work experience placementWork at office- ...the process defined for an alerts/ issue Contact external vendors if their integrations fail Measure the front-end metrics for the site with various tools available Qualifications Must have worked on support projects Must know GCP, Kubernetes and Dynatrace...
$130k - $225k
...expectations, integrity, innovation and a willingness to challenge consensus. The Algorithmic Trading Team is looking for a Site Reliability Engineer for our Chicago office. The SRE team is critical to the success of our trading - ensuring that our production trading...Temporary workWork at officeFlexible hours- We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our dynamic team. In this role, you will apply SRE principles to increase the reliability, scalability, and performance of critical enterprise applications. You will partner with cross...
$93.9k - $156.5k
CME Group Inc. is looking for a Site Reliability Engineer II in Chicago to assist in building, operating, and scaling systems. This role requires... ..., and problem-solving. Candidates will work with senior engineers and collaborate across teams to enhance service reliability...- ...self-healing capabilities and platform automation using Logic Apps and Python. Requirements 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large-scale enterprise environments. Deep, hands-on expertise with Microsoft Azure (...
$118.3k - $219.8k
...Are you excited to lead Site Reliability Engineering teams that keep mission-critical, 24/7 services running reliably and securely? Do you enjoy building automated cloud platforms, hardening security, and driving ongoing cost optimization through strong FinOps practices...Temporary workLocal area- ...Partner,Good Morning ,Greetings from Nukasani group Inc !, We have below urgent long term contract project immediately available for **Senior Systems Software Programmer , Chicago, IL, _Onsite_** need submissions you please review the below role, if you are available,...SeniorLong term contractFor contractorsLocal areaImmediate startDay shift
- iManage is seeking a Senior Site Reliability Engineer in Chicago, IL to enhance their cloud platform's reliability and scalability. As an SRE, you will drive automation initiatives to reduce operational toil and mentor teammates. Your role includes leading architectural...Senior
- A leading financial technology firm in Chicago is seeking a Staff Site Reliability Engineer who will pioneer reliable infrastructure for critical clearing systems. The role involves architecting solutions, driving automation, and collaborating across teams to enhance performance...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- site reliability engineer sre Chicago, IL
- site reliability engineer Chicago, IL
- senior cost analyst Chicago, IL
- senior process manager Chicago, IL
- senior program specialist Chicago, IL
- senior commissioning manager Chicago, IL
- senior manager quality engineering Chicago, IL
- senior software test automation engineer Chicago, IL
- senior design technologist Chicago, IL
- senior director corporate development Chicago, IL


