Senior Site Reliability Engineer
SDI International
No H1 or C2C. Must be Permanent Resident or US Citizen
Senior Site Reliability Engineer
Description and Requirements
About Our Team
We are building Quantum , a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision, we are expanding the reliability engineering organization that powers cross‑device Personal AI.
We are looking for Senior Site Reliability Engineers (SREs) to help us build and evolve the foundational reliability, observability, and operations capabilities that ensure fast, safe, and dependable for millions of users.
This role may support one of several teams within the SRE organization (e.g., Observability, Operations, or Service Reliability), depending on your strengths and interests.
Operating with the speed, ownership, and creative latitude of a startup —yet supported by the scale, resources, and technical depth. We are building new systems, new tooling, and new operational models from the ground up, and we are doing so with clarity, intention, and high engineering standards.
Location: Open to remote work in the US. The preferred work location is Chicago, IL.
What You Might Work On
As a Senior SRE, you may be responsible for a subset of the following, depending on team placement and skill alignment:
Reliability & Performance Engineering
- Improving the availability, scalability, and performance of distributed systems across device, edge, and cloud.
- Defining or refining SLIs, SLOs, and error budgets for critical services.
- Leading initiatives to remove single points of failure, improve resilience, and reduce operational risk.
Operational Excellence
- Participating in on‑call rotations and contributing to incident response, triage, and post-incident reviews.
- Developing automation, runbooks, and self‑healing systems to reduce alert noise and MTTR.
- Enhancing operational readiness and supporting incident prevention programs.
Observability & Insight
- Designing or improving observability systems using OpenTelemetry , Grafana , and modern signal pipelines.
- Building dashboards, analytics, and alerting that illuminate system health and AI service behavior.
- Ensuring telemetry is reliable, actionable, and tied to real‑world outcomes.
Deployments & Change Safety
- Improving reliability of CI/CD workflows, including phased rollouts, canaries, shadow testing, and safe rollback mechanisms.
- Contributing to the evolution of deployment tooling for device+edge+cloud hybrid systems.
Systems Design & Collaboration
- Influencing architectural decisions by injecting reliability, observability, and operational considerations early in design.
- Collaborating with AI/ML engineers, platform engineers, firmware teams, and product partners to deliver robust, dependable user experiences.
Basic Qualifications
- 10+ years of experience in Site Reliability Engineering, Production Engineering, DevOps, or large‑scale distributed systems operations
- Bachelor’s Degree in Computer Science, Engineering, or a related technical discipline
- Strong experience running production distributed systems at scale
- Proficiency in at least one modern programming language (e.g., Python, Go, Java, C++)
- Strong understanding of Linux systems , networking fundamentals, and system performance tuning
- Experience with monitoring/observability (metrics, logs, tracing)
- Hands‑on experience with cloud environments (Azure, AWS, or GCP)
- Experience in incident management, on‑call rotations, and postmortem processes
Preferred Qualifications
- Deep experience with Azure cloud services
- Experience with OpenTelemetry for end‑to‑end instrumentation
- Strong familiarity with Grafana , Prometheus, Loki, Tempo, or similar tools
- Experience supporting AI/ML systems , model serving, or data‑intensive workloads
- Background with hybrid architectures (device + edge + cloud)
- Experience improving deployment reliability and progressive delivery systems
- Passion for automation, reliability engineering, and reducing operational friction
What Success Looks Like
- Systems become more observable, reliable, and predictable.
- Incidents are resolved quickly, and follow‑up improvements prevent recurrence.
- Alerting becomes more accurate, actionable, and trusted.
- Deployments become safer and more consistent.
- Teams move faster because reliability foundations are strong and intuitive.
$145k - $175k
...help you gain your full potential. Job Overview The Site Reliability Engineer supports deployments, cloud infrastructure, and monitoring... ...and infrastructure improvements. You'll be joining a small, senior SRE team with broad ownership of the platforms and infrastructure...SeniorFull timeTemporary workWork at officeLocal areaFlexible hours3 days per week$130k - $180k
...of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage Means… You are an engineer, a builder, and a systems thinker. You’ll create middleware and platform guardrails...SeniorWork at officeLocal areaRemote workWorldwideMonday to FridayFlexible hours$106.28k - $145k
CCC Information Services in Chicago is looking for a Senior Site Reliability Engineer to enhance and support their multi-cloud solutions. This hybrid position offers a salary range of $106,277.25 to $145,000.00, and candidates should have over two years of experience in...Senior$160k - $200k
Ripple is seeking a Senior Site Reliability Engineer in Chicago. In this role, you will enhance platform reliability by embedding with engineering teams and coaching them on CI/CD practices, observability, and application security. Your expertise will help us redefine...Senior$160k - $200k
Ripple in Chicago is seeking a Senior Site Reliability Engineer to enhance product reliability and performance. In this role, you will engage with engineering teams to implement observability practices and optimize CI/CD pipelines, ensuring robust security. The position...Senior$160k - $200k
Ripple is seeking a Senior Software Engineer, Site Reliability in Chicago, Illinois. This role involves ensuring the reliability and availability of Ripple's products while mentoring engineering teams on best practices. The ideal candidate has over 5 years of experience...SeniorFlexible hours$127k - $249k
...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper)....SeniorWork at officeLocal areaRemote workWorldwideFlexible hours$140k - $205k
...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability...SeniorFull timeTemporary workWork at officeFlexible hoursWeekend work$129k - $160k
...About the Company As a Senior Site Reliability Engineer (SRE) at TAG – The Aspen Group, you will be responsible for ensuring the reliability, performance, and scalability of our core systems. This role involves proactively building and managing, monitoring solutions...Senior$125.04k - $187.56k
...Delhaize USA company team includes just over 100 associates across all East Coast office locations. Primary Purpose The Site Reliability Engineer (SRE) III is responsible for ensuring the scalability, reliability, and performance of production systems through...SeniorFull timeWork at officeLocal areaRemote workFlexible hours$130k - $165k
...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team : Site Reliability Engineering About Snapsheet: Snapsheet exists to simplify claims. We leverage our expertise...SeniorFull timeTemporary workLocal areaRemote workVisa sponsorshipWork visaFlexible hours$127k - $249k
...Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the... ...workloads. Role Overview We are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background....SeniorLocal areaRemote workWorldwideFlexible hours- TransUnion is seeking a Staff Site Reliability Engineer to enhance reliability strategies and elevate engineering standards. This critical role involves driving major technical initiatives within a hybrid work environment, ensuring optimal platform performance and reliability...Senior
- ...Senior Site Reliability Engineer – Google Distributed Cloud Edge (Edge SRE) Location: Hybrid – Chicago, IL (preferred) Employment Type: W2, Contract to Hire, Direct Hire Overview Our client is seeking a highly skilled Edge Site Reliability Engineer (Edge SRE...SeniorContract work
- CME Chicago Mercantile Exchange Inc. is seeking a Site Reliability Engineer III to enhance stability for CME Clearing & Risk. In this role, you will ensure secure and reliable technology solutions, bridging development and operations while maintaining risk management services...Senior
- About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in Google Cloud that is used by hundreds of engineers to provide a first class experience to millions of end users around the world...SeniorRemote jobWork from homeSleeping nights
$165k - $225k
...enterprises to deploy demanding AI workloads with enterprise-grade reliability and compliance. Your Role: You will be instrumental in... ...expertise at its core. Working closely with our systems engineers, network engineers, and platform engineering team, you'll architect...SeniorRemote workFlexible hours- ...have partnered with our client in their search for a Senior SRE to work CST hours. Responsibilities Applies software engineering practices to IT operations tasks to maintain a scalable and reliable production environment for running software services...SeniorWork experience placementRemote work
$106k - $130k
...sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement software and tools to...SeniorHourly payWork experience placementWork at officeImmediate startVisa sponsorshipWork visaFlexible hours- ...Senior/Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA About Attain Built for consumers and companies, alike Klover's engineering team powers one of the fastest-growing fintech platforms in the U.S., supporting over one million...SeniorWork at officeImmediate startRemote work
- Hitachi Vantara Corporation is looking for a Site Reliability Engineer (SRE) to design and operate the enterprise observability stack, including Azure Monitor and Managed Grafana. This position requires extensive experience in SRE and cloud infrastructure, with a focus...Senior
$130k - $140k
GlobalLogic is seeking a Senior Infrastructure Engineer in Deer Park, IL, to design and operate the enterprise observability stack. The ideal candidate has 7+ years in SRE or cloud infrastructure engineering, deep expertise in Microsoft Azure, and strong skills in Infrastructure...Senior$111k - $188k
...drives our business. Our team is made up of talented software engineers, infrastructure engineers, leaders and UX professionals. We... ...centers, infrastructure, design and grit. The Role: Senior Site Reliability Engineer with extensive experience in automation and...SeniorTemporary workWork at officeImmediate startRemote work3 days per week$194k - $237k
...Principal Site Reliability Engineer At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting... ...approaches and techniques. Be a thought leader: a senior point of expertise on site reliability engineering issues,...Hourly payWork at officeImmediate startVisa sponsorshipWork visaFlexible hours- ...Qualifications: 8+ years of software engineering experience, or equivalent demonstrated through... ...implement and maintain scalable and reliable infrastructure on Google Cloud Platform... ...vendor resources. Willingness to work on-site at stated location in the job opening....For contractorsWork experience placement
- ...Edward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance...Contract workRemote work
$130k - $140k
...platform automation using Logic Apps and Python. #LI-VK1 Requirements 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large-scale enterprise environments. Deep, hands-on expertise with Microsoft Azure (minimum...Temporary workWork experience placementWork from homeFlexible hours- ...Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes...Flexible hours
$175k - $225k
...Site Reliability Engineer Chicago, IL or New York, NY Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our...Full timeWork at officeRemote workMonday to FridayFlexible hoursRotating shift$160k - $200k
...build the future of corporate treasury and the infrastructure that powers the Internet of Value. THE WORK: As a Senior Site Reliability Engineer you will be a force multiplier at the intersection of platform reliability and engineering excellence. You will be...Full timeWork at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- site reliability engineer remote Chicago, IL
- site reliability engineer sre Chicago, IL
- site reliability engineer Chicago, IL
- senior cost analyst Chicago, IL
- senior process manager Chicago, IL
- senior development engineer Chicago, IL
- senior program specialist Chicago, IL
- senior commissioning manager Chicago, IL
- senior manager quality engineering Chicago, IL
- senior software test automation engineer Chicago, IL



