Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer

SDI International

No H1 or C2C. Must be Permanent Resident or US Citizen

Senior Site Reliability Engineer

Description and Requirements

About Our Team

We are building Quantum , a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision, we are expanding the reliability engineering organization that powers cross‑device Personal AI.

We are looking for Senior Site Reliability Engineers (SREs) to help us build and evolve the foundational reliability, observability, and operations capabilities that ensure fast, safe, and dependable for millions of users.

This role may support one of several teams within the SRE organization (e.g., Observability, Operations, or Service Reliability), depending on your strengths and interests.

Operating with the speed, ownership, and creative latitude of a startup —yet supported by the scale, resources, and technical depth. We are building new systems, new tooling, and new operational models from the ground up, and we are doing so with clarity, intention, and high engineering standards.

Location: Open to remote work in the US. The preferred work location is Chicago, IL.

What You Might Work On

As a Senior SRE, you may be responsible for a subset of the following, depending on team placement and skill alignment:

Reliability & Performance Engineering

  • Improving the availability, scalability, and performance of distributed systems across device, edge, and cloud.
  • Defining or refining SLIs, SLOs, and error budgets for critical services.
  • Leading initiatives to remove single points of failure, improve resilience, and reduce operational risk.

Operational Excellence

  • Participating in on‑call rotations and contributing to incident response, triage, and post-incident reviews.
  • Developing automation, runbooks, and self‑healing systems to reduce alert noise and MTTR.
  • Enhancing operational readiness and supporting incident prevention programs.

Observability & Insight

  • Designing or improving observability systems using OpenTelemetry , Grafana , and modern signal pipelines.
  • Building dashboards, analytics, and alerting that illuminate system health and AI service behavior.
  • Ensuring telemetry is reliable, actionable, and tied to real‑world outcomes.

Deployments & Change Safety

  • Improving reliability of CI/CD workflows, including phased rollouts, canaries, shadow testing, and safe rollback mechanisms.
  • Contributing to the evolution of deployment tooling for device+edge+cloud hybrid systems.

Systems Design & Collaboration

  • Influencing architectural decisions by injecting reliability, observability, and operational considerations early in design.
  • Collaborating with AI/ML engineers, platform engineers, firmware teams, and product partners to deliver robust, dependable user experiences.

Basic Qualifications

  • 10+ years of experience in Site Reliability Engineering, Production Engineering, DevOps, or large‑scale distributed systems operations
  • Bachelor’s Degree in Computer Science, Engineering, or a related technical discipline
  • Strong experience running production distributed systems at scale
  • Proficiency in at least one modern programming language (e.g., Python, Go, Java, C++)
  • Strong understanding of Linux systems , networking fundamentals, and system performance tuning
  • Experience with monitoring/observability (metrics, logs, tracing)
  • Hands‑on experience with cloud environments (Azure, AWS, or GCP)
  • Experience in incident management, on‑call rotations, and postmortem processes

Preferred Qualifications

  • Deep experience with Azure cloud services
  • Experience with OpenTelemetry for end‑to‑end instrumentation
  • Strong familiarity with Grafana , Prometheus, Loki, Tempo, or similar tools
  • Experience supporting AI/ML systems , model serving, or data‑intensive workloads
  • Background with hybrid architectures (device + edge + cloud)
  • Experience improving deployment reliability and progressive delivery systems
  • Passion for automation, reliability engineering, and reducing operational friction

What Success Looks Like

  • Systems become more observable, reliable, and predictable.
  • Incidents are resolved quickly, and follow‑up improvements prevent recurrence.
  • Alerting becomes more accurate, actionable, and trusted.
  • Deployments become safer and more consistent.
  • Teams move faster because reliability foundations are strong and intuitive.
Vacancy posted 9 hours ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in Chicago, IL vacancy
  • $145k - $175k

     ...help you gain your full potential. Job Overview The Site Reliability Engineer supports deployments, cloud infrastructure, and monitoring...  ...and infrastructure improvements. You'll be joining a small, senior SRE team with broad ownership of the platforms and infrastructure... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours
    3 days per week

    Rewards Network

    Chicago, IL
    6 days ago
  • $130k - $180k

     ...of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage Means…  You are an engineer, a builder, and a systems thinker. You’ll create middleware and platform guardrails... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Monday to Friday
    Flexible hours

    iManage

    Chicago, IL
    8 days ago
  • $106.28k - $145k

    CCC Information Services in Chicago is looking for a Senior Site Reliability Engineer to enhance and support their multi-cloud solutions. This hybrid position offers a salary range of $106,277.25 to $145,000.00, and candidates should have over two years of experience in... 
    Senior

    CCC Information Services

    Chicago, IL
    1 day ago
  • $160k - $200k

    Ripple is seeking a Senior Site Reliability Engineer in Chicago. In this role, you will enhance platform reliability by embedding with engineering teams and coaching them on CI/CD practices, observability, and application security. Your expertise will help us redefine... 
    Senior

    jobr.pro

    Chicago, IL
    5 days ago
  • $160k - $200k

    Ripple in Chicago is seeking a Senior Site Reliability Engineer to enhance product reliability and performance. In this role, you will engage with engineering teams to implement observability practices and optimize CI/CD pipelines, ensuring robust security. The position... 
    Senior

    Ripple

    Chicago, IL
    3 days ago
  • $160k - $200k

    Ripple is seeking a Senior Software Engineer, Site Reliability in Chicago, Illinois. This role involves ensuring the reliability and availability of Ripple's products while mentoring engineering teams on best practices. The ideal candidate has over 5 years of experience... 
    Senior
    Flexible hours

    Ripple

    Chicago, IL
    5 days ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper).... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Chicago, IL
    5 days ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability... 
    Senior
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    Chicago, IL
    9 days ago
  • $129k - $160k

     ...About the Company As a Senior Site Reliability Engineer (SRE) at TAG – The Aspen Group, you will be responsible for ensuring the reliability, performance, and scalability of our core systems. This role involves proactively building and managing, monitoring solutions... 
    Senior

    TAG - The Aspen Group

    Chicago, IL
    3 days ago
  • $125.04k - $187.56k

     ...Delhaize USA company team includes just over 100 associates across all East Coast office locations. Primary Purpose The Site Reliability Engineer (SRE) III is responsible for ensuring the scalability, reliability, and performance of production systems through... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    Peapod Digital Labs

    Chicago, IL
    4 days ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology  Team : Site Reliability Engineering About Snapsheet: Snapsheet exists to simplify claims. We leverage our expertise... 
    Senior
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    Chicago, IL
    3 days ago
  • $127k - $249k

     ...Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the...  ...workloads. Role Overview We are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background.... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Chicago, IL
    2 days ago
  • TransUnion is seeking a Staff Site Reliability Engineer to enhance reliability strategies and elevate engineering standards. This critical role involves driving major technical initiatives within a hybrid work environment, ensuring optimal platform performance and reliability... 
    Senior

    TransUnion

    Chicago, IL
    4 days ago
  •  ...Senior Site Reliability Engineer – Google Distributed Cloud Edge (Edge SRE) Location: Hybrid – Chicago, IL (preferred) Employment Type: W2, Contract to Hire, Direct Hire Overview Our client is seeking a highly skilled Edge Site Reliability Engineer (Edge SRE... 
    Senior
    Contract work

    CoSourcing Partners - Enterprise-AI and IT Services Company

    Chicago, IL
    4 days ago
  • CME Chicago Mercantile Exchange Inc. is seeking a Site Reliability Engineer III to enhance stability for CME Clearing & Risk. In this role, you will ensure secure and reliable technology solutions, bridging development and operations while maintaining risk management services... 
    Senior

    CME Chicago Mercantile Exchange Inc.

    Chicago, IL
    5 days ago
  • About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in Google Cloud that is used by hundreds of engineers to provide a first class experience to millions of end users around the world... 
    Senior
    Remote job
    Work from home
    Sleeping nights

    Hopper

    Chicago, IL
    5 days ago
  • $165k - $225k

     ...enterprises to deploy demanding AI workloads with enterprise-grade reliability and compliance. Your Role: You will be instrumental in...  ...expertise at its core. Working closely with our systems engineers, network engineers, and platform engineering team, you'll architect... 
    Senior
    Remote work
    Flexible hours

    Moonlite

    Chicago, IL
    24 days ago
  •  ...have partnered with our client in their search for a Senior SRE to work CST hours. Responsibilities Applies software engineering practices to IT operations tasks to maintain a scalable and reliable production environment for running software services... 
    Senior
    Work experience placement
    Remote work

    Korn Ferry

    Chicago, IL
    5 days ago
  • $106k - $130k

     ...sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement software and tools to... 
    Senior
    Hourly pay
    Work experience placement
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    Chicago, IL
    3 days ago
  •  ...Senior/Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA About Attain Built for consumers and companies, alike Klover's engineering team powers one of the fastest-growing fintech platforms in the U.S., supporting over one million... 
    Senior
    Work at office
    Immediate start
    Remote work

    Attain

    Chicago, IL
    3 days ago
  • Hitachi Vantara Corporation is looking for a Site Reliability Engineer (SRE) to design and operate the enterprise observability stack, including Azure Monitor and Managed Grafana. This position requires extensive experience in SRE and cloud infrastructure, with a focus... 
    Senior

    Hitachi Vantara Corporation

    Chicago, IL
    3 days ago
  • $130k - $140k

    GlobalLogic is seeking a Senior Infrastructure Engineer in Deer Park, IL, to design and operate the enterprise observability stack. The ideal candidate has 7+ years in SRE or cloud infrastructure engineering, deep expertise in Microsoft Azure, and strong skills in Infrastructure... 
    Senior

    GlobalLogic

    Chicago, IL
    4 days ago
  • $111k - $188k

     ...drives our business. Our team is made up of talented software engineers, infrastructure engineers, leaders and UX professionals. We...  ...centers, infrastructure, design and grit. The Role: Senior Site Reliability Engineer with extensive experience in automation and... 
    Senior
    Temporary work
    Work at office
    Immediate start
    Remote work
    3 days per week

    Eskilstuna-Kuriren

    Chicago, IL
    more than 2 months ago
  • $194k - $237k

     ...Principal Site Reliability Engineer At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting...  ...approaches and techniques. Be a thought leader: a senior point of expertise on site reliability engineering issues,... 
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    Chicago, IL
    5 days ago
  •  ...Qualifications: 8+ years of software engineering experience, or equivalent demonstrated through...  ...implement and maintain scalable and reliable infrastructure on Google Cloud Platform...  ...vendor resources. Willingness to work on-site at stated location in the job opening.... 
    For contractors
    Work experience placement

    Cedent Life Talent

    Chicago, IL
    4 days ago
  •  ...Edward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance... 
    Contract work
    Remote work

    HCL Global Systems

    Chicago, IL
    2 days ago
  • $130k - $140k

     ...platform automation using Logic Apps and Python. #LI-VK1 Requirements 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large-scale enterprise environments. Deep, hands-on expertise with Microsoft Azure (minimum... 
    Temporary work
    Work experience placement
    Work from home
    Flexible hours

    GlobalLogic

    Chicago, IL
    1 day ago
  •  ...Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes... 
    Flexible hours

    Info Way Solutions

    Chicago, IL
    4 days ago
  • $175k - $225k

     ...Site Reliability Engineer Chicago, IL or New York, NY Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our... 
    Full time
    Work at office
    Remote work
    Monday to Friday
    Flexible hours
    Rotating shift

    Old Mission Capital

    Chicago, IL
    2 days ago
  • $160k - $200k

     ...build the future of corporate treasury and the infrastructure that powers the Internet of Value. THE WORK: As a Senior Site Reliability Engineer you will be a force multiplier at the intersection of platform reliability and engineering excellence. You will be... 
    Full time
    Work at office
    Local area

    Ripple

    Chicago, IL
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!