Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer

SDI International

No H1 or C2C. Must be Permanent Resident or US Citizen

Senior Site Reliability Engineer

Description and Requirements

About Our Team

We are building Quantum , a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision, we are expanding the reliability engineering organization that powers cross‑device Personal AI.

We are looking for Senior Site Reliability Engineers (SREs) to help us build and evolve the foundational reliability, observability, and operations capabilities that ensure fast, safe, and dependable for millions of users.

This role may support one of several teams within the SRE organization (e.g., Observability, Operations, or Service Reliability), depending on your strengths and interests.

Operating with the speed, ownership, and creative latitude of a startup —yet supported by the scale, resources, and technical depth. We are building new systems, new tooling, and new operational models from the ground up, and we are doing so with clarity, intention, and high engineering standards.

Location: Open to remote work in the US. The preferred work location is Chicago, IL.

What You Might Work On

As a Senior SRE, you may be responsible for a subset of the following, depending on team placement and skill alignment:

Reliability & Performance Engineering

  • Improving the availability, scalability, and performance of distributed systems across device, edge, and cloud.
  • Defining or refining SLIs, SLOs, and error budgets for critical services.
  • Leading initiatives to remove single points of failure, improve resilience, and reduce operational risk.

Operational Excellence

  • Participating in on‑call rotations and contributing to incident response, triage, and post-incident reviews.
  • Developing automation, runbooks, and self‑healing systems to reduce alert noise and MTTR.
  • Enhancing operational readiness and supporting incident prevention programs.

Observability & Insight

  • Designing or improving observability systems using OpenTelemetry , Grafana , and modern signal pipelines.
  • Building dashboards, analytics, and alerting that illuminate system health and AI service behavior.
  • Ensuring telemetry is reliable, actionable, and tied to real‑world outcomes.

Deployments & Change Safety

  • Improving reliability of CI/CD workflows, including phased rollouts, canaries, shadow testing, and safe rollback mechanisms.
  • Contributing to the evolution of deployment tooling for device+edge+cloud hybrid systems.

Systems Design & Collaboration

  • Influencing architectural decisions by injecting reliability, observability, and operational considerations early in design.
  • Collaborating with AI/ML engineers, platform engineers, firmware teams, and product partners to deliver robust, dependable user experiences.

Basic Qualifications

  • 10+ years of experience in Site Reliability Engineering, Production Engineering, DevOps, or large‑scale distributed systems operations
  • Bachelor’s Degree in Computer Science, Engineering, or a related technical discipline
  • Strong experience running production distributed systems at scale
  • Proficiency in at least one modern programming language (e.g., Python, Go, Java, C++)
  • Strong understanding of Linux systems , networking fundamentals, and system performance tuning
  • Experience with monitoring/observability (metrics, logs, tracing)
  • Hands‑on experience with cloud environments (Azure, AWS, or GCP)
  • Experience in incident management, on‑call rotations, and postmortem processes

Preferred Qualifications

  • Deep experience with Azure cloud services
  • Experience with OpenTelemetry for end‑to‑end instrumentation
  • Strong familiarity with Grafana , Prometheus, Loki, Tempo, or similar tools
  • Experience supporting AI/ML systems , model serving, or data‑intensive workloads
  • Background with hybrid architectures (device + edge + cloud)
  • Experience improving deployment reliability and progressive delivery systems
  • Passion for automation, reliability engineering, and reducing operational friction

What Success Looks Like

  • Systems become more observable, reliable, and predictable.
  • Incidents are resolved quickly, and follow‑up improvements prevent recurrence.
  • Alerting becomes more accurate, actionable, and trusted.
  • Deployments become safer and more consistent.
  • Teams move faster because reliability foundations are strong and intuitive.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in Chicago, IL vacancy
  • $145k - $175k

     ...help you gain your full potential. Job Overview The Site Reliability Engineer supports deployments, cloud infrastructure, and monitoring...  ...and infrastructure improvements. You'll be joining a small, senior SRE team with broad ownership of the platforms and infrastructure... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours
    3 days per week

    Rewards Network

    Chicago, IL
    7 days ago
  • $130k - $180k

     ...of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage Means…  You are an engineer, a builder, and a systems thinker. You’ll create middleware and platform guardrails... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Monday to Friday
    Flexible hours

    iManage

    Chicago, IL
    9 days ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper).... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Chicago, IL
    1 day ago
  • $125.04k - $187.56k

     ...Delhaize USA company team includes just over 100 associates across all East Coast office locations. Primary Purpose The Site Reliability Engineer (SRE) III is responsible for ensuring the scalability, reliability, and performance of production systems through... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    Peapod Digital Labs

    Chicago, IL
    5 days ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability... 
    Senior
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    Chicago, IL
    10 days ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology  Team : Site Reliability Engineering About Snapsheet: Snapsheet exists to simplify claims. We leverage our expertise... 
    Senior
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    Chicago, IL
    4 days ago
  • $129k - $160k

     ...About the Company As a Senior Site Reliability Engineer (SRE) at TAG – The Aspen Group, you will be responsible for ensuring the reliability, performance, and scalability of our core systems. This role involves proactively building and managing, monitoring solutions... 
    Senior

    TAG - The Aspen Group

    Chicago, IL
    4 days ago
  • $127k - $249k

     ...Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the...  ...workloads. Role Overview We are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background.... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Chicago, IL
    3 days ago
  •  ...Senior Site Reliability Engineer – Google Distributed Cloud Edge (Edge SRE) Location: Hybrid – Chicago, IL (preferred) Employment Type: W2, Contract to Hire, Direct Hire Overview Our client is seeking a highly skilled Edge Site Reliability Engineer (Edge SRE... 
    Senior
    Contract work

    CoSourcing Partners - Enterprise-AI and IT Services Company

    Chicago, IL
    5 days ago
  • $165k - $225k

     ...enterprises to deploy demanding AI workloads with enterprise-grade reliability and compliance. Your Role: You will be instrumental in...  ...expertise at its core. Working closely with our systems engineers, network engineers, and platform engineering team, you'll architect... 
    Senior
    Remote work
    Flexible hours

    Moonlite

    Chicago, IL
    25 days ago
  •  ...have partnered with our client in their search for a Senior SRE to work CST hours. Responsibilities Applies software engineering practices to IT operations tasks to maintain a scalable and reliable production environment for running software services... 
    Senior
    Work experience placement
    Remote work

    Korn Ferry

    Chicago, IL
    1 day ago
  • $106k - $130k

     ...sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement software and tools to... 
    Senior
    Hourly pay
    Work experience placement
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    Chicago, IL
    4 days ago
  •  ...Senior/Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA About Attain Built for consumers and companies, alike Klover's engineering team powers one of the fastest-growing fintech platforms in the U.S., supporting over one million... 
    Senior
    Work at office
    Immediate start
    Remote work

    Attain

    Chicago, IL
    4 days ago
  • $111k - $188k

     ...drives our business. Our team is made up of talented software engineers, infrastructure engineers, leaders and UX professionals. We...  ...centers, infrastructure, design and grit. The Role: Senior Site Reliability Engineer with extensive experience in automation and... 
    Senior
    Temporary work
    Work at office
    Immediate start
    Remote work
    3 days per week

    Eskilstuna-Kuriren

    Chicago, IL
    more than 2 months ago
  • $194k - $237k

     ...Principal Site Reliability Engineer At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting...  ...approaches and techniques. Be a thought leader: a senior point of expertise on site reliability engineering issues,... 
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    Chicago, IL
    1 day ago
  • $160k - $200k

     ...build the future of corporate treasury and the infrastructure that powers the Internet of Value. THE WORK: As a Senior Site Reliability Engineer you will be a force multiplier at the intersection of platform reliability and engineering excellence. You will be... 
    Full time
    Work at office
    Local area

    Ripple

    Chicago, IL
    2 days ago
  • $130k - $225k

     ...expectations, integrity, innovation and a willingness to challenge consensus. The Algorithmic Trading Team is looking for a Site Reliability Engineer for our Chicago office. The SRE team is critical to the success of our trading – ensuring that our production trading... 
    Temporary work
    Work at office
    Flexible hours

    DRW

    Chicago, IL
    2 days ago
  • $175k - $225k

     ...Site Reliability Engineer Chicago, IL or New York, NY Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our... 
    Full time
    Work at office
    Remote work
    Monday to Friday
    Flexible hours
    Rotating shift

    Old Mission Capital

    Chicago, IL
    3 days ago
  • $91k - $110k

     ...that makes a real difference. Job Description The Site Reliability Engineer (SRE) is responsible for ensuring the reliability,...  ...demonstrated by building strong relationships, influencing peers and senior stakeholders, and navigating conflict to achieve successful... 
    Full time
    Part time
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    Weekend work

    Genex Services

    Chicago, IL
    5 days ago
  •  ...Site Reliability Engineer As a Site Reliability Engineer, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership... 

    Inclusion Cloud

    Chicago, IL
    5 days ago
  • $93.9k - $156.5k

     ...work model, requiring 2 days per week on-site at our corporate office 20 S Wacker Dr,...  ...low-latency performance and rock-solid reliability to seamlessly handle the world's...  ...successful candidate will work alongside senior engineers to learn how we observe, monitor, automate... 
    Work at office
    Local area
    Worldwide
    2 days per week

    CME Group

    Chicago, IL
    2 days ago
  •  ...CST Anchor Days: W (flexible on other 2 days) Site Reliability Engineer - Northern Trust, Goals Driven Wealth Management We are...  ...capacity planning and performance optimization efforts. Work with senior staff and management on service delivery improvements... 
    Contract work
    Work experience placement
    Local area
    Flexible hours

    Apex Informatics

    Chicago, IL
    4 days ago
  • $127.33k - $159.17k

     ...Service Management. It's our goal to always provide an engaging, relevant, and simple experience for our customers. The Site Reliability Engineer (SRE) - Edge Platform is a key member of the Edge Operations and SRE team within Global Technology Infrastructure &... 
    Local area
    Flexible hours
    Shift work

    McDonald's Corporation

    Chicago, IL
    5 days ago
  •  ...Qualifications: 8+ years of software engineering experience, or equivalent demonstrated through...  ...implement and maintain scalable and reliable infrastructure on Google Cloud Platform...  ...vendor resources. Willingness to work on-site at stated location in the job opening.... 
    For contractors
    Work experience placement

    Cedent Life Talent

    Chicago, IL
    5 days ago
  •  ...Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes... 
    Flexible hours

    Info Way Solutions

    Chicago, IL
    5 days ago
  •  ...Edward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance... 
    Contract work
    Remote work

    HCL Global Systems

    Chicago, IL
    3 days ago
  • $130k - $140k

     ...platform automation using Logic Apps and Python. #LI-VK1 Requirements 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large-scale enterprise environments. Deep, hands-on expertise with Microsoft Azure (minimum... 
    Temporary work
    Work experience placement
    Work from home
    Flexible hours

    GlobalLogic

    Chicago, IL
    2 days ago
  • $175k - $225k

     ...Old Mission Capital is seeking a well-rounded technologist with core strengths in Linux and network administration. This Site Reliability Engineer will be responsible for owning and managing the deployment, maintenance, and enhancement of our servers. This Site Reliability... 
    Full time
    Work at office
    Remote work
    Monday to Friday
    Flexible hours
    Rotating shift

    Old Mission

    Chicago, IL
    1 day ago
  • $130k - $150k

     ...Site Reliability Engineer - Disaster Recovery & Business Continuity Boston, MA, United States; Chicago, IL, United States About Charles River...  ...career mentoring and performance coaching from an assigned senior colleague. Additional leadership and collaboration... 
    Work at office
    Work from home
    3 days per week

    Charles River Associates

    Chicago, IL
    4 days ago
  •  ...Partner,Good Morning ,Greetings from Nukasani group Inc !, We have below urgent long term contract project immediately available for **Senior Systems Software Programmer , Chicago, IL, _Onsite_** need submissions you please review the below role, if you are available,... 
    Senior
    Long term contract
    For contractors
    Local area
    Immediate start
    Day shift

    Guru Schools

    Chicago, IL
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!