Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer - Disaster Recovery & Business Continuity

$130k - $150k
Full-time

Charles River Associates

About Charles River Associates For over 50 years, Charles River Associates has been a premier consulting firm that offers employees a place to learn from a diverse group of consultants, industry experts, and academics. At CRA you will be exposed to leading minds who use economic, financial, and business analysis to solve complex world problems for an impressive roster of clients, including major law firms, Fortune 100 companies, and government agencies. Through a collegial environment, formal and informal training opportunities, and a broad array of professional development resources, your experience at CRA will open doors for you throughout your career. The Information Technology (ITS) department at Charles River Associates is currently a team of more than 40 professionals dedicated to enhancing, maintaining, and developing the firm's technology infrastructure and security. The team is comprised of four functions:

  • Service Delivery & Telecom
  • Enterprise Application Solutions
  • Infrastructure, Networking and Cloud Solutions
  • Information Security
Information Technology staff are based in the Boston, Chicago, London, Munich, New York, Oakland, San Francisco, College Station and Washington, DC offices. Mainly a Microsoft house, CRA is looking to maximize the performance of our on-premise systems and hybrid infrastructure, meaning experience with cloud technologies is essential for this role. Position Overview The Site Reliability Engineer (SRE) helps ensure CRA’s critical business services are reliable, scalable, and performant across on-premises and cloud environments. This role blends software engineering and operations practices to reduce manual toil through automation, improve service observability, and strengthen incident response. The SRE partners closely with infrastructure, security, application, and service delivery teams to define measurable reliability targets (SLIs/SLOs), implement resilient architectures, and drive continuous improvement through blameless post-incident learning. Key Responsibilities * Hands-on System Engineering experience with core enterprise infrastructure platforms and services, including Windows Server, VMware vSphere, VMware Site Recovery Manager (SRM), SAN technologies, and the Rubrik ecosystem, with the ability to understand dependencies, recovery workflows, and failure modes across on-premises and cloud environments * Service Ownership & Reliability Targets: Partner with service owners to define and maintain service level indicators (SLIs) and service level objectives (SLOs) for availability, latency, and performance; track error budgets and reliability risk. * Observability: Implement and continuously improve monitoring, logging, alerting, and dashboards to provide actionable, symptom-based signals and reduce mean time to detect/respond (MTTD/MTTR). * Blameless Postmortems & Continuous Improvement: Facilitate post-incident reviews, identify root causes and contributing factors, and drive remediation items to completion; standardize learnings into runbooks and operational practices. * DR Testing Program Build-Out: Design and launch a scalable DR testing program (scope, test types, cadence, success criteria, and evidence capture) in partnership with application, infrastructure, and security teams; maintain runbooks and lead regular tabletop and technical recovery exercises to validate RTO/RPO assumptions and improve recoverability. * DR Readiness: Contribute to reliability architecture and disaster recovery readiness for key services, including dependency mapping, recovery testing inputs, and validation of recovery procedures. * Cross-Functional Collaboration: Work day-to-day with infrastructure, network, cloud, security, and application teams to improve operational excellence, reliability culture, and shared ownership of production outcomes. Relevant Skills & Experience * Experience operating and improving reliability of production services (on-prem and/or cloud), including incident response, operational readiness, and service ownership * Working knowledge of SRE concepts and practices such as SLIs/SLOs, error budgets, monitoring/alerting strategy, and blameless postmortems * Experience with observability tooling and practices (logs, metrics, tracing, dashboards) and using data to drive reliability and performance improvements * Experience with disaster recovery orchestration and recovery testing using VMware Site Recovery Manager (SRM) and Azure Site Recovery (ASR) (or similar public cloud DR services) * Proven experience building and operating a DR testing program, including dependency mapping, test planning, coordination across stakeholders, execution of tabletop and technical failover tests, documentation of results, and tracking remediation actions to closure * Strong cross-functional communication and teamwork skills; comfortable partnering with engineering, security, and operations teams to drive shared outcomes * Ability to document and standardize operational procedures (runbooks), participate in on-call rotations, and manage multiple priorities in a fast-moving environment Career Growth and Benefits * CRA’s robust skills development programs [ including a commitment to offering 100 hours of training annually through formal and informal programs, encourage you to thrive as an individual and team member. Beginning with research and analysis skill building, training continues with technical training, presentation skills, internal seminars, and career mentoring and performance coaching from an assigned senior colleague. Additional leadership and collaboration opportunities exist through internal firm development activities. * We offer a comprehensive total rewards program including a superior benefits package, wellness programming [ to support physical, mental, emotional and financial well-being, and in-house immigration support [ for foreign nationals and international business travelers. Work Location Flexibility CRA creates a work environment that enables our colleagues to benefit from being together in the office to best deliver on our promise of career growth, mentorship and inclusivity. At the same time, we recognize that individuals realize a range of benefits when working from home periodically. We currently expect that individuals spend at least 3 to 4 days a week working in the office (which may include traveling to another CRA office or to client meetings), with specific days determined in coordination with your practice or team. Our Commitment to Equal Employment Opportunity Charles River Associates is an equal opportunity employer (EOE). All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, disability, status as a protected veteran, or any other protected characteristic under applicable law. Salary and other compensation A good-faith estimate of the annual base salary range for this position is $130,000 - $150,000. Stating pay within this range may vary based on factors such as education level, experience, skills, geographic location, market conditions, and other qualifications of the successful candidate. This position may be eligible for additional bonus incentive compensation. CRA offers a comprehensive benefits package, subject to eligibility requirements, which may include: medical, dental, and vision insurance; 401(k) retirement plan with employer match; life and disability insurance; paid time off (vacation, sick leave, holidays); paid parental leave; wellness programs and employee assistance resources; and commuter benefits.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer - Disaster Recovery & Business Continuity in Chicago, IL vacancy
  • $80k - $148k

     ...reorganizations, creating and maintaining disaster recovery procedures and overall support of...  ...SAG products license management at DR site & bringing up ADABAS & Natural at DR site...  ...you can do it. We are a client-facing business, but we do encourage clients to allow us... 
    Suggested
    Full time
    Temporary work
    Work experience placement
    Remote work
    Work from home
    Flexible hours

    Ensono

    Chicago, IL
    5 days ago
  •  ...our clients to achieve key business outcomes that reshape how our...  ...clients to keep up with continuous change and embrace innovation...  ...our purpose: Honesty, Reliability, Curiosity, Collaboration,...  ...team of technical Mainframe Disaster Recovery professionals supporting a... 
    Suggested
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Ensono

    Chicago, IL
    3 days ago
  • $85k - $148k

     ...system backup/restore, tape encryption, disaster recovery planning Support Incident and Change...  ...system modifications with IT and business unit managers Support multiple annual...  ...you are not required to be on a client site, you can choose to work from home or in... 
    Suggested
    Full time
    Temporary work
    Remote work
    Work from home
    Flexible hours

    Ensono

    Chicago, IL
    4 days ago
  • $102k - $148k

     ...clients to achieve key business outcomes that...  ...clients to keep up with continuous change and embrace...  ...: Honesty, Reliability, Curiosity,...  ...multiple different Recovery Facilities. ~ Participate in Disaster Recovery Tests....  ...to be on a client site, you can choose to... 
    Suggested
    Full time
    Temporary work
    Remote work
    Work from home
    Flexible hours

    Ensono

    Chicago, IL
    6 hours ago
  • $103k - $159k

     ...and accomplishment. Being a Site Reliability Engineer at iManage Means… You are...  ...orchestration, observability, and disaster readiness into our products....  ...the full potential of their business content and communications.   We are continuously innovating to solve the most... 
    Suggested
    Work at office
    Local area
    Remote work
    Worldwide
    Monday to Friday
    Flexible hours

    iManage

    Chicago, IL
    2 days ago
  •  ...No H1 or C2C. Must be Permanent Resident or US Citizen Senior Site Reliability Engineer Description and Requirements About Our Team We are building Quantum , a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision... 
    Permanent employment
    Remote work

    SDI International

    Chicago, IL
    4 days ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote...  ...Job Department: Technology Team: Site Reliability Engineering About Snapsheet...  ...service engineering teams to ensure smooth, continued delivery of our service to clients Work... 
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    Chicago, IL
    6 days ago
  •  ...offer strategic business and transactional...  ...& Infrastructure Engineer is responsible for...  ...operational and continuity requirements. Essential...  .... Monitoring & Reliability: Expert...  ...business continuity/disaster recovery (BC/DR) guidelines...  ...availability sets, Azure Site Recovery, Key... 
    Work at office
    Remote work
    Flexible hours
    Night shift
    Weekend work
    Afternoon shift

    Hinshaw & Culbertson

    Chicago, IL
    6 days ago
  •  ...Senior Kafka Platform Engineer (Automation & Kubernetes) Chicago...  ...-managed offerings, ensure reliability and performance, and partner...  ..., rebalancing, replication, recovery). Strong Kubernetes...  ...architectures, cluster linking strategies, and disaster recovery drills.... 

    Benton Partners

    Chicago, IL
    3 days ago
  • $139.23k - $163.8k

     ...Software Engineer Be a part of transformational change where integrity matters, success...  ...data platform supporting multiple business domains (Finance, HR, Supply Chain)...  ...architecture, governance, CI/CD automation, and disaster recovery Build and maintain multi‑account... 
    Temporary work
    Work experience placement
    Local area
    Remote work
    3 days per week

    U.S. Bancorp

    Chicago, IL
    1 day ago
  •  ...looking for a Senior Java Developer to join a cross-functional team. The primary goal of this contract will be to improve our disaster recovery capabilities. Position Responsibilities: The candidate will be part of a development team working with iOS, Android, Web, other... 
    Full time
    Contract work

    Software Technology Inc

    Chicago, IL
    3 days ago
  •  ...Azure Infrastructure/Platform Terraform Engineer Location: Chicago, IL or Remote...  ...Azure SQL Database • Experience in Azure Site Recovery (ASR) for disaster recovery solutions, ensuring high availability and business continuity during migration. • Azure Associate level... 
    Remote work

    Staffing the Universe

    Chicago, IL
    1 day ago
  • $55 - $60 per hour

     ...hands-on experience with IBM BPM / IBM Business Automation Workflow (BAW)...  ...experience in IBM BPM / BAW platform engineering and administration. Experience managing...  ...Experience with high availability, disaster recovery, and backup strategies. Experience... 

    Cynet Systems

    Chicago, IL
    4 days ago
  •  ...a hands-on Senior Manager, IT & Engineering to build and lead Adoreal’s IT function...  ...the organization  ~ Ensure business continuity through disaster recovery planning and regular testing...  ...secure SDLC, and production system reliability  ~ Experience with human resources... 
    Full time
    For contractors
    Local area
    Remote work
    Flexible hours

    Adoreal

    Chicago, IL
    a month ago
  •  ...IBM BPM Platform Engineer / Administrator We are seeking an experienced IBM BPM Platform...  ...optimize enterprise-grade IBM BPM / Business Automation Workflow (BAW) environments....  .... Knowledge of high availability, disaster recovery, backup, and platform resiliency concepts... 

    eTeam

    Chicago, IL
    4 days ago
  •  ...provide them with a Systems / Network Engineer (Remote). Please review the below description...  ...system performance, availability and reliability. Implement and maintain appropriate...  ...security. Manage system backup / disaster recovery procedures; participate in disaster... 
    Immediate start
    Remote work

    Allnessjobs

    Chicago, IL
    6 days ago
  • $165k - $225k

     ...enterprises to deploy demanding AI workloads with enterprise-grade reliability and compliance. Your Role: You will be instrumental in...  ...expertise at its core. Working closely with our systems engineers, network engineers, and platform engineering team, you'll architect... 
    Remote work
    Flexible hours

    Moonlite

    Chicago, IL
    12 days ago
  • $160k - $200k

     ...the next stage of our journey as we continue to grow. Job Description The...  ...looking for an experienced Manager II, Site Reliability Engineering to join our team. In this  role, you...  ...to enhance them Lean into our business domain and needs as well as our company... 
    Full time
    Temporary work
    Local area
    Immediate start
    Remote work
    Shift work

    Flywire

    Chicago, IL
    2 days ago
  •  ...Role: Sr. DevOps Engineer Long Term Location: Either Chicago OR Houston...  ...working collaboratively across IT, business, and third-party suppliers from around...  ...technical architecture Designed robust disaster recovery (DR) AWS Warm Standby strategy Mandatory... 
    Flexible hours

    Futran Tech Solutions Pvt. Ltd.

    Chicago, IL
    2 days ago
  •  ...experienced Principal Engineer to join the...  ...engineering quality and reliability through strong...  ...the Line of Business. Key Responsibilities...  ...and mean time to recovery. Provide...  ...gates, and continuous improvement of test...  ...Define and validate disaster recovery (DR) and... 
    Full time
    Immediate start

    Ritchie Bros.

    Westchester, IL
    3 days ago
  •  ...Overview: Cyber Recovery Engineer Location: Chicago, IL Work Model: Hybrid (onsite 3 days per week) Long Term Contract...  ...healthcare, utilities, government). ~ Hands-on exposure to disaster recovery or backup operations, including DR testing, backup... 
    Long term contract
    3 days per week

    Stellar IT Group

    Chicago, IL
    2 days ago
  •  ...We're seeking a Lead Database Engineer to join our Data Security Team . This...  ...technical solutions to support evolving business and security needs. Conduct 360-degree...  ...growth; ensure high availability and disaster recovery readiness. Collaborate with... 
    Contract work

    Equiliem

    Chicago, IL
    2 days ago
  • $214.5k

     ...Management Platform team sits within Platform Engineering and is responsible for the systems that...  ...connects design and engineering with business users; enabling scalable, multi-brand...  ...systems while maintaining business continuity Familiarity with modern approaches to... 
    Local area
    Flexible hours

    Expedia Group

    Chicago, IL
    4 days ago
  •  ...Responsible for translating business/technical...  ...software and DevSecOps engineers. Ensures that the...  ...opportunities and drive continuous improvement in cloud...  ...Availability, Scalability, Reliability, Recoverability,...  ...Testing, Operations, Disaster Recovery Strong leadership, team... 
    Work experience placement

    Samprasoft

    Chicago, IL
    4 days ago
  •  ...support for the SAP • applications in the Pricing area by resolving • change requests and malfunction/incident tickets. • Discuss business requirements with the Business Process Owners and Key Users and provide satisfying SAP system solution with the aim to... 

    3B Staffing LLC

    Chicago, IL
    1 day ago
  • $82.08k - $193.44k

     ...effective cloud architectures tailored to business needs. Your role Develop cloud...  ...leveraging GCP services such as Compute Engine, BigQuery, Cloud Storage, Pub/Sub,...  ...Implement solutions for high availability, disaster recovery, and auto-scaling. Leverage tools like... 
    Permanent employment
    Full time
    Contract work
    Local area

    Capgemini

    Chicago, IL
    3 days ago
  • $70.35k - $205.8k

     ...Practice, you'll be delivering major SAP engagements (for example, Business Transformation Strategy & Roadmaps, migrations to SAP S/4HANA,...  ...the recruiting process are not a guarantee of future or continued accommodations once hired. If you would like to be considered... 
    Work experience placement
    Live in
    Work at office
    Local area

    Accenture

    Chicago, IL
    5 days ago
  •  ...Senior Lead Network Engineer II Employment Type: Full Time...  ...Engineering, Computer Science, Business, Information systems or a related...  ...Experience utilizing (RF) site survey tools such as Ekahau,...  ...3 years experience with disaster recovery plan creation / implementation... 
    Full time
    Local area
    Remote work
    Monday to Friday
    Flexible hours

    Contact Government Services LLC

    Chicago, IL
    3 days ago
  •  ...Cloud and Storage Engineer Employment Type: Full-Time, Experienced CGS is seeking...  ...SAN: performance, capacity, replication, disaster recovery, backup disk storage, and backup &...  ...Bachelor's in computer science, business, or other relevant discipline. Eight... 
    Full time
    Work experience placement
    Flexible hours

    Contact Government Services LLC

    Chicago, IL
    2 days ago
  •  ...upgrade migration approach and options aligned to customer business needs Support Data Volume Management processes DVM...  ...Provide expertise on SAP Monitoring Patching Backup Recovery High Availability Disaster Recovery approaches and solutions for application health... 

    InterSources

    Chicago, IL
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer - Disaster Recovery & Business Continuity. Be the first to apply!