Site Reliability Engineer – Digital Technology Job Description

Site Reliability Engineer – Digital Technology Job Description Template

Our company is looking for a Site Reliability Engineer – Digital Technology to join our team.

Responsibilities:

  • Participate in the 24×7 support coverage as needed;
  • Troubleshoot priority incidents, facilitate blameless post-mortems;
  • Design self-healing and resiliency patterns;
  • Identify application patterns and analytics in support of better service level objectives;
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents;
  • Work with development teams throughout the software life cycle ensuring sustainable software releases;
  • Design automated software and product upgrades, change management, and release management solutions;
  • Design, code, test and deliver software to automate manual operational work;
  • Build and drive adoption for greater self-healing and resiliency patterns;
  • Develop, Test and debug automated tasks (Apps, Systems, Infrastructure);
  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions;
  • Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands;
  • Coach or manage teams as applicable.

Requirements:

  • Bachelor’s degree or equivalent experience in an software engineering discipline;
  • Proficiency in service-level changes to a system and troubleshooting components;
  • Adept in the development of automated tools, systems, and services in multiple technology domains.