Site Reliability Engineer – Digital Technology Job Description Template
Our company is looking for a Site Reliability Engineer – Digital Technology to join our team.
Responsibilities:
- Participate in the 24×7 support coverage as needed;
- Troubleshoot priority incidents, facilitate blameless post-mortems;
- Design self-healing and resiliency patterns;
- Identify application patterns and analytics in support of better service level objectives;
- Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents;
- Work with development teams throughout the software life cycle ensuring sustainable software releases;
- Design automated software and product upgrades, change management, and release management solutions;
- Design, code, test and deliver software to automate manual operational work;
- Build and drive adoption for greater self-healing and resiliency patterns;
- Develop, Test and debug automated tasks (Apps, Systems, Infrastructure);
- Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions;
- Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands;
- Coach or manage teams as applicable.
Requirements:
- Bachelor’s degree or equivalent experience in an software engineering discipline;
- Proficiency in service-level changes to a system and troubleshooting components;
- Adept in the development of automated tools, systems, and services in multiple technology domains.