Manager Site Reliability Engineering
$51.9 per hourHighmark Health
Company :
Allegheny Health Network
Job Description :
GENERAL OVERVIEW:
This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the Environment of Care (EOC), enabling seamless access to essential services for patients, providers, and the people we serve. Proactively identifies and mitigates potential disruptions to maintain the highest standards of care and operational efficiency. This role blends software engineering, clinical engineering, and security principles with a deep understanding of healthcare operations to minimize downtime, improve system resilience, and to support clinical workflows and continuity of hospital operations. Works cross-functionally with AHN site leaders and teams to navigate and to monitor and support building automation and facility systems, clinical engineering / IoT, healthcare delivery technology architecture, infrastructure and platform operations, and cybersecurity. Fosters a culture of automation, continuous improvement, collaboration, and patient safety. Develops core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation) leveraging industry practices, manufacturer guidance, and other service delivery metrics.
ESSENTIAL RESPONSIBILITIES
Perform management responsibilities to include, but are not limited to: involved in hiring and termination decisions, coaching and development, rewards and recognition, performance management and staff productivity.Plan, organize, staff, direct and control the day-to-day operations of the department; develop and implement policies and programs as necessary; may have budgetary responsibility and authority. (25%)
Oversees the partnership with clinical engineering, cybersecurity, device manufacturers, suppliers, and Information Technology SMEs to oversee and to implement strategies for managing, monitoring, and securing a diverse range of clinical devices and other technology equipment (e.g., IoT), ensuring compliance with HIPAA and other relevant regulations (e.g., FDA, TJC, PCI). Keeps current on healthcare IT trends, including AI, security patching, and best practices for device hardening. Oversees and assists with network segmentation and access controls to isolate and to protect clinical and other critical devices. Automates monitoring tasks to improve efficiency and reduce errors. Identifies and remediates vulnerabilities in clinical devices and related infrastructure. Manages and reports issues with assets, devices, integration services, and other equipment. Engages the appropriate parties to develop and deploy a fix/solution or oversees ownership of resolution actions. Utilizes observability practices to gain deep insights into system behavior, enabling faster identification and resolution of issues. (15%)
Oversees the SRE partnership with Clinical Engineering and Cybersecurity Engineering to troubleshoot technical issues related to medical equipment and systems. Participates in the medical device technology lifecycle – from product/device evaluation, discovery, to implementation, maintenance, and through retirement. Develops the framework and structure to maintain documentation related to the IT infrastructure supporting clinicaland other critical devices. Participates in the planning and oversees the execution of preventative maintenance activities. Provides direction and guidance to team members on how to analyze complex problems and develop effective solutions, how to troubleshoot system outages and performance issues, and how to work collaboratively with other IT, cybersecurity, facility, AI and application teams to resolve issues and to conduct root cause analyses. (15%)
Oversees the SRE partnership with facility leaders to optimize the performance and monitoring of building automation systems (BAS), including HVAC, lighting, fire suppression, security systems, etc. Manages processes and procedures used to monitor BAS performance metrics and proactively identifies potential issues. Works with facilities management to implement improvements to the BAS infrastructure. Works with cybersecurity, vendors/manufacturers, et. al. to ensure the security of building automation systems and oversees monitoring of performance, service delivery, and support. (15%)
Oversees the SRE partnership with IT teams including, but not limited to platform / product management, disaster recovery services, infrastructure and architecture, storage management, and release management. Participates in the planning and execution of downtime drills and system / device recovery exercises. Supports other emergency preparedness drills and exercises, as needed. Leads or participates in post-incident reviews to identify root causes and implement corrective actions. Works with cross-functional stakeholders to Implement and to maintain redundant systems and failover mechanisms to minimize downtime. Reviews and provides feedback on emergency operations plans and other materials which are used to respond to emergency situations (e.g., Continuity of Operations Plans, Incident Response Guides, Downtime Procedures). Manages team members who are supporting the planning and execution of system migrations, releases, and upgrades to ensure minimal disruption to clinical operations. Oversees detailed migration or installation plans, including risk assessments, rollback procedures, and communication strategies. Assists local site leaders with navigating shared services (e.g., AI, IT, Information Security, Clinical Engineering, Platform Operations, Technology Acquisition). (15%)
Establishes core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation). Manages the processes and procedures used for documentation and knowledge sharing including maintaining detailed documentation of systems, device inventories, processes, and procedures.Leads by example by sharing knowledge and best practices with other staff and cross-functional teams. Provides training and mentorship to junior or less experienced team members. Stays current with the latest technologies and trends in site reliability engineering. Leads or participates in briefings with cross-functional stakeholders to manage priorities and team assignments, support ticket queues, etc. (10%)
Other duties as assigned or requested. (5%)
Q UALIFICATIONS:
Required
Bachelor’s degree in Computer Science, Engineering, Management Information Systems, IT, or related field or relevant experience and/or education as determined by the company in lieu of bachelor's degree.
3 years with Management or leadership role
Preferred
Master's degree in Computer Science, Engineering, Management Information Systems, IT, or related field
5 years of experience with Site Reliability Engineering (SRE), Systems Administration, or DevOps particularly in healthcare IT
5 years of experience in Medical device management lifecycle, network / device segmentation, vulnerability and patch management
5 years of experience in Healthcare IT experience in architecture, automation, IoT, telemetry, telehealth, security, system development lifecycle, capacity planning, networking, continuous integration / continuous delivery pipelines (CI/CD), incident management, scripting, metrics, monitoring, redundancy, etc.
3 years of experience working in highly regulated environments
3 years of experience with Progressive leadership roles, preferably inclinical engineering, IT, business continuity, backup and storage management, building automation, or cybersecurity discipline in healthcare
SKILLS:
Problem-Solving: Excellent analytical and troubleshooting skills; High capacity to think analytically, interpret information / observations, apply judgment and to assist with making effective, strategic decisions.
Collaboration: Ability to work effectively in a team environment; demonstrated ability to support multiple sites and locations while maintaining consistency in service delivery processes and procedures.
Communication: Strong written and verbal communication skills.
Flexibility: Willingness to participate in activities or incidents which may occur outside of regular work schedules.
Leadership: Demonstrated resource and project planning capabilities, decision making skills, history of results-oriented delivery, and effective team building across multiple locations and a diverse team of staff, partners, and stakeholders.
Security Awareness: Understanding of security best practices and how to apply them in a healthcare IT environment.
Delivery and Execution: Demonstrated competency in the execution of multiple projects, including managing resources across multiple projects to meet goals.
Relationships: Strong relationship building skills and ability to influence with and without authority in a matrixed organization.
Disclaimer: The job description has been designed to indicate the general nature and essential duties and responsibilities of work performed by employees within this job title. It may not contain a comprehensive inventory of all duties, responsibilities, and qualifications required of employees to do this job.
Compliance Requirement : This job adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies.
As a component of job responsibilities, employees may have access to covered information, cardholder data, or other confidential customer information that must be protected at all times. In connection with this, all employees must comply with both the Health Insurance Portability Accountability Act of 1996 (HIPAA) as described in the Notice of Privacy Practices and Privacy Policies and Procedures as well as all data security guidelines established within the Company’s Handbook of Privacy Policies and Practices and Information Security Policy.
Furthermore, it is every employee’s responsibility to comply with the company’s Code of Business Conduct. This includes but is not limited to adherence to applicable federal and state laws, rules, and regulations as well as company policies and training requirements.
Pay Range Minimum:
$51.90
Pay Range Maximum:
$83.84
Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets.
Highmark Health and its affiliates prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities and prohibit discrimination against all individuals based on any category protected by applicable federal, state, or local law.
We endeavor to make this site accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact the email below.
For accommodation requests, please contact HR Services Online at View email address on click.appcast.io
California Consumer Privacy Act Employees, Contractors, and Applicants Notice
Req ID: J280531
- ...Role: Site Reliability Engineer (SRE) Location: Brentwood, TN (Onsite) Contract Experience: 6-8+ years Role Description: Combines software engineering and IT operations to ensure the reliability, scalability, and performance of systems, with...SuggestedContract work
- ...Job Title: Site Reliability Engineer - Compute Focus Duration: 6 Months to Hire Location: On-Site 2-3 days/week in Nashville, TN Job... ...processes. Key Responsibilities: Focus on the management and optimization of compute infrastructure. Develop...Suggested2 days per week3 days per week
$84.9k - $209.5k
...architects infrastructure and service to ensure reliability and functionality. Forecasts demands and... ...and maintains advanced knowledge of site reliability trends. Responsibilities... ...Capacity Ingestion and Management: - Designs and architects infrastructure...SuggestedTemporary workImmediate startFlexible hoursShift work- Site Reliability Engineer II About the Role This role focuses on enhancing system reliability and scalability for PROS’s platform, contributing... ...party applications to the cloud and contribute to release management documentation. Gain an understanding of application...Suggested
- Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs... ...new tools and develops and maintains advanced knowledge of site reliability trends. Only Oracle brings together the data, infrastructure...SuggestedFull timeFlexible hours
$124k - $280k
...Requirements: Up to 80% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to... ...data storage solutions using cloud services Designing and managing data warehouses and data lakes Implementing IAM roles and...Full timeH1b$73.5k - $212.28k
...Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to... ...for coaching, leveraging team member's unique strengths, and managing performance to deliver on client expectations. With your...Full timeH1b$142.6k - $261.5k
...team of product leaders, data scientists, designers, and software engineers enable our clients to solve their most complex product... ...align with business requirements. Your key responsibilities As a Manager in Application Design and Development, you will lead the effective...Summer holidayFlexible hours- Teradata Corporation (SE) in Nashville, TN seeks qualified candidates for a devops role focused on site reliability engineering. Responsibilities include designing and implementing software solutions, leading chaos engineering efforts, and leveraging AI technologies to...
- The Metropolitan Nashville Airport Authority seeks a Project Manager, Design to oversee engineering and architectural design for capital projects. This role ensures that projects meet quality control and are completed on schedule and within budget. Ideal candidates will...
- Release Train Engineer (Remote w Travel) Our History: From our start in 2009, Conexess has established itself in 3 markets, employing nearly... ...package. Responsibilities: Works closely with Product Managers to understand priorities and works closely with scrum masters to...Remote jobFull timeContract workWork experience placement
- ...Senior SharePoint & Power Platform Engineer Consultant (Manager) - Migrations & Modern Intranet EY advises... ...define information architecture, hub/site strategy, navigation, search approach... ..., practical recommendations, and reliable delivery. Communicating effectively...
$142.6k - $261.5k
Ernst & Young Oman is seeking a Technology Business Analysis Manager to bridge business needs and technical solutions in Nashville, Tennessee. Responsibilities include managing SAP Project Systems, driving continuous improvement, and ensuring alignment with business objectives...$140k - $210k
...Job Description Salary: $140,000 - $210,000 Senior/Staff Site Reliability Engineer, Platforms Location: Nashville, TN (Hybrid, 3 days in... ...team. This is a high-impact role on a small, capable team (1 Manager + 2 ICs) where you'll have significant ownership over the infrastructure...Work at officeLocal area$46.92k
...development so they can reach their full potential. Responsibilities include: Providing daily supervision and mentorship Managing household routines and student schedules Administering medications and ensuring student wellness Driving students to...Full timeWork from homeRelocationRelocation packageFlexible hoursWeekday work$85k - $148k
...optimizing all operating systems assigned. This position reports to the Manager of the z/VM, z/Linux team and is responsible for assisting in... ...most of the time so if you are not required to be on a client site, you can choose to work from home or in our Ensono offices....Full timeTemporary workRemote workWork from homeFlexible hours$80k - $148k
...all the tasks completed on a regular basis and adhere the change management policy. Position reports to Senior Manager - Mainframe... ...keys to SAG products o SAG products license management at DR site & bringing up ADABAS & Natural at DR site o Installation & upgrade...Full timeTemporary workWork experience placementRemote workWork from homeFlexible hours- ...Stringfellow Technology Group is hiring an IT Systems Engineer to join our Professional Services team. The majority of your time will be spent leading IT project delivery for our managed services clients, including Azure deployments, Microsoft 365 tenant builds, and cloud...Local area
$140k - $170k
...Services, Docker, Kubernetes, and familiar with Git development. The Engineer is expected to provide strategy and implement enterprise‑scale... ...maturity levels. AWS Security & Networking: Implement and manage IAM policies, permission sets, Security Groups, NACLs, and VPC...Full timeLocal areaRemote work- ...the enterprise. What you will do We are looking for a mid-level engineer who will be responsible for delivering robust, performant and... ...with Teradata Advanced Development, Architects and Product Management to understand system requirements and test new platform infrastructure...Permanent employmentFlexible hours
- ...looking for a highly skilled Sr. Platform Engineer with extensive experience in... ...Responsibilities Design, implement, and manage virtualization platforms, with a focus on... ...cross-functional teams to ensure platform reliability, scalability, and performance. Maintain...
- ...A cloud analytics company in Nashville is seeking a mid-level engineer to deliver robust infrastructure for cloud and on-premise platforms. The ideal candidate will have experience with public cloud computing, strong Linux skills, and a relevant degree with significant...Flexible hours
$120k - $135k
...: As a member of the Platform Engineering organization, you will be part of a team responsible for managing the large footprint of our application... ..., networking, and application reliability. As a Platform Network Engineer within our Site Reliability Engineering (SRE)...Immediate start- ...TekWissen is a global management consulting, technological service and outsourcing company delivering technology-driven business solutions... ...our numerous clients. Job Description Role: Cloud Solution Engineer Duration: 12+ Months Location: Nashville TN Pay rate: Can be Discussed...
- Schneider Electric has an exciting opportunity for an Engineering Software Product Manager to drive the lifecycle of EcoSet Design software for the NAM market. You will engage with stakeholders to define the product vision and roadmap, monitor performance, and work within...
- Hitachi Automotive Systems Americas, Inc. is seeking a Technical Sales Expert - Bushings to lead the technical sales efforts and manage product homologation for transformer components in the Tennessee region. This role requires strong technical knowledge of transformer...Remote job
- LATICRETE International is seeking a Technical Support Manager in Tennessee to oversee daily operations of technical service representatives in the construction products industry. Ideal candidates will have at least 7 years of construction industry experience, including...
$58k - $68k
ASM Research, An Accenture Federal Services Company, located in Nashville, TN, is seeking a candidate to manage the analysis, development, and delivery of training and communication materials. This role requires a Bachelor’s degree and 3-5 years of relevant experience...$70k - $80k
Company Overview At Motorola Solutions, we believe that everything starts with our people. We're a global close-knit community, united by the relentless pursuit to help keep people safer everywhere. We build and connect technologies to help protect people, property and...Relocation- Accenture is seeking motivated consultants and managers in Nashville to lead treasury transformation initiatives, support modernization projects, and enhance decision-making processes. The ideal candidates have strong backgrounds in treasury operations, technology, and...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Manager Site Reliability Engineering. Be the first to apply!
- on-site clinical research associate (traveling/remote) Nashville, TN
- junior website developer Nashville, TN
- IT site lead Nashville, TN
- site leader Nashville, TN
- site safety Nashville, TN
- site recruiter Nashville, TN
- on site coordinator Nashville, TN
- site services specialist Nashville, TN
- website coordinator Nashville, TN
- website content developer Nashville, TN


