Manager Site Reliability Engineering
$51.9 per hourHighmark Health
Company :
Allegheny Health Network
Job Description :
GENERAL OVERVIEW:
This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the Environment of Care (EOC), enabling seamless access to essential services for patients, providers, and the people we serve. Proactively identifies and mitigates potential disruptions to maintain the highest standards of care and operational efficiency. This role blends software engineering, clinical engineering, and security principles with a deep understanding of healthcare operations to minimize downtime, improve system resilience, and to support clinical workflows and continuity of hospital operations. Works cross-functionally with AHN site leaders and teams to navigate and to monitor and support building automation and facility systems, clinical engineering / IoT, healthcare delivery technology architecture, infrastructure and platform operations, and cybersecurity. Fosters a culture of automation, continuous improvement, collaboration, and patient safety. Develops core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation) leveraging industry practices, manufacturer guidance, and other service delivery metrics.
ESSENTIAL RESPONSIBILITIES
Perform management responsibilities to include, but are not limited to: involved in hiring and termination decisions, coaching and development, rewards and recognition, performance management and staff productivity.Plan, organize, staff, direct and control the day-to-day operations of the department; develop and implement policies and programs as necessary; may have budgetary responsibility and authority. (25%)
Oversees the partnership with clinical engineering, cybersecurity, device manufacturers, suppliers, and Information Technology SMEs to oversee and to implement strategies for managing, monitoring, and securing a diverse range of clinical devices and other technology equipment (e.g., IoT), ensuring compliance with HIPAA and other relevant regulations (e.g., FDA, TJC, PCI). Keeps current on healthcare IT trends, including AI, security patching, and best practices for device hardening. Oversees and assists with network segmentation and access controls to isolate and to protect clinical and other critical devices. Automates monitoring tasks to improve efficiency and reduce errors. Identifies and remediates vulnerabilities in clinical devices and related infrastructure. Manages and reports issues with assets, devices, integration services, and other equipment. Engages the appropriate parties to develop and deploy a fix/solution or oversees ownership of resolution actions. Utilizes observability practices to gain deep insights into system behavior, enabling faster identification and resolution of issues. (15%)
Oversees the SRE partnership with Clinical Engineering and Cybersecurity Engineering to troubleshoot technical issues related to medical equipment and systems. Participates in the medical device technology lifecycle – from product/device evaluation, discovery, to implementation, maintenance, and through retirement. Develops the framework and structure to maintain documentation related to the IT infrastructure supporting clinicaland other critical devices. Participates in the planning and oversees the execution of preventative maintenance activities. Provides direction and guidance to team members on how to analyze complex problems and develop effective solutions, how to troubleshoot system outages and performance issues, and how to work collaboratively with other IT, cybersecurity, facility, AI and application teams to resolve issues and to conduct root cause analyses. (15%)
Oversees the SRE partnership with facility leaders to optimize the performance and monitoring of building automation systems (BAS), including HVAC, lighting, fire suppression, security systems, etc. Manages processes and procedures used to monitor BAS performance metrics and proactively identifies potential issues. Works with facilities management to implement improvements to the BAS infrastructure. Works with cybersecurity, vendors/manufacturers, et. al. to ensure the security of building automation systems and oversees monitoring of performance, service delivery, and support. (15%)
Oversees the SRE partnership with IT teams including, but not limited to platform / product management, disaster recovery services, infrastructure and architecture, storage management, and release management. Participates in the planning and execution of downtime drills and system / device recovery exercises. Supports other emergency preparedness drills and exercises, as needed. Leads or participates in post-incident reviews to identify root causes and implement corrective actions. Works with cross-functional stakeholders to Implement and to maintain redundant systems and failover mechanisms to minimize downtime. Reviews and provides feedback on emergency operations plans and other materials which are used to respond to emergency situations (e.g., Continuity of Operations Plans, Incident Response Guides, Downtime Procedures). Manages team members who are supporting the planning and execution of system migrations, releases, and upgrades to ensure minimal disruption to clinical operations. Oversees detailed migration or installation plans, including risk assessments, rollback procedures, and communication strategies. Assists local site leaders with navigating shared services (e.g., AI, IT, Information Security, Clinical Engineering, Platform Operations, Technology Acquisition). (15%)
Establishes core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation). Manages the processes and procedures used for documentation and knowledge sharing including maintaining detailed documentation of systems, device inventories, processes, and procedures.Leads by example by sharing knowledge and best practices with other staff and cross-functional teams. Provides training and mentorship to junior or less experienced team members. Stays current with the latest technologies and trends in site reliability engineering. Leads or participates in briefings with cross-functional stakeholders to manage priorities and team assignments, support ticket queues, etc. (10%)
Other duties as assigned or requested. (5%)
Q UALIFICATIONS:
Required
Bachelor’s degree in Computer Science, Engineering, Management Information Systems, IT, or related field or relevant experience and/or education as determined by the company in lieu of bachelor's degree.
3 years with Management or leadership role
Preferred
Master's degree in Computer Science, Engineering, Management Information Systems, IT, or related field
5 years of experience with Site Reliability Engineering (SRE), Systems Administration, or DevOps particularly in healthcare IT
5 years of experience in Medical device management lifecycle, network / device segmentation, vulnerability and patch management
5 years of experience in Healthcare IT experience in architecture, automation, IoT, telemetry, telehealth, security, system development lifecycle, capacity planning, networking, continuous integration / continuous delivery pipelines (CI/CD), incident management, scripting, metrics, monitoring, redundancy, etc.
3 years of experience working in highly regulated environments
3 years of experience with Progressive leadership roles, preferably inclinical engineering, IT, business continuity, backup and storage management, building automation, or cybersecurity discipline in healthcare
SKILLS:
Problem-Solving: Excellent analytical and troubleshooting skills; High capacity to think analytically, interpret information / observations, apply judgment and to assist with making effective, strategic decisions.
Collaboration: Ability to work effectively in a team environment; demonstrated ability to support multiple sites and locations while maintaining consistency in service delivery processes and procedures.
Communication: Strong written and verbal communication skills.
Flexibility: Willingness to participate in activities or incidents which may occur outside of regular work schedules.
Leadership: Demonstrated resource and project planning capabilities, decision making skills, history of results-oriented delivery, and effective team building across multiple locations and a diverse team of staff, partners, and stakeholders.
Security Awareness: Understanding of security best practices and how to apply them in a healthcare IT environment.
Delivery and Execution: Demonstrated competency in the execution of multiple projects, including managing resources across multiple projects to meet goals.
Relationships: Strong relationship building skills and ability to influence with and without authority in a matrixed organization.
Disclaimer: The job description has been designed to indicate the general nature and essential duties and responsibilities of work performed by employees within this job title. It may not contain a comprehensive inventory of all duties, responsibilities, and qualifications required of employees to do this job.
Compliance Requirement : This job adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies.
As a component of job responsibilities, employees may have access to covered information, cardholder data, or other confidential customer information that must be protected at all times. In connection with this, all employees must comply with both the Health Insurance Portability Accountability Act of 1996 (HIPAA) as described in the Notice of Privacy Practices and Privacy Policies and Procedures as well as all data security guidelines established within the Company’s Handbook of Privacy Policies and Practices and Information Security Policy.
Furthermore, it is every employee’s responsibility to comply with the company’s Code of Business Conduct. This includes but is not limited to adherence to applicable federal and state laws, rules, and regulations as well as company policies and training requirements.
Pay Range Minimum:
$51.90
Pay Range Maximum:
$83.84
Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets.
Highmark Health and its affiliates prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities and prohibit discrimination against all individuals based on any category protected by applicable federal, state, or local law.
We endeavor to make this site accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact the email below.
For accommodation requests, please contact HR Services Online at View email address on click.appcast.io
California Consumer Privacy Act Employees, Contractors, and Applicants Notice
Req ID: J280531
$126k - $248k
..., you will partner with SRE leaders and engineers to scale the platform that underpins all... ...program execution, strengthen production reliability practices, and coordinate cross-... ...criteria with SRE engineers and leaders. Manage dependencies across platform teams, keep...SuggestedLocal areaRemote workWorldwideFlexible hours- ...This role requires regularly working on-site at customer locations in Arlington, VA.... .... About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security... ...the overall experience of deploying and managing Onebrief on premise. About You You care...SuggestedRelocationRelocation package
- Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of... ...Responsibilities 1. Reliability & Performance Engineering SLA/SLO Management: Define, monitor, and maintain Service Level Objectives (...SuggestedLocal area
$166k - $220k
ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy,... ...through root cause analysis and creating tooling capable of managing large scale deployments Drive continuous organizational...SuggestedFull timeWork experience placementRelocation package- ...developing automation scripts in Bash, Python, and PowerShell, integrating systems, and managing Microsoft Entra services. A minimum of 5 years of experience in systems engineering is required along with a Bachelor's degree in Computer Science. The position offers a hybrid...SuggestedLocal area
- ...based in Washington, DC, is looking for an experienced Senior Site Reliability Engineer to enhance the reliability and operational performance of... ..., automation, and incident response to ensure effective management of systems under the Military Community and Family Policy...Contract work
$191k - $287k
...expectations. Our systems integration engineers internalize the nuances of each deployment... .... ABOUT THE JOB We are looking for a Site Reliability Engineer (SRE) to join AGD, our rapidly... ...the developer experience. You will be managing cloud deployments in AWS, Azure and on...Full timeWork experience placementImmediate start$124k - $280k
...Requirements: Up to 80% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to... ...data storage solutions using cloud services Designing and managing data warehouses and data lakes Implementing IAM roles and...Full timeH1b$125k - $200k
Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that... ...implement infrastructure‑as‑code (IaC) to provision and manage cloud resources (e.g., AWS, Azure, GCP). Build and maintain...Local area2 days per week$120k - $214k
...mission critical capabilities to our customers. System Deployment Engineers work in complex environments with shared environmental... ...Participate in customer demonstrations and exercises Work with site reliability engineers to provide and refine requirements for tooling and...Full timeTemporary workWork experience placementLocal areaRelocation package- ID.me is seeking a Developer Marketing Manager to enhance how developers discover, learn, and integrate with our identity platform.... ...portal and documentation while collaborating with Product and Engineering. The ideal candidate has over 10 years in developer marketing,...
- Senior Site Reliability Engineer, Kubernetes w/ active TS/SCI
- ...Description: Onsite in Washington, DC our client seeks a Sr. Site Reliability Engineer III to design, automate, and operate mission-critical... ...automated CI/CD pipelines, monitoring, and configuration management workflows across all environments. Provision, configure...Hourly payPermanent employmentFull timeLocal areaImmediate start
$147k - $202k
...team, we live this mission by building the most reliable and performant systems on the planet. We... ...We are looking for an experienced Senior Site Reliability Engineer (SRE) who thrives on the challenge of managing large-scale cloud production systems. The ideal...Permanent employmentLocal areaWorldwideFlexible hours- ...protect our country from threats. Job Description SITE RELIABILITY ENGINEER (SRE) Own your opportunity. Make your impact As a Site... ...—including Incident, Problem, Change, and Configuration Management Strengthen the enterprise security posture by...
$100k - $110k
...Instructional Design and Training, Software Engineering and IT Support Services to improve the... ...with MKS2. Position Title: Site Reliability Systems Engineer (REMOTE) Program:... ...be investigation, working with event management, application owners, DevOps teams, and...Work at officeRemote work- Senior Manager, Site Reliability Engineering (Federal)
$194k - $267k
...We are seeking a highly technical Staff Observability Site Reliability Engineer with a specialty in Splunk to own and evolve our Splunk ecosystem... .... Required Skills & Experience (The Essentials) Log Management: Minimum 5+ Experience scaling and managing Splunk Cloud...Permanent employmentWork at officeLocal areaWorldwideFlexible hours$194k - $267k
...automate it” and who can rapidly self-educate on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses...Permanent employmentWork at officeLocal areaWorldwideFlexible hours$207k - $284.9k
...This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Senior Manager, Site Reliability Engineering District of Columbia Area Secure Every Identity, from AI to Human Identity is the key to unlocking the...Permanent employmentLocal areaWorldwideFlexible hours$189.3k - $302.81k
...Customer. Share insights and best practices, and connect with Engineering and Product teams to remove blockers. Target and resolve... ...years working in enterprise software from a customer success management, technical consulting, technical project management, technical...Work experience placementRemote workFlexible hours$170k - $210k
Lakera Inc is seeking a Solutions Engineer in Washington D.C. to serve as the primary technical contact for customers, design tailored... ...strong proficiency in Python and Javascript, and a track record of managing relationships with large enterprises. The total compensation...$142.6k - $261.5k
...team of product leaders, data scientists, designers, and software engineers enable our clients to solve their most complex product... ...align with business requirements. Your key responsibilities As a Manager in Application Design and Development, you will lead the effective...Summer holidayFlexible hours$90k - $140k
...Federal SAP Concur Implementation Project Manager The Position: Censeo is seeking a consultant to serve as a Customer Engagement... ...information provided by GO.gov Applies business process re-engineering to optimize implementation of the GO.gov solution and ensure...Work at officeRemote workWork from homeFlexible hoursNight shift$180k - $200k
...Release/Deployment Engineer Jito builds the Market Layer of Solana: the execution systems... ...paths, and improve the automation and reliability of the release process over time.... ...with Git-based release workflows, version management, branching, tagging, changelogs, and...$80k - $140k
...Overview The Release Engineer will support the support production deployment and release management activities for a cloud-based platform leveraging Salesforce, AWS... ...ensuring deployments are performed in a controlled, reliable, and compliant manner. The DevOps Release...Full timeImmediate startRemote work- Milton Hershey School, a cost-free pre-K through 12th grade residential school, is seeking a Youth Development Specialist in the Education, Training & E-Learning department. This role supports students’ academic, social, and emotional growth through mentoring, small-group...
- ...Specialist in Washington, D.C. to perform complex training development tasks for Navy and DoD personnel. Responsibilities include managing training contractors, ensuring compliance with regulations, and designing effective training programs. The ideal candidate should...For contractors
$103.71k - $138.28k
...and experience in system architecture and engineering disciplines. Specific technical... ...Supports due diligence activities including site surveys, design, design review, bill of... ...experience to include indexing, clustering, managing, and troubleshooting. 5+ years with automation...Temporary workRemote work- Staff Site Reliability Engineer, Core IDaaS w/ active TS/SCI
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Manager Site Reliability Engineering. Be the first to apply!
- site reliability engineer Washington DC
- site reliability engineer sre Washington DC
- site services specialist Washington DC
- site leader Washington DC
- site safety Washington DC
- site acquisition specialist Washington DC
- junior website developer Washington DC
- on-site clinical research associate (traveling/remote) Washington DC
- site recruiter Washington DC
- website content developer Washington DC


