Manager Site Reliability Engineering
$51.9 per hourHighmark Health
Company :
Allegheny Health Network
Job Description :
GENERAL OVERVIEW:
This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the Environment of Care (EOC), enabling seamless access to essential services for patients, providers, and the people we serve. Proactively identifies and mitigates potential disruptions to maintain the highest standards of care and operational efficiency. This role blends software engineering, clinical engineering, and security principles with a deep understanding of healthcare operations to minimize downtime, improve system resilience, and to support clinical workflows and continuity of hospital operations. Works cross-functionally with AHN site leaders and teams to navigate and to monitor and support building automation and facility systems, clinical engineering / IoT, healthcare delivery technology architecture, infrastructure and platform operations, and cybersecurity. Fosters a culture of automation, continuous improvement, collaboration, and patient safety. Develops core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation) leveraging industry practices, manufacturer guidance, and other service delivery metrics.
ESSENTIAL RESPONSIBILITIES
Perform management responsibilities to include, but are not limited to: involved in hiring and termination decisions, coaching and development, rewards and recognition, performance management and staff productivity.Plan, organize, staff, direct and control the day-to-day operations of the department; develop and implement policies and programs as necessary; may have budgetary responsibility and authority. (25%)
Oversees the partnership with clinical engineering, cybersecurity, device manufacturers, suppliers, and Information Technology SMEs to oversee and to implement strategies for managing, monitoring, and securing a diverse range of clinical devices and other technology equipment (e.g., IoT), ensuring compliance with HIPAA and other relevant regulations (e.g., FDA, TJC, PCI). Keeps current on healthcare IT trends, including AI, security patching, and best practices for device hardening. Oversees and assists with network segmentation and access controls to isolate and to protect clinical and other critical devices. Automates monitoring tasks to improve efficiency and reduce errors. Identifies and remediates vulnerabilities in clinical devices and related infrastructure. Manages and reports issues with assets, devices, integration services, and other equipment. Engages the appropriate parties to develop and deploy a fix/solution or oversees ownership of resolution actions. Utilizes observability practices to gain deep insights into system behavior, enabling faster identification and resolution of issues. (15%)
Oversees the SRE partnership with Clinical Engineering and Cybersecurity Engineering to troubleshoot technical issues related to medical equipment and systems. Participates in the medical device technology lifecycle – from product/device evaluation, discovery, to implementation, maintenance, and through retirement. Develops the framework and structure to maintain documentation related to the IT infrastructure supporting clinicaland other critical devices. Participates in the planning and oversees the execution of preventative maintenance activities. Provides direction and guidance to team members on how to analyze complex problems and develop effective solutions, how to troubleshoot system outages and performance issues, and how to work collaboratively with other IT, cybersecurity, facility, AI and application teams to resolve issues and to conduct root cause analyses. (15%)
Oversees the SRE partnership with facility leaders to optimize the performance and monitoring of building automation systems (BAS), including HVAC, lighting, fire suppression, security systems, etc. Manages processes and procedures used to monitor BAS performance metrics and proactively identifies potential issues. Works with facilities management to implement improvements to the BAS infrastructure. Works with cybersecurity, vendors/manufacturers, et. al. to ensure the security of building automation systems and oversees monitoring of performance, service delivery, and support. (15%)
Oversees the SRE partnership with IT teams including, but not limited to platform / product management, disaster recovery services, infrastructure and architecture, storage management, and release management. Participates in the planning and execution of downtime drills and system / device recovery exercises. Supports other emergency preparedness drills and exercises, as needed. Leads or participates in post-incident reviews to identify root causes and implement corrective actions. Works with cross-functional stakeholders to Implement and to maintain redundant systems and failover mechanisms to minimize downtime. Reviews and provides feedback on emergency operations plans and other materials which are used to respond to emergency situations (e.g., Continuity of Operations Plans, Incident Response Guides, Downtime Procedures). Manages team members who are supporting the planning and execution of system migrations, releases, and upgrades to ensure minimal disruption to clinical operations. Oversees detailed migration or installation plans, including risk assessments, rollback procedures, and communication strategies. Assists local site leaders with navigating shared services (e.g., AI, IT, Information Security, Clinical Engineering, Platform Operations, Technology Acquisition). (15%)
Establishes core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation). Manages the processes and procedures used for documentation and knowledge sharing including maintaining detailed documentation of systems, device inventories, processes, and procedures.Leads by example by sharing knowledge and best practices with other staff and cross-functional teams. Provides training and mentorship to junior or less experienced team members. Stays current with the latest technologies and trends in site reliability engineering. Leads or participates in briefings with cross-functional stakeholders to manage priorities and team assignments, support ticket queues, etc. (10%)
Other duties as assigned or requested. (5%)
Q UALIFICATIONS:
Required
Bachelor’s degree in Computer Science, Engineering, Management Information Systems, IT, or related field or relevant experience and/or education as determined by the company in lieu of bachelor's degree.
3 years with Management or leadership role
Preferred
Master's degree in Computer Science, Engineering, Management Information Systems, IT, or related field
5 years of experience with Site Reliability Engineering (SRE), Systems Administration, or DevOps particularly in healthcare IT
5 years of experience in Medical device management lifecycle, network / device segmentation, vulnerability and patch management
5 years of experience in Healthcare IT experience in architecture, automation, IoT, telemetry, telehealth, security, system development lifecycle, capacity planning, networking, continuous integration / continuous delivery pipelines (CI/CD), incident management, scripting, metrics, monitoring, redundancy, etc.
3 years of experience working in highly regulated environments
3 years of experience with Progressive leadership roles, preferably inclinical engineering, IT, business continuity, backup and storage management, building automation, or cybersecurity discipline in healthcare
SKILLS:
Problem-Solving: Excellent analytical and troubleshooting skills; High capacity to think analytically, interpret information / observations, apply judgment and to assist with making effective, strategic decisions.
Collaboration: Ability to work effectively in a team environment; demonstrated ability to support multiple sites and locations while maintaining consistency in service delivery processes and procedures.
Communication: Strong written and verbal communication skills.
Flexibility: Willingness to participate in activities or incidents which may occur outside of regular work schedules.
Leadership: Demonstrated resource and project planning capabilities, decision making skills, history of results-oriented delivery, and effective team building across multiple locations and a diverse team of staff, partners, and stakeholders.
Security Awareness: Understanding of security best practices and how to apply them in a healthcare IT environment.
Delivery and Execution: Demonstrated competency in the execution of multiple projects, including managing resources across multiple projects to meet goals.
Relationships: Strong relationship building skills and ability to influence with and without authority in a matrixed organization.
Disclaimer: The job description has been designed to indicate the general nature and essential duties and responsibilities of work performed by employees within this job title. It may not contain a comprehensive inventory of all duties, responsibilities, and qualifications required of employees to do this job.
Compliance Requirement : This job adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies.
As a component of job responsibilities, employees may have access to covered information, cardholder data, or other confidential customer information that must be protected at all times. In connection with this, all employees must comply with both the Health Insurance Portability Accountability Act of 1996 (HIPAA) as described in the Notice of Privacy Practices and Privacy Policies and Procedures as well as all data security guidelines established within the Company’s Handbook of Privacy Policies and Practices and Information Security Policy.
Furthermore, it is every employee’s responsibility to comply with the company’s Code of Business Conduct. This includes but is not limited to adherence to applicable federal and state laws, rules, and regulations as well as company policies and training requirements.
Pay Range Minimum:
$51.90
Pay Range Maximum:
$83.84
Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets.
Highmark Health and its affiliates prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities and prohibit discrimination against all individuals based on any category protected by applicable federal, state, or local law.
We endeavor to make this site accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact the email below.
For accommodation requests, please contact HR Services Online at View email address on click.appcast.io
California Consumer Privacy Act Employees, Contractors, and Applicants Notice
Req ID: J280531
$127.1k - $198.58k
...position contributes to team efforts in engineering, analytics, and technical planning, applying... ...leading scientific, engineering, and management expertise in a culture grounded in... ...visiting the Benefits ( page on our Careers ( site. Compensation at Noblis is determined...SuggestedPermanent employmentFull timeContract workPart timeLocal areaRemote work$80k - $148k
...all the tasks completed on a regular basis and adhere the change management policy. Position reports to Senior Manager - Mainframe... ...keys to SAG products o SAG products license management at DR site & bringing up ADABAS & Natural at DR site o Installation & upgrade...SuggestedFull timeTemporary workWork experience placementRemote workWork from homeFlexible hours$140k - $170k
...Services, Docker, Kubernetes, and familiar with Git development. The Engineer is expected to provide strategy and implement enterprise‑scale... ...maturity levels. AWS Security & Networking: Implement and manage IAM policies, permission sets, Security Groups, NACLs, and VPC...SuggestedFull timeLocal areaRemote work$120k - $135k
...: As a member of the Platform Engineering organization, you will be part of a team responsible for managing the large footprint of our application... ..., networking, and application reliability. As a Platform Network Engineer within our Site Reliability Engineering (SRE)...SuggestedImmediate start$105k - $141.75k
...porting application code across platforms. Familiarity with agile engineering practices like Test Automation, Test-Driven Development (TDD... ...(CI), Continuous Delivery (CD), DevOps, and Test Data Management, etc. Experience developing, deploying, and tuning data modernization...SuggestedRemote workWorldwide$80k
...and Responsibilities: Provide Tier‑3 engineering support for Microsoft 365 GCC, Exchange Online... ..., performance, and security. Manage, monitor, restore, and optimize enterprise... ...SharePoint Online platform operations, including site collections, permissions, integrations,...Contract work$94.1k - $150k
...The Platform Engineer (Ops Technology Lead) is responsible for designing, implementing, and... ...within the CASTLE-NET program, ensuring reliability, scalability, and security. This role supports application deployment and management, ensures compliance with CASTLE-NET policies...Contract workWork at office$197.4k - $232k
...Type: FullTime Location Type: Remote Department Engineering Compensation: $197.4K – $232K • Offers Equity At... ...environment. Make architecture and technical decisions that balance reliability, scalability, performance, and operability, and clearly...Full timeRemote work- ...with AI. What you will do We are looking for a mid-level engineer who will be responsible for delivering robust, performant and... ...with Teradata Advanced Development, Architects and Product Management to understand system requirements and test new platform infrastructure...Permanent employmentFlexible hours
$99.6k - $223.4k
...Job Description Job Title: Senior Software Engineer and CMTS - Exadata Location: In-office position in Redwood City, CA... ...intelligent flow control, and Ethernet-based RDMA performance and reliability. That makes this role especially relevant for engineers with...Temporary workWork at officeFlexible hours- ...enterprise platforms and IT Service Management (ITSM). This position... ...supporting the delivery of reliable, high-quality IT services.... ...ServiceNow platform usingApp Engine Studio Support implementation... ...impact every day across 100+ sites in the areas of Defense, Citizen...Full timeContract workPart timeInternshipLocal areaImmediate startFlexible hours
$163.9k - $235.55k
...work matters—and so do you. Director, Go-To-Market Product Engineering – Salesforce (M5 Level) The Director, GTM Engineering, is a... ...and culture, inspiring confidence, and strengthening the management team. • Foster strong cross-functional partnerships across...Local area$96.8k - $306.4k
...technical and business challenges. Oracle Kubernetes Engine (OKE) is OCI's managed Kubernetes service. OKE enables customers to create, run,... ...cluster lifecycle management, orchestration, scalability, reliability, performance, automation, observability, security, and integration...Temporary workRemote workFlexible hours$140k
...provides payment technology, education services, and learning management solutions to education and faith-based organizations, serving... ...people where they live, learn and work. The Senior Software Engineer designs, creates, maintains, audits and improves software applications...Temporary workLocal area$30 per hour
...unique opportunities for smart, hands-on engineers with the expertise and passion to solve... ...Being empowered with the flexibility, reliability, and scalability of Virtual Networking,... ...technical and non-technical audiences (management, peers) Understanding of agile...Hourly payTemporary workInternshipFlexible hours$118k - $178k
...As the world’s number 1 job site*, our mission is to help people... ...Day to Day As a Software Engineer III on the AI Gateway & Guardrails... ...decisions, drive service reliability through SLOs and operational... ...Collaborate with engineers, product managers, and governance partners...Work experience placementLocal area$100.32k
...Maximus is currently seeking a Software Engineer. In this role, you will provide expertise in the areas of managed file transfer and EDI X12 translations. In addition, they must configure, support and maintain environments and procedures for all supported applications...Remote work$79.2k - $209.5k
...Job Description We’re looking for highly skilled AI engineers to design and build high-scale, cloud-based data processing pipelines... ...solutions to rapidly prototype, test, iterate, and deliver reliable code. ~ Experience using the ChatGPT, Claude or similar models...Temporary workFlexible hours- ...Job Title: Senior Windows Engineer (Endpoint Management & Modern Workplace) Job Location: Durham, NC Overview We are seeking an experienced... ...or limited in your ability to use or access our career site as a result of your disability, you may request reasonable...Full time
$79.2k - $209.5k
...performance monitoring, troubleshooting, security remediation, compliance support, vendor coordination, and infrastructure lifecycle management activities. Responsibilities The position is responsible for the administration, support, and operational management of...Temporary workFlexible hours$103.71k - $138.28k
...and experience in system architecture and engineering disciplines. Specific technical... ...Supports due diligence activities including site surveys, design, design review, bill of... ...experience to include indexing, clustering, managing, and troubleshooting. •5+ years with automation...Full timeTemporary workRemote work- A leading financial services firm is seeking a Sr. Distinguished Machine Learning Engineer to define and drive the technical strategy for personalized product experiences. The ideal candidate has extensive experience in machine learning, data-intensive solutions, and leading...Remote work
$104.5k - $234.6k
...operational goals, sharing results with manager upon completion. Adheres to and improves... ...; provides guidance and coaching to engineers to drive improvements. Utilizes advanced... ...availability, health, support, and reliability. Core Responsibilities Planning &...Temporary workFlexible hoursShift work$238.7k - $365.7k
...The Role The Vehicle Experiences Engine software team is a dynamic and fast paced... ...requirements such as scalability, maintainability, reliability, extensibility, usability, and security.... ...presentations to senior and executive management. Stays updated on new technology and...Local areaRemote workWork from homeRelocation package- ...The Systems Administrator, Senior manages and optimizes complex enterprise infrastructure spanning Windows and Linux servers, VMware environments, and cloud services that support mission-critical government systems. The role leads major changes, automation, and incident...Contract workWork at office
- ...more at . Overview of Job Function: As a Senior Software Engineer, you will take deep technical ownership of significant product... ...work on software that directly impacts how enterprise customers manage workforce performance and customer interactions at scale....Contract workLocal areaShift work
$79.2k - $209.5k
...complex business and technology challenges. Engineers at OCI have deep technical ownership and... ...used by OCI services to persist and manage critical control-plane metadata. It is a... ...to simplify how OCI service teams build reliable control planes by abstracting database complexity...Temporary workFlexible hours$99.6k - $234.6k
...Edge Security team as a Principal Software Engineer focused on building and scaling Oracle... ...advanced traffic inspection and policy management capabilities across OCI’s global infrastructure... ...to deliver secure, performant, and reliable services while helping define the long-...Temporary workFlexible hours$79.2k - $209.5k
...unencumbered and will need your contribution to make it a premier engineering center with the focus on excellence. Health Data Intelligence... ...work closely with multi-functional teams, including product management, submissions operations teams, and other software development...Temporary workImmediate startFlexible hours$180k - $220k
...creating transformative change in healthcare. Senior Software Engineer The Role As a Senior Software Engineer, you will lead... ...initiatives that advance Datavant's platform scalability and reliability. You'll drive technical design, coach peers, and ensure system...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Manager Site Reliability Engineering. Be the first to apply!
- on-site clinical research associate (traveling/remote) Dover, DE
- junior website developer Dover, DE
- site safety Dover, DE
- site reliability engineering manager
- site reliability engineer remote
- lead site reliability engineer
- site reliability engineer sre
- site reliability engineer
- junior site reliability engineer
- site investigation


