Manager Site Reliability Engineering
Highmark Health
Job Title
This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the Environment of Care (EOC), enabling seamless access to essential services for patients, providers, and the people we serve. Proactively identifies and mitigates potential disruptions to maintain the highest standards of care and operational efficiency. This role blends software engineering, clinical engineering, and security principles with a deep understanding of healthcare operations to minimize downtime, improve system resilience, and to support clinical workflows and continuity of hospital operations. Works cross-functional with AHN site leaders and teams to navigate and to monitor and support building automation and facility systems, clinical engineering / IoT, healthcare delivery technology architecture, infrastructure and platform operations, and cybersecurity. Fosters a culture of automation, continuous improvement, collaboration, and patient safety. Develops core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation) leveraging industry practices, manufacturer guidance, and other service delivery metrics.
Essential Responsibilities
- Perform management responsibilities to include, but are not limited to: involved in hiring and termination decisions, coaching and development, rewards and recognition, performance management and staff productivity. Plan, organize, staff, direct and control the day-to-day operations of the department; develop and implement policies and programs as necessary; may have budgetary responsibility and authority. (25%)
- Oversees the partnership with clinical engineering, cybersecurity, device manufacturers, suppliers, and Information Technology SMEs to oversee and to implement strategies for managing, monitoring, and securing a diverse range of clinical devices and other technology equipment (e.g., IoT), ensuring compliance with HIPAA and other relevant regulations (e.g., FDA, TJC, PCI). Keeps current on healthcare IT trends, including AI, security patching, and best practices for device hardening. Oversees and assists with network segmentation and access controls to isolate and to protect clinical and other critical devices. Automates monitoring tasks to improve efficiency and reduce errors. Identifies and remediates vulnerabilities in clinical devices and related infrastructure. Manages and reports issues with assets, devices, integration services, and other equipment. Engages the appropriate parties to develop and deploy a fix/solution or oversees ownership of resolution actions. Utilizes observability practices to gain deep insights into system behavior, enabling faster identification and resolution of issues. (15%)
- Oversees the SRE partnership with Clinical Engineering and Cybersecurity Engineering to troubleshoot technical issues related to medical equipment and systems. Participates in the medical device technology lifecycle – from product/device evaluation, discovery, to implementation, maintenance, and through retirement. Develops the framework and structure to maintain documentation related to the IT infrastructure supporting clinical and other critical devices. Participates in the planning and oversees the execution of preventative maintenance activities. Provides direction and guidance to team members on how to analyze complex problems and develop effective solutions, how to troubleshoot system outages and performance issues, and how to work collaboratively with other IT, cybersecurity, facility, AI and application teams to resolve issues and to conduct root cause analyses. (15%)
- Oversees the SRE partnership with facility leaders to optimize the performance and monitoring of building automation systems (BAS), including HVAC, lighting, fire suppression, security systems, etc. Manages processes and procedures used to monitor BAS performance metrics and proactively identifies potential issues. Works with facilities management to implement improvements to the BAS infrastructure. Works with cybersecurity, vendors/manufacturers, et. al. to ensure the security of building automation systems and oversees monitoring of performance, service delivery, and support. (15%)
- Oversees the SRE partnership with IT teams including, but not limited to platform / product management, disaster recovery services, infrastructure and architecture, storage management, and release management. Participates in the planning and execution of downtime drills and system / device recovery exercises. Supports other emergency preparedness drills and exercises, as needed. Leads or participates in post-incident reviews to identify root causes and implement corrective actions. Works with cross-functional stakeholders to implement and to maintain redundant systems and failover mechanisms to minimize downtime. Reviews and provides feedback on emergency operations plans and other materials which are used to respond to emergency situations (e.g., Continuity of Operations Plans, Incident Response Guides, Downtime Procedures). Manages team members who are supporting the planning and execution of system migrations, releases, and upgrades to ensure minimal disruption to clinical operations. Oversees detailed migration or installation plans, including risk assessments, rollback procedures, and communication strategies. Assists local site leaders with navigating shared services (e.g., AI, IT, Information Security, Clinical Engineering, Platform Operations, Technology Acquisition). (15%)
- Establishes core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation). Manages the processes and procedures used for documentation and knowledge sharing including maintaining detailed documentation of systems, device inventories, processes, and procedures. Leads by example by sharing knowledge and best practices with other staff and cross-functional teams. Provides training and mentorship to junior or less experienced team members. Stays current with the latest technologies and trends in site reliability engineering. Leads or participates in briefings with cross-functional stakeholders to manage priorities and team assignments, support ticket queues, etc. (10%)
- Other duties as assigned or requested. (5%)
Qualifications
Required
- Bachelor's degree in Computer Science, Engineering, Management Information Systems, IT, or related field or relevant experience and/or education as determined by the company in lieu of bachelor's degree.
- 3 years with Management or leadership role
Preferred
- Master's degree in Computer Science, Engineering, Management Information Systems, IT, or related field
- 5 years of experience with Site Reliability Engineering (SRE), Systems Administration, or DevOps particularly in healthcare IT
- 5 years of experience in Medical device management lifecycle, network / device segmentation, vulnerability and patch management
- 5 years of experience in Healthcare IT experience in architecture, automation, IoT, telemetry, telehealth, security, system development lifecycle, capacity planning, networking, continuous integration / continuous delivery pipelines (CI/CD), incident management, scripting, metrics, monitoring, redundancy, etc.
- 3 years of experience working in highly regulated environments
- 3 years of experience with Progressive leadership roles, preferably in clinical engineering, IT, business continuity, backup and storage management, building automation, or cybersecurity discipline in healthcare
Skills
- Problem-Solving: Excellent analytical and troubleshooting skills; High capacity to think analytically, interpret information / observations, apply judgment and to assist with making effective, strategic decisions.
- Collaboration: Ability to work effectively in a team environment; demonstrated ability to support multiple sites and locations while maintaining consistency in service delivery processes and procedures.
- Communication: Strong written and verbal communication skills.
- Flexibility: Willingness to participate in activities or incidents which may occur outside of regular work schedules.
- Leadership: Demonstrated resource and project planning capabilities, decision making skills, history of results-oriented delivery, and effective team building across multiple locations and a diverse team of staff, partners, and stakeholders.
- Security Awareness: Understanding of security best practices and how to apply them in a healthcare IT environment.
- Delivery and Execution: Demonstrated competency in the execution of multiple projects, including managing resources across multiple projects to meet goals.
- Relationships: Strong relationship building skills and ability to influence with and without authority in a matrixed organization.
- ...Skills Kubernetes and Docker knowledge will be considered high Bachelor’s degree or equivalent experience in a software engineering discipline Mastery in at least two or more software languages (e.g. Python, Java, Go, etc.) with respect to designing, coding,...Suggested
$80k - $133k
## Site Reliability EngineerApplylocations: US - TX, San Antonio: US - VA, McLean: US - Remote... ...scalability.* Participate in incident management ceremonies to analyze the root cause... ...experience in IT administration, software engineering, or platform engineering, with a focus...SuggestedPermanent employmentContract workTemporary workRemote workFlexible hours- ...Responsible for leading a talented team of SREs/DevOps Engineers across a wide variety of Cloud Services to ensure the reliability, availability, and performance of software... ...leadership Ability to delegate tasks and manage others effectively, especially in times of...SuggestedFull time
- Dovel Technologies, Inc is seeking a Site Reliability Engineer to establish and maintain SRE practices within Agile teams. The candidate will... ...reliability and participate in code reviews and incident management. The role requires a BA/BS Degree or equivalent experience...SuggestedRemote jobFlexible hours
$91k - $321.5k
...Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to... ...AI solutions that drive operational excellence. As a Senior Manager you will serve as a strategic advisor, leveraging your...SuggestedFull timeH1b$119.3k - $159.1k
...Senior Manager, AI-Enabled Engineering Las Vegas, Nevada The SHOW comes alive at MGM Resorts International. Have you ever wondered what it... ...performing engineering team that delivers scalable, secure, and reliable solutions through AI-augmented development practices....Shift work$73.5k - $212.28k
...Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to... ...for coaching, leveraging team member's unique strengths, and managing performance to deliver on client expectations. With your...Full timeH1b- ...Job Description Job Description Job Description Job Title: Site Reliability Engineer Location: San Antonio, TX (Lackland AFB / Onsite 5x per week) Security Clearance: TS/SCI (or SCI Eligibility) We question. We listen. We adapt. Be honest. Be pragmatic...Full time
$142.6k - $261.5k
...team of product leaders, data scientists, designers, and software engineers enable our clients to solve their most complex product... ...align with business requirements. Your key responsibilities As a Manager in Application Design and Development, you will lead the effective...Summer holidayFlexible hours- ...Job Description Job Description SUMMARY Seeking a Engineering Manager in the San Antonio, TX area! The Lead Engineer will serve as a technical lead for the design and execution of custom and standardized industrial control panel enclosures and engineered mechanical...Relocation package
$46.92k
...development so they can reach their full potential. Responsibilities include: Providing daily supervision and mentorship Managing household routines and student schedules Administering medications and ensuring student wellness Driving students to...Full timeWork from homeRelocationRelocation packageFlexible hoursWeekday work$107.12k - $182.1k
Overview Our senior partner managers build and strengthen relationships with business partners and systems integrators to collaboratively sell and promote the adoption of Esri’s technology. This Sr. Partner Manager role will be responsible for strategic partnerships with...Full time- Ernst & Young Oman seeks a Senior Manager for Technology Consulting in SAP services. This role involves leading managed services, developing client relationships, and ensuring operational excellence in service delivery. The ideal candidate will have 8-10 years of SAP experience...
- Ernst & Young Oman is seeking a Technology Business Analyst Manager to lead SAP projects while bridging business needs and technical solutions. Based in San Antonio, TX, this role encompasses responsibilities like analyzing business models and ensuring alignment between...
- ...What WE are and where WE are going: At ST Engineering, we apply our technology and innovation... ..., ensuring that SAP systems are stable, reliable, and efficiently supporting business... ...with SAP best practices and standards, and manage any custom developments or enhancements...Temporary workLocal area
- A veteran-owned solutions provider is seeking a DevOps Engineer to design and maintain cloud-based applications while ensuring compliance with regulatory standards. The role requires collaboration with various teams and extensive experience in cloud services and automation...Remote job
$76.2k - $174.1k
...want it to go. Join EY and help to build a better working world. Manager - Financial Services Organization – AI and Data - Service... ...majors Minimum of 5 years of related work experience in AI/ML engineering or MLE/ML Ops Experience working with business, management, and...Work experience placementSummer holidayFlexible hours$91.27k - $121.69k
...Join us in building the future. The Role The IT Systems Engineer II provides advanced Tier II support by troubleshooting and repairing... ...II is operationally responsible for all IP and network management applications associated with the IP and transport platforms and...Temporary workShift workNight shift$107.9k - $195.05k
...integrating commercial products to provide a comprehensive digital engineering approach to IT transformation. Our team is solving the world’s... ...Aids the scrum team by eliminating any external blockers and manages internal roadblocks through process improvement. Basic...Contract workWork experience placementLocal areaImmediate start$125.5k - $230.2k
Ernst & Young Oman is seeking an AI/Machine Learning Engineer, Manager Consultant. This role involves leading teams to develop scalable AI solutions in the Power & Utilities sector. Candidates should have 6-10 years of experience, strong skills in Python, and a Bachelor...$80 per hour
...tests for large codebases in various source languages Create and manage Docker environments to ensure 100% reproducible builds and test... ...code quality Requirements 5+ years of experience as a Software Engineer (primarily Python ) Deep experience with pytest (fixtures,...Hourly payFreelanceRemote workFlexible hours- ...Partner cross-functionally with Product, Marketing, Data, and Engineering to translate business needs into scalable technical solutions... ...understanding of marketing automation, customer journey management, CRM, and customer care workflows . Hands‑on ability to configure...Local areaNight shift
- Join a leading healthcare provider dedicated to exceptional patient experiences as a Nuclear Med Tech PRN. This role offers the opportunity to make a significant impact on patient care while working in a supportive and innovative environment. You will supervise a skilled...Relief
$5,000 per month
...Platform Engineer Position Overview We are seeking a Cloud Platform Engineer with experience in cloud technologies, particularly... ...practices and modern automation tools to define, build, and manage virtual infrastructure in the cloud. Design and implement core...Temporary workLocal areaRemote workFlexible hours2 days per week1 day per week- Pfluger Architects in San Antonio is seeking a BIM Manager who will lead the development and implementation of firmwide BIM standards and workflows. This role encompasses training, troubleshooting, and supporting project teams to enhance their use of BIM technologies. The...Local area
- ...Energy in San Antonio, Texas, is seeking a Plant Technical Support Manager to provide technical support for combustion and gas turbines.... ...operations. Candidates should have a Bachelor of Science in Engineering, extensive knowledge in turbine technologies, and be results-...
$141.1k - $204.7k
Boeing is seeking a Mechanical Systems and Power First-Line Engineering Manager to lead a dynamic team in San Antonio, TX. The role involves overseeing the development and certification of Mechanical Systems and Power Systems, ensuring safety and quality throughout engineering...- LATICRETE International is looking for a Technical Support Manager in San Antonio, Texas, to oversee technical service operations in the construction products industry. The ideal candidate will manage a team of technical representatives, ensure product quality, and communicate...
- Capital Factory is seeking a motivated Technical Sales Manager in San Antonio, Texas to drive revenue growth and support the sales team. In this role, you will bridge engineering and sales, ensuring customer needs are met and solutions provided. You will manage CRM systems...
$142.6k - $261.5k
...career wherever you want it to go. Join EY and help to build a better working world. Technology – Engineering & Systems Integration – Technology Business Analysis – Manager SAP – Finance Project Systems – Manager Our objective is to provide clients with a unique...Summer holidayFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Manager Site Reliability Engineering. Be the first to apply!
- on-site clinical research associate (traveling/remote) San Antonio, TX
- junior website developer San Antonio, TX
- IT site lead San Antonio, TX
- site leader San Antonio, TX
- site safety San Antonio, TX
- site recruiter San Antonio, TX
- on site coordinator San Antonio, TX
- site services specialist San Antonio, TX
- website coordinator San Antonio, TX
- website content developer San Antonio, TX




