Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Manager Site Reliability Engineering

$51.9 per hour

Highmark Health

Company :

Allegheny Health Network

Job Description :

GENERAL OVERVIEW:

This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the Environment of Care (EOC), enabling seamless access to essential services for patients, providers, and the people we serve. Proactively identifies and mitigates potential disruptions to maintain the highest standards of care and operational efficiency. This role blends software engineering, clinical engineering, and security principles with a deep understanding of healthcare operations to minimize downtime, improve system resilience, and to support clinical workflows and continuity of hospital operations. Works cross-functionally with AHN site leaders and teams to navigate and to monitor and support building automation and facility systems, clinical engineering / IoT, healthcare delivery technology architecture, infrastructure and platform operations, and cybersecurity. Fosters a culture of automation, continuous improvement, collaboration, and patient safety. Develops core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation) leveraging industry practices, manufacturer guidance, and other service delivery metrics.

ESSENTIAL RESPONSIBILITIES

  • Perform management responsibilities to include, but are not limited to: involved in hiring and termination decisions, coaching and development, rewards and recognition, performance management and staff productivity.Plan, organize, staff, direct and control the day-to-day operations of the department; develop and implement policies and programs as necessary; may have budgetary responsibility and authority. (25%)

  • Oversees the partnership with clinical engineering, cybersecurity, device manufacturers, suppliers, and Information Technology SMEs to oversee and to implement strategies for managing, monitoring, and securing a diverse range of clinical devices and other technology equipment (e.g., IoT), ensuring compliance with HIPAA and other relevant regulations (e.g., FDA, TJC, PCI). Keeps current on healthcare IT trends, including AI, security patching, and best practices for device hardening. Oversees and assists with network segmentation and access controls to isolate and to protect clinical and other critical devices. Automates monitoring tasks to improve efficiency and reduce errors. Identifies and remediates vulnerabilities in clinical devices and related infrastructure. Manages and reports issues with assets, devices, integration services, and other equipment. Engages the appropriate parties to develop and deploy a fix/solution or oversees ownership of resolution actions. Utilizes observability practices to gain deep insights into system behavior, enabling faster identification and resolution of issues. (15%)

  • Oversees the SRE partnership with Clinical Engineering and Cybersecurity Engineering to troubleshoot technical issues related to medical equipment and systems. Participates in the medical device technology lifecycle – from product/device evaluation, discovery, to implementation, maintenance, and through retirement. Develops the framework and structure to maintain documentation related to the IT infrastructure supporting clinicaland other critical devices. Participates in the planning and oversees the execution of preventative maintenance activities. Provides direction and guidance to team members on how to analyze complex problems and develop effective solutions, how to troubleshoot system outages and performance issues, and how to work collaboratively with other IT, cybersecurity, facility, AI and application teams to resolve issues and to conduct root cause analyses. (15%)

  • Oversees the SRE partnership with facility leaders to optimize the performance and monitoring of building automation systems (BAS), including HVAC, lighting, fire suppression, security systems, etc. Manages processes and procedures used to monitor BAS performance metrics and proactively identifies potential issues. Works with facilities management to implement improvements to the BAS infrastructure. Works with cybersecurity, vendors/manufacturers, et. al. to ensure the security of building automation systems and oversees monitoring of performance, service delivery, and support. (15%)

  • Oversees the SRE partnership with IT teams including, but not limited to platform / product management, disaster recovery services, infrastructure and architecture, storage management, and release management. Participates in the planning and execution of downtime drills and system / device recovery exercises. Supports other emergency preparedness drills and exercises, as needed. Leads or participates in post-incident reviews to identify root causes and implement corrective actions. Works with cross-functional stakeholders to Implement and to maintain redundant systems and failover mechanisms to minimize downtime. Reviews and provides feedback on emergency operations plans and other materials which are used to respond to emergency situations (e.g., Continuity of Operations Plans, Incident Response Guides, Downtime Procedures). Manages team members who are supporting the planning and execution of system migrations, releases, and upgrades to ensure minimal disruption to clinical operations. Oversees detailed migration or installation plans, including risk assessments, rollback procedures, and communication strategies. Assists local site leaders with navigating shared services (e.g., AI, IT, Information Security, Clinical Engineering, Platform Operations, Technology Acquisition). (15%)

  • Establishes core metrics for monitoring and maintaining system health for SRE practitioners (e.g., latency, traffic, errors, and saturation). Manages the processes and procedures used for documentation and knowledge sharing including maintaining detailed documentation of systems, device inventories, processes, and procedures.Leads by example by sharing knowledge and best practices with other staff and cross-functional teams. Provides training and mentorship to junior or less experienced team members. Stays current with the latest technologies and trends in site reliability engineering. Leads or participates in briefings with cross-functional stakeholders to manage priorities and team assignments, support ticket queues, etc. (10%)

  • Other duties as assigned or requested. (5%)

Q UALIFICATIONS:

Required

  • Bachelor’s degree in Computer Science, Engineering, Management Information Systems, IT, or related field or relevant experience and/or education as determined by the company in lieu of bachelor's degree.

  • 3 years with Management or leadership role

Preferred

  • Master's degree in Computer Science, Engineering, Management Information Systems, IT, or related field

  • 5 years of experience with Site Reliability Engineering (SRE), Systems Administration, or DevOps particularly in healthcare IT

  • 5 years of experience in Medical device management lifecycle, network / device segmentation, vulnerability and patch management

  • 5 years of experience in Healthcare IT experience in architecture, automation, IoT, telemetry, telehealth, security, system development lifecycle, capacity planning, networking, continuous integration / continuous delivery pipelines (CI/CD), incident management, scripting, metrics, monitoring, redundancy, etc.

  • 3 years of experience working in highly regulated environments

  • 3 years of experience with Progressive leadership roles, preferably inclinical engineering, IT, business continuity, backup and storage management, building automation, or cybersecurity discipline in healthcare

SKILLS:

  • Problem-Solving: Excellent analytical and troubleshooting skills; High capacity to think analytically, interpret information / observations, apply judgment and to assist with making effective, strategic decisions.

  • Collaboration: Ability to work effectively in a team environment; demonstrated ability to support multiple sites and locations while maintaining consistency in service delivery processes and procedures.

  • Communication: Strong written and verbal communication skills.

  • Flexibility: Willingness to participate in activities or incidents which may occur outside of regular work schedules.

  • Leadership: Demonstrated resource and project planning capabilities, decision making skills, history of results-oriented delivery, and effective team building across multiple locations and a diverse team of staff, partners, and stakeholders.

  • Security Awareness: Understanding of security best practices and how to apply them in a healthcare IT environment.

  • Delivery and Execution: Demonstrated competency in the execution of multiple projects, including managing resources across multiple projects to meet goals.

  • Relationships: Strong relationship building skills and ability to influence with and without authority in a matrixed organization.

Disclaimer: The job description has been designed to indicate the general nature and essential duties and responsibilities of work performed by employees within this job title. It may not contain a comprehensive inventory of all duties, responsibilities, and qualifications required of employees to do this job.

Compliance Requirement : This job adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies.

As a component of job responsibilities, employees may have access to covered information, cardholder data, or other confidential customer information that must be protected at all times. In connection with this, all employees must comply with both the Health Insurance Portability Accountability Act of 1996 (HIPAA) as described in the Notice of Privacy Practices and Privacy Policies and Procedures as well as all data security guidelines established within the Company’s Handbook of Privacy Policies and Practices and Information Security Policy.

Furthermore, it is every employee’s responsibility to comply with the company’s Code of Business Conduct. This includes but is not limited to adherence to applicable federal and state laws, rules, and regulations as well as company policies and training requirements.

Pay Range Minimum:

$51.90

Pay Range Maximum:

$83.84

Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets.

Highmark Health and its affiliates prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities and prohibit discrimination against all individuals based on any category protected by applicable federal, state, or local law.

We endeavor to make this site accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact the email below.

For accommodation requests, please contact HR Services Online at View email address on click.appcast.io

California Consumer Privacy Act Employees, Contractors, and Applicants Notice

Req ID: J280531

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Manager Site Reliability Engineering in Austin, TX vacancy
  •  ...exceptional interactions, smarter decision-making, and accelerated growth in the AI-driven world. We’re looking for a Senior Site Reliability Engineer to help build and scale a high-impact SRE function. You’ll be a technical leader on a team responsible for improving... 
    Suggested

    Elea Ecuador

    Austin, TX
    1 day ago
  • $152k - $195k

     ...5,000 organizations for self‑monitoring, third‑party risk management, board reporting, and cyber insurance underwriting; making...  ...Capital, GV and Riverwood Capital. About the Team As a Senior Site Reliability Engineer, you will be a key technical leader driving the design and... 
    Suggested

    Zoomcar

    Austin, TX
    4 days ago
  • $110.7k - $171.8k

     ...Participation in oncall rotation as a platform reliability escalation point Incident response, postincident reviews, and problem management Improve day2 operations by...  ...control requirements. Collaborate with engineering teams across the organization to influence... 
    Suggested
    Work experience placement
    Work at office
    Local area

    Visa

    Austin, TX
    1 day ago
  •  ...Site Reliability Engineer, Enterprise Technology Services Austin, Texas, United States Software and Services Imagine what we could do together...  ...role in supporting the Apple ecosystem by offering identity management, factory and device support, infrastructure support,... 
    Suggested

    Apple

    Austin, TX
    5 days ago
  • $98.58k - $138.02k

     ...office locations: Austin, TX; Irvine, CA; or Akron, OH. Role Site Reliability Engineer II will be responsible for supporting, enhancing, and...  ...Terraform, Ansible, or CloudFormation. Work within change management protocols to provide maximum uptime for production systems... 
    Suggested
    Work at office

    Restaurant365

    Austin, TX
    5 days ago
  • $110.7k - $171.8k

     ...Description As a part of the Product Reliability Engineering (PRE) Organization of VISA , you will...  ..., performance, efficiency, change management, monitoring, emergency response, and capacity...  ...and software that help increase site reliability and performance. Site reliability... 
    Permanent employment
    Work experience placement
    Work at office
    Local area
    Immediate start
    Flexible hours
    Weekend work

    Visa

    Austin, TX
    1 day ago
  •  ...responders. And this is where you come in. We're seeking a Senior Site Reliability Engineer who can own our data tier at high availability while also...  ..., construction, and public safety. When a hotel manager radios housekeeping or a trucker calls dispatch, they're on... 
    Permanent employment
    Local area
    Flexible hours

    Zello

    Austin, TX
    5 days ago
  •  ...constantly striving to make the most reliable and scalable systems possible to ensure...  ...ahead and we’re looking for a passionate Site Reliability Engineer to join our team in Dallas, TX or...  ...preferably AWS), using CI/CD to deploy, manage and operate production systems, focusing... 
    Local area

    Traveltechessentialist

    Austin, TX
    5 days ago
  •  ...to match. The role We’re looking for a Senior SRE to own the reliability, scalability, and operational posture of Satsuma’s multi-cloud...  ...using AI‑assisted development workflows Partner closely with engineering on reliability reviews and architecture decisions 5‑8 years... 

    Satsuma

    Austin, TX
    5 days ago
  •  ...passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. The... 
    Permanent employment
    Remote work
    Work from home
    Flexible hours

    NinjaOne

    Austin, TX
    1 day ago
  •  ...mission-critical software to enterprise clients. As customer adoption increases, they are expanding their SRE function to improve reliability, scalability, and performance across their cloud-native environment. This is a core growth function for the business. What you'll... 

    Involved Solutions

    Austin, TX
    5 days ago
  •  ...the selected candidate for this role to work on site in the specified location(s). As a Senior Site Reliability Engineer within the CETSAvE organization, you will play...  ...: Production Operations & Incident Management Respond to system alerts and production incident... 
    Full time
    Work at office

    Charles Schwab

    Austin, TX
    14 hours ago
  • $131.6k - $210.3k

     ...Progress starts with you. Job Description The Staff Site Reliability Engineer (Azure)is responsible for designing, building, and...  ...including cloud networking, compute, storage, identity and access management, observability, and container orchestration. The ideal... 
    Work experience placement
    Work at office
    Local area
    Remote work

    Visa

    Austin, TX
    3 days ago
  •  ...Site Reliability Engineer (Edge Services), Infrastructure Services Austin, Texas, United States Software and Services We are seeking a proactive...  ...workflows using Python or Go. Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana,... 
    Shift work

    Apple

    Austin, TX
    5 days ago
  •  ...that acts as a force multiplier for our engineering organization. Our mission is to be the...  ...console; we write code, build tools, and manage our infrastructure through GitOps. As a...  ...be a key driver of our architecture, reliability, and developer enablement strategy. This... 
    Temporary work
    Immediate start
    Flexible hours

    FloSports

    Austin, TX
    5 days ago
  •  ...DevOps / Site Reliability Engineer ID70127 Full time | AgileEngine | United States Posted On 06/17/2026 Job Information City: Austin State/Province...  ...monitoring alerts, utilizing Cloud Security Posture Management (CSPM) tools like Wiz to secure workloads MUST HAVES You must... 
    Full time
    Work at office
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    AgileEngine

    Austin, TX
    3 days ago
  • Sr Site Reliability Engineer, Customer Systems Austin, Texas, United States Software and Services Imagine what you could do here. Apple is a...  ...Storage, and Network 3+ years of experience with deploying/managing Kubernetes using Helm Experience with Shell Scripting,... 

    Apple Inc.

    Austin, TX
    4 days ago
  •  .... Hands-on experience deploying and managing services with Helm in Kubernetes environments...  ...: Build and operate scalable and reliable infrastructure. Collaborate with...  ...insurance 401(k) Get notified about new Site Reliability Engineer jobs in Austin, Texas Metropolitan... 
    Full time
    Remote work

    Altimetrik

    Austin, TX
    4 days ago
  • Capacity Ingestion and Management Participates and listens in on discussions for the design...  ...and/or service according to terms for reliability and functionality. Assists team members...  ...deployments. Gains basic knowledge of site reliability trends and shares relevant information... 
    Immediate start
    Shift work

    Ll Oefentherapie

    Austin, TX
    13 hours ago
  • $152k - $241.5k

    Senior Site Reliability Engineer - HPC page is loaded## Senior Site Reliability Engineer - HPClocations: US, CA, Santa Clara: US, TX, Austin:...  ...network fabrics.* Use IaC(Infrastructure‑as‑Code) and config management to standardize and automate provisioning everywhere.*... 

    NVIDIA Corporation

    Austin, TX
    1 day ago
  • Teacher Retirement System of Texas is hiring a Site Reliability Engineer for its Austin office. The role requires expertise in maintaining IT infrastructure...  ...will work collaboratively with IT staff to design and manage complex systems, utilizing tools such as PowerShell and... 
    Work at office

    Teacher Retirement System of Texas

    Austin, TX
    4 days ago
  •  ...developer tooling ecosystem that shapes how engineers work day to day, including Python and ....  ...them. What You’ll Work On Operations & Reliability: Serve as a primary escalation point for...  ...developer tooling configurations and manage software vulnerabilities Contribute to on... 

    Dimensional Fund Advisors

    Austin, TX
    4 days ago
  • Site Reliability Engineer, Teamcenter, Enterprise Technology Services Austin, Texas, United States Software and Services Description As an SRE,...  ...efficiency and reducing manual toil. Documentation: Maintain and manage runbooks and best practices to foster knowledge sharing and... 

    Apple Inc.

    Austin, TX
    3 days ago
  • A leading technology company is looking for a Site Reliability Engineer to join their Enterprise Technology Services in Austin, Texas. The role involves automating operations, optimizing infrastructure, and collaborating with engineering teams to ensure system reliability... 

    Apple Inc.

    Austin, TX
    13 hours ago
  • Senior Site Reliability Engineer - Trustwise (Austin) About Trustwise: At Trustwise, we are deeply committed to building an AI Trust layer that...  ...optimization, Trustwise enables developers and enterprises to manage risks, meet compliance requirements, and accelerate AI... 
    Remote work

    trustwise Inc.

    Austin, TX
    4 days ago
  •  ...for a Senior SRE to join our Platform Engineering team as the operations owner of our observability...  .... You’ll be responsible for the reliability, scalability, and continued evolution...  ...infrastructure - including Elasticsearch cluster management, index lifecycle policies, and... 

    Dimensional Fund Advisors

    Austin, TX
    4 days ago
  • Site Reliability Engineer (Associate / Intermediate / Senior) Site Reliability Engineer Associate (SRE) is responsible for assisting to ensure...  ...Information Technology Infrastructure. The incumbent assists in managing a complex application and infrastructure environment that... 
    Full time
    Work experience placement

    Teacher Retirement System of Texas

    Austin, TX
    4 days ago
  • $127k - $249k

    We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure...  ...Azure, GCP), including network and compute security, identity management, and cloud security posture management (CSPM). Automation and... 
    Local area
    Remote work

    MongoDB

    Austin, TX
    4 days ago
  • A leading company is seeking a Site Reliability Engineer to join their Platform Infrastructure team. This remote role involves building reliable infrastructure, collaborating with development teams, and ensuring robust integrations with third-party services. Ideal candidates... 
    Remote work

    Altimetrik

    Austin, TX
    4 days ago
  • Charles Schwab Corporation is seeking a Senior Site Reliability Engineer to lead efforts in enhancing the reliability, scalability, and performance...  ...should demonstrate expertise in automation, incident management, and cloud-native architectures. This position is based in... 
    Work at office

    Charles Schwab Corporation

    Austin, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager Site Reliability Engineering. Be the first to apply!