Manager, Site Reliability Engineering
Paradigm
Paradigm is a software company transforming the way that the residential, construction & building product industries operate across the globe. We are looking for a Manager, Site Reliability Engineering to be part of revolutionizing these industries. We're looking for a hands‑on SRE leader to build and develop a high‑performing team that oversees reliability across our Azure‑based platform. You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines technical depth with people leadership. You'll design reliability frameworks, lead incident response, coach engineers, and partner with product teams to embed reliability into everything we build. Working closely with the Senior Director of SRE & Cloud Operations, you'll transform reactive operations into proactive, data‑driven service management with increasing use of AI and automation to get there faster. What You Will Do: Lead and grow a team of site reliability engineers. Provide guidance, mentorship, and career development. Contribute to and mature SRE practices across production services: SLOs, SLIs, error budgets, toil reduction, and blameless post‑mortems that turn incidents into lasting improvements. Oversee the incident management lifecycle end‑to‑end including detection, response, resolution, post‑incident review, and systemic improvement. Design on‑call rotations, runbooks, and escalation procedures that balance service reliability with engineer well‑being and sustainable work practices. Drive measurable reductions in MTTR and MTTD through improved observability, intelligent automation, and predictive monitoring. Build automation to eliminate manual operational work including provisioning, deployment, scaling, self‑healing, and reporting. Implement chaos engineering practices to validate system resilience and surface weaknesses before they cause outages. Partner with engineering and product teams to embed reliability requirements into the development lifecycle, from design through deployment. Collaborate with the observability team to ensure comprehensive instrumentation, smart alerting, and actionable dashboards across all critical services. Measure, report, and advocate for reliability improvements with both technical and executive stakeholders using data to drive investment decisions. What You Need to Succeed: Bachelor’s degree in Engineering, or a related field or equivalent experience. 7+ years in site reliability engineering, DevOps, or infrastructure engineering, with at least 1 year in people management (or demonstrated tech lead experience with direct influence over team processes and career growth). Hands‑on experience running production systems on Azure (including proficiency with key services such as AKS, App Services, Service Bus, Event Grid, and Azure Monitor) or comparable cloud platforms. Proven track record implementing SRE practices with measurable reliability improvements and familiarity with modern observability platforms (Datadog, Prometheus/Grafana, or equivalent). AI‑enhanced observability experience is preferred. Experience leading incident response for high‑severity production issues and running effective post‑mortems. Strong background in automation, infrastructure as code (Terraform, Bicep, or similar), and systematically eliminating manual operational work. Experience with Kubernetes container orchestration with production‑grade operational experience. Ability to automate workflows and build scripts using Python, Bash, PowerShell, or Go. Experience with AI coding assistants and CI/CD systems (GitHub Actions, Azure DevOps, ArgoCD) with automation capabilities is preferred. Knowledge of distributed systems patterns is preferred. Exposure to AIOps platforms or using LLMs for operational automation is preferred. Strong communication with the ability to make complex technical issues clear for both engineers and executives. Data‑driven approach. You use metrics and telemetry to guide decisions, not gut feel. You are collaborative cross‑functionally and build trust and alignment naturally. #J-18808-Ljbffr
- ...candidate for this role to work on site in the specified location(s). Workplace Services Engineering (WSE) is an organization... ...The Senior Technical Project Manager (TPM) leads cross‑functional coordination... ...across initiatives tied to reliability, automation, platform...SuggestedWork at office
- Role: Senior SRE Engineer Location: Washington DC - Hybrid Job Description... ...and Grail to drive proactive reliability, mentoring cross-functional... ..., or AWS CDK. Log Management: Manage high-volume log ingest... ...Flexibility: Ability to work on-site in the Washington, DC area as...SuggestedWork from homeFlexible hours
- Position Overview: The primary responsibility of the Senior Site Reliability Engineer (SRE) is to lead reliability engineering initiatives... ..., App Services, Functions, VMSS, Storage, Front Door, API Management, Load Balancers, Monitor, Log Analytics, App Insights, Key...SuggestedShift workNight shift
- PNC Financial Services Group, Inc. is seeking a Senior Site Reliability Engineer for its SRC Lending organization in Dallas, TX. The role focuses on engineering stability, performance, and resiliency across production environments. Qualifications include a university degree...Suggested
- ...Technology Services enables the future of how clients manage their money by providing innovative and reliable technology products and services as part of our... ..., Schwab Infrastructure teams, and application engineering teams to efficiently resolve incidents and operational...Suggested
- Site Reliability Engineer (Chicago, IL; Dallas, TX; ...) Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated... ..., and ability to work effectively with the client, IT management and staff, and other groups in Information Technology, including...Contract workFor contractorsWork experience placement
- We are seeking an experienced Site Reliability Engineer to lead the migration of on‑prem applications to Cloud and to maintain the Cloud applications... ...Technical Experience Experience in setting up and managing environments on Azure using Terraform infrastructure as code...Permanent employmentContract workLocal area
- ...Information Technology group delivers secure, reliable technology solutions that enable DTCC... ...Director of Identity and Access Management (IAM), you will serve as the technical leader responsible for Site Reliability Engineering across IAM platform, overseeing and managing...Remote workFlexible hours
- Sr. Manager, Site Reliability Engineering Coppell, TX Job Summary We are seeking a highly skilled and motivated Sr. Manager, Site Reliability Engineering to build and lead our Site Reliability Engineering capability from the ground up. In this role, you will contribute...Casual workWork at officeLocal areaWork from homeRelocationFlexible hours
- ...enables the future of how clients manage their money by providing innovative and reliable technology products and services... ...architecture and application engineering teams to ensure infrastructure and... ..., safe deployments. Practice Site Reliability Engineering mindset...Work experience placement
$103.5k - $172.5k
Overview SeniorManager, Site Reliability Engineering The Site Reliability Engineering Manager is responsible for overseeing the daily operations and delivery of the Site Reliability Engineering teams. This role plays a key part in driving team productivity and ensuring...Contract workTemporary workShift work- ...candidate for this role to work on site in the specified location(s). Workplace Services Engineering (WSE) is an organization... ...As a Principal Architect, Site Reliability Engineering for Schwab's Technology... ...observability, incident management, resilience engineering, and capacity...Work at office
- Compliance Engineering, Site Reliability Engineering, Vice President, Dallas Job Description We are Compliance Engineering, a global team of more... ...Engineering SRE. Job Responsibilities Proactive management of our production services by measuring and monitoring availability...Full timeWork at office
$46.92k
...development so they can reach their full potential. Responsibilities include: Providing daily supervision and mentorship Managing household routines and student schedules Administering medications and ensuring student wellness Driving students to...Full timeWork from homeRelocationRelocation packageFlexible hoursWeekday work$68 - $73 per hour
...: $68.00 USD Hourly - $73.00 USD Hourly Description: Software Engineer, Contact Center Technology We are not accepting C2C or 1099 arrangements... ...scalable, high-performance systems that enhance workforce management and customer experience. You will design and develop...Hourly payEarly shift- ...see our Privacy Policy.#Senior Software Engineer page is loaded## Senior Software EngineerApplylocations... ..., collaborating closely with product managers and business stakeholders to turn high-... ...flows using **NoSQL (MongoDB)**. Ensure reliable, real-time data processing and...Hourly pay
- ...a journey towards organisational excellence. With our global management approach, we offer empowering experiences to drive your professional... ...across multiple levels for the following positions: Test Engineer (Job Code TE) – software quality assurance & testing of applications...
$112.92k - $158.48k
...Career Area Technology, Digital and Data Position Summary Position: Software Engineer Location: 5205 N. O'Connor Boulevard, Irving, TX 75039. Responsibilities Perform all programming, project management, and development assignments. Work directly on complex application/...Part timeCurrently hiringRemote workFlexible hoursShift workWeekend work$102.4k - $179k
...collaborate with cross‑functional teams to build modern full‑stack solutions using .NET, Angular, and Azure technologies while driving engineering excellence, automation, and software quality. You will contribute to system architecture, mentor engineers, and support the...$88.9k - $165.1k
...workflows into enterprise systems Lead system design and architecture discussions Ensure high code quality through reviews, testing, and engineering best practices Collaborate with product, data, platform, and cross‑functional teams Own and resolve complex production issues....Local areaFlexible hours- ...Staffing Firm has multiple openings for JOB ID12427: Software Engineer. Design complex software solutions, ensuring scalability, security... .... Engage in cross-functional collaboration with product managers, quality assurance teams, and other stakeholders to deliver high...Relocation
- ...Senior Software Engineer (Promotions Platform) We are seeking a highly skilled and strategic... ..., collaborating closely with product managers and business stakeholders to turn high‑level... ...flows using NoSQL (MongoDB). Ensure reliable, real‑time data processing and integration...
$120k - $130k
Technologies C#, .NET Angular, Angular Material, TypeScript SQL/T-SQL, Entity Framework Azure, REST APIs Responsibilities Build responsive user interfaces Develop backend REST APIs Maintain Azure pipelines including ADF and Databricks Perform unit and system testing and...- ...JOB SUMMARY The ASU Reliability Engineering Manager will manage the reliability engineering program, manage engineering projects, and provide technical expertise to support a safe, efficient, and reliable ASU (Air Separation Unit) production operation. This role leads...Work experience placementWork at officeWork from homeWeekend workAfternoon shift
$73.5k - $212.28k
...Requirements: Up to 60% At PwC, our people in data and analytics engineering focus onleveraging advanced technologies and techniques to... ...for coaching, leveraging team member's unique strengths, and managing performance to deliver on client expectations. With your growing...Full timeH1b$140.8k - $232.5k
The Options Clearing Corporation is seeking a Manager for Platform Engineering focused on Developer Experience. This role involves defining the DevX roadmap, engaging with engineering teams, and leading Agile practices to enhance development workflows. Key responsibilities...- ...report progress. Suggest improvements to software architecture, development processes, and new technologies. Strictly follow Citi’s engineering standards across all project modules. Consistently perform code and design reviews. Define operating standards and processes,...
- ...formatting practices. + Ability to understand and utilize JSON and XML + Must be proactive, detail oriented and possess strong time management skills. + Self-starter. Ability to work autonomously with little support.* **Education:** + Bachelors Degree in Computer Science*...Full timeWork at officeLocal area
$112.71k - $183.14k
..., and compliance of supported tools. In this role, the DevOps engineer will support our Autonomy engineering teams in Cat Technology... ...our customers. Competent to perform all programming, project management, and development assignments without close supervision; normally...Part timeRelocation packageFlexible hours$60k - $80k
...Typescript. Design performance, accessible, and secure UI architectures with Material UI components, setting standards for complex state management, authentication flows, and UX consistency. Back End: Architect scalable services using Node.js or Python, design REST and event‑...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Manager, Site Reliability Engineering. Be the first to apply!
- site services specialist Irving, TX
- site leader Irving, TX
- site safety Irving, TX
- junior website developer Irving, TX
- on-site clinical research associate (traveling/remote) Irving, TX
- IT site lead Irving, TX
- website coordinator Irving, TX
- on site coordinator Irving, TX
- site reliability engineer remote
- site reliability engineer


