Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Manager, Site Reliability Engineering

Paradigm

Paradigm is a software company transforming the way that the residential, construction & building product industries operate across the globe. We are looking for a Manager, Site Reliability Engineering to be part of revolutionizing these industries. We're looking for a hands‑on SRE leader to build and develop a high‑performing team that oversees reliability across our Azure‑based platform. You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines technical depth with people leadership. You'll design reliability frameworks, lead incident response, coach engineers, and partner with product teams to embed reliability into everything we build. Working closely with the Senior Director of SRE & Cloud Operations, you'll transform reactive operations into proactive, data‑driven service management with increasing use of AI and automation to get there faster. What You Will Do: Lead and grow a team of site reliability engineers. Provide guidance, mentorship, and career development. Contribute to and mature SRE practices across production services: SLOs, SLIs, error budgets, toil reduction, and blameless post‑mortems that turn incidents into lasting improvements. Oversee the incident management lifecycle end‑to‑end including detection, response, resolution, post‑incident review, and systemic improvement. Design on‑call rotations, runbooks, and escalation procedures that balance service reliability with engineer well‑being and sustainable work practices. Drive measurable reductions in MTTR and MTTD through improved observability, intelligent automation, and predictive monitoring. Build automation to eliminate manual operational work including provisioning, deployment, scaling, self‑healing, and reporting. Implement chaos engineering practices to validate system resilience and surface weaknesses before they cause outages. Partner with engineering and product teams to embed reliability requirements into the development lifecycle, from design through deployment. Collaborate with the observability team to ensure comprehensive instrumentation, smart alerting, and actionable dashboards across all critical services. Measure, report, and advocate for reliability improvements with both technical and executive stakeholders using data to drive investment decisions. What You Need to Succeed: Bachelor’s degree in Engineering, or a related field or equivalent experience. 7+ years in site reliability engineering, DevOps, or infrastructure engineering, with at least 1 year in people management (or demonstrated tech lead experience with direct influence over team processes and career growth). Hands‑on experience running production systems on Azure (including proficiency with key services such as AKS, App Services, Service Bus, Event Grid, and Azure Monitor) or comparable cloud platforms. Proven track record implementing SRE practices with measurable reliability improvements and familiarity with modern observability platforms (Datadog, Prometheus/Grafana, or equivalent). AI‑enhanced observability experience is preferred. Experience leading incident response for high‑severity production issues and running effective post‑mortems. Strong background in automation, infrastructure as code (Terraform, Bicep, or similar), and systematically eliminating manual operational work. Experience with Kubernetes container orchestration with production‑grade operational experience. Ability to automate workflows and build scripts using Python, Bash, PowerShell, or Go. Experience with AI coding assistants and CI/CD systems (GitHub Actions, Azure DevOps, ArgoCD) with automation capabilities is preferred. Knowledge of distributed systems patterns is preferred. Exposure to AIOps platforms or using LLMs for operational automation is preferred. Strong communication with the ability to make complex technical issues clear for both engineers and executives. Data‑driven approach. You use metrics and telemetry to guide decisions, not gut feel. You are collaborative cross‑functionally and build trust and alignment naturally. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Manager, Site Reliability Engineering in Irving, TX vacancy
  •  ...candidate for this role to work on site in the specified location(s). Workplace Services Engineering (WSE) is an organization...  ...The Senior Technical Project Manager (TPM) leads cross‑functional coordination...  ...across initiatives tied to reliability, automation, platform... 
    Suggested
    Work at office

    Charles Schwab

    Southlake, TX
    13 hours ago
  • Role: Senior SRE Engineer Location: Washington DC - Hybrid Job Description...  ...and Grail to drive proactive reliability, mentoring cross-functional...  ..., or AWS CDK. Log Management: Manage high-volume log ingest...  ...Flexibility: Ability to work on-site in the Washington, DC area as... 
    Suggested
    Work from home
    Flexible hours

    Vytwo

    Dallas, TX
    1 day ago
  • Position Overview: The primary responsibility of the Senior Site Reliability Engineer (SRE) is to lead reliability engineering initiatives...  ..., App Services, Functions, VMSS, Storage, Front Door, API Management, Load Balancers, Monitor, Log Analytics, App Insights, Key... 
    Suggested
    Shift work
    Night shift

    Las Vegas Sands Corp.

    Dallas, TX
    4 days ago
  • PNC Financial Services Group, Inc. is seeking a Senior Site Reliability Engineer for its SRC Lending organization in Dallas, TX. The role focuses on engineering stability, performance, and resiliency across production environments. Qualifications include a university degree... 
    Suggested

    PNC Financial Services Group, Inc.

    Dallas, TX
    2 days ago
  •  ...Technology Services enables the future of how clients manage their money by providing innovative and reliable technology products and services as part of our...  ..., Schwab Infrastructure teams, and application engineering teams to efficiently resolve incidents and operational... 
    Suggested

    Charles Schwab

    Southlake, TX
    13 hours ago
  • Site Reliability Engineer (Chicago, IL; Dallas, TX; ...) Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated...  ..., and ability to work effectively with the client, IT management and staff, and other groups in Information Technology, including... 
    Contract work
    For contractors
    Work experience placement

    Cedent

    Dallas, TX
    2 days ago
  • We are seeking an experienced Site Reliability Engineer to lead the migration of on‑prem applications to Cloud and to maintain the Cloud applications...  ...Technical Experience Experience in setting up and managing environments on Azure using Terraform infrastructure as code... 
    Permanent employment
    Contract work
    Local area

    Robotics Technologies LLC

    Dallas, TX
    3 days ago
  •  ...Information Technology group delivers secure, reliable technology solutions that enable DTCC...  ...Director of Identity and Access Management (IAM), you will serve as the technical leader responsible for Site Reliability Engineering across IAM platform, overseeing and managing... 
    Remote work
    Flexible hours

    Dtcc

    Coppell, TX
    1 day ago
  • Sr. Manager, Site Reliability Engineering Coppell, TX Job Summary We are seeking a highly skilled and motivated Sr. Manager, Site Reliability Engineering to build and lead our Site Reliability Engineering capability from the ground up. In this role, you will contribute... 
    Casual work
    Work at office
    Local area
    Work from home
    Relocation
    Flexible hours

    Brinker International

    Coppell, TX
    13 hours ago
  •  ...enables the future of how clients manage their money by providing innovative and reliable technology products and services...  ...architecture and application engineering teams to ensure infrastructure and...  ..., safe deployments. Practice Site Reliability Engineering mindset... 
    Work experience placement

    Charles Schwab Corporation

    Southlake, TX
    2 days ago
  • $103.5k - $172.5k

    Overview SeniorManager, Site Reliability Engineering The Site Reliability Engineering Manager is responsible for overseeing the daily operations and delivery of the Site Reliability Engineering teams. This role plays a key part in driving team productivity and ensuring... 
    Contract work
    Temporary work
    Shift work

    JCPenney

    Dallas, TX
    1 day ago
  •  ...candidate for this role to work on site in the specified location(s). Workplace Services Engineering (WSE) is an organization...  ...As a Principal Architect, Site Reliability Engineering for Schwab's Technology...  ...observability, incident management, resilience engineering, and capacity... 
    Work at office

    Charles Schwab

    Southlake, TX
    1 day ago
  • Compliance Engineering, Site Reliability Engineering, Vice President, Dallas Job Description We are Compliance Engineering, a global team of more...  ...Engineering SRE. Job Responsibilities Proactive management of our production services by measuring and monitoring availability... 
    Full time
    Work at office

    Goldman Sachs Group, Inc.

    Dallas, TX
    1 day ago
  • $46.92k

     ...development so they can reach their full potential. Responsibilities include: Providing daily supervision and mentorship Managing household routines and student schedules Administering medications and ensuring student wellness Driving students to... 
    Full time
    Work from home
    Relocation
    Relocation package
    Flexible hours
    Weekday work

    Milton Hershey School

    Irving, TX
    10 days ago
  • $68 - $73 per hour

     ...: $68.00 USD Hourly - $73.00 USD Hourly Description: Software Engineer, Contact Center Technology We are not accepting C2C or 1099 arrangements...  ...scalable, high-performance systems that enhance workforce management and customer experience. You will design and develop... 
    Hourly pay
    Early shift

    The Judge Group

    Irving, TX
    1 day ago
  •  ...see our Privacy Policy.#Senior Software Engineer page is loaded## Senior Software EngineerApplylocations...  ..., collaborating closely with product managers and business stakeholders to turn high-...  ...flows using **NoSQL (MongoDB)**. Ensure reliable, real-time data processing and... 
    Hourly pay

    SuperMom

    Irving, TX
    4 days ago
  •  ...a journey towards organisational excellence. With our global management approach, we offer empowering experiences to drive your professional...  ...across multiple levels for the following positions: Test Engineer (Job Code TE) – software quality assurance & testing of applications... 

    Global Bridge InfoTech

    Irving, TX
    13 hours ago
  • $112.92k - $158.48k

     ...Career Area Technology, Digital and Data Position Summary Position: Software Engineer Location: 5205 N. O'Connor Boulevard, Irving, TX 75039. Responsibilities Perform all programming, project management, and development assignments. Work directly on complex application/... 
    Part time
    Currently hiring
    Remote work
    Flexible hours
    Shift work
    Weekend work

    Caterpillar

    Irving, TX
    13 hours ago
  • $102.4k - $179k

     ...collaborate with cross‑functional teams to build modern full‑stack solutions using .NET, Angular, and Azure technologies while driving engineering excellence, automation, and software quality. You will contribute to system architecture, mentor engineers, and support the... 

    Vizient

    Irving, TX
    3 days ago
  • $88.9k - $165.1k

     ...workflows into enterprise systems Lead system design and architecture discussions Ensure high code quality through reviews, testing, and engineering best practices Collaborate with product, data, platform, and cross‑functional teams Own and resolve complex production issues.... 
    Local area
    Flexible hours

    Unavailable

    Irving, TX
    13 hours ago
  •  ...Staffing Firm has multiple openings for JOB ID12427: Software Engineer. Design complex software solutions, ensuring scalability, security...  .... Engage in cross-functional collaboration with product managers, quality assurance teams, and other stakeholders to deliver high... 
    Relocation

    One-on-One Support

    Irving, TX
    8 hours ago
  •  ...Senior Software Engineer (Promotions Platform) We are seeking a highly skilled and strategic...  ..., collaborating closely with product managers and business stakeholders to turn high‑level...  ...flows using NoSQL (MongoDB). Ensure reliable, real‑time data processing and integration... 

    7-Eleven

    Irving, TX
    3 days ago
  • $120k - $130k

    Technologies C#, .NET Angular, Angular Material, TypeScript SQL/T-SQL, Entity Framework Azure, REST APIs Responsibilities Build responsive user interfaces Develop backend REST APIs Maintain Azure pipelines including ADF and Databricks Perform unit and system testing and...

    Tata Consultancy Services

    Irving, TX
    2 days ago
  •  ...JOB SUMMARY The ASU Reliability Engineering Manager will manage the reliability engineering program, manage engineering projects, and provide technical expertise to support a safe, efficient, and reliable ASU (Air Separation Unit) production operation. This role leads... 
    Work experience placement
    Work at office
    Work from home
    Weekend work
    Afternoon shift

    Matheson Tri-Gas

    Irving, TX
    1 day ago
  • $73.5k - $212.28k

     ...Requirements: Up to 60% At PwC, our people in data and analytics engineering focus onleveraging advanced technologies and techniques to...  ...for coaching, leveraging team member's unique strengths, and managing performance to deliver on client expectations. With your growing... 
    Full time
    H1b

    PwC

    Dallas, TX
    1 day ago
  • $140.8k - $232.5k

    The Options Clearing Corporation is seeking a Manager for Platform Engineering focused on Developer Experience. This role involves defining the DevX roadmap, engaging with engineering teams, and leading Agile practices to enhance development workflows. Key responsibilities... 

    The Options Clearing Corporation

    Dallas, TX
    2 days ago
  •  ...report progress. Suggest improvements to software architecture, development processes, and new technologies. Strictly follow Citi’s engineering standards across all project modules. Consistently perform code and design reviews. Define operating standards and processes,... 

    Virtusa

    Irving, TX
    4 days ago
  •  ...formatting practices. + Ability to understand and utilize JSON and XML + Must be proactive, detail oriented and possess strong time management skills. + Self-starter. Ability to work autonomously with little support.* **Education:** + Bachelors Degree in Computer Science*... 
    Full time
    Work at office
    Local area

    Neighborly

    Irving, TX
    2 days ago
  • $112.71k - $183.14k

     ..., and compliance of supported tools. In this role, the DevOps engineer will support our Autonomy engineering teams in Cat Technology...  ...our customers. Competent to perform all programming, project management, and development assignments without close supervision; normally... 
    Part time
    Relocation package
    Flexible hours

    Caterpillar Financial Service Corp

    Irving, TX
    1 day ago
  • $60k - $80k

     ...Typescript. Design performance, accessible, and secure UI architectures with Material UI components, setting standards for complex state management, authentication flows, and UX consistency. Back End: Architect scalable services using Node.js or Python, design REST and event‑... 

    Tata Consultancy Services

    Irving, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager, Site Reliability Engineering. Be the first to apply!