Site Reliability Engineer

Analytic Partners Inc

Analytic Partners is a global leader in commercial measurement and optimization, turning data into expertise for the world’s largest brands for almost 25 years. Our holistic approach to decisioning is powered by our industry-leading platform and team of experts, who help leaders make better decisions, faster – unlocking business growth and creating powerful customer connections. With clients in 50+ countries and global offices across New York City, Miami, Dallas, Dublin, London, Paris, Singapore, Shanghai, Munich, Poznan, Sydney, Melbourne, Charlottesville and Denver, we’re growing fast. And we’re looking for top talent to join us in shaping the future of analytics. To learn more about what we do, visit analyticpartners.com – and see why we’re recognized as a Leader in the industry by independent research firms Forrester and Gartner. What You’ll Be Doing Own the Internal Developer Platform (IDP) as a product, treating engineering teams as customers and optimizing for reliability, usability, and delivery velocity. Define and execute a platform roadmap aligned with business priorities, developer needs, and long‑term scalability. Design, build, and evolve paved roads for application delivery, including CI/CD pipelines, infrastructure templates, service scaffolding, and standardized deployment patterns. Build self‑service capabilities that enable teams to provision, deploy, observe, and operate services with minimal friction. Create and maintain reusable platform abstractions across AWS and Azure that standardize security, reliability, networking, and observability. Reduce developer cognitive load by abstracting unnecessary complexity while enforcing clear guardrails for security, cost, and compliance. Partner closely with application, product, and security teams to embed reliability, scalability and security by design. Establish and evolve platform standards for logging, monitoring, alerting, tracing, and incident response workloads. Define, measure, and manage SLIs, SLOs, and error budgets for shared platform services. Drive the reduction of operational toil through automation, standardization and platform‑first solutions. Ensure shared platform services meet high standards for availability, performance, resilience, and scalability. Own system‑to‑system integration and messaging patterns used across the platform. Lead capacity planning, demand forecasting and performance tuning for platform services. Plan and execute zero‑downtime upgrades, migrations and releases of platform components. Lead platform‑level incident response workflows, post‑incident reviews and drive systemic improvements rather than one‑off fixes. Evaluate incoming platform requests and translate them into scalable, productized capabilities. Mentor engineers and drive platform adoption through documentation, enablement and technical evangelism. Participate in a 24x7 on‑call rotation as an escalation point for platform reliability and availability issues. Operate effectively in ambiguous problem spaces, making sound architectural and product decisions with limited guidance. What We Look For In You Bachelor’s degree in Computer Science or equivalent practical experience. 4+ years of experience in Platform Engineering, Site Reliability Engineering, DevOps or Systems Engineering roles. Strong expertise in Linux and Windows operating systems. Advanced automation and scripting skills using Python, Bash or PowerShell. Deep, hands‑on experience designing and operating AWS and Azure platforms at scale. Strong experience building and operating CI/CD platforms (Jenkins, GitHub Actions or equivalent). Strong experience with Infrastructure as Code and configuration management (Terraform, CloudFormation, ARM or similar). Production experience with containerized and orchestration platforms such as Docker and Kubernetes. In‑depth experience with the HashiCorp ecosystem (Nomad, Consul, Vault). Strong understanding of distributed systems, cloud‑native architectures and reliability patterns. Experience designing and operating observability platforms (e.g., Splunk, Sumo Logic or similar). Familiarity with security and compliance practices, including vulnerability scanning and enterprise security tooling. Strong understanding of the software delivery lifecycle, release engineering and platform lifecycle management. Experience working in Agile / DevOps environments with a strong product mindset. Demonstrated ability to influence without authority, set standards and drive adoption across teams. Excellent communication skills, able to translate platform capabilities into clear developer value. Strong problem‑solving skills with a bias toward durable, scalable solutions over short‑term fixes. A mindset of continuous improvement, curiosity and learning. Comfortable supporting a global, follow‑the‑sun operation when needed. How We Measure Success Strong developer adoption and satisfaction with the platform (DX). Reduced deployment friction, lead time and operational toil. Platform reliability and performance meeting or exceeding defined SLOs. Consistent, high‑quality service delivery across engineering teams. Reduced incident frequency and severity driven by systemic platform improvements. Increased standardisation, automation and self‑service adoption across the organization. Our differentiator is – Our People! We hire the brightest talent and develop them into leaders. We foster a culture of PEOPLE, PASSION and GROWTH. People: We value our people, customers and partners. Passion: We love what we do. Growth: Unlimited growth means unlimited potential. AP is a customer‑focused, team‑oriented organisation where innovation and results are rewarded and individuals can chart their own careers. As a woman‑founded and led company, this has meant supporting a meritocracy where everyone has opportunities to achieve their best and we foster an environment of diversity, equity and inclusion. In practice this means we will not only work to recruit a diverse workforce, but also maximise the full potential of all our people. You can read more about our commitment to DEI. #J-18808-Ljbffr Analytic Partners

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Dallas, TX vacancy

Site Reliability Engineer
...About the Company Role: SRE RunOps Engineer Location: Irving, TX Onsite job About the Role Production Support... .... Implement infrastructure best practices around reliability, scalability, and cost efficiency. Assist with deployments,...
Suggested
Work experience placement
Resolve Tech Solutions
Irving, TX
1 day ago
Site Reliability Engineering
...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...
Suggested
Forhyre
Dallas, TX
21 days ago
Site Reliability Engineer (Chicago, IL; Dallas, TX; San Jose, CA)
Site Reliability Engineer (Chicago, IL; Dallas, TX; ...) Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of: work experience, training, experience, education. Contractor will implement and maintain scalable...
Suggested
Contract work
For contractors
Work experience placement
Cedent
Dallas, TX
20 hours ago
Senior Site Reliability Engineer
Position Overview: The primary responsibility of the Senior Site Reliability Engineer (SRE) is to lead reliability engineering initiatives across our Azure estate and Command Center operations. This role focuses on scripting, automation, and observability to ensure uptime...
Suggested
Shift work
Night shift
Las Vegas Sands Corp.
Dallas, TX
2 days ago
Senior SRE (Site Reliability Engineer)
Role: Senior SRE Engineer Location: Washington DC - Hybrid Job Description We are seeking... ...Davis AI and Grail to drive proactive reliability, mentoring cross-functional DevOps teams... ...Location/Flexibility: Ability to work on-site in the Washington, DC area as required and...
Suggested
Work from home
Flexible hours
Vytwo
Dallas, TX
4 days ago
Site Reliability Engineer
Required Skills AWS/Azure/GCP (GCP is not used very much) Kubernetes Helm Docker Gitlab Grafana Cyberark/Hashicorp Vault Terraform etc. Experience Experience utilizing Java, Perl, Python, Go and scripting experience in Shell and Perl to automate reports and monitor enterprise...
TechDigital Group
Dallas, TX
1 day ago
Principal Site Reliability Engineer (SRE)
...Job Description Job Description About the Role We're seeking an exceptional Principal Site Reliability Engineer to architect, design, and build our SRE foundation from the ground up at InfiniteChoice. This is a rare greenfield opportunity to establish SRE practices...
Remote work
INFINITE CHOICE LLC
Dallas, TX
a month ago
Senior Manager, Site Reliability Engineering
$103.5k - $172.5k
Overview SeniorManager, Site Reliability Engineering The Site Reliability Engineering Manager is responsible for overseeing the daily operations and delivery of the Site Reliability Engineering teams. This role plays a key part in driving team productivity and ensuring...
Contract work
Temporary work
Shift work
JCPenney
Dallas, TX
4 days ago
Associate Principal, Site Reliability Engineering
$122.1k - $198.3k
Associate Principal, Site Reliability Engineering Responsibilities Collaborate with development, operations and infrastructure teams to ensure availability of services, and to work through implementation issues Develop automation for incident response and to prevent...
Work experience placement
Remote work
2 days per week
The Options Clearing Corporation
Dallas, TX
2 days ago
Compliance Engineering, Site Reliability Engineering, Vice President, Dallas
Compliance Engineering, Site Reliability Engineering - Vice President, Dallas, TX, United States We are Compliance Engineering, a global team of more than 300 engineers and scientists who work on the most complex, mission‑critical problems. We build and operate a suite...
Goldman Sachs Bank AG
Dallas, TX
4 days ago
Site Reliability Engineer - Cloud & Microservices
A leading technology solutions provider is seeking an experienced software developer to work on cloud migration and automation tools. The role primarily involves utilizing AWS, Azure, or GCP, with strong skills in Kubernetes, Docker, and microservices. Candidates should...
TechDigital Group
Dallas, TX
1 day ago
Cloud Site Reliability Engineer
$85 - $90 per hour
Join us to co-create solutions for a better future! Job Details Information Technology Cloud Site Reliability Engineer, Dallas, TX (Hybrid) Posted: 6/5/2026 Job ID: 64014 Job Category: Information Technology Position Type: Contract Duration: Long Term Remaining Positions...
Contract work
Work experience placement
Stefanini, Inc
Dallas, TX
4 days ago
Senior Cloud Platform & Site Reliability Engineering Lead
$136.88k - $200.75k
...good. Please note that we do not offer visa sponsorship for this position. ROLE SUMMARY The Senior Cloud Platform & Site Reliability Engineering Lead partners with business and technical stakeholders to lead cloud platform design, engineering, and integration...
Hourly pay
Full time
Work at office
Flexible hours
National Life Insurance Company
Addison, TX
13 hours ago
SRE/Support Developer (Production Engineering)
...growing company, we are building the infrastructure, tooling, and engineering culture to scale both our platform and our impact. The... ...of production support, software engineering, and site reliability. This is not a traditional support role. You will act as...
Work at office
Immediate start
3 days per week
Wellfit Technologies
Irving, TX
a month ago
Lead Pre-Construction Solutions Engineer - AV & Low Voltage
...high-performing, cohesive team. The Lead Pre-Construction Engineer – AV & Low Voltage is responsible for supporting new... ...cost while maintaining performance, compliance, and long-term reliability. Maintain and refine historical pricing, labor models, and...
Temporary work
For subcontractor
Local area
Flexible hours
Lockstep Technology Group
Dallas, TX
13 days ago
Lead SRE / DevOps Engineer: Observability & Resilience
$125k - $135k
...A global consulting firm is seeking a Lead Site Reliability Engineer (SRE) to enhance reliability and monitoring across their platforms. This mid-senior level position aims to improve service availability through SRE practices and incident management. Responsibilities...
Synechron
Dallas, TX
3 days ago
Lead SRE/DevOps Engineer
$125k - $135k
...$125,000.00/yr - $135,000.00/yr We are hiring Sr. Data Engineer with ITIL certification - NYC. Please share resumes to parimal... ...key global markets. We are seeking a highly skilled Lead Site Reliability Engineer (SRE) / DevOps Engineer to drive the reliability, observability...
Full time
Temporary work
Flexible hours
Synechron
Dallas, TX
3 days ago
DevOps / Site Reliability Engineer ID70127
...make an impact, and work with people who care, we'd love to meet you! ABOUT THE ROLE We are looking for a Senior Site Reliability Engineer to maintain operational resilience and 24/7 stability across a multi-cloud security program spanning Azure, AWS, and GCP....
Full time
Work at office
Remote work
Visa sponsorship
Work visa
Flexible hours
AgileEngine
Dallas, TX
1 day ago
Blockchain Site Reliability Engineer
...Job Position: Blockchain Site Reliability Engineer Location: Dallas, TX, USA (Remote Acceptable) Company: []( Contact: [ ****@*****.*** ]( About Company InfStones is an advanced, enterprise-grade Platform as a Service (PaaS) blockchain infrastructure provider...
Contract work
Remote work
Worldwide
InfStones
Dallas, TX
28 days ago
Software Systems Engineer - IV /Java Developer
...Software Systems Engineer - IV /Java Developer America Networks is a leading sensor and networking solutions partner for companies in any Industrial, Manufacturing, and Waste management space. We design and manufacture sensors for storage tanks, water metering, energy...
America Networks
Irving, TX
1 day ago
Release Engineer
...Consulting services in the US. We are actively seeking a Release Engineer for one of our clients in Dallas, TX. The role is hybrid.... ...version control systems, and other automation tools to support a reliable, scalable, and efficient deployment pipeline, while also working...
Rootshell Inc
Dallas, TX
1 day ago
Release Train Engineer
...Release Train Engineer Remote Contract to Hire Pay Rate: 70-80 per hour A growing technology organization focused on modernizing digital workflows and building scalable, secure platforms is seeking a Release Train Engineer to support its Agile delivery efforts...
Hourly pay
Contract work
Remote work
ConsultNet
Dallas, TX
14 days ago
Senior SRE Engineer - AKS, Terraform, Kubernetes, Azure
$90 per hour
CorGTA is seeking a Senior SRE Engineer in Dallas, Texas, to support production infrastructure. This role offers a contract to design and implement Kubernetes clusters, manage CI/CD pipelines, and ensure system observability using Azure and Terraform. Candidates should...
Hourly pay
Contract work
CorGTA
Dallas, TX
20 hours ago
RTE (Release Train Engineer)
...Role: Release Train Engineer (RTE) Location: Dallas, TX or Louisville, KY (Onsite) Duration: 6 Months Required Qualifications: Scaled Agile (SAFe) certification(s): RTE 6.0 (active license) Typically, 5 years related experience including 2 on large scale, technically...
Alrek Business Solutions
Dallas, TX
1 day ago
Release Train Engineer
$128.47k - $192.71k
...specific processes. • Leads a variety of types of business process design initiatives. • Assesses potential implications of re-engineering for multiple functions or departments. • Demonstrates mastery of re-engineering concepts, methods, and tools. Growth and...
Part time
Worldwide
Flexible hours
Caterpillar
Irving, TX
9 days ago
Senior SRE Engineer (AKS, Azure, Terraform, Kubernetes, and PowerShell.)
$85 - $90 per hour
Senior SRE Engineer (AKS, Azure, Terraform, Kubernetes, and PowerShell.) JOB ID - 7933 Role: Senior SRE Engineer Location: Dallas / Fort... ...$85-$90 per hour INC Structure: 8 Month contract *** 4 days on-site *** -- We have a great new opportunity to support one of our Consulting...
Hourly pay
Contract work
Work experience placement
CorGTA
Dallas, TX
20 hours ago
Asset & Wealth Management-Cloud SRE Engineer-Associate-Dallas
Job Description Cloud SRE Engineer - Associate Who We Look For Goldman Sachs Engineers are innovators and problem-solvers who thrive... ...-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering ecosystem....
Goldman Sachs
Dallas, TX
4 days ago
Systems Test Engineer - Platform Verification
...supervised autonomy and driverless autonomy operations requires systems engineering practices like tracing product requirements to system... ...Regularly scheduled team building activities and social events both on-site, off-site & virtually. - As we grow, this list continues to...
Full time
Work at office
Work from home
Flexible hours
Waabi
Dallas, TX
28 days ago
Reliability Engineer HROC
$107.48k - $143.31k
...employees. What You'll Be Doing Lead and apply regional reliability engineering strategies to improve equipment performance, uptime, and... ...management skills with the ability to support multiple sites remotely. Willingness to travel to the plant and corporate...
Temporary work
Remote work
Flexible hours
51905 HROC LLC
Irving, TX
2 days ago
Senior .NET Developer (Financial Systems / Lending Platform)
...fintech environment supporting mission-critical lending and transaction systems . This is a unique hybrid role blending software engineering + deep Nortridge Loan System (NLS) expertise —with direct impact on production stability and system performance. What You’ll...
Staffing Technologies
Irving, TX
29 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!