Site Reliability Engineer
Analytic Partners Inc
Analytic Partners is a global leader in commercial measurement and optimization, turning data into expertise for the world’s largest brands for almost 25 years. Our holistic approach to decisioning is powered by our industry-leading platform and team of experts, who help leaders make better decisions, faster – unlocking business growth and creating powerful customer connections. With clients in 50+ countries and global offices across New York City, Miami, Dallas, Dublin, London, Paris, Singapore, Shanghai, Munich, Poznan, Sydney, Melbourne, Charlottesville and Denver, we’re growing fast. And we’re looking for top talent to join us in shaping the future of analytics. To learn more about what we do, visit analyticpartners.com – and see why we’re recognized as a Leader in the industry by independent research firms Forrester and Gartner. What You’ll Be Doing Own the Internal Developer Platform (IDP) as a product, treating engineering teams as customers and optimizing for reliability, usability, and delivery velocity. Define and execute a platform roadmap aligned with business priorities, developer needs, and long‑term scalability. Design, build, and evolve paved roads for application delivery, including CI/CD pipelines, infrastructure templates, service scaffolding, and standardized deployment patterns. Build self‑service capabilities that enable teams to provision, deploy, observe, and operate services with minimal friction. Create and maintain reusable platform abstractions across AWS and Azure that standardize security, reliability, networking, and observability. Reduce developer cognitive load by abstracting unnecessary complexity while enforcing clear guardrails for security, cost, and compliance. Partner closely with application, product, and security teams to embed reliability, scalability and security by design. Establish and evolve platform standards for logging, monitoring, alerting, tracing, and incident response workloads. Define, measure, and manage SLIs, SLOs, and error budgets for shared platform services. Drive the reduction of operational toil through automation, standardization and platform‑first solutions. Ensure shared platform services meet high standards for availability, performance, resilience, and scalability. Own system‑to‑system integration and messaging patterns used across the platform. Lead capacity planning, demand forecasting and performance tuning for platform services. Plan and execute zero‑downtime upgrades, migrations and releases of platform components. Lead platform‑level incident response workflows, post‑incident reviews and drive systemic improvements rather than one‑off fixes. Evaluate incoming platform requests and translate them into scalable, productized capabilities. Mentor engineers and drive platform adoption through documentation, enablement and technical evangelism. Participate in a 24x7 on‑call rotation as an escalation point for platform reliability and availability issues. Operate effectively in ambiguous problem spaces, making sound architectural and product decisions with limited guidance. What We Look For In You Bachelor’s degree in Computer Science or equivalent practical experience. 4+ years of experience in Platform Engineering, Site Reliability Engineering, DevOps or Systems Engineering roles. Strong expertise in Linux and Windows operating systems. Advanced automation and scripting skills using Python, Bash or PowerShell. Deep, hands‑on experience designing and operating AWS and Azure platforms at scale. Strong experience building and operating CI/CD platforms (Jenkins, GitHub Actions or equivalent). Strong experience with Infrastructure as Code and configuration management (Terraform, CloudFormation, ARM or similar). Production experience with containerized and orchestration platforms such as Docker and Kubernetes. In‑depth experience with the HashiCorp ecosystem (Nomad, Consul, Vault). Strong understanding of distributed systems, cloud‑native architectures and reliability patterns. Experience designing and operating observability platforms (e.g., Splunk, Sumo Logic or similar). Familiarity with security and compliance practices, including vulnerability scanning and enterprise security tooling. Strong understanding of the software delivery lifecycle, release engineering and platform lifecycle management. Experience working in Agile / DevOps environments with a strong product mindset. Demonstrated ability to influence without authority, set standards and drive adoption across teams. Excellent communication skills, able to translate platform capabilities into clear developer value. Strong problem‑solving skills with a bias toward durable, scalable solutions over short‑term fixes. A mindset of continuous improvement, curiosity and learning. Comfortable supporting a global, follow‑the‑sun operation when needed. How We Measure Success Strong developer adoption and satisfaction with the platform (DX). Reduced deployment friction, lead time and operational toil. Platform reliability and performance meeting or exceeding defined SLOs. Consistent, high‑quality service delivery across engineering teams. Reduced incident frequency and severity driven by systemic platform improvements. Increased standardisation, automation and self‑service adoption across the organization. Our differentiator is – Our People! We hire the brightest talent and develop them into leaders. We foster a culture of PEOPLE, PASSION and GROWTH. People: We value our people, customers and partners. Passion: We love what we do. Growth: Unlimited growth means unlimited potential. AP is a customer‑focused, team‑oriented organisation where innovation and results are rewarded and individuals can chart their own careers. As a woman‑founded and led company, this has meant supporting a meritocracy where everyone has opportunities to achieve their best and we foster an environment of diversity, equity and inclusion. In practice this means we will not only work to recruit a diverse workforce, but also maximise the full potential of all our people. You can read more about our commitment to DEI. #J-18808-Ljbffr Analytic Partners
- ...About the Company Role: SRE RunOps Engineer Location: Irving, TX Onsite job About the Role Production Support... .... Implement infrastructure best practices around reliability, scalability, and cost efficiency. Assist with deployments,...SuggestedWork experience placement
- ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...Suggested
- Site Reliability Engineer (Chicago, IL; Dallas, TX; ...) Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of: work experience, training, experience, education. Contractor will implement and maintain scalable...SuggestedContract workFor contractorsWork experience placement
- Position Overview: The primary responsibility of the Senior Site Reliability Engineer (SRE) is to lead reliability engineering initiatives across our Azure estate and Command Center operations. This role focuses on scripting, automation, and observability to ensure uptime...SuggestedShift workNight shift
- Role: Senior SRE Engineer Location: Washington DC - Hybrid Job Description We are seeking... ...Davis AI and Grail to drive proactive reliability, mentoring cross-functional DevOps teams... ...Location/Flexibility: Ability to work on-site in the Washington, DC area as required and...SuggestedWork from homeFlexible hours
- Required Skills AWS/Azure/GCP (GCP is not used very much) Kubernetes Helm Docker Gitlab Grafana Cyberark/Hashicorp Vault Terraform etc. Experience Experience utilizing Java, Perl, Python, Go and scripting experience in Shell and Perl to automate reports and monitor enterprise...
- ...Job Description Job Description About the Role We're seeking an exceptional Principal Site Reliability Engineer to architect, design, and build our SRE foundation from the ground up at InfiniteChoice. This is a rare greenfield opportunity to establish SRE practices...Remote work
$103.5k - $172.5k
Overview SeniorManager, Site Reliability Engineering The Site Reliability Engineering Manager is responsible for overseeing the daily operations and delivery of the Site Reliability Engineering teams. This role plays a key part in driving team productivity and ensuring...Contract workTemporary workShift work$122.1k - $198.3k
Associate Principal, Site Reliability Engineering Responsibilities Collaborate with development, operations and infrastructure teams to ensure availability of services, and to work through implementation issues Develop automation for incident response and to prevent...Work experience placementRemote work2 days per week- Compliance Engineering, Site Reliability Engineering - Vice President, Dallas, TX, United States We are Compliance Engineering, a global team of more than 300 engineers and scientists who work on the most complex, mission‑critical problems. We build and operate a suite...
- A leading technology solutions provider is seeking an experienced software developer to work on cloud migration and automation tools. The role primarily involves utilizing AWS, Azure, or GCP, with strong skills in Kubernetes, Docker, and microservices. Candidates should...
$85 - $90 per hour
Join us to co-create solutions for a better future! Job Details Information Technology Cloud Site Reliability Engineer, Dallas, TX (Hybrid) Posted: 6/5/2026 Job ID: 64014 Job Category: Information Technology Position Type: Contract Duration: Long Term Remaining Positions...Contract workWork experience placement$136.88k - $200.75k
...good. Please note that we do not offer visa sponsorship for this position. ROLE SUMMARY The Senior Cloud Platform & Site Reliability Engineering Lead partners with business and technical stakeholders to lead cloud platform design, engineering, and integration...Hourly payFull timeWork at officeFlexible hours- ...growing company, we are building the infrastructure, tooling, and engineering culture to scale both our platform and our impact. The... ...of production support, software engineering, and site reliability. This is not a traditional support role. You will act as...Work at officeImmediate start3 days per week
- ...high-performing, cohesive team. The Lead Pre-Construction Engineer – AV & Low Voltage is responsible for supporting new... ...cost while maintaining performance, compliance, and long-term reliability. Maintain and refine historical pricing, labor models, and...Temporary workFor subcontractorLocal areaFlexible hours
$125k - $135k
...A global consulting firm is seeking a Lead Site Reliability Engineer (SRE) to enhance reliability and monitoring across their platforms. This mid-senior level position aims to improve service availability through SRE practices and incident management. Responsibilities...$125k - $135k
...$125,000.00/yr - $135,000.00/yr We are hiring Sr. Data Engineer with ITIL certification - NYC. Please share resumes to parimal... ...key global markets. We are seeking a highly skilled Lead Site Reliability Engineer (SRE) / DevOps Engineer to drive the reliability, observability...Full timeTemporary workFlexible hours- ...make an impact, and work with people who care, we'd love to meet you! ABOUT THE ROLE We are looking for a Senior Site Reliability Engineer to maintain operational resilience and 24/7 stability across a multi-cloud security program spanning Azure, AWS, and GCP....Full timeWork at officeRemote workVisa sponsorshipWork visaFlexible hours
- ...Job Position: Blockchain Site Reliability Engineer Location: Dallas, TX, USA (Remote Acceptable) Company: []( Contact: [ ****@*****.*** ]( About Company InfStones is an advanced, enterprise-grade Platform as a Service (PaaS) blockchain infrastructure provider...Contract workRemote workWorldwide
- ...Software Systems Engineer - IV /Java Developer America Networks is a leading sensor and networking solutions partner for companies in any Industrial, Manufacturing, and Waste management space. We design and manufacture sensors for storage tanks, water metering, energy...
- ...Consulting services in the US. We are actively seeking a Release Engineer for one of our clients in Dallas, TX. The role is hybrid.... ...version control systems, and other automation tools to support a reliable, scalable, and efficient deployment pipeline, while also working...
- ...Release Train Engineer Remote Contract to Hire Pay Rate: 70-80 per hour A growing technology organization focused on modernizing digital workflows and building scalable, secure platforms is seeking a Release Train Engineer to support its Agile delivery efforts...Hourly payContract workRemote work
$90 per hour
CorGTA is seeking a Senior SRE Engineer in Dallas, Texas, to support production infrastructure. This role offers a contract to design and implement Kubernetes clusters, manage CI/CD pipelines, and ensure system observability using Azure and Terraform. Candidates should...Hourly payContract work- ...Role: Release Train Engineer (RTE) Location: Dallas, TX or Louisville, KY (Onsite) Duration: 6 Months Required Qualifications: Scaled Agile (SAFe) certification(s): RTE 6.0 (active license) Typically, 5 years related experience including 2 on large scale, technically...
$128.47k - $192.71k
...specific processes. • Leads a variety of types of business process design initiatives. • Assesses potential implications of re-engineering for multiple functions or departments. • Demonstrates mastery of re-engineering concepts, methods, and tools. Growth and...Part timeWorldwideFlexible hours$85 - $90 per hour
Senior SRE Engineer (AKS, Azure, Terraform, Kubernetes, and PowerShell.) JOB ID - 7933 Role: Senior SRE Engineer Location: Dallas / Fort... ...$85-$90 per hour INC Structure: 8 Month contract *** 4 days on-site *** -- We have a great new opportunity to support one of our Consulting...Hourly payContract workWork experience placement- Job Description Cloud SRE Engineer - Associate Who We Look For Goldman Sachs Engineers are innovators and problem-solvers who thrive... ...-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering ecosystem....
- ...supervised autonomy and driverless autonomy operations requires systems engineering practices like tracing product requirements to system... ...Regularly scheduled team building activities and social events both on-site, off-site & virtually. - As we grow, this list continues to...Full timeWork at officeWork from homeFlexible hours
$107.48k - $143.31k
...employees. What You'll Be Doing Lead and apply regional reliability engineering strategies to improve equipment performance, uptime, and... ...management skills with the ability to support multiple sites remotely. Willingness to travel to the plant and corporate...Temporary workRemote workFlexible hours- ...fintech environment supporting mission-critical lending and transaction systems . This is a unique hybrid role blending software engineering + deep Nortridge Loan System (NLS) expertise —with direct impact on production stability and system performance. What You’ll...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer sre Dallas, TX
- site reliability engineer Dallas, TX
- website coordinator Dallas, TX
- site leader Dallas, TX
- on site coordinator Dallas, TX
- site safety Dallas, TX
- site recruiter Dallas, TX
- on-site clinical research associate (traveling/remote) Dallas, TX
- junior website developer Dallas, TX
- site services specialist Dallas, TX



