Site Reliability Engineer
$60 - $65 per hourZealogics.com
Role Overview
The Site Reliability Engineer will support Cyber Data Risk & Resilience by ensuring the reliability, availability, performance, and operational visibility of critical cybersecurity platforms and services. This role is responsible for keeping production systems running, instrumenting infrastructure and application layers, building meaningful monitoring and actionable alerting, supporting incident response, and continuously improving dashboards used by engineering, operations, risk, and executive stakeholders.
Responsibilities
Maintain and improve the reliability, availability, scalability, and performance of cybersecurity platforms, services, and supporting infrastructure
Support day-to-day operational stability by monitoring system health, identifying risks, responding to incidents, and driving timely resolution of service-impacting issues
Instrument infrastructure, applications, services, APIs, data pipelines, and cloud components to provide end-to-end visibility into system behavior and service health
Design, build, and continuously refine monitoring, alerting, logging, tracing, and observability capabilities across distributed systems and cloud environments
Develop meaningful and actionable alerts that reduce noise, improve signal quality, and enable teams to respond quickly to emerging issues
Define and track key reliability metrics, including availability, latency, throughput, error rates, saturation, service-level indicators, service-level objectives, and operational risk indicators
Build, maintain, and enhance dashboards for engineering, operations, product, risk, and executive stakeholders, ensuring information is accurate, timely, and decision-ready
Continuously modify and improve executive dashboards to support regular leadership reviews of service health, reliability trends, incidents, risks, and operational performance
Partner with engineering, cybersecurity, infrastructure, cloud, and application teams to identify reliability gaps and implement long-term improvements
Participate in incident response, root-cause analysis, problem management, and post-incident reviews to prevent recurrence and improve operational maturity
Automate operational tasks, health checks, reporting, deployment validation, and recovery procedures to improve efficiency and reduce manual effort
Collaborate with application and platform teams to embed reliability, monitoring, and supportability requirements into the software development lifecycle
Support CI/CD, DevOps, and release management practices by validating operational readiness, monitoring coverage, rollback plans, and production support requirements
Contribute to resiliency engineering efforts, including capacity planning, performance tuning, failover validation, disaster recovery readiness, and chaos/resilience testing where applicable
Ensure monitoring, alerting, dashboards, and operational processes align with enterprise security, risk, compliance, and governance standards
Required Qualifications
7 to 10+ years of experience in site reliability engineering, systems engineering, software engineering, DevOps, infrastructure engineering, or production operations
Strong experience supporting highly available, distributed, cloud-based, or mission-critical technology platforms
Hands-on experience with observability practices, including monitoring, alerting, logging, metrics, tracing, dashboards, and service health reporting
Experience instrumenting applications, services, APIs, infrastructure, databases, and cloud components to enable end-to-end operational visibility
Strong understanding of reliability engineering concepts, including SLIs, SLOs, SLAs, error budgets, incident management, capacity management, and operational readiness
Experience designing actionable alerts that support rapid issue detection, triage, escalation, and resolution
Experience building and maintaining operational dashboards for technical teams, support teams, and senior/executive stakeholders
Strong scripting or programming skills using Python, Java, Bash, PowerShell, or similar languages for automation and operational tooling
Experience with cloud platforms such as AWS, Azure, or GCP
Experience with Infrastructure-as-Code tools such as Terraform or similar technologies
Experience working with CI/CD pipelines, DevOps workflows, release processes, and production support models
Experience troubleshooting distributed systems, REST services, event-driven architectures, messaging platforms, and service-to-service integrations
Familiarity with relational and non-relational databases, such as PostgreSQL, MSSQL, MongoDB, or similar platforms
Strong analytical, troubleshooting, and problem-solving skills with the ability to diagnose complex technical issues across multiple layers of the stack
Strong written and verbal communication skills, including the ability to translate technical issues into clear business and executive-level updates
Preferred Skills
Experience supporting cybersecurity, risk, resilience, compliance, or enterprise security platforms
Experience with observability and monitoring tools such as Splunk, Grafana, Prometheus, Datadog, Dynatrace, New Relic, Azure Monitor, CloudWatch, OpenTelemetry, or similar platforms
Experience creating executive-level service health dashboards, reliability scorecards, operational risk reporting, or incident trend reporting
Experience developing automated health checks, synthetic monitoring, service dependency maps, and operational runbooks
Experience with incident response, major incident management, postmortems, root-cause analysis, and problem management practices
Experience with containerized and cloud-native environments, including Kubernetes, Docker, serverless services, or managed cloud platforms
Experience with distributed messaging or streaming platforms such as Apache Kafka
Familiarity with cloud-native security, governance, and policy tooling such as Azure Policy, AWS SCP, GCP constraints, or related controls
Familiarity with Cloud Security Posture Management tools such as Wiz, Prisma, CloudGuard, or similar platforms
Experience with cloud-based AI services such as Azure AI, AWS Bedrock, or Google Vertex AI, particularly from an operational monitoring, reliability, or governance perspective
Experience supporting Linux and Windows environments through scripting, automation, monitoring, and operational troubleshooting
Exposure to web technologies, APIs, front-end services, or user-facing application monitoring
Additional Skills
Strong ownership mindset with a focus on operational excellence and service reliability
Ability to operate effectively in fast-paced, production-focused environments with minimal supervision
Strong ability to prioritize issues based on customer impact, business risk, service criticality, and operational urgency
Effective collaboration skills across engineering, operations, cybersecurity, infrastructure, risk, and executive stakeholder groups
Ability to communicate service health, operational risks, incidents, and reliability trends clearly to both technical and non-technical audiences
Proactive and continuous-improvement mindset with a focus on automation, simplification, resilience, and measurable outcomes
Strong attention to detail when building dashboards, defining metrics, tuning alerts, and preparing executive-level operational reporting
Rate range -$60-$65
Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Alpharetta, GA vacancy
$129k - $161k
...Job Description Job Description Job title: Senior Site Reliability Engineer Reports to: Director, Site Reliability Engineering Department: Cloud Platforms Location: Remote Grade: 20 About Priority: Priority Technology Holdings, Inc. is a leading...SuggestedRemote work- ...security to responsibly propel the global lottery industry ever forward. Position Summary We are looking for a skilled Site Reliability Engineer (SRE) to enhance the stability, performance, and reliability of our production systems. The SRE will work closely with...SuggestedPermanent employmentWork experience placementLocal area
- ...your work. You are visible, your talents are valued, and you are empowered to shape the future of payments. As a Principal Site Reliability Engineer in Norcross, GA or Omaha, NE , you will join a diverse, passionate team, dedicated to powering the world’s payments...SuggestedWork at officeLocal areaWorldwide
- ...shape the future of our communities. This is a Software Engineering position at Director level, which is part of the job family... ...businesses. This role is for an experienced and driven Site Reliability Engineer (SRE) to join our AI Platform team to help support...SuggestedFull time
- ...Software Systems Engineer - IV America Networks is a leading sensor and networking solutions partner for companies in any Industrial, Manufacturing, and Waste management space. We design and manufacture sensors for storage tanks, water metering, energy metering, gas...Suggested
$63.22k - $94.55k
...maintain Windows Server environments with a focus on performance, reliability, and scalability Develop and maintain PowerShell scripts... ...excellence and process optimization Mentor junior engineers and contribute to knowledge sharing across the team Required...Remote work$29.75 - $35 per hour
...Job Title Mobile Building Engineer Job Description Summary The position involves maintaining and repairing HVAC, plumbing, electrical, and building mechanical systems to ensure maximum efficiency of building systems. The role requires expertise in various types...Hourly payMinimum wageApprenticeshipWork at officeLocal areaFlexible hoursShift workWeekend workAfternoon shift- ...Lead Building Engineer Alpharetta, GA The Lead Operating Engineer is responsible for the HVAC system and all mechanical equipment within the building. The position works very closely with the Chief Engineer to ensure that the building systems are functioning properly...For contractors
- ...Job Description Job Description The Lead Operating Engineer is responsible for the HVAC system and all mechanical equipment within the building. The position works very closely with the Chief Engineer to ensure that the building systems are functioning properly....For contractorsWork at office
$105k - $160k
...Job Purpose and Impact The Senior Professional, Platform Engineering job designs, develops and maintains digital technology infrastructure... ...to automate the deployment process, ensuring smooth and reliable releases. COLLABORATION: Partners with cross functional...Work experience placement$100k - $150k
...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled Site Reliability Engineer (SRE) to join our dynamic team and contribute to our mission of transforming business processes through technology. This...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa- ...Job Title: OpenShift Platform Engineer Location: Alpharetta, GA (Hybrid - Partially Onsite) Job Description: We are seeking an experienced OpenShift Platform Engineer to support, manage, and enhance enterprise-grade Red Hat OpenShift environments...
$55k - $68.5k
...and technologies in UX design to continuously improve design processes and outcomes. Qualifications Bachelors - Computer Engineering, Bachelors - Computer Science, Bachelors - Information Technology, Masters - Software Engineering Certifications AZ-204:...Remote workShift work$69.9k - $111.7k
...optical networking systems. This role plays a key part in building reliable, high-performance embedded software that enables scalable and... ...to complex systems development within a collaborative engineering environment. How you will make an impact: Design, implement...Flexible hours$44k - $185k
...S&TO, Supply Chain, and Infrastructure Engineering. As part of this collaborative environment... ...growth. Your work ensures high-quality, reliable data infrastructure that supports Cisco'... ...insurance. Please see the Cisco careers site to discover more benefits and perks....Full timeTemporary workApprenticeshipInternshipLocal areaFlexible hours$85.39k - $116.98k
...Syms Strategic Group (SSG) is seeking a talented Senior Systems Engineer (Angular) Location: Remote Department: Veterans Affairs... ...services to deliver live and historical EDI transaction data reliably and performantly Support and contribute to an Angular-based...Full timeRemote work- ...QA Engineer – Terabit (Networking / ISP Platform) We are looking for a skilled QA Engineer to ensure the quality, performance, and reliability of Terabit-scale networking and ISP platforms. The role involves testing APIs, backend systems, monitoring tools, and high...
- ...Job Title: Senior Software Engineer - UDDI DNS Job ID: 85763 Location: Alpharetta, Georgia What you will be doing: Develop... ...of Infoblox's Platform Recommend ways to improve system reliability, efficiency, and quality Work closely with various cross-...
$98.5k - $134.24k
...Job title: Software Engineer Reports to: Director, Engineering Department: Development Location: Alpharetta, GA or Remote... ...software in an agile environment. You will focus on delivering reliable, well-tested solutions while strengthening platform stability,...Contract workRemote workShift work$102.3k - $147.05k
...Because at UKG, your work matters-and so do you. The Software Engineer III- Eng is a mid-level role within UKG's Payroll engineering... ...at scale and must meet strict standards for correctness, reliability, and compliance. You will work independently on moderately complex...Temporary work- ...Work closely with DevOps and cloud infrastructure architects and engineers to design, implement and manage secure, scalable and reliable cloud infrastructure environments for Foghorn customers. Propose and implement cloud infrastructure transformation and automation based...
$110k - $186k
...Senior Software Engineer - C Programming Calling all innovators - find your future at Fiserv... ...millions of times a day - quickly, reliably, and securely. Any time you swipe your credit... ...about this role: This role is on-site Monday through Friday. Fiserv considers in...Full timeContract workTemporary workFor contractorsH1bWork at officeLocal areaMonday to Friday- ...the purpose of providing basic information about technical designs and system requirements Note: this is an entry level software engineer position Preferred Qualifications: Knowledge of specific applications, systems, or business segments Solid knowledge of...Full timeTemporary workShift workDay shift
- ...named among the 50 best-managed firms in the nation, this is the firm for you. Doeren Mayhew is seeking a Senior Software Engineer . This position is available in Troy, Michigan, Atlanta, Georgia or Dallas, Texas. Responsibilities: Act as a...
- ...Overview: We hire the best employees to serve our customers. We are looking for an experienced Software Developers and Engineers who has a passion for building software solutions, asking questions to solve customer problems and is comfortable working in a product...Full timeWork at officeLocal areaImmediate start
- ...Science, or related field. Foreign equivalent accepted. Must have 5 years of experience in job offered or as a Software Engineer, Technical Lead, or any combination thereof performing the following: Microsoft BI technology stack. Design, development, and...Local area
$80.5k - $174.3k
...JOB DESCRIPTION We are seeking a Senior Salesforce Engineer to design, build, and scale enterprise-grade solutions on the Salesforce platform. This role requires strong technical depth, architectural thinking, and sound judgment in choosing between low-code, no...Local areaVisa sponsorshipWork visa- ...Job Role - Sr. Software Engineer Location: ALPHARETTA, Georgia ,United States,30009 Position Type - Contract Mandatory Skills: Java Full stack. OpenShift, API/Web service, Azure integration, SQL Server Job Description: The Sr. Application Developer...Contract work
- ...Software Engineer Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop... ...drive engineering excellence in areas of security, quality, reliability, performance, cost optimization, efficiency Generative AI:...Full timeImmediate startShift work
$152.13k - $154.2k
...Senior Software Engineer Employer: Delta Dental Insurance Company Location: 1130 Sanctuary Parkway, Alpharetta, GA 30009; Must... ...authentication, and data encryption. Optimize the performance, reliability, and scalability of both ReactJS and Spring Boot applications...Work at officeLocal areaVisa sponsorshipWork visa
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
Related searches
- site leader Alpharetta, GA
- site safety Alpharetta, GA
- on-site clinical research associate (traveling/remote) Alpharetta, GA
- junior website developer Alpharetta, GA
- site services specialist Alpharetta, GA
- IT site lead Alpharetta, GA
- lead site reliability engineer
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer


