Site Reliability Engineer (Edge Services), Infrastructure Services
Apple
Role Number: 200663929-0776
Summary
We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated, data-driven reliability framework. You will play a pivotal role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and seamless user experiences.
Description
As a key member of the SRE team, your mission is to treat operations as a software problem. You will focus on designing and implementing a next-generation observability and alerting strategy that prioritizes high-cardinality data and meaningful signals over noise. You will spend your time building "self-healing" systems, reducing toil through aggressive automation, and partnering with development teams to bake reliability into the CI/CD pipeline. Your goal is to move us toward a proactive stance where performance bottlenecks are identified and mitigated before they impact the customer.
Minimum Qualifications
B.S. in Computer Science, Computer Engineering, or a related technical field, or equivalent professional work experience.
Understanding of Linux internals and deep networking expertise, including (QUIC), and You should be comfortable debugging protocol-level issues and optimizing traffic flow.
Proven ability to automate repetitive tasks and complex workflows using Python or Go
Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana, ClickHouse) with a focus on creating actionable, high-signal quality alerting.
Grasp of Data Structures and Algorithms (DSA) to write efficient, performant code and troubleshoot complex system bottlenecks.
Practical knowledge of SLIs, SLOs, Error Budgets, Release Management and Incident Management to drive engineering priorities.
Preferred Qualifications
Experience managing cloud environments (AWS, GCP, or Azure) using Terraform, Ansible, or Pulumi.
Orchestration: Hands-on experience scaling and securing containerized workloads via Kubernetes.
A track record of leading "blameless post-mortems" and using those insights to harden the system against future failures.
Ability to consult with product teams on service design to improve long-term maintainability.
A proactive engineering mindset focused on shifting from "fixing things when they break" to "designing things so they don't break" (or so they fail gracefully).
Practical fluency in applying Generative AI tools within SRE and software engineering workflows — from accelerating observability query construction and alert design to building AI-assisted debugging and triage capabilities that encode institutional knowledge into repeatable, context-aware workflows — with the engineering rigour to validate, own, and iterate on AI-assisted outputs in production-adjacent contexts
$132.1k - $244.6k
Site Reliability Engineer (Edge Services), Infrastructure Services Elk Grove, California, United States Software and Services We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive...SuggestedRelocationShift work$154.6k - $274.9k
...United States Software and Services Do you want to help build some... ...T) organization.IS&T is the engine behind everything Apple does... ...what you could do here.Infrastructure Services is part of IS&T and... ...for cost, performance, and reliability. Lead cluster lifecycle management...SuggestedRelocation$128.3k - $193.2k
Telecom Expense Management Engineer, Infrastructure Services job at Apple Inc.. Elk Grove, CA. Apple is where individual imaginations capture together, contributing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience...SuggestedWork experience placementWorldwideRelocation$99.6k - $234.6k
...Overview Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a next-... ...distributed systems-focused engineering team Responsibilities... ...for low latency and high reliability Collaborate cross-... ...across our products and services, we help customers turn...SuggestedTemporary workFlexible hours$78.4k - $129.4k
...SharePoint environments that host collaboration sites, content repositories, and business... ..., site collections, and supporting web services to ensure availability, performance, and... ...‑services landscape, collaborating with infrastructure teams as needed. Monitor performance,...SuggestedContract workWork at office$86.5k - $142.7k
...proofs‑of‑concept, and guiding engineering teams through complex... ...Digital Engineering Managed Services. Hands‑on solution architecture... ...‑functional architecture & reliability Define performance,... ...working directly in code, infrastructure and pipelines—not just producing...Summer holidayFlexible hours$86.5k - $142.7k
...proofs‑of‑concept, and guiding engineering teams through complex... ...Digital Engineering Managed Services. Your key responsibilities... ...‑functional architecture & reliability • Define performance,... ...working directly in code, infrastructure and pipelines—not just producing...Summer holidayFlexible hours$126.2k - $207.33k
...the better. We believe building engineering is more than systems and... ...of HDR’s Building Engineering Services Group, you’ll help design the critical infrastructure that supports the digital age and... ...Regular BusinessClass Building Site Civil Job Posting Mar 21, 2026...Full timeContract workTemporary work$45 - $65 per hour
...seeking a Consulting GIS Specialist - Service Center Manager and Site Supervisor/Application Developer to... ...staff, manage enterprise GIS infrastructure, and develop custom applications that... ...management Collaborate with planners, engineers, scientists, and IT staff to...Hourly payFull timeContract workLocal areaRemote work$99.6k - $223.4k
...Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the... ...of building a cutting-edge, ultra-high-performance GPU... ...automation, and diagnostic services. These are essential for running... ...Looking For: Adaptable Engineers: Self-motivated individuals...Temporary workFlexible hours$109.5k - $150.55k
...Renaissance is looking for an experienced Sr Site Reliability Engineer to be part of the Engineering... ...with a focus on Application and Infrastructure Availability, Reliability, Observability... ...Recovery exercises. Implementing service level objectives (SLO/SLI/SLA's) &...For contractorsLocal areaRemote workWorldwideWork visaFlexible hoursWeekend work$83k - $187k
...Description We are looking for a Senior Site Reliability Engineer to join our OCI team. This role is... ..., triaging, and mitigating OCI service-impacting events as quickly as possible... ...Solve complex problems related to infrastructure cloud services and automate common...Temporary workWork experience placementFlexible hours- ...Senior Site Reliability Engineer Location: West Lake, CA or Carrolton, TX (ONSITE) FTE ONLY... ..., Lambda, VPC, IAM, CloudWatch). Infrastructure as Code (IaC): Strong proficiency with... ...CloudWatch, or similar. Define and track Service Level Objectives (SLOs) and Service...Permanent employment
$86.5k - $142.7k
...to go. Join EY and help to build a better working world. Job Summary As a Senior Consultant within EY’s Digital Engineering Managed Services team, you will design and build scalable full-stack applications for Media & Entertainment platforms, including...Summer holidayFlexible hours- Quest Technology Management is looking for a proactive Site Reliability Engineer in Elk Grove, California. In this role, you will champion the... ...production ecosystems, ensuring resilience and scalability of services. You will design observability and alerting strategies,...
$99.6k - $234.6k
...Job Description As a Principal Site Reliability Engineer, you will play a pivotal role in building... ...operate highly reliable, scalable infrastructure that supports Commercial and Federal... ...) team to take shared ownership of services and platform components. Develop a strong...Temporary workFlexible hours$119k - $170k
...Role We are looking for a Staff Site Reliability Engineer to join our team. This is a hybrid (... ...the Zscaler production data center services, including servers, operating systems... ...experience in software engineering, infrastructure software, and/or platform engineering...Permanent employmentFull timeWork at officeLocal areaRemote work3 days per week- Agilent Technologies is looking for a dedicated Field Service Engineer based in California, preferably near San Francisco or Los Angeles. In... ...you will support security and detection systems at customer sites, requiring extensive travel (75% overnight). The ideal candidate...Full timeRemote workNight shift
- Sikich LLC is seeking an AWS Connect and Salesforce Service Cloud Voice Developer/Administrator to collaborate closely with clients in Sacramento. The ideal candidate will configure and enhance solutions on Salesforce to improve operations. This position requires a minimum...Flexible hours
$83k - $95k
...Controls Commissioning - Lead Field Service Technician here at Honeywell,... ...' buildings (commercial sites). In this role, you will have... ...Install, configure, and test pre-engineered software for control systems... ...performance-driven salary, cutting-edge work, and developing solutions...Temporary workFor contractorsWork experience placementLocal areaFlexible hours$110k - $140k
...are seeking a highly skilled Network Engineer to join our Managed Service Provider (MSP) support team. The... ...implementing, and managing complex network infrastructures with a particular focus on Palo... ...Juniper Mist and Meraki, including site surveys, RF planning, and...Remote workFlexible hoursAfternoon shift$102k - $125k
...value, convenience and exceptional service to our members. Job Title Host Systems Engineer I Position Details Status:... ...Mgr - IT Systems Department: Infrastructure Systems Job Code: 12016 Pay Range... ...contributing to a culture of reliability and compliance. What You'll Do...Remote job$45 - $65 per hour
Dewberry seeks a Consulting GIS Specialist for a role based in Sacramento, CA. This position involves leading a GIS Service Center, managing staff, and delivering advanced geospatial services for federal clients. The ideal candidate will have significant experience in GIS...Hourly pay$160k
AWS Connect and Salesforce Service Cloud Voice Developer/Administrator - Staff Mandatory Qualifications (SMQs) AWS Connect and Salesforce... ...for this experience. Current AWS Certified CloudOps Engineer - Associate Certification required. Strong communication and collaboration...Full timeLocal areaFlexible hours$91.27k - $114.09k
...! Job Description Summary: The Manager, Professional Services Engineering, leads a team of experienced engineers specializing in enterprise... ...complex business challenges by powering their critical infrastructure, business processes, and data. We help extend the value of...Remote workWorldwideFlexible hours$160k
Position Summary Sikich is seeking an AWS Connect and Salesforce Service Cloud Voice Developer/Administrator to join our team. In this... ...substituted for this experience. Current AWS Certified CloudOps Engineer - Associate Certification required. Strong communication and...Full timeFlexible hours- A leading solutions provider in electrical services is seeking a Technical Service Manager in Sacramento, CA. The candidate will oversee operations, ensuring client satisfaction and high-quality standards in technical services. Responsibilities include mentoring staff,...Hourly payFull time
$102.5k - $187.9k
Location: Anywhere in Country Overview Oracle Services practice assists our national consulting practices in planning, pursuing, delivering and managing large, complex full lifecycle initiatives along with providing experience in leading practices, methods, and resources...Summer holidayFlexible hours- Los Rios Community College District is seeking an IT Technical Services Supervisor who will be responsible for planning, organizing,... ...Services Team activities within various fields such as Network Engineering, Cybersecurity, and Cloud Services. This role requires significant...
- Los Rios Community College District is seeking an IT Technical Services Supervisor to lead IT operations, focusing on network engineering, cybersecurity, and cloud services. This role requires a bachelor's degree in a relevant field and supervisory experience. The salary...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer (Edge Services), Infrastructure Services. Be the first to apply!
- site leader Elk Grove, CA
- site safety Elk Grove, CA
- on-site clinical research associate (traveling/remote) Elk Grove, CA
- IT site lead Elk Grove, CA
- site reliability engineer remote
- site reliability engineer
- lead site reliability engineer
- junior site reliability engineer
- site reliability engineer sre
- site reliability engineering manager

