Site Reliability Engineer (Edge Services), Infrastructure Services
Apple
Role Number: 200663929-3956
Summary
We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated, data-driven reliability framework. You will play a pivotal role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and seamless user experiences.
Description
As a key member of the SRE team, your mission is to treat operations as a software problem. You will focus on designing and implementing a next-generation observability and alerting strategy that prioritizes high-cardinality data and meaningful signals over noise. You will spend your time building "self-healing" systems, reducing toil through aggressive automation, and partnering with development teams to bake reliability into the CI/CD pipeline. Your goal is to move us toward a proactive stance where performance bottlenecks are identified and mitigated before they impact the customer.
Minimum Qualifications
B.S. in Computer Science, Computer Engineering, or a related technical field, or equivalent professional work experience.
Understanding of Linux internals and deep networking expertise, including (QUIC), and You should be comfortable debugging protocol-level issues and optimizing traffic flow.
Proven ability to automate repetitive tasks and complex workflows using Python or Go
Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana, ClickHouse) with a focus on creating actionable, high-signal quality alerting.
Grasp of Data Structures and Algorithms (DSA) to write efficient, performant code and troubleshoot complex system bottlenecks.
Practical knowledge of SLIs, SLOs, Error Budgets, Release Management and Incident Management to drive engineering priorities.
Preferred Qualifications
Experience managing cloud environments (AWS, GCP, or Azure) using Terraform, Ansible, or Pulumi.
Orchestration: Hands-on experience scaling and securing containerized workloads via Kubernetes.
A track record of leading "blameless post-mortems" and using those insights to harden the system against future failures.
Ability to consult with product teams on service design to improve long-term maintainability.
A proactive engineering mindset focused on shifting from "fixing things when they break" to "designing things so they don't break" (or so they fail gracefully).
Practical fluency in applying Generative AI tools within SRE and software engineering workflows — from accelerating observability query construction and alert design to building AI-assisted debugging and triage capabilities that encode institutional knowledge into repeatable, context-aware workflows — with the engineering rigour to validate, own, and iterate on AI-assisted outputs in production-adjacent contexts
$147.4k - $272.1k
Site Reliability Engineer (Edge Services), Infrastructure Services Sunnyvale, California, United States Software and Services We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive...SuggestedRelocationShift work- ...Site Reliability Engineer, Enterprise Technology Services At Apple, groundbreaking ideas quickly transform into extraordinary products and services that... ...through custom mechanisms. The role covers managing infrastructure, capacity planning, disaster recovery, and auto-...SuggestedWorldwideRelocation
$147.4k - $272.1k
Site Reliability Engineer, Enterprise Technology Services Sunnyvale, California, United States Software and Services Imagine what we could do together. At... ...identity management, factory and device support, infrastructure support, platform support, and collaboration tools...SuggestedRelocation$181.1k - $318.4k
Sr Software Engineer (Infrastructure Applications), Infrastructure Services Sunnyvale, California, United States Software and Services Working with amazing people and awesome products not only makes your work meaningful, it drives you to make our world a better place....SuggestedWork at officeRelocation$198.3k - $342.8k
Site Reliability Engineering Manager, eBusiness Services Sunnyvale, California, United States Software and Services Imagine what we could do together. At Apple... ...in a Site Reliability Engineering, DevOps, or an Infrastructure‑focused role. Proficiency in one or more...SuggestedRelocation$228.1k - $393.8k
Site Reliability Engineering Manager, Storage - Apple Services Engineering Cupertino, California, United States Software and Services Are you a talented Engineering... ...distributed storage technologies to Apple's infrastructure? At Apple, scale is huge and impact is enormous....Relocation$165k - $242k
...Senior Software Engineer - Data Infrastructure Services Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers... ...processing architecture and solve for scalability and reliability. Improve the performance, security, reliability, and...Permanent employmentTemporary workCasual workWork at officeFlexible hours- Apple Inc. is seeking a Site Reliability Engineer for its Enterprise Technology Services in Sunnyvale, California. In this role, you will collaborate with application teams to automate operations, optimize infrastructure, and ensure systems are reliable and high-performing...
$190k - $220k
LandingAI is building the infrastructure to make the world’s... ...of the strongest AI Engineers and Machine Learning... ...and production‑grade reliability. A data‑centric... ...approach to model quality, edge cases, evaluation,... ...powered applications and services, focusing on high...Work at office$147k - $211k
...Experience developing large-scale infrastructure and distributed systems.... ...in building managed services, such as databases or storage... ...About the job Google's software engineers develop the next-generation... ...that leverage Google’s cutting-edge technology, and tools that...$147k - $237.5k
...solving real‑world problems with cutting‑edge technology and bold thinking. Here,... ...Job Summary Your Career: Strata Logging Service (SLS) powers advanced cybersecurity innovations... ...or identifiable information. Its infrastructure is secured with industry‑standard best practices...Full timeWork at officeLocal areaVisa sponsorshipWork visa$172.1k - $305.6k
...United States Software and Services The Apple Services Engineering team is one of the most... ...solutions. The Service Reliability Engineering (SRE) team... ...responsible for service infrastructure that ensures our customers... ...project management for Site Reliability Engineering...Relocation- ...in Santa Clara is seeking a skilled software engineer to develop a cloud-native stack for managing data infrastructures. This role focuses on building and shipping services around Kubernetes and engaging with cutting-edge technologies. The ideal candidate will have a...
$170.7k - $300.2k
.../10/2023 Apple's iCloud SRE team is seeking a Service Reliability Engineer (SRE) to contribute to the design, development... ...should have at least 3 years of experience in Site Reliability Engineering, DevOps, or infrastructure-focused roles, along with proficiency in supporting...- At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-... ...engineering efforts to guarantee flawless service operation with consistent... ...performant, and supportable. Background with infrastructure automation. Experience running...
$172.1k - $258.6k
Site Reliability Engineer, Physical Infrastructure Cupertino, California, United States Software and Services We are looking for a creative and highly motivated Site Reliability Engineer to join our team. Having depth and breadth of knowledge working in physical infrastructure...WorldwideRelocation$123k - $190k
About The HIL Platform and Services team is responsible for developing... ...and scaling high-fidelity, reliable Hardware-in-Loop validation... ...Reliability. Role As a Software Engineer on the HIL Platform and... ...Will Give You a Competitive Edge (Preferred Qualifications) Experience...Work experience placementFlexible hours$172.53k - $201.38k
Software Engineer III - Trust Service Team Develop and scale the Trust Service backend to support identity... ...where we build and operate the core infrastructure that decides how much trust to... ...captures what was verified and how reliably it was verified. Every access decision...Contract work$181.1k - $318.4k
Senior Software Engineer - Messaging Identity Services Cupertino, California, United States Software and Services... ..., with each other in a secure, reliable, and privacy-protecting way. We... ...to build messaging experiences and infrastructure that scale to the next billion...Relocation$90k - $176k
About the team MongoDB Technical Services Engineers use their outstanding problem solving and customer service skills, along with their deep... ...business pain points, application architectures, and Cloud infrastructure configurations Collaborate with enterprise customers to...Permanent employmentWork at officeLocal areaMonday to FridayFlexible hoursShift workWeekend work$138.9k - $256.5k
As a software engineer on the Training Platform team, you have the following... ..., and their web portal & web service components. Partner with data scientists... ...deploying on a Kubernetes based infrastructure is required Experience running reliable production services with focus on...Work experience placementRelocation$128.16k - $159.91k
The Field Service Engineer provides high-level technical support and application... ...application, process, or reliability issues across development... ...20‑40%) to customer sites, fabs, or OSAT locations as... ...competitive salary and leading‑edge work, Solstice Advanced Materials...Permanent employmentTemporary workWork experience placementFlexible hours- ...resilience for the infrastructure, systems, and... ...Location: 5 on-site days a week in Sunnyvale... ...Vision: Our Engineering team is driven by... ...with a cutting-edge technology stack... ...for designing new services and applications... ...enhancing system reliability and scalability of...Work experience placementImmediate start
$245k - $295k
...only vertically integrated AI infrastructure company built from the... ...center construction, and cloud services. If you want to do the... ...Join Crusoe as a Senior Engineering Manager and lead a talented... ...Models (LLMs) to build cutting-edge AI solutions within our Command...Full timeTemporary work$147.4k - $220.9k
Software Engineer, Full stack , Retail Engineering Apps & Services Sunnyvale, California, United States Software and Services Join us, the team that serves as... ...knowledge of networking concepts & protocols (e.g. CDN, edge computing, load balancing, OSI model, etc.). At...Relocation$175k - $225k
...enterprises, CoreWeave combines superior infrastructure performance with deep technical... ...and innovative Software Engineer of Network Services to lead the architecture, scaling,... ..., drive innovation, and ensure the reliability, security, and scalability of the CoreWeave...Permanent employmentTemporary workCasual workWork at officeFlexible hours$152k - $241.5k
The NVIDIA DGXC Data Services team is developing a cloud-native stack of software services... ...data across hybrid and multi-cloud infrastructures. We are building the next-generation... ...Science, Information Systems, or Computer Engineering (or equivalent experience) with 5+...$217.57k - $271k
Staff Software Engineer - Trust Service Team Location: Mountain View, California, United States... ...where we build and operate the core infrastructure that decides how much trust to extend... ...captures what was verified and how reliably it was verified. Every downstream access...Contract work$174k - $252k
Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’s... ...unparalleled scale, efficiency, reliability and velocity. Our... ...of our cutting‑edge AI models, delivering unparalleled... ...power to global services, and providing the essential...Full timeWorldwide- ...Software Engineer - Platform Infrastructure BREV/AN is at the forefront of revolutionizing how businesses... ...development team to integrate cloud services and solutions, specifically within the... ...passion for this position at the cutting edge of AI security. #J-18808-Ljbffr...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer (Edge Services), Infrastructure Services. Be the first to apply!
- site reliability engineer Sunnyvale, CA
- site reliability engineer sre Sunnyvale, CA
- security infrastructure engineer Sunnyvale, CA
- infrastructure engineer Sunnyvale, CA
- data infrastructure engineer Sunnyvale, CA
- infrastructure engineering manager Sunnyvale, CA
- senior infrastructure engineer Sunnyvale, CA
- infrastructure automation engineer Sunnyvale, CA
- remote infrastructure engineer Sunnyvale, CA
- infrastructure developer Sunnyvale, CA


