Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer (Edge Services), Infrastructure Services

Apple

Role Number: 200663929-3956

Summary

We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated, data-driven reliability framework. You will play a pivotal role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and seamless user experiences.

Description

As a key member of the SRE team, your mission is to treat operations as a software problem. You will focus on designing and implementing a next-generation observability and alerting strategy that prioritizes high-cardinality data and meaningful signals over noise. You will spend your time building "self-healing" systems, reducing toil through aggressive automation, and partnering with development teams to bake reliability into the CI/CD pipeline. Your goal is to move us toward a proactive stance where performance bottlenecks are identified and mitigated before they impact the customer.

Minimum Qualifications

  • B.S. in Computer Science, Computer Engineering, or a related technical field, or equivalent professional work experience.

  • Understanding of Linux internals and deep networking expertise, including (QUIC), and You should be comfortable debugging protocol-level issues and optimizing traffic flow.

  • Proven ability to automate repetitive tasks and complex workflows using Python or Go

  • Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana, ClickHouse) with a focus on creating actionable, high-signal quality alerting.

  • Grasp of Data Structures and Algorithms (DSA) to write efficient, performant code and troubleshoot complex system bottlenecks.

  • Practical knowledge of SLIs, SLOs, Error Budgets, Release Management and Incident Management to drive engineering priorities.

Preferred Qualifications

  • Experience managing cloud environments (AWS, GCP, or Azure) using Terraform, Ansible, or Pulumi.

  • Orchestration: Hands-on experience scaling and securing containerized workloads via Kubernetes.

  • A track record of leading "blameless post-mortems" and using those insights to harden the system against future failures.

  • Ability to consult with product teams on service design to improve long-term maintainability.

  • A proactive engineering mindset focused on shifting from "fixing things when they break" to "designing things so they don't break" (or so they fail gracefully).

  • Practical fluency in applying Generative AI tools within SRE and software engineering workflows — from accelerating observability query construction and alert design to building AI-assisted debugging and triage capabilities that encode institutional knowledge into repeatable, context-aware workflows — with the engineering rigour to validate, own, and iterate on AI-assisted outputs in production-adjacent contexts

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer (Edge Services), Infrastructure Services in Sunnyvale, CA vacancy
  • $147.4k - $272.1k

    Site Reliability Engineer (Edge Services), Infrastructure Services Sunnyvale, California, United States Software and Services We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive... 
    Suggested
    Relocation
    Shift work

    Apple Inc.

    Sunnyvale, CA
    18 hours ago
  •  ...Site Reliability Engineer, Enterprise Technology Services At Apple, groundbreaking ideas quickly transform into extraordinary products and services that...  ...through custom mechanisms. The role covers managing infrastructure, capacity planning, disaster recovery, and auto-... 
    Suggested
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  • $147.4k - $272.1k

    Site Reliability Engineer, Enterprise Technology Services Sunnyvale, California, United States Software and Services Imagine what we could do together. At...  ...identity management, factory and device support, infrastructure support, platform support, and collaboration tools... 
    Suggested
    Relocation

    Apple Inc.

    Sunnyvale, CA
    18 hours ago
  • $181.1k - $318.4k

    Sr Software Engineer (Infrastructure Applications), Infrastructure Services Sunnyvale, California, United States Software and Services Working with amazing people and awesome products not only makes your work meaningful, it drives you to make our world a better place.... 
    Suggested
    Work at office
    Relocation

    Apple Inc.

    Sunnyvale, CA
    8 hours ago
  • $198.3k - $342.8k

    Site Reliability Engineering Manager, eBusiness Services Sunnyvale, California, United States Software and Services Imagine what we could do together. At Apple...  ...in a Site Reliability Engineering, DevOps, or an Infrastructure‑focused role. Proficiency in one or more... 
    Suggested
    Relocation

    Apple Inc.

    Sunnyvale, CA
    4 days ago
  • $228.1k - $393.8k

    Site Reliability Engineering Manager, Storage - Apple Services Engineering Cupertino, California, United States Software and Services Are you a talented Engineering...  ...distributed storage technologies to Apple's infrastructure? At Apple, scale is huge and impact is enormous.... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $165k - $242k

     ...Senior Software Engineer - Data Infrastructure Services Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers...  ...processing architecture and solve for scalability and reliability. Improve the performance, security, reliability, and... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • Apple Inc. is seeking a Site Reliability Engineer for its Enterprise Technology Services in Sunnyvale, California. In this role, you will collaborate with application teams to automate operations, optimize infrastructure, and ensure systems are reliable and high-performing... 

    Apple Inc.

    Sunnyvale, CA
    4 days ago
  • $190k - $220k

    LandingAI is building the infrastructure to make the world’s...  ...of the strongest AI Engineers and Machine Learning...  ...and production‑grade reliability. A data‑centric...  ...approach to model quality, edge cases, evaluation,...  ...powered applications and services, focusing on high... 
    Work at office

    LandingAI, Inc.

    Mountain View, CA
    1 day ago
  • $147k - $211k

     ...Experience developing large-scale infrastructure and distributed systems....  ...in building managed services, such as databases or storage...  ...About the job Google's software engineers develop the next-generation...  ...that leverage Google’s cutting-edge technology, and tools that... 

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $147k - $237.5k

     ...solving real‑world problems with cutting‑edge technology and bold thinking. Here,...  ...Job Summary Your Career: Strata Logging Service (SLS) powers advanced cybersecurity innovations...  ...or identifiable information. Its infrastructure is secured with industry‑standard best practices... 
    Full time
    Work at office
    Local area
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    1 day ago
  • $172.1k - $305.6k

     ...United States Software and Services The Apple Services Engineering team is one of the most...  ...solutions. The Service Reliability Engineering (SRE) team...  ...responsible for service infrastructure that ensures our customers...  ...project management for Site Reliability Engineering... 
    Relocation

    Apple Inc.

    Cupertino, CA
    18 hours ago
  •  ...in Santa Clara is seeking a skilled software engineer to develop a cloud-native stack for managing data infrastructures. This role focuses on building and shipping services around Kubernetes and engaging with cutting-edge technologies. The ideal candidate will have a... 

    NVIDIA Corporation

    Santa Clara, CA
    8 hours ago
  • $170.7k - $300.2k

     .../10/2023 Apple's iCloud SRE team is seeking a Service Reliability Engineer (SRE) to contribute to the design, development...  ...should have at least 3 years of experience in Site Reliability Engineering, DevOps, or infrastructure-focused roles, along with proficiency in supporting... 

    Career-Mover

    Cupertino, CA
    2 days ago
  • At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-...  ...engineering efforts to guarantee flawless service operation with consistent...  ...performant, and supportable. Background with infrastructure automation. Experience running... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $172.1k - $258.6k

    Site Reliability Engineer, Physical Infrastructure Cupertino, California, United States Software and Services We are looking for a creative and highly motivated Site Reliability Engineer to join our team. Having depth and breadth of knowledge working in physical infrastructure... 
    Worldwide
    Relocation

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $123k - $190k

    About The HIL Platform and Services team is responsible for developing...  ...and scaling high-fidelity, reliable Hardware-in-Loop validation...  ...Reliability. Role As a Software Engineer on the HIL Platform and...  ...Will Give You a Competitive Edge (Preferred Qualifications) Experience... 
    Work experience placement
    Flexible hours

    General Motors

    Sunnyvale, CA
    18 hours ago
  • $172.53k - $201.38k

    Software Engineer III - Trust Service Team Develop and scale the Trust Service backend to support identity...  ...where we build and operate the core infrastructure that decides how much trust to...  ...captures what was verified and how reliably it was verified. Every access decision... 
    Contract work

    jobs.frontdoordefense.com - Jobboard

    Mountain View, CA
    18 hours ago
  • $181.1k - $318.4k

    Senior Software Engineer - Messaging Identity Services Cupertino, California, United States Software and Services...  ..., with each other in a secure, reliable, and privacy-protecting way. We...  ...to build messaging experiences and infrastructure that scale to the next billion... 
    Relocation

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $90k - $176k

    About the team MongoDB Technical Services Engineers use their outstanding problem solving and customer service skills, along with their deep...  ...business pain points, application architectures, and Cloud infrastructure configurations Collaborate with enterprise customers to... 
    Permanent employment
    Work at office
    Local area
    Monday to Friday
    Flexible hours
    Shift work
    Weekend work

    Insider, Inc.

    Palo Alto, CA
    1 day ago
  • $138.9k - $256.5k

    As a software engineer on the Training Platform team, you have the following...  ..., and their web portal & web service components. Partner with data scientists...  ...deploying on a Kubernetes based infrastructure is required Experience running reliable production services with focus on... 
    Work experience placement
    Relocation

    Apple Inc.

    Cupertino, CA
    8 hours ago
  • $128.16k - $159.91k

    The Field Service Engineer provides high-level technical support and application...  ...application, process, or reliability issues across development...  ...20‑40%) to customer sites, fabs, or OSAT locations as...  ...competitive salary and leading‑edge work, Solstice Advanced Materials... 
    Permanent employment
    Temporary work
    Work experience placement
    Flexible hours

    Solstice Advanced Materials

    Santa Clara, CA
    2 days ago
  •  ...resilience for the infrastructure, systems, and...  ...Location: 5 on-site days a week in Sunnyvale...  ...Vision: Our Engineering team is driven by...  ...with a cutting-edge technology stack...  ...for designing new services and applications...  ...enhancing system reliability and scalability of... 
    Work experience placement
    Immediate start

    Illumio

    Sunnyvale, CA
    6 days ago
  • $245k - $295k

     ...only vertically integrated AI infrastructure company built from the...  ...center construction, and cloud services. If you want to do the...  ...Join Crusoe as a Senior Engineering Manager and lead a talented...  ...Models (LLMs) to build cutting-edge AI solutions within our Command... 
    Full time
    Temporary work

    Crusoe

    Sunnyvale, CA
    17 days ago
  • $147.4k - $220.9k

    Software Engineer, Full stack , Retail Engineering Apps & Services Sunnyvale, California, United States Software and Services Join us, the team that serves as...  ...knowledge of networking concepts & protocols (e.g. CDN, edge computing, load balancing, OSI model, etc.). At... 
    Relocation

    Apple Inc.

    Sunnyvale, CA
    3 days ago
  • $175k - $225k

     ...enterprises, CoreWeave combines superior infrastructure performance with deep technical...  ...and innovative Software Engineer of Network Services to lead the architecture, scaling,...  ..., drive innovation, and ensure the reliability, security, and scalability of the CoreWeave... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $152k - $241.5k

    The NVIDIA DGXC Data Services team is developing a cloud-native stack of software services...  ...data across hybrid and multi-cloud infrastructures. We are building the next-generation...  ...Science, Information Systems, or Computer Engineering (or equivalent experience) with 5+... 

    NVIDIA Corporation

    Santa Clara, CA
    18 hours ago
  • $217.57k - $271k

    Staff Software Engineer - Trust Service Team Location: Mountain View, California, United States...  ...where we build and operate the core infrastructure that decides how much trust to extend...  ...captures what was verified and how reliably it was verified. Every downstream access... 
    Contract work

    jobs.frontdoordefense.com - Jobboard

    Mountain View, CA
    18 hours ago
  • $174k - $252k

    Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’s...  ...unparalleled scale, efficiency, reliability and velocity. Our...  ...of our cutting‑edge AI models, delivering unparalleled...  ...power to global services, and providing the essential... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  •  ...Software Engineer - Platform Infrastructure BREV/AN is at the forefront of revolutionizing how businesses...  ...development team to integrate cloud services and solutions, specifically within the...  ...passion for this position at the cutting edge of AI security. #J-18808-Ljbffr... 
    Flexible hours

    BREVIAN

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer (Edge Services), Infrastructure Services. Be the first to apply!