Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior SRE — Cloud Observability & Reliability Lead

$126k - $204.5k

Palo Alto Networks

Palo Alto Networks, Inc. is seeking a skilled DevOps/SRE engineer to join their Cortex team in Santa Clara, California. This role involves operating and maintaining large-scale GCP environments and requires expertise in observability tools such as Thanos, Prometheus, and Grafana. The ideal candidate should have over 5 years of experience, strong skills in cloud technologies, and a passion for high reliability. Compensation ranges from $126,000 to $204,500 annually, depending on experience and qualifications. #J-18808-Ljbffr Palo Alto Networks, Inc.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior SRE — Cloud Observability & Reliability Lead in Santa Clara, CA vacancy
  • $120k - $145k

    Fortinet, Inc. is seeking a Staff SRE to scale FortiSASE’s cloud infrastructure. The ideal candidate will have...  ...systems. Responsibilities include leading initiatives across teams, optimizing performance, and improving reliability. The position offers a salary range of... 
    Cloud
    Senior

    Fortinet, Inc.

    Sunnyvale, CA
    3 days ago
  • $176k - $276k

    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain...  ...and deployment and open source cloud enabling technologies like Kubernetes...  ...reliability aspects of large scale Observability & Telemetry collection platform with... 
    Cloud
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • donato technologies is seeking a Senior SRE / DevOps Engineer in Sunnyvale, CA. The successful candidate will focus on ensuring system reliability and scalability while automating operations across all teams. Candidates should have over 8 years of experience in DevOps,... 
    Cloud
    Senior

    donato technologies

    Sunnyvale, CA
    4 days ago
  • NVIDIA Corporation is looking for a Senior Systems Software Engineer (SRE) in Santa Clara, California. This role focuses on designing...  .... Key responsibilities include ensuring GPU cloud services run with maximum reliability, participating in service lifecycles, and leveraging... 
    Cloud
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $201.6k - $302k

     ...The Role: As the Senior Engineering Manager...  ...Hybrid Services & Reliability (HSR) within AV Core...  ...trust. You will lead a newly seeded team...  ...of the hybrid cloud systems that underlie...  ...Reliability Engineering (SRE) and defining SLO/...  ...view on automated observability, incident response... 
    Cloud
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $151.6k - $245.3k

    Palo Alto Networks, Inc. seeks a Principal Site Reliability Engineer in Santa Clara, CA. The role involves driving SRE and DevOps initiatives, architecting scalable solutions...  ...in AI productivity tools, and expertise in cloud-native application development on GCP or AWS. A... 
    Cloud

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  •  ...obsessed**, and results-oriented Senior Product Manager to drive the **core reliability platforms and services** that...  ...Developer Tools, Platform Engineering, SRE, Observability, or a related technical field....  ...in the developer tools, cloud infrastructure, or observability... 
    Cloud
    Local area
    Flexible hours

    GEICO

    Palo Alto, CA
    1 day ago
  • $146.58k - $229.6k

     ...results‑oriented Senior Product Manager to drive the core reliability platforms and services...  ...strategy for the Observability, BCDR & Incident...  ...engineering team. Leading cross‑functional teams...  ...Engineering, SRE, Observability, or...  ...developer tools, cloud infrastructure, or... 
    Cloud
    Work experience placement
    Local area

    Government Employees Insurance Company

    Palo Alto, CA
    3 days ago
  • Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible...  .... Responsibilities Lead and scale a global,...  ...Code, and automation. Observability and incident troubleshooting...  ...with Kubernetes, cloud platforms, and event‑driven... 
    Cloud
    Senior
    Work at office
    3 days per week

    Qcells North America

    Santa Clara, CA
    3 days ago
  • A leading cybersecurity firm in Santa Clara is seeking a Principal Site Reliability Engineer to design and optimize their cloud platforms. The successful candidate will lead automation strategies,...  ...over 10 years of experience in DevOps/SRE with expert-level skills in... 
    Cloud

    Fortinet, Inc.

    Santa Clara, CA
    2 days ago
  • Apple Inc. is seeking a proactive Site Reliability Engineer in Sunnyvale, California to enhance...  ...will include designing observability strategies and implementing automation...  ...Linux, Python, and have experience with cloud environments and monitoring tools. This... 
    Cloud

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • Illumio is seeking a Senior Site Reliability Engineer to enhance reliability and performance in their cloud-based systems in Sunnyvale, California. The ideal candidate will...  ...Responsibilities include monitoring systems, leading incident responses, and implementing... 
    Cloud
    Senior

    Illumio

    Sunnyvale, CA
    4 days ago
  • Qcells North America is seeking a Senior DevOps & SRE Manager to ensure the reliability, scalability, and operational...  ...platform ecosystem. This role requires leading a global team and managing...  ...strong expertise in Kubernetes, cloud platforms, and event-driven systems... 
    Cloud

    Qcells North America

    Santa Clara, CA
    4 days ago
  • NVIDIA Corporation is seeking a Reliability Engineer to build a robust operational framework across teams. The successful candidate will have over 10 years of experience in software engineering and operational excellence, focusing on chaos engineering and building production... 
    Cloud

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...Responsibilities Build org‑wide reliability strategy, guiding how NVIDIA...  ...high standards across teams. Lead incident response for high...  ...reliability function like Google SRE or Meta production...  ...organization. Proficiency with modern observability and operational tools such as... 
    Cloud
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $262k - $365k

    Google Inc. is seeking a Senior Staff Software Engineer, specializing in Site Reliability Engineering. This role involves leading projects, engaging through the entire lifecycle of services, and ensuring systems remain reliable and efficient. Candidates should have 8 years... 
    Senior

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $110k - $140k

     ...professional in Sunnyvale, CA to manage secure Docker and Kubernetes environments. You will design and optimize high-availability cloud infrastructure and lead incident response for critical situations. The successful candidate will have strong Linux/Unix and Python skills,... 
    Cloud
    Senior

    Tata Consultancy Services

    Sunnyvale, CA
    3 days ago
  • $126k - $204.5k

     ...enhancement of our comprehensive observability systems. To meet the...  ...Utilize expertise in monitoring cloud platforms, particularly GCP,...  ...of the product and ensure the reliability and availability of our services...  ...of experience as a DevOps/SRE engineer with a passion for technology... 
    Cloud
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe in Santa Clara, CA is seeking a Senior SRE to join the Compute Farm team. This role involves owning SRE solutions and ensuring system reliability while engaging in groundbreaking innovations in AI and HPC. Successful candidates will possess a strong technical... 
    Cloud
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...amazing people. NVIDIA is leading the way in groundbreaking...  ...We’re looking for a Senior SRE to join our Compute Farm...  ...globally distributed, multi‑cloud hybrid environment - On‑prem...  ...management, fleet reliability/auto‑healing, E2E observability or data‑driven operations... 
    Cloud
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • Palo Alto Networks, Inc. is looking for a Site Reliability Engineer to support its hybrid cloud infrastructure. You'll work closely with developers, researchers, and security experts to ensure applications are production-ready, scalable, and reliable. Your expertise with... 
    Cloud
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    11 hours ago
  • A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS and Kubernetes. You will lead migrations...  ...with various teams to ensure reliability. This position is onsite in the San Francisco... 
    Cloud
    Senior

    EITACIES Inc.

    Santa Clara, CA
    11 hours ago
  •  ...looking for a Principal DevOps, SRE & Application Infrastructure Architect...  ...environments, managing cloud infrastructure, and ensuring end-to-end production reliability. Candidates should have 12+ years...  ...DevOps practices. You will also lead incident management, optimize cloud... 
    Cloud
    Senior
    Contract work

    Tech Mirrors

    Sunnyvale, CA
    4 days ago
  • $201.6k - $302k

    General Motors in Sunnyvale is looking for a Senior Engineering Manager for Hybrid Services & Reliability. This role involves leading a team responsible for ensuring the reliability of hybrid cloud systems crucial for autonomous vehicle development. The ideal candidate... 
    Cloud

    General Motors

    Sunnyvale, CA
    3 days ago
  • $176k - $333.5k

    NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • Apple Inc. is seeking a Senior Site Reliability Engineer based in Cupertino, California, to drive reliability standards across the Apple Data Platform. You will mentor engineers and ensure that large-scale infrastructures run reliably and efficiently. With a focus on technical... 
    Senior

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $165k - $248k

     ...products. We deliver industry-leading silicon design, IP,...  ...infrastructure is observed, understood, and operated...  ...increase infrastructure reliability across environments...  ..., storage, networking, cloud services, and business-...  ...Partner with infrastructure, SRE, platform engineering,... 
    Cloud
    Senior

    Synopsys Inc

    Sunnyvale, CA
    12 days ago
  •  ...We're looking for a Senior Site Reliability Engineer to own the reliability...  ...we need a seasoned SRE to help us scale...  ..., and build the observability, alerting, and on‑call...  ...practices to support them Lead incident response and...  ...planning across our cloud infrastructure as the... 
    Cloud
    Senior

    Nectar

    Palo Alto, CA
    11 hours ago
  •  ...Albanese, Inc. is looking for a highly skilled Senior Systems Administrator to manage and...  ...role involves ensuring systems are secure, reliable, and scalable while supporting both...  ...employees and distributed field teams. You will lead system upgrades, support compliance with... 
    Cloud
    Senior
    Work at office

    Joseph J. Albanese

    Santa Clara, CA
    2 days ago
  •  ...Description About the Role Senior Site Reliability Engineer (Payments...  ...will own production observability, incident response,...  ...management, and cloud infrastructure reliability...  ...environments. Lead incident management during...  ...Strong knowledge of SRE principles, including... 
    Cloud
    Senior

    Kody

    Sunnyvale, CA
    11 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior SRE — Cloud Observability & Reliability Lead. Be the first to apply!