Senior SRE — Cloud Observability & Reliability Lead
$126k - $204.5kPalo Alto Networks
Palo Alto Networks, Inc. is seeking a skilled DevOps/SRE engineer to join their Cortex team in Santa Clara, California. This role involves operating and maintaining large-scale GCP environments and requires expertise in observability tools such as Thanos, Prometheus, and Grafana. The ideal candidate should have over 5 years of experience, strong skills in cloud technologies, and a passion for high reliability. Compensation ranges from $126,000 to $204,500 annually, depending on experience and qualifications. #J-18808-Ljbffr Palo Alto Networks, Inc.
$120k - $145k
Fortinet, Inc. is seeking a Staff SRE to scale FortiSASE’s cloud infrastructure. The ideal candidate will have... ...systems. Responsibilities include leading initiatives across teams, optimizing performance, and improving reliability. The position offers a salary range of...CloudSenior$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain... ...and deployment and open source cloud enabling technologies like Kubernetes... ...reliability aspects of large scale Observability & Telemetry collection platform with...CloudSenior- donato technologies is seeking a Senior SRE / DevOps Engineer in Sunnyvale, CA. The successful candidate will focus on ensuring system reliability and scalability while automating operations across all teams. Candidates should have over 8 years of experience in DevOps,...CloudSenior
- NVIDIA Corporation is looking for a Senior Systems Software Engineer (SRE) in Santa Clara, California. This role focuses on designing... .... Key responsibilities include ensuring GPU cloud services run with maximum reliability, participating in service lifecycles, and leveraging...CloudSenior
$201.6k - $302k
...The Role: As the Senior Engineering Manager... ...Hybrid Services & Reliability (HSR) within AV Core... ...trust. You will lead a newly seeded team... ...of the hybrid cloud systems that underlie... ...Reliability Engineering (SRE) and defining SLO/... ...view on automated observability, incident response...CloudSeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$151.6k - $245.3k
Palo Alto Networks, Inc. seeks a Principal Site Reliability Engineer in Santa Clara, CA. The role involves driving SRE and DevOps initiatives, architecting scalable solutions... ...in AI productivity tools, and expertise in cloud-native application development on GCP or AWS. A...Cloud- ...obsessed**, and results-oriented Senior Product Manager to drive the **core reliability platforms and services** that... ...Developer Tools, Platform Engineering, SRE, Observability, or a related technical field.... ...in the developer tools, cloud infrastructure, or observability...CloudLocal areaFlexible hours
$146.58k - $229.6k
...results‑oriented Senior Product Manager to drive the core reliability platforms and services... ...strategy for the Observability, BCDR & Incident... ...engineering team. Leading cross‑functional teams... ...Engineering, SRE, Observability, or... ...developer tools, cloud infrastructure, or...CloudWork experience placementLocal area- Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible... .... Responsibilities Lead and scale a global,... ...Code, and automation. Observability and incident troubleshooting... ...with Kubernetes, cloud platforms, and event‑driven...CloudSeniorWork at office3 days per week
- A leading cybersecurity firm in Santa Clara is seeking a Principal Site Reliability Engineer to design and optimize their cloud platforms. The successful candidate will lead automation strategies,... ...over 10 years of experience in DevOps/SRE with expert-level skills in...Cloud
- Apple Inc. is seeking a proactive Site Reliability Engineer in Sunnyvale, California to enhance... ...will include designing observability strategies and implementing automation... ...Linux, Python, and have experience with cloud environments and monitoring tools. This...Cloud
- Illumio is seeking a Senior Site Reliability Engineer to enhance reliability and performance in their cloud-based systems in Sunnyvale, California. The ideal candidate will... ...Responsibilities include monitoring systems, leading incident responses, and implementing...CloudSenior
- Qcells North America is seeking a Senior DevOps & SRE Manager to ensure the reliability, scalability, and operational... ...platform ecosystem. This role requires leading a global team and managing... ...strong expertise in Kubernetes, cloud platforms, and event-driven systems...Cloud
- NVIDIA Corporation is seeking a Reliability Engineer to build a robust operational framework across teams. The successful candidate will have over 10 years of experience in software engineering and operational excellence, focusing on chaos engineering and building production...Cloud
$184k - $287.5k
...Responsibilities Build org‑wide reliability strategy, guiding how NVIDIA... ...high standards across teams. Lead incident response for high... ...reliability function like Google SRE or Meta production... ...organization. Proficiency with modern observability and operational tools such as...CloudSenior$262k - $365k
Google Inc. is seeking a Senior Staff Software Engineer, specializing in Site Reliability Engineering. This role involves leading projects, engaging through the entire lifecycle of services, and ensuring systems remain reliable and efficient. Candidates should have 8 years...Senior$110k - $140k
...professional in Sunnyvale, CA to manage secure Docker and Kubernetes environments. You will design and optimize high-availability cloud infrastructure and lead incident response for critical situations. The successful candidate will have strong Linux/Unix and Python skills,...CloudSenior$126k - $204.5k
...enhancement of our comprehensive observability systems. To meet the... ...Utilize expertise in monitoring cloud platforms, particularly GCP,... ...of the product and ensure the reliability and availability of our services... ...of experience as a DevOps/SRE engineer with a passion for technology...CloudSenior$152k - $241.5k
NVIDIA Gruppe in Santa Clara, CA is seeking a Senior SRE to join the Compute Farm team. This role involves owning SRE solutions and ensuring system reliability while engaging in groundbreaking innovations in AI and HPC. Successful candidates will possess a strong technical...CloudSenior$152k - $241.5k
...amazing people. NVIDIA is leading the way in groundbreaking... ...We’re looking for a Senior SRE to join our Compute Farm... ...globally distributed, multi‑cloud hybrid environment - On‑prem... ...management, fleet reliability/auto‑healing, E2E observability or data‑driven operations...CloudSenior- Palo Alto Networks, Inc. is looking for a Site Reliability Engineer to support its hybrid cloud infrastructure. You'll work closely with developers, researchers, and security experts to ensure applications are production-ready, scalable, and reliable. Your expertise with...CloudSenior
- A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS and Kubernetes. You will lead migrations... ...with various teams to ensure reliability. This position is onsite in the San Francisco...CloudSenior
- ...looking for a Principal DevOps, SRE & Application Infrastructure Architect... ...environments, managing cloud infrastructure, and ensuring end-to-end production reliability. Candidates should have 12+ years... ...DevOps practices. You will also lead incident management, optimize cloud...CloudSeniorContract work
$201.6k - $302k
General Motors in Sunnyvale is looking for a Senior Engineering Manager for Hybrid Services & Reliability. This role involves leading a team responsible for ensuring the reliability of hybrid cloud systems crucial for autonomous vehicle development. The ideal candidate...Cloud$176k - $333.5k
NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in...Senior- Apple Inc. is seeking a Senior Site Reliability Engineer based in Cupertino, California, to drive reliability standards across the Apple Data Platform. You will mentor engineers and ensure that large-scale infrastructures run reliably and efficiently. With a focus on technical...Senior
$165k - $248k
...products. We deliver industry-leading silicon design, IP,... ...infrastructure is observed, understood, and operated... ...increase infrastructure reliability across environments... ..., storage, networking, cloud services, and business-... ...Partner with infrastructure, SRE, platform engineering,...CloudSenior- ...We're looking for a Senior Site Reliability Engineer to own the reliability... ...we need a seasoned SRE to help us scale... ..., and build the observability, alerting, and on‑call... ...practices to support them Lead incident response and... ...planning across our cloud infrastructure as the...CloudSenior
- ...Albanese, Inc. is looking for a highly skilled Senior Systems Administrator to manage and... ...role involves ensuring systems are secure, reliable, and scalable while supporting both... ...employees and distributed field teams. You will lead system upgrades, support compliance with...CloudSeniorWork at office
- ...Description About the Role Senior Site Reliability Engineer (Payments... ...will own production observability, incident response,... ...management, and cloud infrastructure reliability... ...environments. Lead incident management during... ...Strong knowledge of SRE principles, including...CloudSenior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior SRE — Cloud Observability & Reliability Lead. Be the first to apply!
- senior data management analyst Santa Clara, CA
- senior app developer Santa Clara, CA
- senior game producer Santa Clara, CA
- senior packaging engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior software test automation engineer Santa Clara, CA
- senior compensation manager Santa Clara, CA
- senior sourcing engineer Santa Clara, CA
- senior director engineering Santa Clara, CA
- senior vice president of operations Santa Clara, CA
