Senior SRE — Cloud Observability & Reliability Lead
$126k - $204.5kPalo Alto Networks, Inc.
Palo Alto Networks, Inc. is seeking a skilled DevOps/SRE engineer to join their Cortex team in Santa Clara, California. This role involves operating and maintaining large-scale GCP environments and requires expertise in observability tools such as Thanos, Prometheus, and Grafana. The ideal candidate should have over 5 years of experience, strong skills in cloud technologies, and a passion for high reliability. Compensation ranges from $126,000 to $204,500 annually, depending on experience and qualifications. #J-18808-Ljbffr Palo Alto Networks, Inc.
$120k - $145k
Fortinet, Inc. is seeking a Staff SRE to scale FortiSASE’s cloud infrastructure. The ideal candidate will have... ...systems. Responsibilities include leading initiatives across teams, optimizing performance, and improving reliability. The position offers a salary range of...CloudSenior$175k - $210k
...Senior Manager, DevOps & SRE – Platform Reliability & Global Operations Location: Santa Clara, CA... ...platforms. This role leads a blended DevOps and SRE... ...Code, and automation Observability and incident troubleshooting... ...with Kubernetes, cloud platforms, and event driven...CloudSeniorWork at office3 days per week$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain... ...and deployment and open source cloud enabling technologies like Kubernetes... ...reliability aspects of large scale Observability & Telemetry collection platform with...CloudSenior$201.6k - $302k
...The Role: As the Senior Engineering Manager... ...Hybrid Services & Reliability (HSR) within AV Core... ...trust. You will lead a newly seeded team... ...of the hybrid cloud systems that underlie... ...Reliability Engineering (SRE) and defining SLO/... ...view on automated observability, incident response...CloudSeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$151.6k - $245.3k
Palo Alto Networks, Inc. seeks a Principal Site Reliability Engineer in Santa Clara, CA. The role involves driving SRE and DevOps initiatives, architecting scalable solutions... ...in AI productivity tools, and expertise in cloud-native application development on GCP or AWS. A...Cloud$146.58k - $229.6k
...results‑oriented Senior Product Manager to drive the core reliability platforms and services... ...strategy for the Observability, BCDR & Incident... ...engineering team. Leading cross‑functional teams... ...Engineering, SRE, Observability, or... ...developer tools, cloud infrastructure, or...CloudWork experience placementLocal area$146.58k - $229.6k
...and results-oriented Senior Product Manager to drive the core reliability platforms and services... ...incident management, observability, and cloud infrastructure into actionable... ...team. ~ Leading cross-functional teams... ...Platform Engineering, SRE, Observability, or a related...CloudHourly payWork experience placementLocal areaRemote workFlexible hours- Apple Inc. is seeking a proactive Site Reliability Engineer in Sunnyvale, California to enhance... ...will include designing observability strategies and implementing automation... ...Linux, Python, and have experience with cloud environments and monitoring tools. This...Cloud
- Illumio is seeking a Senior Site Reliability Engineer to enhance reliability and performance in their cloud-based systems in Sunnyvale, California. The ideal candidate will... ...Responsibilities include monitoring systems, leading incident responses, and implementing...CloudSenior
- NVIDIA Gruppe is looking for a Senior Manager of Site Reliability Engineering in Santa Clara, California, to lead IT operations with a focus on leveraging AI and automation.... ...ideal candidate has extensive experience in SRE, IT service management, and is skilled in applying...Senior
$135.6k - $200k
Vistance Networks, Inc. is seeking a hands-on Devops Architect in Sunnyvale, California, to lead the technical strategy for their cloud platform and spearhead DevOps operations. The ideal candidate will have over 10 years of experience in infrastructure engineering, deep...CloudSenior$126k - $204.5k
...enhancement of our comprehensive observability systems. To meet the... ...Utilize expertise in monitoring cloud platforms, particularly GCP,... ...of the product and ensure the reliability and availability of our services... ...of experience as a DevOps/SRE engineer with a passion for technology...CloudSenior$152k - $241.5k
NVIDIA Gruppe in Santa Clara, CA is seeking a Senior SRE to join the Compute Farm team. This role involves owning SRE solutions and ensuring system reliability while engaging in groundbreaking innovations in AI and HPC. Successful candidates will possess a strong technical...CloudSenior$152k - $241.5k
...amazing people. NVIDIA is leading the way in groundbreaking... ...We’re looking for a Senior SRE to join our Compute Farm... ...globally distributed, multi‑cloud hybrid environment - On‑prem... ...management, fleet reliability/auto‑healing, E2E observability or data‑driven operations...CloudSenior- SambaNova Systems is seeking a Senior Cloud Platform Engineer in Palo Alto, California. This role focuses on the reliability and scalability of our AI inferencing service, requiring experience in Site Reliability Engineering and cloud infrastructure. The ideal candidate...CloudSenior
- A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS and Kubernetes. You will lead migrations... ...with various teams to ensure reliability. This position is onsite in the San Francisco...CloudSenior
$168k - $270.25k
NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (... ...and external-facing GPU cloud gaming services have reliability... ...tools to improve the SRE Observability. Be part of the... ...available VMI setup on K8's. Lead significant production improvements...CloudSeniorFull time$201.6k - $302k
General Motors in Sunnyvale is looking for a Senior Engineering Manager for Hybrid Services & Reliability. This role involves leading a team responsible for ensuring the reliability of hybrid cloud systems crucial for autonomous vehicle development. The ideal candidate...Cloud$176k - $333.5k
NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in...Senior- ...Observe By Snowflake AI Backend Engineer At Snowflake... ...the Snowflake AI Data Cloud and engineered for... ...Graph and chat-based AI SRE provide rich context and... ...resolution 10x faster. Leading engineering teams at... ...daily while maintaining reliability at enterprise scale. As...CloudSenior
$152k - $241.5k
...s most advanced computing workloads. Observability is at the heart of this transformation.... ...will design and develop high-throughput, reliable telemetry pipelines and modern data infrastructure... ...Experience working with Kubernetes and cloud-native infrastructure ~ Strong...CloudSenior$200k - $250k
...work gets done. Observe by Snowflake is an... ...Snowflake AI Data Cloud and engineered for... ...and chat-based AI SRE provide rich... ...resolution 10x faster. Leading engineering teams... ...while maintaining reliability at enterprise... ...About the Role As a Senior Product Manager at...CloudSeniorFlexible hours$235k - $295k
A leading data and AI infrastructure company is seeking a Sr. Staff Software Engineer to join their Observability team in Mountain View, California. In this role, you will develop... ...solutions and ensure product reliability across cloud regions. Candidates should have 1...CloudSenior$200k - $287.5k
...work gets done. Observe by Snowflake is an... ...Snowflake AI Data Cloud and engineered for... ...and chat-based AI SRE provide rich... ...resolution 10x faster. Leading engineering teams... ...while maintaining reliability at enterprise... ...We are hiring a Senior Software Engineer...CloudSeniorFlexible hours$200k - $287.5k
...how work gets done. Observe by Snowflake is an AI-powered... ...the Snowflake AI Data Cloud and engineered for... ...Graph and chat-based AI SRE provide rich context... ...resolution 10x faster. Leading engineering teams at... ...daily while maintaining reliability at enterprise scale. As...CloudSeniorFlexible hours$200k - $287.5k
...work gets done. Observe by Snowflake is an... ...Snowflake AI Data Cloud and engineered for... ...and chat-based AI SRE provide rich... ...faster. Leading engineering teams... ...while maintaining reliability at enterprise scale... ...customers. As a Senior Infrastructure Engineer...CloudSeniorImmediate startFlexible hours$210k - $300k
...Site Reliability Engineer (SRE) / DevOps Engineer Location: Onsite in NYC or San Francisco Compensation: $210,000–$300,000 Base Salary... ...Engineer to help build, scale, and operate highly reliable cloud infrastructure and developer platforms. In this role, you will...Cloud$200k - $287.5k
...Senior Software Engineer — Streaming Data... ...work gets done. Observe by Snowflake is an... ...Snowflake AI Data Cloud and engineered for... ...and chat-based AI SRE provide rich context... ...10x faster. Leading engineering teams... ...while maintaining reliability at enterprise scale...CloudSeniorTemporary workFlexible hours$188k - $275k
...Senior Manager, Observability Sunnyvale, CA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers... ...with confidence. Trusted by leading AI labs, startups, and... ...pipelines, and observability reliability, enabling teams to detect issues...CloudSeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$200k - $287.5k
...how work gets done. Observe by Snowflake is an AI-powered... ...the Snowflake AI Data Cloud and engineered for... ...Graph and chat-based AI SRE provide rich context... ...resolution 10x faster. Leading engineering teams at... ...daily while maintaining reliability at enterprise scale. As...CloudSeniorFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior SRE — Cloud Observability & Reliability Lead. Be the first to apply!
- senior game producer Santa Clara, CA
- senior manager process engineering Santa Clara, CA
- senior manufacturing engineer Santa Clara, CA
- senior manager clinical operations Santa Clara, CA
- senior optical engineer Santa Clara, CA
- senior lead project manager Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior device engineer Santa Clara, CA
- senior full stack developer Santa Clara, CA
- senior hvac project manager Santa Clara, CA


