Senior SRE: Observability & Telemetry Platform
$176k - $333.5kNVIDIA
NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in infrastructure automation, distributed systems design, and tools like Kubernetes and OpenStack. The role emphasizes performance and sustainability, aiming for a diverse and inclusive work environment. Competitive salary range is $176,000 - $333,500 depending on level, plus equity and benefits. #J-18808-Ljbffr NVIDIA Corporation
$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... ...operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time monitoring...PlatformSenior$200k - $287.5k
...Senior Software Engineer — Streaming Data Products At... ...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake... ...Graph and chat-based AI SRE provide rich context and... ...of terabytes of telemetry daily while maintaining...PlatformSeniorTemporary workFlexible hours$200k - $287.5k
...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake... ...Graph and chat-based AI SRE provide rich context and... ...of terabytes of telemetry daily while maintaining... ...platforms. We are hiring a Senior Software Engineer for...PlatformSeniorFlexible hours$200k - $287.5k
...Senior Software Engineer At Snowflake, we are powering the era of the... ...of how work gets done. Observe by Snowflake is an AI-powered observability platform engineered for scale — ingesting... ...across hundreds of terabytes of telemetry data daily. As part of Snowflake...PlatformSeniorFlexible hours- Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible... ...operation of event‑driven and telemetry pipelines. Govern and manage third... ...Infrastructure as Code, and automation. Observability and incident troubleshooting at...PlatformSeniorWork at office3 days per week
$126k - $204.5k
Palo Alto Networks, Inc. is seeking a skilled DevOps/SRE engineer to join their Cortex team in Santa Clara, California. This role... ...maintaining large-scale GCP environments and requires expertise in observability tools such as Thanos, Prometheus, and Grafana. The ideal...Senior$152k - $241.5k
...accelerated cluster team, you will turn telemetry and workload data into clear findings... ...infrastructure signals to find application and platform improvement opportunities. Work with... ...-to-end. Hands‑on use of telemetry / observability stacks (e.g., Grafana, Elasticsearch,...PlatformSenior$139k - $204k
...Senior Engineer, Network Observability Livingston, NJ / New York, NY / Sunnyvale, CA /... ...pioneers, CoreWeave delivers a platform of technology, tools, and... ...the monitoring, telemetry, and observability systems... ...Experience as a Network Engineer, SRE, Software Developer, or Systems...PlatformSeniorTemporary workCasual workWork at officeFlexible hours$201.6k - $302k
...Description The Role: As the Senior Engineering Manager for Hybrid... ...an inherent property of the platform, ensuring that all teams have... ...Site Reliability Engineering (SRE) and defining SLO/SLI... ...Opinionated view on automated observability, incident response, and MTTR...PlatformSeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours- A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS... ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has...PlatformSenior
$184k - $287.5k
NVIDIA Gruppe is seeking a Senior System Software Engineer to lead the development of their next-generation Data & Observability Platform in Santa Clara, California. This role focuses on high-performance ingestion, governance systems, and user experience improvements while...PlatformSenior- ...Observe By Snowflake At Snowflake, we are powering the era of... ...an AI-powered observability platform built on the Snowflake AI Data... ...Context Graph and chat-based AI SRE provide rich context and... ...troubleshoot hundreds of terabytes of telemetry daily while maintaining...PlatformSenior
$146.58k - $229.6k
...obsessed, and results‑oriented Senior Product Manager to drive the core reliability platforms and services that empower our engineering... ...clear product strategy for the Observability, BCDR & Incident Management... ...Tools, Platform Engineering, SRE, Observability, or a related...PlatformWork experience placementLocal area- ...engineers to design solutions for next-generation AI supercomputing platforms. You will work in a collaborative environment to drive fleet... ...extensive experience in C/C++, Python, and familiarity with telemetry solutions. This role promotes innovation and aims to improve...PlatformSenior
- ...Data Analyst to join their GPU-accelerated cluster team. In this role, you will analyze complex datasets to drive application and platform improvements while applying machine learning and deep learning techniques to derive actionable insights. The ideal candidate will...PlatformSenior
$146.58k - $229.6k
...obsessed , and results-oriented Senior Product Manager to drive the core reliability platforms and services that empower our... ..., incident management, observability, and cloud infrastructure into... ...Developer Tools, Platform Engineering, SRE, Observability, or a related technical...PlatformHourly payWork experience placementLocal area$200k - $250k
...Observe By Snowflake At Snowflake, we are powering... ...-powered observability platform built on the Snowflake... ...Graph and chat-based AI SRE provide rich context and... ...hundreds of terabytes of telemetry daily while maintaining... ...About The Role As a Senior Product Manager at...PlatformSeniorFlexible hours$188k - $275k
...pioneers, CoreWeave delivers a platform of technology, tools, and... ...at What You'll Do: The Observability Engineering organization at... ...for metrics, logs, traces, telemetry pipelines, and observability... ...role: CoreWeave is seeking a Senior Manager, Observability Engineering...PlatformSeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours$148.75k - $361k
...Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico... ...a talented and experienced Senior Software Engineer, MLOps/DevOps... ...strong background in DevOps/SRE practices, cloud infrastructure... ...Define and enforce observability standards for ML systems, including...PlatformSeniorWork at officeLocal areaRemote workMonday to ThursdayFlexible hours$200k - $287.5k
...future of how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake AI Data... ...Context Graph and chat-based AI SRE provide rich context and automated... ...hundreds of terabytes of telemetry daily while maintaining reliability...PlatformSeniorFlexible hours- ...manage the infrastructure for its high-scale, real-time speech AI platform. The candidate will design and implement robust deployment... ...Terraform. The role emphasizes operational excellence, developer velocity, and deep observability across systems. #J-18808-Ljbffr SanasPlatformSenior
- NVIDIA Gruppe in Santa Clara is seeking an experienced engineer to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring...PlatformSenior
$184k - $287.5k
...Infrastructure organization is seeking a Senior System Software Engineer to lead the... ...of our next-generation Data & Observability Platform. We serve and collaborate directly with... ...NVIDIA engineers rely on to visualize chip telemetry, debug distributed pipelines, and ensure...PlatformSenior$136.5k - $276.5k
...Assurance product line. This platform brings automation,... ...provider networks by integrating telemetry with real-time analytics in... ...class SaaS solution. As a Senior Software Engineer on the Wired... ...golang) Implement and improve observability using metrics, structured...PlatformSeniorWork experience placementWork at officeLocal areaImmediate start2 days per week$188k - $275k
...pioneers, CoreWeave delivers a platform of technology, tools, and... ...at What You'll Do: The Observability Engineering organization at... ...for metrics, logs, traces, telemetry pipelines, and observability... ...role: CoreWeave is seeking a Senior Manager, Observability Engineering...PlatformSeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi-cloud... ...tenant system that process data and real time events and network telemetry from multiple public clouds to provide real time insights,...PlatformSeniorImmediate start
$120.3k - $194.53k
Palo Alto Networks, Inc. is looking for a Site Reliability Engineer to work on their Internet Security Platform team in Santa Clara, California. The role involves building reliable cloud infrastructure, supporting Advanced DNS Security services, and requires strong skills...PlatformSenior- A global technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute infrastructure for Apple's services. The role involves developing AI-powered tooling, automating deployment, and ensuring that services...PlatformSenior
$235k - $295k
A data and AI company in Mountain View seeks a Software Engineer for the Observability team. This role involves developing solutions for product performance insights and managing cloud infrastructure. The ideal candidate has over 15 years of experience in software development...PlatformSenior$139k - $220k
...pioneers, CoreWeave delivers a platform of technology, tools, and... ...You'll Do: Join CoreWeave's Observability team, responsible for... ...About the role: As a Senior Software Engineer on the Observability... ...metrics, logging, tracing, and telemetry pipelines. Your day-to-day...PlatformSeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior SRE: Observability & Telemetry Platform. Be the first to apply!
- senior automation controls engineer Santa Clara, CA
- senior brand designer Santa Clara, CA
- senior business analyst contract Santa Clara, CA
- senior app developer Santa Clara, CA
- senior digital account manager Santa Clara, CA
- senior specialist Santa Clara, CA
- senior account executive Santa Clara, CA
- senior database analyst Santa Clara, CA
- legal senior counsel family office Santa Clara, CA
- senior aws cloud engineer Santa Clara, CA


