Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior SRE: Observability & Telemetry Platform

$176k - $333.5k

NVIDIA

NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in infrastructure automation, distributed systems design, and tools like Kubernetes and OpenStack. The role emphasizes performance and sustainability, aiming for a diverse and inclusive work environment. Competitive salary range is $176,000 - $333,500 depending on level, plus equity and benefits. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior SRE: Observability & Telemetry Platform in Santa Clara, CA vacancy
  • $176k - $276k

    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale...  ...operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time monitoring... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $200k - $287.5k

     ...Senior Software Engineer — Streaming Data Products At...  ...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake...  ...Graph and chat-based AI SRE provide rich context and...  ...of terabytes of telemetry daily while maintaining... 
    Platform
    Senior
    Temporary work
    Flexible hours

    Streamlit

    Menlo Park, CA
    2 days ago
  • $200k - $287.5k

     ...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake...  ...Graph and chat-based AI SRE provide rich context and...  ...of terabytes of telemetry daily while maintaining...  ...platforms. We are hiring a Senior Software Engineer for... 
    Platform
    Senior
    Flexible hours

    Snowflake Computing

    Menlo Park, CA
    21 hours ago
  • $200k - $287.5k

     ...Senior Software Engineer At Snowflake, we are powering the era of the...  ...of how work gets done. Observe by Snowflake is an AI-powered observability platform engineered for scale — ingesting...  ...across hundreds of terabytes of telemetry data daily. As part of Snowflake... 
    Platform
    Senior
    Flexible hours

    Streamlit

    Menlo Park, CA
    2 days ago
  • Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible...  ...operation of event‑driven and telemetry pipelines. Govern and manage third...  ...Infrastructure as Code, and automation. Observability and incident troubleshooting at... 
    Platform
    Senior
    Work at office
    3 days per week

    Qcells North America

    Santa Clara, CA
    1 day ago
  • $126k - $204.5k

    Palo Alto Networks, Inc. is seeking a skilled DevOps/SRE engineer to join their Cortex team in Santa Clara, California. This role...  ...maintaining large-scale GCP environments and requires expertise in observability tools such as Thanos, Prometheus, and Grafana. The ideal... 
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    21 hours ago
  • $152k - $241.5k

     ...accelerated cluster team, you will turn telemetry and workload data into clear findings...  ...infrastructure signals to find application and platform improvement opportunities. Work with...  ...-to-end. Hands‑on use of telemetry / observability stacks (e.g., Grafana, Elasticsearch,... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  • $139k - $204k

     ...Senior Engineer, Network Observability Livingston, NJ / New York, NY / Sunnyvale, CA /...  ...pioneers, CoreWeave delivers a platform of technology, tools, and...  ...the monitoring, telemetry, and observability systems...  ...Experience as a Network Engineer, SRE, Software Developer, or Systems... 
    Platform
    Senior
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $201.6k - $302k

     ...Description The Role: As the Senior Engineering Manager for Hybrid...  ...an inherent property of the platform, ensuring that all teams have...  ...Site Reliability Engineering (SRE) and defining SLO/SLI...  ...Opinionated view on automated observability, incident response, and MTTR... 
    Platform
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    21 hours ago
  • A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS...  ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has... 
    Platform
    Senior

    EITACIES Inc.

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking a Senior System Software Engineer to lead the development of their next-generation Data & Observability Platform in Santa Clara, California. This role focuses on high-performance ingestion, governance systems, and user experience improvements while... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  •  ...Observe By Snowflake At Snowflake, we are powering the era of...  ...an AI-powered observability platform built on the Snowflake AI Data...  ...Context Graph and chat-based AI SRE provide rich context and...  ...troubleshoot hundreds of terabytes of telemetry daily while maintaining... 
    Platform
    Senior

    Streamlit

    Menlo Park, CA
    4 days ago
  • $146.58k - $229.6k

     ...obsessed, and results‑oriented Senior Product Manager to drive the core reliability platforms and services that empower our engineering...  ...clear product strategy for the Observability, BCDR & Incident Management...  ...Tools, Platform Engineering, SRE, Observability, or a related... 
    Platform
    Work experience placement
    Local area

    Government Employees Insurance Company

    Palo Alto, CA
    21 hours ago
  •  ...engineers to design solutions for next-generation AI supercomputing platforms. You will work in a collaborative environment to drive fleet...  ...extensive experience in C/C++, Python, and familiarity with telemetry solutions. This role promotes innovation and aims to improve... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  •  ...Data Analyst to join their GPU-accelerated cluster team. In this role, you will analyze complex datasets to drive application and platform improvements while applying machine learning and deep learning techniques to derive actionable insights. The ideal candidate will... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  • $146.58k - $229.6k

     ...obsessed , and results-oriented Senior Product Manager to drive the core reliability platforms and services that empower our...  ..., incident management, observability, and cloud infrastructure into...  ...Developer Tools, Platform Engineering, SRE, Observability, or a related technical... 
    Platform
    Hourly pay
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    4 days ago
  • $200k - $250k

     ...Observe By Snowflake At Snowflake, we are powering...  ...-powered observability platform built on the Snowflake...  ...Graph and chat-based AI SRE provide rich context and...  ...hundreds of terabytes of telemetry daily while maintaining...  ...About The Role As a Senior Product Manager at... 
    Platform
    Senior
    Flexible hours

    Streamlit

    Menlo Park, CA
    4 days ago
  • $188k - $275k

     ...pioneers, CoreWeave delivers a platform of technology, tools, and...  ...at What You'll Do: The Observability Engineering organization at...  ...for metrics, logs, traces, telemetry pipelines, and observability...  ...role: CoreWeave is seeking a Senior Manager, Observability Engineering... 
    Platform
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $148.75k - $361k

     ...Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico...  ...a talented and experienced Senior Software Engineer, MLOps/DevOps...  ...strong background in DevOps/SRE practices, cloud infrastructure...  ...Define and enforce observability standards for ML systems, including... 
    Platform
    Senior
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    21 hours ago
  • $200k - $287.5k

     ...future of how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake AI Data...  ...Context Graph and chat-based AI SRE provide rich context and automated...  ...hundreds of terabytes of telemetry daily while maintaining reliability... 
    Platform
    Senior
    Flexible hours

    Snowflake Computing

    Menlo Park, CA
    21 hours ago
  •  ...manage the infrastructure for its high-scale, real-time speech AI platform. The candidate will design and implement robust deployment...  ...Terraform. The role emphasizes operational excellence, developer velocity, and deep observability across systems. #J-18808-Ljbffr Sanas
    Platform
    Senior

    Sanas

    Palo Alto, CA
    1 day ago
  • NVIDIA Gruppe in Santa Clara is seeking an experienced engineer to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...Infrastructure organization is seeking a Senior System Software Engineer to lead the...  ...of our next-generation Data & Observability Platform. We serve and collaborate directly with...  ...NVIDIA engineers rely on to visualize chip telemetry, debug distributed pipelines, and ensure... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  • $136.5k - $276.5k

     ...Assurance product line. This platform brings automation,...  ...provider networks by integrating telemetry with real-time analytics in...  ...class SaaS solution. As a Senior Software Engineer on the Wired...  ...golang) Implement and improve observability using metrics, structured... 
    Platform
    Senior
    Work experience placement
    Work at office
    Local area
    Immediate start
    2 days per week

    Hewlett Packard Enterprise

    Sunnyvale, CA
    4 days ago
  • $188k - $275k

     ...pioneers, CoreWeave delivers a platform of technology, tools, and...  ...at What You'll Do: The Observability Engineering organization at...  ...for metrics, logs, traces, telemetry pipelines, and observability...  ...role: CoreWeave is seeking a Senior Manager, Observability Engineering... 
    Platform
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    21 days ago
  •  ...Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi-cloud...  ...tenant system that process data and real time events and network telemetry from multiple public clouds to provide real time insights,... 
    Platform
    Senior
    Immediate start

    Illumio

    Sunnyvale, CA
    1 day ago
  • $120.3k - $194.53k

    Palo Alto Networks, Inc. is looking for a Site Reliability Engineer to work on their Internet Security Platform team in Santa Clara, California. The role involves building reliable cloud infrastructure, supporting Advanced DNS Security services, and requires strong skills... 
    Platform
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    1 day ago
  • A global technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute infrastructure for Apple's services. The role involves developing AI-powered tooling, automating deployment, and ensuring that services... 
    Platform
    Senior

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $235k - $295k

    A data and AI company in Mountain View seeks a Software Engineer for the Observability team. This role involves developing solutions for product performance insights and managing cloud infrastructure. The ideal candidate has over 15 years of experience in software development... 
    Platform
    Senior

    Menlo Ventures

    Mountain View, CA
    21 hours ago
  • $139k - $220k

     ...pioneers, CoreWeave delivers a platform of technology, tools, and...  ...You'll Do: Join CoreWeave's Observability team, responsible for...  ...About the role: As a Senior Software Engineer on the Observability...  ...metrics, logging, tracing, and telemetry pipelines. Your day-to-day... 
    Platform
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    29 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior SRE: Observability & Telemetry Platform. Be the first to apply!