Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior System Software Engineer - Data Platform Observability

$184k - $287.5k

NVIDIA

NVIDIA's Hardware Infrastructure organization is seeking a Senior System Software Engineer to lead the evolution of our next-generation Data & Observability Platform. We serve and collaborate directly with NVIDIA's rapidly growing AI, HW, and SW engineering and research teams across the company. We are looking for a Full-Stack technical lead who is not afraid to dig deep into infrastructure. You will be the technical anchor for our Observability stack, driving the transition to a modern tooling that is best fit for our customers and use cases. You will build the centralized platform that thousands of NVIDIA engineers rely on to visualize chip telemetry, debug distributed pipelines, and ensure platform reliability.

What you'll be doing:
  • Architect High-Performance Ingestion: Design and build centralized telemetry pipelines capable of handling massive scale. You will solve global latency challenges by implementing modern, push-based edge collection architectures to replace legacy proxy models.
  • Build Policy Enforcement Systems: Design and implement the technical infrastructure for data governance, policy engines, access control enforcement points, secure credential management, and audit logging. Looking for someone who has built governance controls into a platform, not just administered them.
  • Focus on User Experience: Develop a modern, web interface and APIs that unify distinct observability signals into a seamless, consolidated user experience.
  • Optimize Storage & Cost: Implement cost-effective tiered storage architectures. You will define strategies for routing high-volume data to cold storage solutions to reduce costs while maintaining multi-year data retention.
  • Drive Platform Automation: Architect workflow orchestration systems to automate platform maintenance, data lifecycle management, and complex pipeline operations.
  • Work in a diverse team to provide operational and strategic data to empower our engineers and researchers to improve performance, productivity, and efficiency to continuously improve quality, workloads, and processes through better observability.
What we need to see:
  • BS or MS in Computer Science, Electrical Engineering, or related field (or equivalent experience).
  • 8+ years of full-stack software development experience with a focus on Data Platforms or Infrastructure Tools.
  • Strong Full-Stack Fluency: Proficiency in high-performance backend systems programming and modern frontend web frameworks for building responsive user interfaces (Python, JS, Java, Rust, Go, React, or similar).
  • Observability Expertise: Experience with observability platforms such as Apache Spark, Elastic/Open Search, Grafana, Prometheus, and other similar open-source tools. Hands-on experience operating and extending the Grafana Ecosystem or ELK stack at scale. You understand the internals of time-series databases and inverted indexes.
  • Infrastructure-as-Code: Experience deploying complex stateful services on Kubernetes using Helm, Terraform, or Ansible.
  • Streaming & Storage: Familiarity with event streaming and modern data lake formats
Ways to stand out from the crowd:
  • Experience writing Custom Grafana data source Plugins or backend plugins in Go.
  • Background with migrating legacy monoliths to microservices or Vector-based pipelines.
  • Experience with OpenTelemetry (OTEL) collector configuration, writing custom processors, or instrumentation SDKs.
  • Background in Data Governance, including implementation of Policy-as-Code or compliance frameworks in a regulated environment.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 1, 2026.

This posting is for an existing vacancy.


NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior System Software Engineer - Data Platform Observability in United States vacancy
  • $184k - $287.5k

     ...NVIDIA’s Hardware Infrastructure organization is seeking a Senior System Software Engineer to lead the evolution of our next-generation Data & Observability Platform. We serve and collaborate directly with NVIDIA’s rapidly growing AI, HW, and SW engineering and research... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking a Senior System Software Engineer to lead the development of their next-generation Data & Observability Platform in Santa Clara, California. This role focuses on high-performance ingestion, governance systems, and user experience improvements while... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    14 hours ago
  • $152k - $241.5k

     ...NVIDIA EngOps Engineer NVIDIA is leading the way...  ...develop and maintain software facilitating GPU communication...  ...of operating systems, computer networks, and...  ...and debugging complex data center networks. ~ Experience...  .... Experience with observability tools such as Grafana.... 
    Senior
    Remote work

    NVIDIA

    United States
    4 days ago
  •  ...developers on NVIDIA AI server system (CPU, GPU, Memory, NIC...  ...‑on test debugging in data‑center system labs....  ...early system software and firmware stacks. What...  ...introduce and support server platforms; proficiency in C/C++...  ...higher in Electrical Engineering, Computer Science, or... 
    Senior
    Early shift

    NVIDIA Gruppe

    Santa Clara, CA
    14 hours ago
  • A tech company is seeking a Senior Data Engineer specializing in Data Observability to develop a robust data reliability framework. The ideal candidate will implement observability across dbt, Snowflake DMFs, and integrate alerts with OpsGenie for incident management. Responsibilities... 
    Senior

    SnapCode Inc

    California, MO
    3 days ago
  • $160k - $200k

     ...Services, LLC is looking for an Expert Cribl Engineer in College Park, MD. This role requires...  ...TS/SCI clearance and deep expertise in observability engineering. Responsibilities include...  ...and integrating with enterprise systems. The ideal candidate will have 10+ years... 
    Senior

    Creative Solutions Services, LLC

    College Park, MD
    14 hours ago
  •  ...Workiva Location: LATAM Duration: 6 months Job Title: Senior Data Engineer - Data Observability (Snowflake, dbt, DMF) Role Overview We are looking for...  ...like Splunk Experience with incident management platforms (OpsGenie preferred) Strong SQL and data modeling skills... 
    Senior

    SnapCode Inc

    California, MO
    3 days ago
  • $154k - $190k

    Dynatrace LLC is seeking a Solution Engineer to support the sales team by providing technical expertise on Advanced Observability. This role involves executing demos, managing POCs, and collaborating across teams. Candidates should have 5+ years of experience in observability... 
    Senior
    Remote job

    Dynatrace LLC

    San Francisco, CA
    14 hours ago
  • Lamb Weston Holdings, Inc. is seeking a Sr Data Quality & Observability Engineer in Eagle, ID. This role is pivotal in modernizing their enterprise data ecosystem. You will implement data quality rules and observability practices to ensure data reliability across various... 
    Senior

    Lamb Weston Holdings, Inc.

    Eagle, ID
    4 hours ago
  •  ...McLean, Virginia is seeking an experienced Sr Manager - Software Engineer to lead and develop a senior engineering team. The ideal candidate should have...  ...engineering experience with strong expertise in observability tools like Splunk and DataDog. This role emphasizes... 
    Senior

    Compunnel, Inc.

    Mc Lean, VA
    4 days ago
  • $220k - $250k

    ## Solutions Engineer (Pre-Sales)Dallas, TX· Full-time· Senior#### About The Position**About Coralogix...  ...is a full-stack observability and security platform built for modern...  ...reduce cost, retain more data, and troubleshoot...  ..., distributed systems, containers, and modern... 
    Senior
    Full time
    Work at office
    3 days per week

    Coralogix, inc.

    Dallas, TX
    4 days ago
  • Hitachi Vantara Corporation is looking for a Site Reliability Engineer (SRE) to design and operate the enterprise observability stack, including Azure Monitor and Managed Grafana. This position requires extensive experience in SRE and cloud infrastructure, with a focus... 
    Senior

    Hitachi Vantara Corporation

    Chicago, IL
    3 days ago
  • $220k - $250k

    CTERA Networks is hiring a Solutions Engineer to join the Boston pre-sales team. This senior, hands-on technical sales role requires leading product evaluations...  ...in technical roles, a strong understanding of observability, and the ability to communicate effectively with both... 
    Senior

    CTERA Networks

    Boston, MA
    2 days ago
  • Watershed is seeking experienced software engineers to build the AI platform that powers its emissions measurement and decarbonization products...  ...of the agent infrastructure, focusing on observability and reliable systems development. This role requires 6+ years of experience... 
    Senior
    Work at office

    Watershed

    San Francisco, CA
    4 days ago
  • Somi AI in San Francisco is looking for a Software Engineer to join our Insights team. You will design and implement solutions that enhance database observability across our systems, collaborating with various teams to ensure performance metrics are effectively reported... 
    Senior

    Somi AI

    San Francisco, CA
    2 days ago
  • Palantir is seeking a Senior Software Engineer for their New York office to own the observability platform. The successful candidate will work on log ingestion, processing, and monitoring solutions, while collaborating with leadership to define technical strategies. Ideal... 
    Senior
    Work at office
    Flexible hours

    jobs.frontdoordefense.com - Jobboard

    New York, NY
    3 days ago
  • A technology company based in the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud infrastructure. In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards... 
    Senior
    Remote job
    Flexible hours

    3Box Labs

    New York, NY
    14 hours ago
  • $235k - $295k

    A data and AI company in Mountain View seeks a Software Engineer for the Observability team. This role involves developing solutions for product performance insights and managing cloud infrastructure. The ideal candidate has over 15 years of experience in software development... 
    Senior

    Menlo Ventures

    Mountain View, CA
    14 hours ago
  • Sanas is looking for a skilled Production Engineer to manage the infrastructure for its high-scale, real-time speech AI platform. The candidate will design and...  ...operational excellence, developer velocity, and deep observability across systems. #J-18808-Ljbffr Sanas
    Senior

    Sanas

    Palo Alto, CA
    1 day ago
  • $63 per hour

    Senior Data Dog Cloud Engineer (Observability) Work location: Hybrid- 1* week in Washington, D.C. 20002 Type: Contract...  ...—using Datadog (or a comparable platform)—to help teams detect issues...  ...OpenShift/Terraform certifications System One, and its subsidiaries including... 
    Senior
    Contract work
    Local area

    System One

    Washington DC
    14 hours ago
  • $176k - $276k

    Site Reliability Engineering (SRE) at NVIDIA is an engineering...  ...large scale production systems with high efficiency and...  ...the combination of software and systems engineering...  ...aspects of large scale Observability & Telemetry collection platform with a focus on performance... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • ClickHouse is looking for a Senior Backend Engineer to build a high-performance observability platform. You will shape product development, engage with the community, and tackle technical challenges critical for scaling. With 5+ years of backend engineering experience,... 
    Senior
    Remote job
    Flexible hours

    ClickHouse

    New York, NY
    14 hours ago
  • ClickHouse is hiring a Senior Backend Engineer to build a high-performance observability platform. You will shape the product by developing key backend services that ensure reliable data pipelines. The role involves engaging with the community and tackling complex technical... 
    Senior
    Remote job
    Flexible hours

    ClickHouse

    New Bremen, OH
    1 day ago
  • $150k - $180k

    Senior Systems Software Engineer - Data Path Job Category : ENG - Software Requisition Number : SENIO002378 Posted : March 13, 2026 Full-Time Hybrid...  ...With over 40 years of innovation, Quantum's end-to-end platform is uniquely equipped to orchestrate, protect, and... 
    Senior
    Full time
    Work at office
    Local area
    Remote work

    Quantum

    Englewood, CO
    14 hours ago
  • $152.3k - $209.45k

    A leading technology firm is seeking a Senior Solution Engineer to drive the growth of their Unified Analytics Platform. The role involves engaging directly with customers...  ...leadership, and architect effective data systems. Candidates must have over 5 years of customer... 
    Senior

    Menlo Ventures

    Annapolis, MD
    1 day ago
  • $155k - $195k

     ...agent orchestration. Our commercial agent platform, consisting of LangSmith and LangGraph...  ...Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable...  ...platform product for LLM application observability, testing, and debugging. You will: Develop... 
    Senior

    Langchain

    New York, NY
    14 hours ago
  • $60 - $80 per hour

     ...Title: Senior Data Platform Engineer Job Type: Contract to Hire (CTH) Contract Length...  ...data from multiple source systems into clean, analysis-ready...  ...analysis. Data Quality & Observability: Establishing and...  ...ability to mentor or coach in software engineering practices. #J-... 
    Senior
    Hourly pay
    Contract work
    Work at office
    3 days per week

    DeWinter Group

    New York, NY
    2 days ago
  •  ...AI-native revenue platform that replaces the...  ...with one unified system. We’re...  ...possible when all the data lives under one roof...  ...categories in enterprise software. We’re looking for a Senior Data Platform Engineer to help build...  ...scale. Improve observability, tooling, and developer... 
    Senior
    Work at office
    Shift work

    Slope

    San Francisco, CA
    2 days ago
  • Hydrolix is seeking a Senior Solutions Engineer to join their innovative team in data management. The role involves technical presales, project delivery, and collaborating...  ...of relevant experience. Expertise in SQL, cloud platforms, and strong communication skills in English and... 
    Senior

    Hydrolix

    New Bremen, OH
    2 days ago
  • $186k - $223k

     ...Overview As a Senior Engineer for our Data Platform team, you will be responsible for maintaining and continuously improving our data platform...  ...alerting, governance, and cost optimization. Work on observability systems that enable holistic system and data quality... 
    Senior

    Hinge

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior System Software Engineer - Data Platform Observability. Be the first to apply!