Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Platform Telemetry Engineer

$152k - $241.5k

NVIDIA Gruppe

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We are looking to grow our company and establish teams with the most thoughtful people in the world. NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to the design of this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. What you will be doing Drive next generation fleet management solutions for scaling AI infrastructure using GPUs and Grace solution from Nvidia. Work with customers, product management and other architects to narrow down on requirements for implementation to ensure speed of light product development. Bring up clarity on architecture for fleet health monitoring and fault‑remediation solution at scale. Work with customers and other architects, understand their requirements on health monitoring, making best use of available capabilities in‑band as well as out of band. Detailed architecture, do POCs to validate architecture. Educate customers about product architecture and take feedback to make necessary changes. Write architecture specs, design documents and own end to end delivery of product by working across the teams. Do code review for the code produced because of architecture specs. Ensure product is properly tested by working with the development team to enhance unit testing and proper test plan in place. Drive product life cycles with QA teams to productize the code and be responsible as a product owner. Articulate requirements as part of Jira and bug management tools and work out an end‑to‑end execution plan in collaboration with other managers. Contribute to all phases of product development, from product definition, architecture, and design, through implementation, debugging, testing and early customer support. What we need to see BS, MS, or PhD in EE/CS or related field of education (or equivalent experience). 5+ years hands‑on coding experience. Strong knowledge of time series databases like Influxdb & Prometheus. Strong knowledge of building and consuming REST APIs (Redfish is big plus). Strong knowledge of telemetry visualization solutions like Grafana & Influx. Strong knowledge of firmware architecture, optimize firmware for low latency APIs. Strong knowledge of analyzing algorithms for time & space complexity and project system resource requirements. Proven record of solutions for scalability. Strong and demonstrable skill in C/C++ and Python. Experience programming and debugging skills for server platforms. Experience in SCM (e.g., Git, Perforce) and project management tools like Jira. You should possess excellent written and oral communication skills, excellent work ethics, a great sense of teamwork, love to produce quality work and commitment to finish your tasks every single day. You are a self‑starter who loves to find creative solutions to complicated problems and hands on with coding. Ways to stand out from the crowd Experience building telemetry collection & analysis engines. Experience with Redfish. Experience with notification systems like PagerDuty. Active Open Compute (OCP) and DMTF contributor in relevant areas. Hands on with x86 or ARM system architecture. Familiarity with Confidential Compute. Experience with ML and multi‑variable optimization techniques. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD – 241,500 USD for Level 3, and 184,000 USD – 287,500 USD for Level 4. You will also be eligible for equity and benefits. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 17 hours ago
Similar jobs that could be interesting for youBased on the Senior Platform Telemetry Engineer in Santa Clara, CA vacancy
  • NVIDIA Gruppe in Santa Clara is seeking expert engineers to design solutions for next-generation AI supercomputing platforms. You will work in a collaborative environment...  ...in C/C++, Python, and familiarity with telemetry solutions. This role promotes innovation and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $184k - $287.5k

    Senior Systems Software Engineer (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems...  ...reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS...  ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has... 
    Senior

    EITACIES Inc.

    Santa Clara, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is seeking a Senior Engineer to lead the evolution of the core NIM Platform SDK and microservice framework in Santa Clara, California. This hands-on role involves architecting significant new features and solving complex engineering challenges. The ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • A tech company is seeking a Senior DevOps Engineer to enhance and automate its infrastructure for a site-builder platform. This position focuses on creating robust CI/CD pipelines and production-ready Kubernetes environments. The ideal candidate will have substantial experience... 
    Senior

    TechDigital Group

    Santa Clara, CA
    3 days ago
  • $91.4k - $187k

     ...a Sr. Network Developer to design and deploy network hardware platforms. This role involves collaboration with manufacturers and internal...  ...ideal candidate will have 3-5 years of experience in network engineering, a Bachelor's degree in a related field, and strong problem-... 
    Senior

    Ll Oefentherapie

    Santa Clara, CA
    1 day ago
  • $126k - $203.5k

    Palo Alto Networks, Inc. is seeking a Senior Staff Production Engineer to design and build foundational cloud platform capabilities. This role involves working with infrastructure, software engineering, and production reliability to improve developer productivity and system... 
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    17 hours ago
  • $224k - $431.25k

    NVIDIA Gruppe is looking for a Senior System Software Engineer for Cloud in Santa Clara, California. You will design, build, and deploy cloud-based solutions for GeForce NOW, focusing on scalability and reliability. The ideal candidate has 12+ years of experience in software... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • Palo Alto Networks, Inc. is seeking a Sr. Manager, Platform Engineering, to lead the execution layer for the Identity Domain. In this role, you will ensure high-quality product representation and manage a team of Platform Engineers while interfacing with Product Management... 
    Senior
    Remote job

    Palo Alto Networks, Inc.

    Santa Clara, CA
    17 hours ago
  • $166k - $244k

    A leading tech company is seeking a Senior Software Engineer to develop next-generation technologies in Sunnyvale, CA. The role involves software development and managing project priorities within a global team. Candidates should have a Bachelor's degree and 5 years of... 
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • CrowdStrike Holdings, Inc. is seeking a Backend Engineer in Sunnyvale, CA. In this hybrid role, you'll join the cloud backend team, responsible...  ...lifecycle and enhancing product experience on the Falcon platform. Your efforts will include leading engineering projects,... 
    Senior

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    17 hours ago
  • $129.3k - $193.9k

    Qualcomm is seeking a Software Virtual Platform / Simulation Senior Engineer to design and develop SystemC TLM models that accurately represent SoC architectures. The role requires collaboration with hardware and software engineers and extensive experience in C++ programming... 
    Senior

    Qualcomm

    Santa Clara, CA
    4 days ago
  • $165k - $242k

    A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed... 
    Senior

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $120.5k - $243k

    Hobbsnews is seeking a Senior Platform Software Engineer located in Sunnyvale, California. This hybrid role requires on-site work two days per week, focusing on high-performance networking and security systems development. The ideal candidate will hold a Master’s or Ph.... 
    Senior
    2 days per week

    Hobbsnews

    Sunnyvale, CA
    4 days ago
  • Apple Inc. is looking for a Sr Application Engineer to join its IT Inventory Platform team in Sunnyvale, California. This role requires strong customer engagement and technical skills to improve systems that manage applications and services across various environments.... 
    Senior

    Apple Inc.

    Sunnyvale, CA
    4 days ago
  • Apple Inc. is seeking a Senior Software Engineer focused on building large-scale voice and real-time communication platforms in Sunnyvale, California. This role involves leading the design and development of distributed systems that enhance customer and agent interactions... 
    Senior

    Apple

    Sunnyvale, CA
    1 day ago
  • $185k - $298k

    Palo Alto Networks, Inc. is seeking a Senior Manager, Software Engineering to lead teams building cloud platform services for machine identities at scale. You will be responsible for overseeing the development of distributed systems, mentoring engineering managers, and... 
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  • A prominent tech company is seeking a Java / Platform Developer for an onsite position in Santa Clara, CA. The ideal candidate should have 5-7 years of Java web development experience, strong skills in multi-threading, and a deep understanding of performance tuning. This... 
    Senior

    Idealforce

    Santa Clara, CA
    3 days ago
  • $152k - $222k

    Google is hiring a Platform Customer Engineer in Sunnyvale, CA to help customers leverage the Google Cloud Platform. The ideal candidate will engage with technical sales teams to troubleshoot and create cloud solutions. This role requires 10+ years of experience in cloud... 
    Senior

    Google

    Sunnyvale, CA
    17 hours ago
  • Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance... 
    Senior

    Cerebras

    Sunnyvale, CA
    17 hours ago
  • $166.5k - $319k

    GlobalFoundries seeks an Advanced Photonics Engineer to drive the definition and realization of next-generation photonic devices. This role requires a deep understanding of photonic devices and excellent customer engagement skills. Candidates should have a PhD in a related... 
    Senior

    GlobalFoundries

    Santa Clara, CA
    4 days ago
  • $168k - $322k

    NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The role involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  •  .... We’re seeking a world‑class Principal Engineer (Sr Manager‑equivalent) to lead the evolution...  ...within our Cloud Infrastructure and Platform Engineering (CIPE) organization. You...  ...AI‑augmented cloud platforms, mentoring senior engineers and infusing industry‑leading... 
    Senior
    Full time
    Work at office
    3 days per week

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  • United States Digital Space LLC is seeking a Sr. Software Engineer for the Platform Team to design and implement secure AI systems. This role involves leading complex initiatives, mentoring engineers, and optimizing AI capabilities across the organization. The ideal candidate... 
    Senior

    United States Digital Space LLC

    Sunnyvale, CA
    1 day ago
  • $160k - $322k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • Qualcomm in Santa Clara is looking for a Senior Software Engineer for their robotics software platform. You will shape the architecture and lead technical developments while collaborating across teams to deliver high-performance solutions. Successful candidates will possess... 
    Senior

    Qualcomm

    Santa Clara, CA
    17 hours ago
  • Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate... 
    Senior
    Remote job

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  • $210k - $295k

    SPACE EXPLORATION TECHNOLOGIES CORP in Sunnyvale, CA, is seeking a Principal Software Engineer for the Platform Team. This role focuses on building foundational AI tooling and security infrastructure to enhance engineering workflows at SpaceX. The ideal candidate will have... 
    Senior

    SPACE EXPLORATION TECHNOLOGIES CORP

    Sunnyvale, CA
    2 days ago
  •  ...consultant, offering strategic direction and leadership to the engineering teams. Engage with ODMs and cross‑functional teams to collect...  ...scripting. Experience programming and debugging skills for GPU platforms. Excellent written and oral communication skills. Excellent... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Platform Telemetry Engineer. Be the first to apply!