Senior Platform Telemetry Engineer
$152k - $241.5kNVIDIA Gruppe
NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We are looking to grow our company and establish teams with the most thoughtful people in the world. NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to the design of this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. What you will be doing Drive next generation fleet management solutions for scaling AI infrastructure using GPUs and Grace solution from Nvidia. Work with customers, product management and other architects to narrow down on requirements for implementation to ensure speed of light product development. Bring up clarity on architecture for fleet health monitoring and fault‑remediation solution at scale. Work with customers and other architects, understand their requirements on health monitoring, making best use of available capabilities in‑band as well as out of band. Detailed architecture, do POCs to validate architecture. Educate customers about product architecture and take feedback to make necessary changes. Write architecture specs, design documents and own end to end delivery of product by working across the teams. Do code review for the code produced because of architecture specs. Ensure product is properly tested by working with the development team to enhance unit testing and proper test plan in place. Drive product life cycles with QA teams to productize the code and be responsible as a product owner. Articulate requirements as part of Jira and bug management tools and work out an end‑to‑end execution plan in collaboration with other managers. Contribute to all phases of product development, from product definition, architecture, and design, through implementation, debugging, testing and early customer support. What we need to see BS, MS, or PhD in EE/CS or related field of education (or equivalent experience). 5+ years hands‑on coding experience. Strong knowledge of time series databases like Influxdb & Prometheus. Strong knowledge of building and consuming REST APIs (Redfish is big plus). Strong knowledge of telemetry visualization solutions like Grafana & Influx. Strong knowledge of firmware architecture, optimize firmware for low latency APIs. Strong knowledge of analyzing algorithms for time & space complexity and project system resource requirements. Proven record of solutions for scalability. Strong and demonstrable skill in C/C++ and Python. Experience programming and debugging skills for server platforms. Experience in SCM (e.g., Git, Perforce) and project management tools like Jira. You should possess excellent written and oral communication skills, excellent work ethics, a great sense of teamwork, love to produce quality work and commitment to finish your tasks every single day. You are a self‑starter who loves to find creative solutions to complicated problems and hands on with coding. Ways to stand out from the crowd Experience building telemetry collection & analysis engines. Experience with Redfish. Experience with notification systems like PagerDuty. Active Open Compute (OCP) and DMTF contributor in relevant areas. Hands on with x86 or ARM system architecture. Familiarity with Confidential Compute. Experience with ML and multi‑variable optimization techniques. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD – 241,500 USD for Level 3, and 184,000 USD – 287,500 USD for Level 4. You will also be eligible for equity and benefits. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr
- NVIDIA Gruppe in Santa Clara is seeking expert engineers to design solutions for next-generation AI supercomputing platforms. You will work in a collaborative environment... ...in C/C++, Python, and familiarity with telemetry solutions. This role promotes innovation and...Senior
$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production... ...operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time...Senior- ...A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS... ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has...Senior
$184k - $356.5k
...NVIDIA Corporation is looking for a Senior Engineer to own and evolve the core NIM Platform SDK and microservice framework. This role involves developing high-performance systems programming and collaborating across teams to deliver AI inference at scale. The ideal candidate...Senior- ...Overview: Job Title: Senior DevOps Engineer / Platform Engineer Job Location: Santa Clara, CA Job Type: Long Term Contract Summary Seeking a Senior DevOps Engineer to design, build, and scale infrastructure for a site-builder/network automation platform...SeniorLong term contract
- ...A leading technology company in Santa Clara is seeking a seasoned professional to develop their attestation platform. The role involves leading the design and construction of highly available cloud services, ensuring integrity across NVIDIA's systems. Ideal candidates...Senior
- ...NVIDIA Gruppe is looking for a Senior System Software Engineer to advance their autonomous vehicle platform. This role involves leading software integration, architecting platform software, and collaborating with various engineering teams to ensure compliance with NVIDIA...Senior
- ...NVIDIA Gruppe is seeking highly motivated EngOps and Platform Engineers to develop automated tools for managing large GPU clusters. This position requires strong expertise in high-performance computing and deep learning. The ideal applicants have a BS or MS in a relevant...Senior
$184k - $356.5k
...NVIDIA Gruppe is seeking a Senior Engineer to lead the evolution of the core NIM Platform SDK and microservice framework in Santa Clara, California. This hands-on role involves architecting significant new features and solving complex engineering challenges. The ideal...Senior- ...NVIDIA Corporation is looking for a Systems Engineer to innovate in workflow infrastructure for large-scale chip engineering. You will build and maintain features and work with senior engineers to enhance existing systems across various programming and configuration tools...Senior
$120.5k - $243k
...Hobbsnews is seeking a Senior Platform Software Engineer located in Sunnyvale, California. This hybrid role requires on-site work two days per week, focusing on high-performance networking and security systems development. The ideal candidate will hold a Master’s or Ph...Senior2 days per week- ...A tech company is seeking a Senior DevOps Engineer to enhance and automate its infrastructure for a site-builder platform. This position focuses on creating robust CI/CD pipelines and production-ready Kubernetes environments. The ideal candidate will have substantial experience...Senior
$224k - $431.25k
...NVIDIA Gruppe is looking for a Senior System Software Engineer for Cloud in Santa Clara, California. You will design, build, and deploy cloud-based solutions for GeForce NOW, focusing on scalability and reliability. The ideal candidate has 12+ years of experience in software...Senior- ...General Motors is currently hiring an experienced platform software engineer for its ADAS/AD Software Organization in Sunnyvale, California. You will be pivotal in developing high-performance platform software for advanced driver assistance systems in next-generation...SeniorCurrently hiring
$200k - $250k
...Yoh, A Day is seeking a Sr. Platform Software Engineer in Santa Clara, California. This role involves working on next-generation AI and data center switching platforms with a focus on low-level software development and Linux systems integration. The ideal candidate will...Senior$170k - $230k
...General Motors is hiring a Senior Platform Engineer to enhance the Autonomous Vehicle (AV) Cloud Engineering team. The role involves building and evolving platform capabilities that facilitate faster AV development. Ideal candidates will have a strong background in Kubernetes...Senior$160k - $322k
...NVIDIA Gruppe in Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate...Senior- ...Qualcomm in Santa Clara is looking for a Senior Software Engineer for their robotics software platform. You will shape the architecture and lead technical developments while collaborating across teams to deliver high-performance solutions. Successful candidates will possess...Senior
- ...Apple Inc. is seeking a Senior Software Engineer focused on building large-scale voice and real-time communication platforms in Sunnyvale, California. This role involves leading the design and development of distributed systems that enhance customer and agent interactions...Senior
- ...Apple Inc. is looking for a Sr Application Engineer to join its IT Inventory Platform team in Sunnyvale, California. This role requires strong customer engagement and technical skills to improve systems that manage applications and services across various environments...Senior
$175k - $317k
...A leading storage technology firm in Santa Clara seeks a Senior Platform Software Engineer to join their Systems Software team. You will architect and deliver core software for innovative storage platforms, ensuring high availability and performance. The ideal candidate...Senior$168k - $322k
...NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The role involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization...Senior- .... We're seeking a world-class Principal Engineer (Sr Manager-equivalent) to lead the evolution... ...within our Cloud Infrastructure and Platform Engineering (CIPE) organization. You... ...AI-augmented cloud platforms, mentoring senior engineers and infusing industry-leading...SeniorFull timeWork at office3 days per week
$196k - $310.5k
A leading technology company in Santa Clara is seeking a Senior Cybersecurity Engineer specializing in Identity Platform and Access Management. The role involves developing and deploying large-scale identity systems, modernizing authentication processes, and ensuring robust...Senior$152k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking a skilled engineer to develop and optimize robotics software for their Robotics Platform. The role involves creating new features, performance optimization, and integrating simulation tools to enhance machine capabilities. The ideal...Senior- ...consultant, offering strategic direction and leadership to the engineering teams. Engage with ODMs and cross‑functional teams to collect... ...scripting. Experience programming and debugging skills for GPU platforms. Excellent written and oral communication skills. Excellent...Senior
- ...A leading technology company is seeking a Senior System Software Engineer focused on OpenBMC for GPU Server platforms. This role involves firmware design, development, and performance analysis, requiring strong experience in BMC Firmware development and device drivers....SeniorRemote work
$160k - $200k
...PlusAI, based in Silicon Valley, is seeking a Senior ML Infrastructure Engineer to design scalable architectures for machine learning models. This role involves building robust data pipelines, managing GPU clusters, and collaborating with cross-functional teams. Candidates...Senior- ...NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence...Senior
- ...General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With...SeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Platform Telemetry Engineer. Be the first to apply!
- client platform engineer Santa Clara, CA
- platform engineer Santa Clara, CA
- senior platform engineer Santa Clara, CA
- platform engineering manager Santa Clara, CA
- data platform engineer Santa Clara, CA
- platform developer Santa Clara, CA
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA

