Senior Platform Telemetry Engineer
$152k - $241.5kNVIDIA Gruppe
NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We are looking to grow our company and establish teams with the most thoughtful people in the world. NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to the design of this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. What you will be doing Drive next generation fleet management solutions for scaling AI infrastructure using GPUs and Grace solution from Nvidia. Work with customers, product management and other architects to narrow down on requirements for implementation to ensure speed of light product development. Bring up clarity on architecture for fleet health monitoring and fault‑remediation solution at scale. Work with customers and other architects, understand their requirements on health monitoring, making best use of available capabilities in‑band as well as out of band. Detailed architecture, do POCs to validate architecture. Educate customers about product architecture and take feedback to make necessary changes. Write architecture specs, design documents and own end to end delivery of product by working across the teams. Do code review for the code produced because of architecture specs. Ensure product is properly tested by working with the development team to enhance unit testing and proper test plan in place. Drive product life cycles with QA teams to productize the code and be responsible as a product owner. Articulate requirements as part of Jira and bug management tools and work out an end‑to‑end execution plan in collaboration with other managers. Contribute to all phases of product development, from product definition, architecture, and design, through implementation, debugging, testing and early customer support. What we need to see BS, MS, or PhD in EE/CS or related field of education (or equivalent experience). 5+ years hands‑on coding experience. Strong knowledge of time series databases like Influxdb & Prometheus. Strong knowledge of building and consuming REST APIs (Redfish is big plus). Strong knowledge of telemetry visualization solutions like Grafana & Influx. Strong knowledge of firmware architecture, optimize firmware for low latency APIs. Strong knowledge of analyzing algorithms for time & space complexity and project system resource requirements. Proven record of solutions for scalability. Strong and demonstrable skill in C/C++ and Python. Experience programming and debugging skills for server platforms. Experience in SCM (e.g., Git, Perforce) and project management tools like Jira. You should possess excellent written and oral communication skills, excellent work ethics, a great sense of teamwork, love to produce quality work and commitment to finish your tasks every single day. You are a self‑starter who loves to find creative solutions to complicated problems and hands on with coding. Ways to stand out from the crowd Experience building telemetry collection & analysis engines. Experience with Redfish. Experience with notification systems like PagerDuty. Active Open Compute (OCP) and DMTF contributor in relevant areas. Hands on with x86 or ARM system architecture. Familiarity with Confidential Compute. Experience with ML and multi‑variable optimization techniques. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD – 241,500 USD for Level 3, and 184,000 USD – 287,500 USD for Level 4. You will also be eligible for equity and benefits. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
- NVIDIA Gruppe in Santa Clara is seeking expert engineers to design solutions for next-generation AI supercomputing platforms. You will work in a collaborative environment... ...in C/C++, Python, and familiarity with telemetry solutions. This role promotes innovation and...Senior
$184k - $287.5k
Senior Systems Software Engineer (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems... ...reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time...Senior- A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS... ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has...Senior
$184k - $356.5k
NVIDIA Gruppe is seeking a Senior Engineer to lead the evolution of the core NIM Platform SDK and microservice framework in Santa Clara, California. This hands-on role involves architecting significant new features and solving complex engineering challenges. The ideal...Senior- A tech company is seeking a Senior DevOps Engineer to enhance and automate its infrastructure for a site-builder platform. This position focuses on creating robust CI/CD pipelines and production-ready Kubernetes environments. The ideal candidate will have substantial experience...Senior
$91.4k - $187k
...a Sr. Network Developer to design and deploy network hardware platforms. This role involves collaboration with manufacturers and internal... ...ideal candidate will have 3-5 years of experience in network engineering, a Bachelor's degree in a related field, and strong problem-...Senior$126k - $203.5k
Palo Alto Networks, Inc. is seeking a Senior Staff Production Engineer to design and build foundational cloud platform capabilities. This role involves working with infrastructure, software engineering, and production reliability to improve developer productivity and system...Senior$224k - $431.25k
NVIDIA Gruppe is looking for a Senior System Software Engineer for Cloud in Santa Clara, California. You will design, build, and deploy cloud-based solutions for GeForce NOW, focusing on scalability and reliability. The ideal candidate has 12+ years of experience in software...Senior- Palo Alto Networks, Inc. is seeking a Sr. Manager, Platform Engineering, to lead the execution layer for the Identity Domain. In this role, you will ensure high-quality product representation and manage a team of Platform Engineers while interfacing with Product Management...SeniorRemote job
$166k - $244k
A leading tech company is seeking a Senior Software Engineer to develop next-generation technologies in Sunnyvale, CA. The role involves software development and managing project priorities within a global team. Candidates should have a Bachelor's degree and 5 years of...SeniorFull time- CrowdStrike Holdings, Inc. is seeking a Backend Engineer in Sunnyvale, CA. In this hybrid role, you'll join the cloud backend team, responsible... ...lifecycle and enhancing product experience on the Falcon platform. Your efforts will include leading engineering projects,...Senior
$129.3k - $193.9k
Qualcomm is seeking a Software Virtual Platform / Simulation Senior Engineer to design and develop SystemC TLM models that accurately represent SoC architectures. The role requires collaboration with hardware and software engineers and extensive experience in C++ programming...Senior$165k - $242k
A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed...Senior$120.5k - $243k
Hobbsnews is seeking a Senior Platform Software Engineer located in Sunnyvale, California. This hybrid role requires on-site work two days per week, focusing on high-performance networking and security systems development. The ideal candidate will hold a Master’s or Ph....Senior2 days per week- Apple Inc. is looking for a Sr Application Engineer to join its IT Inventory Platform team in Sunnyvale, California. This role requires strong customer engagement and technical skills to improve systems that manage applications and services across various environments....Senior
- Apple Inc. is seeking a Senior Software Engineer focused on building large-scale voice and real-time communication platforms in Sunnyvale, California. This role involves leading the design and development of distributed systems that enhance customer and agent interactions...Senior
$185k - $298k
Palo Alto Networks, Inc. is seeking a Senior Manager, Software Engineering to lead teams building cloud platform services for machine identities at scale. You will be responsible for overseeing the development of distributed systems, mentoring engineering managers, and...Senior- A prominent tech company is seeking a Java / Platform Developer for an onsite position in Santa Clara, CA. The ideal candidate should have 5-7 years of Java web development experience, strong skills in multi-threading, and a deep understanding of performance tuning. This...Senior
$152k - $222k
Google is hiring a Platform Customer Engineer in Sunnyvale, CA to help customers leverage the Google Cloud Platform. The ideal candidate will engage with technical sales teams to troubleshoot and create cloud solutions. This role requires 10+ years of experience in cloud...Senior- Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance...Senior
$166.5k - $319k
GlobalFoundries seeks an Advanced Photonics Engineer to drive the definition and realization of next-generation photonic devices. This role requires a deep understanding of photonic devices and excellent customer engagement skills. Candidates should have a PhD in a related...Senior$168k - $322k
NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The role involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization...Senior- .... We’re seeking a world‑class Principal Engineer (Sr Manager‑equivalent) to lead the evolution... ...within our Cloud Infrastructure and Platform Engineering (CIPE) organization. You... ...AI‑augmented cloud platforms, mentoring senior engineers and infusing industry‑leading...SeniorFull timeWork at office3 days per week
- United States Digital Space LLC is seeking a Sr. Software Engineer for the Platform Team to design and implement secure AI systems. This role involves leading complex initiatives, mentoring engineers, and optimizing AI capabilities across the organization. The ideal candidate...Senior
$160k - $322k
NVIDIA Gruppe in Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate...Senior- Qualcomm in Santa Clara is looking for a Senior Software Engineer for their robotics software platform. You will shape the architecture and lead technical developments while collaborating across teams to deliver high-performance solutions. Successful candidates will possess...Senior
- Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate...SeniorRemote job
$210k - $295k
SPACE EXPLORATION TECHNOLOGIES CORP in Sunnyvale, CA, is seeking a Principal Software Engineer for the Platform Team. This role focuses on building foundational AI tooling and security infrastructure to enhance engineering workflows at SpaceX. The ideal candidate will have...Senior- ...consultant, offering strategic direction and leadership to the engineering teams. Engage with ODMs and cross‑functional teams to collect... ...scripting. Experience programming and debugging skills for GPU platforms. Excellent written and oral communication skills. Excellent...Senior
- NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Platform Telemetry Engineer. Be the first to apply!
- senior platform engineer Santa Clara, CA
- platform engineering manager Santa Clara, CA
- platform developer Santa Clara, CA
- data platform engineer Santa Clara, CA
- platform engineer Santa Clara, CA
- senior cloud service delivery manager Santa Clara, CA
- senior business analyst contract Santa Clara, CA
- senior product design engineer Santa Clara, CA
- senior game producer Santa Clara, CA
- senior software manager Santa Clara, CA
