Principal Firmware Engineer - Server Manageability and Observability
$272k - $431.25kNVIDIA
Technical Architect For Data Center Systems
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level. Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.
What you'll be doing:
- Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries.
- As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products.
- Align NVIDIA's roadmap with major customers' requirements through direct engagement.
- Develop and drive adoption of new technologies and protocols.
- Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies.
What we need to see:
- Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces.
- Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs).
- Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals.
- Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI).
- Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts
- Experience collaborating with platform security experts to define tradeoffs between security and ease of use.
- Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution.
- BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience).
- 15+ years in the area of System architecture and design.
Ways to stand out from the crowd:
- Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF.
- Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA)
- Knowledge of enterprise storage architectures and distributed parallel processing paradigms
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you!
NVIDIA's invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until May 20, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$272k - $431.25k
.... We are looking for expert engineers to come and help design rack... ...architect to own end to end manageability architecture for these products... ...you’ll be doing: Drive server management for large... ...implemented in right way with each firmware and software module Collaborate...SuggestedWork at office$272k - $431.25k
Responsibilities Drive server management for large clusters and data centers deploying GPUs and... ...and implemented correctly in each firmware and software module. Design and build... ...development. BS, MS, or PhD in Electrical Engineering, Computer Science, or a related field,...SuggestedWork at office- A leading technology company in Sunnyvale seeks a Software Engineer Manager II to lead embedded systems projects. You will set team priorities and align strategies with organizational goals. The role requires strong technical leadership and at least 8 years of software...Suggested
$211.8k - $317.8k
...Inc. Job Area: Engineering Group, Engineering Group... ...production-ready ARM server platforms. By joining... ...silicon enablement, firmware, OS integrations,... ...Performance, and Limits Management Software & Firmware... ...including fleet management, observability, and policy-based...SuggestedWork experience placementWork from home- ...design specifications and develop firmware applications for low-power,... ...for embedded systems. Manage and maintain source code repositories... ...teams in Digital and beyond. Engineering for device systems spans deep... ...to products. The Principal Firmware Engineer provides thought...SuggestedTemporary workLocal areaImmediate startRemote work
- ...Senior / Principal Firmware Engineer Location: Santa Clara, CA Duration: Full-time/Perm Responsible... ...complex SoC/silicon products for Server, Storage, and/or Networking applications... ...kits (SDKs) to execute on system management controllers (e.g. BMC). Experience...Permanent employmentFull time
$211.8k - $317.8k
...Technologies, Inc.Job Area:Engineering Group, Engineering Group > Software... ...on developing CPU platform firmware for Qualcomm's Snapdragon... ...Verification and Validation Manager to lead a global, multi-role... ...Code Management System.Principal Duties and Responsibilities:...Work experience placementWork from home- A leading tech firm in Santa Clara is seeking a firmware development leader for their Direct Flash Module. This role involves defining the firmware strategy, leading high-performance firmware design, and ensuring project delivery while collaborating with cross-functional...
$211.8k - $317.8k
...Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group Software... ...Summary: As a CPU Performance Management FW Developer, you are responsible for working... ...solution, and implement embedded firmware, to manage performance of the CPU subsystem...Work experience placementRemote workWork from homeRelocation- ...Software/Firmware Engineering Program Manager (EPM) The Software/Firmware EPM leads all engineering activities required for development testing and production release of software and firmware used with Client Labs' products. This is a high-impact position that is directly...Work at office
- ..., training, and enterprise deployment. As the Business Manager for AI Workstations & Servers, you will own the commercial success, channel strategy,... ...role partners cross-functionally with product management, engineering, manufacturing, global sales, and channel partners to...
$272k - $431.25k
...Software Engineer We are seeking software engineers to work on next-generation high-speed... ...a GPU or high-performance computing server will encounter in its lifecycle, by collaborating... ...and debugging skills Ability to self-manage, show leadership, and have good interpersonal...$224k - $356.5k
...improving efficiency, and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo organization. Your responsibility... ...including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter, PyNVML). Background in...Local areaWorldwide$160k - $250k
...starts with you. About the Role: This is a Technical Engineering Manager role (50% Management / 50% Technical) responsible for owning... ...- a lightweight sensor installed on client machines that observes system activity and recognizes malicious behavior, paired with...Work experience placementWork at officeLocal area2 days per week3 days per week- ...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times... ...optimization (latency, throughput, memory efficiency); observability and reliability across the inference stack. Ensure high-...
$206k - $303k
...Principal Engineer - Observability CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours- ...and insights to be collected - and the Firmware team is at the heart of this transformation... .... In this role, you will lead and manage the definition of architecture and implementation... ...multiple teams in Digital and beyond. Engineering for Brambles device systems spans deep...For contractorsLocal areaRemote work
$168k - $231k
...flexibility to do it in their own way. The Role: As a Principal Firmware Engineer, you will play a critical role in designing, developing,... ...systems. Project Leadership: Lead firmware projects, managing timelines, resources, and collaboration with hardware and...Immediate startRemote workWork from homeFlexible hours$200k - $287.5k
...redefine the future of how work gets done. Observe by Snowflake is an AI-powered... ...built on the Snowflake AI Data Cloud and engineered for scale. We ingest and store logs, metrics... ...Software Engineer for the Observe Data Management team. This team owns the core pipelines...Flexible hours$165k - $267.5k
...Job Summary We are seeking a highly motivated Software Engineering Manager to lead and grow development teams working on Cortex, Palo... ...Preferred Qualifications ~ Previous experience in cybersecurity, observability, or large-scale multi-tenant platforms is a plus....Remote work- A global electronics manufacturer based in Santa Clara is looking for a Technical Program Manager I to coordinate AI server and rack system requirements. This entry-level role involves managing the NPI lifecycle from design to mass production, leading cross-functional collaboration...
- A leading electronics manufacturer in Santa Clara seeks a Technical Program Manager II to coordinate with engineering teams to define requirements for AI servers. The role involves managing the NPI lifecycle, resolving technical issues, and supporting business development...
$230k - $375k
...predictably as the program accelerates. As Senior Manager of AV Cloud Capacity & Performance Engineering, you own the team and function responsible for... ...findings as engineering-grade recommendations, not observational reports. Own strategic vendor and cloud provider...Work experience placementWork at officeLocal areaWork from homeFlexible hours- ...CA Headquarters. Our Team's Vision: Our Engineering team is driven by a culture that... ...threats in history. Your Impact: The Senior Manager of Cloud Engineering will lead a team responsible... ..., ensuring high‑quality automation, observability, and operational excellence. Lead and...Immediate start
- A leading technology company in Santa Clara seeks a Senior Firmware Engineer to manage server firmware for large data centers using NVIDIA's GPUs. Candidates should have over 15 years of experience in server firmware development and a strong grasp of data center management...
$147k - $237.5k
...stronger relationships, and the kind of precision that drives great outcomes. Job Summary We are looking for a Principal Vulnerability Management Engineer to join the Cortex DevSecOps group and bolster our vulnerability management practices. This role focuses on securing...Full timeWork at officeVisa sponsorshipWork visa$248k - $391k
A leading technology company is seeking a Principal Data and Asset Management Engineer to lead the design of enterprise-scale CMDB and ensure data accuracy across systems. The ideal candidate has over 15 years of software engineering experience, strong skills in Python...Remote job$200k - $322k
...profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at scale... ...into an intelligent, automated operating model using observability, AI insights, and orchestration. This leader will apply...$272k - $425.5k
...seeking a strong technology leader to manage our Server Software Technical Program Management... ...leading a team of Senior TPMs who drive the firmware and system software for NVIDIA's next-... ...Product Introduction) and sustaining engineering teams. Drive the end-to-end SDLC...$184k - $287.5k
...NVIDIA is seeking a Senior Firmware Engineer to join our CSP Engagements team, focusing on system software for Datacenter... ...: Design and develop firmware solutions for manageability and observability of data center servers. Actively participate in hardware bring-up...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Firmware Engineer - Server Manageability and Observability. Be the first to apply!
- chief design engineer Santa Clara, CA
- principal infrastructure engineer Santa Clara, CA
- principal data engineer Santa Clara, CA
- chief engineer Santa Clara, CA
- principal developer Santa Clara, CA
- director data engineering Santa Clara, CA
- general engineer Santa Clara, CA
- director quality engineering Santa Clara, CA
- senior chief engineer Santa Clara, CA
- principal network engineer Santa Clara, CA


