Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Firmware Engineer - Server Manageability and Observability

$272k - $431.25k

NVIDIA Gruppe

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA’s rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking for a strong technical architect to own the end‑to‑end architecture of these products, at the system software level, covering firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry‑leading cloud service providers on taking these products to market. What you’ll be doing Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries. As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next‑generation data center products. Align NVIDIA’s roadmap with major customers’ requirements through direct engagement. Develop and drive adoption of new technologies and protocols. Make critical technical decisions in ambiguous situations, mitigating risks through left‑shift strategies. What we need to see Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces. Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs). Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals. Proficiency in Out‑of‑Band and In‑Band management architectures, device management protocols (MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI). Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts. Experience collaborating with platform security experts to define tradeoffs between security and ease of use. Demonstrated success in leading complex, cross‑functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large‑scale, collaborative environments. Demonstrable experience in implementing left‑shift strategy to de‑risk program execution. BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience). 15+ years in the area of system architecture and design. Ways to stand out from the crowd Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF. Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA). Knowledge of enterprise storage architectures and distributed parallel processing paradigms. Benefits Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000USD–431,250USD. You will also be eligible for equity and benefits. Application Information Applications for this job will be accepted at least until May20,2026. Equal Opportunity Employment NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Principal Firmware Engineer - Server Manageability and Observability in Santa Clara, CA vacancy
  • $272k - $431.25k

    What you’ll be doing: Drive server management for large clusters and data centers deploying GPUs...  ...implemented in right way with each firmware and software module. Collaborate with...  ...drive large complex problem with 50+ engineers working. Your base salary will be determined... 
    Suggested
    Work at office

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $211.8k - $317.8k

     ...future for all. As a Qualcomm Software Engineer, you will design, develop, create,...  ...and interfaces. As a SoC RAS and Manageability Controller Firmware Developer, you are responsible for working...  ...requirements on a datacenter server platform. Familiarity with ARM RAS specification... 
    Suggested
    Work experience placement
    Remote work
    Work from home

    Qualcomm

    Santa Clara, CA
    1 day ago
  • $218.8k - $335.3k

     ...maintaining the tools and services engineers here at GM use every day to...  ...looking for an Engineering Manager with an extensive...  ...start delivering impact through observability frameworks and will evolve depending...  ...as those developing AI/ML, firmware, and infrastructure—to... 
    Suggested
    Full time
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...NVIDIA Corporation in Santa Clara, California is seeking a Software Engineering Manager to lead a distributed team in developing security-critical firmware. This role involves ensuring quality and delivery of root-of-trust firmware, fostering a high-performing team culture... 
    Suggested
    Remote job

    Jobleads-US

    Santa Clara, CA
    4 days ago
  • $211.8k - $317.8k

     ...team to collaborate with world‑class engineers and create innovative solutions...  ...scalability. We are seeking a Software and Firmware Verification and Validation Manager to lead a global, multi‑role team...  ...a source‑code management system Principal Duties and Responsibilities... 
    Suggested
    Work experience placement
    Work from home

    Qualcomm

    Santa Clara, CA
    4 days ago
  • A leading tech firm in Santa Clara is seeking a firmware development leader for their Direct Flash Module. This role involves defining the firmware strategy, leading high-performance firmware design, and ensuring project delivery while collaborating with cross-functional... 

    Pure Storage, Inc.

    Santa Clara, CA
    4 days ago
  • Netflix, Inc. is seeking an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team. In this role, you will ensure every client release, server canary, and A/B test is safely delivered while building a high-performing team of engineers. Responsibilities... 
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    2 days ago
  • General Motors is seeking an Engineering Manager for its Observability team to enhance tools that support the autonomous vehicle program. This leadership role involves managing engineers, driving technical strategies, and ensuring the observability of complex systems. The... 
    Remote job

    General Motors

    Mountain View, CA
    2 days ago
  • $211.8k - $317.8k

     ...is seeking an experienced ARM Server Power, Performance, and Limits Management Software & Firmware Architect to define the end‑...  ...interfaces suitable for fleet‑level observability and automation. Collaborate...  ...Bachelor’s degree in Engineering, Information Systems, Computer... 

    Qualcomm

    Santa Clara, CA
    4 days ago
  • $147k - $237.5k

    Job Summary Principal Software Engineer to architect, build, and evolve our observability platform across infrastructure, applications,...  ...recommendations on open source, self‑managed, managed‑service, and hybrid...  ...skills, Codex tools, MCP servers, structured prompts) that... 
    Remote work
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $200k - $260k

     ...As we continue to grow, we’re looking for a skilled  Principal Firmware Engineer – Coherent Optical Modules & Embedded Platforms (CPO)  to...  ...design ~ Firmware upgrade frameworks ~ Flash memory management ~ Production firmware release processes ~ Proven track... 
    Local area
    Immediate start

    Bright Vision Technologies

    Santa Clara, CA
    3 days ago
  • $168k - $231k

     ...flexibility to do it in their own way. The Role: As a Principal Firmware Engineer, you will play a critical role in designing, developing,...  ...systems. Project Leadership: Lead firmware projects, managing timelines, resources, and collaboration with hardware and... 
    Immediate start
    Remote work
    Work from home
    Flexible hours

    Logitech

    San Jose, CA
    16 days ago
  • $147k - $237.5k

    Job Summary We are seeking a Principal Software Engineer to join our Machine Identity Management CyberArk team, focused on building and scaling frontend experiences that enable visibility, control, and orchestration of machine identities. Responsibilities Lead the design... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  •  ...CA Headquarters. Our Team's Vision: Our Engineering team is driven by a culture that...  ...threats in history. Your Impact: The Senior Manager of Cloud Engineering will lead a team responsible...  ..., ensuring high‑quality automation, observability, and operational excellence. Lead and... 
    Immediate start

    Illumio

    Sunnyvale, CA
    1 day ago
  • Overview We are looking for an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team, a newly formed group that owns the release...  ...observability stack, ensuring every client release, server canary, and A/B test is safely delivered and... 
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    4 days ago
  • $165k - $267.5k

     ...Job Summary We are seeking a highly motivated Software Engineering Manager to lead and grow development teams working on Cortex, Palo...  ...Preferred Qualifications ~ Previous experience in cybersecurity, observability, or large-scale multi-tenant platforms is a plus.... 
    Remote work

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $135.6k - $204.38k

     ...Manager, Software Engineering We have an opportunity for a Manager, Engineering to join our Universal Asset Insights team in Burnaby, BC,...  ...inventory via programmatic APIs, using AI for enhanced observability, policy enforcement, and adaptive security Belong— Your... 
    Flexible hours
    Shift work
    Night shift

    Infoblox

    Santa Clara, CA
    4 days ago
  • $233k - $349.6k

     ...Qualcomm Technologies, Inc. is hiring a Server Power Management Architect for its Data Center team,...  ...with hardware, software, and firmware architects to develop an optimal end‑...  ...sequencing. Collaborate with thermal engineers to optimize implementation. Maintain... 
    Work from home

    Jobleads-US

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

    We are seeking software engineers to work on next-generation high-speed interconnect technologies...  ...a GPU or high-performance computing server will encounter in its lifecycle, by collaborating...  ...and debugging skills. Ability to self-manage, show leadership, and have good... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

     ...on leadership with expertise in systems engineering, inference infrastructure, and open‑...  ...and pushed forward. As Technical Lead Manager, you will lead the engineering team within...  ...including operators, Helm charts, and GPU observability tooling (DCGM, dcgm‑exporter, PyNVML).... 
    Local area
    Worldwide

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...and more connected. We are looking for a technically deep Engineering Manager to lead the AI team at Coram. This team is small, highly capable...  ...Establish strong engineering standards around reliability, observability, and model evaluation What We’re Looking For Several... 
    Shift work

    Coram AI

    Sunnyvale, CA
    3 days ago
  •  ...and insights to be collected - and the Firmware team is at the heart of this transformation...  .... In this role, you will lead and manage the definition of architecture and implementation...  ...multiple teams in Digital and beyond. Engineering for Brambles device systems spans deep... 
    For contractors
    Remote work

    CHEP UK Ltd.

    Santa Clara, CA
    3 days ago
  • $140k - $215k

     ...Role At CrowdStrike, Site Reliability Engineering (SRE) is at the forefront of ensuring the...  ...role, you'll have the opportunity to manage a team of talented engineers, providing...  ...multi‑cloud failover strategies. Advanced observability experience including Prometheus,... 
    Full time
    Work experience placement
    Work at office
    Local area
    2 days per week

    Koitecc Solutions

    Sunnyvale, CA
    1 day ago
  • $272k

    NVIDIA Gruppe is seeking an expert in server firmware development. This role requires over 15 years of experience, focusing on managing data center health and optimizing firmware...  ...has a strong educational background in engineering and expertise in C/C++, Python, and data... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • A leading electronics manufacturer in Santa Clara seeks a Technical Program Manager II to coordinate with engineering teams to define requirements for AI servers. The role involves managing the NPI lifecycle, resolving technical issues, and supporting business development... 

    Foxconn E BG Group

    Santa Clara, CA
    3 days ago
  • Tesla, located in Palo Alto, is seeking a Software Engineer for the Battery Management System Team. In this role, you will develop high-quality software, focusing on firmware drivers and real-time software algorithms that enhance vehicle performance and reliability. The... 

    Tesla

    Palo Alto, CA
    5 days ago
  • $140k - $300k

    Tesla is seeking an Embedded Software Engineer in Palo Alto to contribute to battery management systems for their energy products. This role involves developing and debugging real-time software in embedded RTOS environments and collaborating with hardware teams for design... 

    Tesla

    Palo Alto, CA
    1 day ago
  • $200k - $250k

     ...Description What You Can Expect We are looking for a Senior Principal Firmware Engineer to join Client's Optical Connectivity firmware team in...  ...(including hitless/in-service upgrade), and flash management Experience with SoC-based embedded platforms (MCU/DSP-... 

    Phizenix

    Santa Clara, CA
    21 days ago
  •  ...Description Role As an Applied ML Validation Manager on the Software Validation team within...  ..., Machine Learning, Robotics, Software Engineering, Data Science , or a related field. 2+...  ...on evaluation, automation, and ML observability . Benefits Overview From day one, we... 
    Local area
    Work from home

    Israelvcforum

    Sunnyvale, CA
    2 days ago
  • $200k - $322k

    Senior Manager, Site Reliability Engineering page is loaded## Senior Manager, Site Reliability Engineeringlocations: US, CA, Santa Claratime type:...  ...into an intelligent, automated operating model using observability, AI insights, and orchestration. This leader will apply... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Firmware Engineer - Server Manageability and Observability. Be the first to apply!