Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - Datacenter Systems

$184k - $287.5k
Full-time

NVIDIA

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. Join NVIDIA's software infrastructure team to design, build, and improve software systems for rack, networking, and datacenter provisioning and management. As a Senior Software Engineer - Datacenter Systems, you will work with innovative technology supporting large-scale GPU clusters connected through NVLink and InfiniBand. These clusters run today's fastest HPC and AI workloads. This role suits ambitious individuals eager to contribute meaningfully to our stable release train architectures and Site Reliability Engineering (SRE) practices. What you'll be doing: Develop and manage software for hands-off datacenter provisioning and lifecycle management, including rack installation, bare-metal networking configuration, and cluster scaling. Build and implement scalable release train architectures that modularize systems and enable independent, reliable release cycles. Define, monitor, and enforce Service Level Indicators (SLI), Objectives (SLO), and Agreements (SLA) for core infrastructure services to ensure high availability and reliability. Develop intuitive user interfaces (UIs) and APIs for internal provisioning and management tools, making cluster operations and visibility more straightforward. Lead the technical requirement definition process, clearly articulating requirements, inputs, outputs, and quantifiable outcomes for new infrastructure features and system improvements. Build and maintain CI/CD pipelines that support fast, reliable integration and deployment across complex systems. Build tools and automation workflows that simplify software releases, manage dependencies, and increase reliability. Automate software updates and monitor system health to improve reliability and availability. Resolve operational issues across distributed infrastructure as well as manage firmware and software rollouts to minimize downtime and ensure consistency. Work with global engineering teams to align infrastructure tools and support project achievements. What we need to see: BS or MS in Computer Science, Computer Engineering, or a related field or equivalent experience. 8+ years of experience managing infrastructure or systems in high-performance or distributed environments. Expertise in software programming using Python, Rust, C++, and Shell or similar high-level languages. Practical experience with modern CI/CD tools and infrastructure-as-code frameworks such as Jenkins, GitLab, Ansible, GitOps, and Kubernetes. Ability to use AI coding tools and agents effectively to increase your efficiency. Strong understanding of Linux, networking, and distributed system building. Ability to break down monolithic systems into scalable, loosely coupled components. Excellent communication and collaboration skills across multi-functional areas. Ways to stand out from the crowd: Demonstrated experience implementing SRE practices, specifically defining and tracking SLIs, SLOs, and SLAs. Proficiency with observability tools such as Prometheus and Grafana for system health monitoring and analysis. Experience crafting user-facing components (front-end or CLI) for infrastructure management tools. Experience with cluster management tools like Slurm as well as familiarity with NVIDIA DGX systems and GPU-based clusters such as GB200, GB300, and VR-NVL72. Consistent track record leading DevOps process improvements and drive team efficiency. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 13, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - Datacenter Systems in Santa Clara, CA vacancy
  •  ...to PCs, gaming and embedded systems. Grounded in a culture of innovation...  ...your career. THE ROLE As a senior member of the LLM inference...  ...intersection of inference engines, distributed systems, and GPU...  ...architectures and kernel development. Software Engineering Expertise in... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    3 hours ago
  • $174k - $252k

    Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’s degree or equivalent practical experience. 5 years of experience in low level systems programming languages (e.g., C++ or C). 3 years of experience testing, maintaining... 
    Senior
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $175k - $317k

     ...Senior Platform Software Engineer, System Engineering Santa Clara, California Join the Systems Software team to architect and deliver the core software that powers the industry's most innovative, high-performance, and highly available storage platforms. You will... 
    Senior
    Flexible hours

    Pure Storage

    Santa Clara, CA
    13 hours ago
  • $152k - $241.5k

    NVIDIA is looking for outstanding software engineers to help us expand our enterprise GPU management...  ...will span many aspects of GPU system integration, including telemetry and metrics...  ...trends in the enterprise, cloud and datacenter. Come join us as we craft the future... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $160k - $185k

    An innovative aerospace start-up in California is seeking a Senior Software Engineer to join their dynamic team. You will play a crucial role in...  ...ideal candidate will have extensive experience in embedded systems and real-time operating systems. This position offers a... 
    Senior
    Relocation package

    LTA Research

    Sunnyvale, CA
    2 days ago
  •  ...Senior Software Engineer, Systems/Solutions Test This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing... 
    Senior
    Work at office
    2 days per week

    Hewlett Packard Enterprise

    Sunnyvale, CA
    7 days ago
  • $166k - $244k

    A leading technology company based in Sunnyvale is looking for a Senior Software Engineer to develop next-generation software solutions. The ideal candidate will have 5 years of experience in software development and expertise in C++. Responsibilities include writing and... 
    Senior

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $224k - $356.5k

     ...At NVIDIA, our Financial Systems Engineering team is at the heart of ensuring that our massive scale operates with zero friction. We are responsible...  ...to-End System Design: Design, deploy, and maintain scalable software services that ensure transactional integrity and manage the... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...Software Engineer The NextGen OS team is focused on building Applied Intuition's operating system (OS) stack for future vehicles and new products. This is a unique opportunity to build and work on a new full-stack operating system. As a Software Engineer on the... 
    Senior
    For contractors
    For subcontractor

    Applied Intuition

    Sunnyvale, CA
    4 days ago
  • $175k - $317k

     ...opportunities and leave your mark, come join us. THE ROLE Join the Systems Software team to architect and deliver the core software that powers...  ...innovation. You will collaborate closely with hardware engineering and cross-functional software teams to define the future of... 
    Senior
    Full time
    Work at office
    Flexible hours

    Everpure

    Santa Clara, CA
    4 days ago
  • $170.6k - $261.3k

     ...standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation safety and entertainment...  ...MRM) to bring the vehicle to a safe stop. As a Senior Software Engineer on the Secondary Driving System team within Embodied... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $168k - $270.25k

     ...skills to build distributed and compute systems, backend services, microservices and cloud...  ...BS or MS in Computer Science, Computer Engineering or related field (or equivalent...  ...experience developing microservices, cloud software and/or tooling roles. Desirable Experience... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • Amphenol ICC is seeking an Application Engineer based in Santa Clara, CA, to support data center and high-performance computing customers in deploying and optimizing Active Cable Products. The ideal candidate will have over 5 years of experience with high-speed interconnect... 
    Senior

    Amphenol ICC

    Santa Clara, CA
    3 days ago
  • $168k - $270.25k

    Senior Software Engineer, Distributed Systems - NIM Factory page is loaded## Senior Software Engineer, Distributed Systems - NIM Factorylocations: US, CA, Santa Clara: US, TX, Remote: US, NY, Remote: US, CA, Remotetime type: Full timeposted on: Posted Todayjob requisition... 
    Senior
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    13 hours ago
  • We are looking for a Senior Software Engineer to help build NeMo Platform, NVIDIA’s product for developing, evaluating, deploying, and operating AI systems at scale. This role will focus on NeMo Evaluator, which helps teams understand whether changes to AI agents are making... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    Job Overview NVIDIA is seeking a highly motivated Software Engineer to join our growing AI and Generative AI engineering team. In this role,...  ...to the design, development, and evaluation of large-scale AI systems powering next‑generation applications in LLMs, agentic AI, retrieval... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

    Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $160.36k - $240.54k

     ...a safer, richer, and more connected future. About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in the cloud/data center. This system is the foundation of many critical business applications throughout the company... 
    Senior

    Icehouseventures

    Mountain View, CA
    1 day ago
  • $224k - $356.5k

    NVIDIA Corporation is seeking a Senior Software Engineer in Santa Clara to define runtime intelligence and safety architecture for autonomous vehicles...  ...involves integrating AI with vehicle dynamics and safety systems, tackling complex problems in real-time robotics. The ideal... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $141k - $202k

    A leading technology company in Sunnyvale is seeking a Software Engineer III to develop infrastructure solutions. The role involves programming in C++, tackling large-scale systems challenges, and ensuring software quality through testing and debugging. Candidates should... 
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $140k - $210k

     ...computing, artificial intelligence, and software-defined networking to provide our...  ...prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation...  ...Networks is looking for world-class Senior/Lead Network Systems software engineers. Network... 
    Senior
    Work experience placement

    Arista Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $147.4k - $272.1k

    Senior iOS Software Engineer, Customer Systems Sunnyvale, California, United States Software and Services Do you have a passion for solving problems that impact hundreds of millions of users? At Apple, we are reimagining the next generation of Support, and by joining... 
    Senior
    Work experience placement
    Relocation

    Apple Inc.

    Sunnyvale, CA
    1 day ago
  •  ...centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...Together, we advance your career. SENIOR GPU FIRMWARE ENGINEER The Role Join AMD’s Datacenter firmware application team as a...  ...intersection of hardware and software. You enjoy collaborating... 
    Senior

    AMD

    Santa Clara, CA
    2 days ago
  • $174k - $252k

    Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records...  ...designing, analyzing, and troubleshooting large-scale distributed systems. 2 years of experience leading projects and providing... 
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • NVIDIA Corporation is seeking a candidate to analyze large-scale datacenter workloads on GPU-accelerated clusters. Responsibilities include identifying application improvements and building visualizations for data analysis. The ideal candidate has 5+ years of experience... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced...  ...and NVIDIA accelerators from datacenter GPUs to edge SoCs. Achieve maximum...  ...Architectural knowledge of CPU and GPU systems. GPU programming experience (CUDA... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...Join a team that analyzes large‑scale datacenter workloads on GPU‑accelerated clusters....  ...partner with OS, container, GPU, and systems engineers, and apply machine learning or deep learning...  ...or prediction) within existing software workflows. Qualifications 5+ years of... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

    We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced...  ...of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph...  ...low‑latency, resource‑constrained systems or embedded AI pipelines (e.g.... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...A leading space systems provider in California seeks a Senior Software Engineer to drive complex projects and mentor team members. The ideal candidate excels in system design and programming, with a strong background in either robotics, autonomy, embedded software, mobile... 
    Senior

    Muon Space Inc

    San Jose, CA
    2 days ago
  • $170k - $240k

     ...Commure is seeking a skilled software engineer to enhance healthcare workflows through innovative technology. Based in Mountain View, California, the role involves designing, developing, and iterating on complex software features while collaborating with cross-functional... 
    Senior

    Commure

    Mountain View, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - Datacenter Systems. Be the first to apply!