Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps

$152k - $241.5k

NVIDIA

NVIDIA EngOps Engineer

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.

Join our team of innovative engineers who develop and maintain software facilitating GPU communication, driving groundbreaking solutions in High Performance Computing and Deep Learning. We are seeking a highly motivated EngOps Engineer (5+ years of experience) to join our advanced infrastructure software team. In this role, you will be responsible for maintaining high-performance, rack-scale management solutions for datacenter environments. You will work directly with our Infrastructure Service software development team to support deployment and debug of our hardware and Infrastructure Manager.

What you'll be doing:

  • Take ownership of daily cluster failures and issues, troubleshooting them promptly to maintain optimal cluster availability and performance.
  • Manage updates to the site controller management nodes.
  • Manage the rollout and rollback of cluster software and firmware updates, ensuring smooth transitions and minimal disruptions.

What we need to see:

  • BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or a related field, or equivalent experience.
  • 5+ years of hands-on experience in deploying and administrating clusters, servers, switches, and related infrastructure.
  • Experience with deployment and configuration of operating systems, computer networks, and high-performance applications.
  • Proven ability to work effectively with developers and test engineers across different teams and time zones.
  • Experience deploying services in Kubernetes.
  • Datacenter or computer architecture experience is required—you should understand server, rack, and network topologies, as well as hardware/firmware/software interactions.
  • Background with hardware management protocols (Redfish, IPMI, BMC) and firmware update automation.
  • Experience configuring and debugging complex data center networks.
  • Experience developing scripts to automate recovery actions for management controllers and datacenter systems.

Ways to stand out from the crowd:

  • Direct experience with industry standard alerting tools and emergency response practices. Experience with observability tools such as Grafana.
  • Hands-on experience with GPU-focused hardware and software, such as DGX systems and Compute Clusters.
  • Proficiency in designing large scale networking technologies and the associated challenges. Experience with OpenStack and Foreman

NVIDIA is often recognized as one of the technology industry's most esteemed employers. We have some of the brightest and most driven individuals in the world working with us. If you are a self-motivated and imaginative individual, we encourage you to apply!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 1, 2026.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps in United States vacancy
  • $152k - $241.5k

     ...of innovative engineers who develop and maintain software facilitating GPU...  ...motivated EngOps Engineer (5+ years...  ...our advanced infrastructure software team....  ..., rack-scale management solutions for...  ...of operating systems, computer networks...  ...complex data center networks. ~... 
    Senior
    Remote work

    NVIDIA

    United States
    3 days ago
  • $160k - $240k

    Senior Software Engineer - Data Center Infrastructure Management API Location New York Business Area Engineering and CTO Ref # 10043898 Description & Requirements...  ...excellence. We are dedicated to designing scalable systems that allow for efficient planning and maintenance... 
    Senior
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    1 day ago
  • The Senior Manager, Power Generation Engineering provides strategic and organizational leadership...  ...of power generation infrastructure supporting hyperscale and enterprise data centers worldwide. This leader is...  ...generation, microgrids, fuel systems, renewable energy... 
    Senior
    Full time
    Worldwide
    Flexible hours

    Oracle

    Abilene, TX
    2 days ago
  •  ...impact, not just manage projects, but change...  .... At Accenture Infrastructure & Capital...  ..., grids, transit systems, and public infrastructure...  ...managers, engineers, technologists, and...  ...You’ll report to senior leadership on cost...  ...working in hyper-scale data centers is a plus... 
    Senior
    For subcontractor
    Work at office
    Local area
    Flexible hours

    Accenture Infrastructure and Capital Projects, LLC

    Abilene, TX
    27 days ago
  •  ...technology company in Austin, Texas is looking for an EngOps Engineer to maintain high-performance management solutions in datacenter environments. The...  ...of experience in deploying clusters and managing infrastructure, along with a degree in a related field. Candidates... 
    Senior

    NVIDIA Corporation

    Austin, TX
    4 days ago
  •  ...impact, not just manage projects, but change...  .... At Accenture Infrastructure & Capital Projects...  ...factories, grids, transit systems, and public...  ...project managers, engineers, technologists, and...  ...office or center. With all our roles...  ...Experience working with data centers or similar... 
    Senior
    Contract work
    Work at office
    Local area
    Remote work
    Flexible hours

    Accenture Infrastructure and Capital Projects, LLC

    Dallas, TX
    2 days ago
  •  ...Senior IT Infrastructure Project Manager Randstad is seeking a highly skilled and experienced...  ...managing Mainframe and Data Center Migration projects within...  ...of all hardware and software. Responsibilities:...  ...project sites within the EPM system. Ensure quality project... 
    Senior
    For contractors
    Work experience placement

    Samprasoft

    Washington DC
    11 hours ago
  •  ...Manufacturing Co is seeking an experienced HPC Infrastructure Planner Lead in Chicago. This role...  ...overseeing infrastructure projects, managing contractor relationships, and incorporating...  ...significant experience in managing data center operations and excellent communication... 
    Senior
    For contractors

    Dormont Manufacturing Co

    Chicago, IL
    1 day ago
  •  ...Senior IT Infrastructure Project Manager Bonita Springs, FL, USA, 34134 Employment Status...  ...may include hardware, software, network, security, cloud, or data center solutions. The IT Infrastructure...  ...science, information systems, engineering, or related field. PMP... 
    Senior
    Full time

    Herc Rentals

    Bonita Springs, FL
    22 hours ago
  • $60 - $70 per hour

    Eliassen Group is looking for a Project Manager III to lead network and infrastructure initiatives in Denver, CO. The ideal candidate will manage full lifecycle delivery for connectivity and data center projects, coordinate multiple technical teams, and drive scope, schedule... 
    Senior
    Hourly pay

    Eliassen Group

    Denver, CO
    1 day ago
  • $55 - $65 per hour

     ...For", has an immediate need for a Senior Infrastructure Project Manager to manage projects in the department...  ...front-line management and users of the system. Professionalism and change...  ...Wireless, Technology architecture, Data Center and infrastructure related projects... 
    Senior
    Remote job
    Contract work
    Immediate start

    Motion Recruitment

    New York, NY
    11 hours ago
  • $91.4k - $187k

     ...Job Description Deploys physical plant cabling systems. Coordinates infrastructure implementation efforts for Oracle data centers. Collaborates with Program Management teams to ensure project deliverables are completed on time. Collaborates with vendors to deliver... 
    Senior
    Temporary work
    Flexible hours
    Shift work

    Oracle

    Reston, VA
    6 hours ago
  • $224k - $356.5k

    We are seeking a senior system software engineer to work on next-generation Data Center GPU diagnostics for rack-scale AI supercomputer systems. Our charter is to...  ...crafting CUDA/C++ diagnostic workloads and software infrastructure required for new chip development, validation,... 
    Senior

    Dormont Manufacturing Co

    Raleigh, NC
    1 day ago
  •  ...We are Olsson. We engineer and design solutions...  ...Description As a Senior Project Manager for Large & Special...  ...companies on their Data Centers. Your engagement will...  ...will help develop the infrastructure that powers the...  ...participate in a bonus system that rewards performance... 
    Senior
    Full time
    Worldwide
    Flexible hours

    Olsson

    Dallas, TX
    more than 2 months ago
  • $272k - $431.25k

    Principal System Software Engineer - Data Center MODS page is loaded## Principal System Software Engineer - Data Center MODSlocations: US, CA, Santa...  ...challenges within their unique data center infrastructures.**What we need to see:*** Bachelor's degree in Computer... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

     ...platforms are at the center of generative AI, autonomous...  ...instruments and data centers across the world...  ...vertically integrated software stacks, GPU...  ...dedicated and motivated System Software Engineer who is passionate about AI Infrastructure. You will collaborate... 
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    11 hours ago
  • $200k - $250k

     ...Vantage Data Centers Management Company LLC is seeking a Director of Global Category Strategy, responsible...  ...sourcing integrated and prefabricated infrastructure products. This highly strategic role involves collaborating with engineering and operations to drive supplier... 
    Senior
    Remote work

    Vantage Data Centers Management Company LLC

    Colorado
    4 days ago
  • IDR is seeking a Data Center Infrastructure Management Engineer II to join one of our top clients for a remote opportunity. This role is involved in managing...  ...center environments, supporting high‑availability systems and building automation across large‑scale facilities... 
    Remote work

    IDR, Inc.

    Atlanta, GA
    3 days ago
  • Data Center Infrastructure Management (DCIM) Engineer II is responsible for the management and administration of the Electric Power Monitoring Systems (EPMS) and Building Management Systems (BMS’s). Primary responsibilities include strategy, engineering, design, and innovation... 
    For contractors
    Work at office
    Flexible hours

    Quality Technology Services, LLC

    Atlanta, GA
    2 days ago
  • IDR, Inc. is seeking a Data Center Infrastructure Management Engineer II for a remote opportunity. This role is responsible for managing critical infrastructure...  ...of power infrastructure and troubleshooting cooling systems. The ideal candidate will have 5+ years of experience... 
    Remote job

    IDR, Inc.

    Atlanta, GA
    3 days ago
  • The Data Center Infrastructure Management (DCIM) Engineer II is responsible for the management and administration of the Electric Power Monitoring Systems (EPMS) and Building Management Systems (BMS’s). Their primary responsibilities include strategy, engineering, design... 
    For contractors
    Work at office
    Flexible hours

    QTS Realty Trust , Inc.

    Atlanta, GA
    3 days ago
  • IDR is seeking a Data Center Infrastructure Management Engineer III to join one of our top clients for an opportunity in Atlanta, GA. This role offers an...  ...data center industry, focusing on supporting critical systems and automation. Position Overview for the Data Center... 

    IDR, Inc.

    Atlanta, GA
    1 day ago
  • ViziRecruiter, LLC. is offering an exciting opportunity for a Sr. Data Center Infrastructure Engineer in North Carolina. This contract position involves hands-on deployment, maintenance, and support of physical data center infrastructure. The engineer will collaborate with... 
    Senior
    Contract work

    ViziRecruiter,LLC.

    Raleigh, NC
    1 day ago
  • TerraPower in Bellevue, Washington, seeks an IT System Engineer experienced in managing data center operations and VMware environments. This critical role involves ensuring physical infrastructure integrity and backup solution administration. The ideal candidate will have... 
    Senior
    Relocation package

    Dormont Manufacturing Co

    Bellevue, WA
    3 days ago
  •  ...Be Doing Develop diagnostic systems for NVIDIA data center platforms, which involve hardware and software tools to develop the worst...  ...failures, acting as a Level 2 engineering contact for critical issues...  ...Background in cloud‑scale infrastructure and partner engagement.... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    11 hours ago
  • Accenture Infrastructure and Capital Projects, LLC Infrastructure...  ...Capital Projects, Senior Construction Manager, ANS May 26, 2026...  ..., grids, transit systems, and public...  ...project managers, engineers, technologists, and...  ...management services for data center construction... 
    Senior
    Contract work
    For contractors
    Local area
    Flexible hours

    Dormont Manufacturing Co

    Kansas City, MO
    2 days ago
  • Ascendion is looking for a Connectivity Engineer to support the physical infrastructure of global data centers in New Albany, OH. This role involves managing full project lifecycles, from initiation to production turnover, ensuring quality and timeliness. The ideal candidate... 
    Senior

    Ascendion

    New Albany, OH
    2 days ago
  •  ...Digital Health is seeking a Technical Services Engineer II to ensure the availability, performance, and security of its corporate infrastructure in Boston. The role involves maintaining and optimizing cloud and data center environments as well as administering server platforms... 
    Senior

    Mass Digital Health

    Boston, MA
    2 days ago
  • $130k - $150k

     ...impact, not just manage projects, but change...  .... At Accenture Infrastructure & Capital...  ..., grids, transit systems, and public infrastructure...  ...managers, engineers, technologists, and...  ...You’ll report to senior leadership on cost...  ...working in hyper‑scale data centers is a plus.... 
    Senior
    For subcontractor
    Live in
    Work at office
    Local area
    Flexible hours

    Accenture

    Columbus, OH
    2 days ago
  • $140k - $165k

     ...an impact, not just manage projects, but change...  ...built. At Accenture Infrastructure & Capital Projects, you...  ...factories, grids, transit systems, and public...  ...- project managers, engineers, technologists, and strategists...  ...working with data centers or similar mission-critical... 
    Senior
    Contract work
    Work at office
    Local area
    Flexible hours

    Accenture Infrastructure & Capital Projects, LLC

    Columbus, OH
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps. Be the first to apply!