Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps

$152k - $241.5k

NVIDIA

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.

Join our team of innovative engineers who develop and maintain software facilitating GPU communication, driving groundbreaking solutions in High Performance Computing and Deep Learning. We are seeking a highly motivated EngOps Engineer (5+ years of experience) to join our advanced infrastructure software team. In this role, you will be responsible for maintaining high-performance, rack-scale management solutions for datacenter environments. You will work directly with our Infrastructure Service software development team to support deployment and debug of our hardware and Infrastructure Manager.

What you'll be doing:
  • Take ownership of daily cluster failures and issues, troubleshooting them promptly to maintain optimal cluster availability and performance.
  • Manage updates to the site controller management nodes.
  • Manage the rollout and rollback of cluster software and firmware updates, ensuring smooth transitions and minimal disruptions.
What we need to see:
  • BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or a related field, or equivalent experience.
  • 5+ years of hands-on experience in deploying and administrating clusters, servers, switches, and related infrastructure.
  • Experience with deployment and configuration of operating systems, computer networks, and high-performance applications.
  • Proven ability to work effectively with developers and test engineers across different teams and time zones.
  • Experience deploying services in Kubernetes.
  • Datacenter or computer architecture experience is required-you should understand server, rack, and network topologies, as well as hardware/firmware/software interactions.
  • Background with hardware management protocols (Redfish, IPMI, BMC) and firmware update automation.
  • Experience configuring and debugging complex data center networks.
  • Experience developing scripts to automate recovery actions for management controllers and datacenter systems.
Ways to stand out from the crowd:
  • Direct experience with industry standard alerting tools and emergency response practices. Experience with observability tools such as Grafana.
  • Hands-on experience with GPU-focused hardware and software, such as DGX systems and Compute Clusters.
  • Proficiency in designing large scale networking technologies and the associated challenges. Experience with OpenStack and Foreman

NVIDIA is often recognized as one of the technology industry's most esteemed employers. We have some of the brightest and most driven individuals in the world working with us. If you are a self-motivated and imaginative individual, we encourage you to apply!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 1, 2026.

This posting is for an existing vacancy.


NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps in United States vacancy
  • Pacific Asset Management, LLC in Charlotte, NC is seeking a talented Senior Windows Infrastructure Engineer to enhance their IT infrastructure across multiple regions. This role...  ...implementing, and maintaining Windows Server-based systems while ensuring security and reliability.... 
    Senior
    Flexible hours

    Pacific Asset Management, LLC

    Charlotte, NC
    3 days ago
  • $160k - $240k

    Senior Software Engineer - Data Center Infrastructure Management API Location New York Business Area Engineering and CTO Ref # 10043898 Description & Requirements...  ...excellence. We are dedicated to designing scalable systems that allow for efficient planning and maintenance... 
    Senior
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    5 days ago
  • $137.8k - $170k

     ...make an impact: The Senior Technical Program Manager will work cross...  ...Information Security and Infrastructure Engineering teams to deliver world...  ...Infrastructure Engineering and Data Center teams. Partner...  ...release management, and system verification and testing... 
    Senior
    Temporary work
    Local area

    Sirius XM Radio Inc

    Atlanta, GA
    2 days ago
  • $65k - $150k

    A leading financial institution in New York is seeking a Network Engineer to manage and implement network infrastructure projects. Candidates should have at least 4 years of experience in network engineering with a strong understanding of network security and operations... 
    Senior

    Bocusa

    New York, NY
    2 days ago
  •  ...Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal) Our Public Cloud...  ...solving real-world data management challenges, a strong understanding...  ...Deliver cloud infrastructure automation tools,...  ...nodes in multiple data centers Use and contribute... 
    Senior

    Salesforce, Inc..

    San Francisco, CA
    5 days ago
  • $55 - $65 per hour

     ...For", has an immediate need for a Senior Infrastructure Project Manager to manage projects in the department...  ...front-line management and users of the system. Professionalism and change...  ...Wireless, Technology architecture, Data Center and infrastructure related projects... 
    Senior
    Contract work
    Immediate start
    Remote work

    Motion Recruitment

    United States
    4 days ago
  •  ...opening for a full-time, Senior Director, IT Infrastructure. This will be a...  ...services across data centers, cloud platforms, networks...  ...business-critical systems including SAP S/4...  ...teams and vendor-managed services, and...  ...Information Technology, Engineering, or related field... 
    Senior
    Full time
    Contract work
    Work at office
    Immediate start
    Remote work
    3 days per week

    Arclin

    Alpharetta, GA
    4 days ago
  •  ...Job Description Enterprise Infrastructure Services -Mergers Integration Role -Senior Infrastructure Technical Project Manager Years of Experience: 10+ years Location:...  .... Lead planning and execution for data center and application migrations, including dependency... 
    Senior
    Work at office
    Local area

    LanceSoft

    New York, NY
    4 days ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers...  ...ability to plan, visualize, and manage massive amounts of infrastructure...  ...with internal/external systems and data sources that feed infrastructure... 
    Senior
    Temporary work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $160k

     ...About the role As an Infrastructure Senior Project Manager, you will lead various infrastructure...  ...work 30-hour shutdowns / system start-ups that can occur...  .... Ensure project data integrity and documentation...  ...experience working in Data Center, Mission Critical or Colocation... 
    Senior
    Visa sponsorship
    Weekend work
    3 days per week

    CBRE

    New York, NY
    2 days ago
  • $224k - $356.5k

     ...We are seeking a senior system software engineer to work on next-generation Data Center GPU diagnostics for rack-scale AI supercomputer systems. Our charter is to...  ...crafting CUDA/C++ diagnostic workloads and software infrastructure required for new chip development, validation,... 
    Senior

    NVIDIA

    Durham, NC
    1 day ago
  • $117.2k - $313.7k

     ...Job Category Software Engineering Job Details...  ...Distributed Systems Software Engineer...  ...Public Cloud (Senior/Lead/Principal...  ...and hiring managers across the organization...  ...real-world data management...  ...Deliver cloud infrastructure automation tools...  ...multiple data centers Using and... 
    Senior

    Salesforce

    San Francisco, CA
    1 day ago
  • $10,200 - $11,100 per month

     ...partnering with a leading cloud and software provider to hire a Senior Systems Software Engineer (Cloud Infrastructure) who will play a key role...  ...experience with server and data center hardware testing and...  ...• Able to independently manage testing environments and drive... 
    Senior
    Contract work
    Local area
    Immediate start
    Shift work

    Team Red Dog

    Redmond, WA
    1 day ago
  •  ...Principal Infrastructure Engineer As a Principal Infrastructure...  ...JPMorgan Chase within the Data Center Networking group, you...  ...knowledge of software, applications, and technical systems and processes in infrastructure...  ...domains to influence, manage and implement... 

    Chase

    Jersey City, NJ
    3 days ago
  • $130k - $150k

     ...Accenture Infrastructure & Capital Projects You...  ...impact, not just manage projects, but...  ..., grids, transit systems, and public infrastructure...  ...managers, engineers, technologists, and...  ...You'll report to senior leadership on...  ...working in hyper-scale data centers is a plus... 
    Senior
    For subcontractor
    Work at office
    Local area
    Flexible hours

    Accenture Infrastructure & Capital Projects, LLC

    Abilene, TX
    2 days ago
  • $130k - $150k

     ...impact, not just manage projects, but change...  .... At Accenture Infrastructure & Capital...  ..., grids, transit systems, and public infrastructure...  ...managers, engineers, technologists, and...  ...You'll report to senior leadership on cost...  ...working in hyper-scale data centers is a plus... 
    Senior
    For subcontractor
    Live in
    Work at office
    Local area
    Flexible hours

    Accenture

    Abilene, TX
    7 days ago
  •  ...IDR is seeking a Data Center Infrastructure Management Engineer II to join one of our top clients for a remote opportunity. This role is involved in managing...  ...center environments, supporting high-availability systems and building automation across large-scale facilities... 
    Remote work

    IDR Healthcare

    Atlanta, GA
    3 days ago
  • $130k - $170k

     ...Accenture Infrastructure & Capital Projects You've never...  ...an impact, not just manage projects, but change...  ...factories, grids, transit systems, and public...  ...- project managers, engineers, technologists, and strategists...  ...working with data centers or similar mission-critical... 
    Senior
    Contract work
    Work at office
    Local area
    Flexible hours

    Accenture Infrastructure & Capital Projects, LLC

    Atlanta, GA
    10 hours ago
  •  ...Move to Skip to Content Link Senior IT Infrastructure Project Manager Founded in 1965, Herc...  ...projects may include hardware, software, network, security, cloud, or data center solutions. The IT...  ...computer science, information systems, engineering, or related field. PMP Certification... 
    Senior

    Herc Rentals

    Florida, NY
    4 days ago
  •  ...IDR is seeking a Data Center Infrastructure Management Engineer III to join one of our top clients for an opportunity in Atlanta, GA. This role offers...  ...data center industry, focusing on supporting critical systems and automation. Position Overview for the Data... 

    IDR Healthcare

    Atlanta, GA
    3 days ago
  • $185k - $210k

     ...Accenture Infrastructure & Capital Projects You...  ...impact, not just manage projects, but...  ..., grids, transit systems, and public infrastructure...  ...managers, engineers, technologists, and...  ...across data center construction projects...  ...or other relevant software $185,000 - $2... 
    Senior
    For subcontractor
    Work at office
    Local area
    Remote work
    Relocation
    Flexible hours

    Accenture Infrastructure & Capital Projects, LLC

    Staten Island, NY
    2 days ago
  •  ...The Data Center Infrastructure Management (DCIM) Engineer II is responsible for the management and administration of the Electric Power Monitoring Systems (EPMS) and Building Management Systems (BMS's). Primary responsibilities for strategy, engineering, design, and... 
    For contractors
    Work at office
    Flexible hours

    QTS Realty Trust , Inc.

    Atlanta, GA
    4 days ago
  •  ...Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Data Center Infrastructure Engineer. Position requires an active Top Secret/SCI clearance with ability...  ...across servers, storage systems, networking architecture, and power... 
    Local area
    Flexible hours

    Koniag

    Washington DC
    3 days ago
  • A global IT solutions provider seeks a Solutions Engineer for Enterprise to collaborate with account teams, design tailored infrastructure solutions, and manage client relationships. The role emphasizes teamwork, technology opportunities, and requires 3-5 years of related... 
    Senior
    Remote work

    SHI International

    United States
    3 days ago
  • $150k - $170k

     ...Accenture Infrastructure & Capital Projects You've never...  ...an impact, not just manage projects, but change...  ...factories, grids, transit systems, and public...  ...- project managers, engineers, technologists, and strategists...  ...protected across all data center projects. You'll... 
    Senior
    Work at office
    Local area
    Remote work
    Flexible hours
    Shift work

    Accenture Infrastructure & Capital Projects, LLC

    Indianapolis, IN
    3 days ago
  •  ...A leading software development platform in the United States is seeking a Senior Software Engineer to enhance its physical infrastructure frameworks. This role involves designing and building systems for scalable data centers while collaborating with cross-functional teams... 
    Senior
    Remote work

    GitHub

    United States
    4 days ago
  •  ...A leading Managed Service Provider is seeking a Sr Program Project Manager to oversee large-scale data center migration projects. This fully remote position requires a strong background in IT program management with a minimum of 10 years of experience. The ideal candidate... 
    Senior
    Remote work

    IT Associates

    New York, NY
    4 days ago
  • $130k - $180k

     ...Accenture Infrastructure & Capital Projects...  ...impact, not just manage projects, but change...  ...grids, transit systems, and public...  ...managers, engineers, technologists,...  ...Ensure cost data, change orders,...  ...Accenture office or center. Here's what...  ...decision support to senior stakeholders.... 
    Senior
    Contract work
    For contractors
    Work at office
    Local area
    Remote work
    Flexible hours

    Accenture Infrastructure & Capital Projects, LLC

    Kansas City, MO
    3 days ago
  • $140k - $165k

     ...an impact, not just manage projects, but change...  ...built. At Accenture Infrastructure & Capital Projects, you...  ...factories, grids, transit systems, and public...  ...- project managers, engineers, technologists, and strategists...  ...Civils Projects (Data Center) 6+ years of minimum... 
    Senior
    Contract work
    Local area
    Flexible hours

    Accenture

    New York, NY
    4 days ago
  • $150k - $170k

     ...an impact, not just manage projects, but change...  ...built. At Accenture Infrastructure & Capital Projects, you...  ...factories, grids, transit systems, and public...  ...- project managers, engineers, technologists, and strategists...  ...protected across all data center projects. You'll... 
    Senior
    Live in
    Work at office
    Local area
    Remote work
    Flexible hours
    Shift work

    Accenture

    Indianapolis, IN
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps. Be the first to apply!