Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps
$152k - $241.5kNVIDIA
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.
Join our team of innovative engineers who develop and maintain software facilitating GPU communication, driving groundbreaking solutions in High Performance Computing and Deep Learning. We are seeking a highly motivated EngOps Engineer (5+ years of experience) to join our advanced infrastructure software team. In this role, you will be responsible for maintaining high-performance, rack-scale management solutions for datacenter environments. You will work directly with our Infrastructure Service software development team to support deployment and debug of our hardware and Infrastructure Manager. What you'll be doing:- Take ownership of daily cluster failures and issues, troubleshooting them promptly to maintain optimal cluster availability and performance.
- Manage updates to the site controller management nodes.
- Manage the rollout and rollback of cluster software and firmware updates, ensuring smooth transitions and minimal disruptions.
- BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or a related field, or equivalent experience.
- 5+ years of hands-on experience in deploying and administrating clusters, servers, switches, and related infrastructure.
- Experience with deployment and configuration of operating systems, computer networks, and high-performance applications.
- Proven ability to work effectively with developers and test engineers across different teams and time zones.
- Experience deploying services in Kubernetes.
- Datacenter or computer architecture experience is required-you should understand server, rack, and network topologies, as well as hardware/firmware/software interactions.
- Background with hardware management protocols (Redfish, IPMI, BMC) and firmware update automation.
- Experience configuring and debugging complex data center networks.
- Experience developing scripts to automate recovery actions for management controllers and datacenter systems.
- Direct experience with industry standard alerting tools and emergency response practices. Experience with observability tools such as Grafana.
- Hands-on experience with GPU-focused hardware and software, such as DGX systems and Compute Clusters.
- Proficiency in designing large scale networking technologies and the associated challenges. Experience with OpenStack and Foreman
NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
- Pacific Asset Management, LLC in Charlotte, NC is seeking a talented Senior Windows Infrastructure Engineer to enhance their IT infrastructure across multiple regions. This role... ...implementing, and maintaining Windows Server-based systems while ensuring security and reliability....SeniorFlexible hours
$160k - $240k
Senior Software Engineer - Data Center Infrastructure Management API Location New York Business Area Engineering and CTO Ref # 10043898 Description & Requirements... ...excellence. We are dedicated to designing scalable systems that allow for efficient planning and maintenance...SeniorTemporary workFor contractorsWork experience placement$137.8k - $170k
...make an impact: The Senior Technical Program Manager will work cross... ...Information Security and Infrastructure Engineering teams to deliver world... ...Infrastructure Engineering and Data Center teams. Partner... ...release management, and system verification and testing...SeniorTemporary workLocal area$65k - $150k
A leading financial institution in New York is seeking a Network Engineer to manage and implement network infrastructure projects. Candidates should have at least 4 years of experience in network engineering with a strong understanding of network security and operations...Senior- ...Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal) Our Public Cloud... ...solving real-world data management challenges, a strong understanding... ...Deliver cloud infrastructure automation tools,... ...nodes in multiple data centers Use and contribute...Senior
$55 - $65 per hour
...For", has an immediate need for a Senior Infrastructure Project Manager to manage projects in the department... ...front-line management and users of the system. Professionalism and change... ...Wireless, Technology architecture, Data Center and infrastructure related projects...SeniorContract workImmediate startRemote work- ...opening for a full-time, Senior Director, IT Infrastructure. This will be a... ...services across data centers, cloud platforms, networks... ...business-critical systems including SAP S/4... ...teams and vendor-managed services, and... ...Information Technology, Engineering, or related field...SeniorFull timeContract workWork at officeImmediate startRemote work3 days per week
- ...Job Description Enterprise Infrastructure Services -Mergers Integration Role -Senior Infrastructure Technical Project Manager Years of Experience: 10+ years Location:... .... Lead planning and execution for data center and application migrations, including dependency...SeniorWork at officeLocal area
$165k - $242k
...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers... ...ability to plan, visualize, and manage massive amounts of infrastructure... ...with internal/external systems and data sources that feed infrastructure...SeniorTemporary workFlexible hours$160k
...About the role As an Infrastructure Senior Project Manager, you will lead various infrastructure... ...work 30-hour shutdowns / system start-ups that can occur... .... Ensure project data integrity and documentation... ...experience working in Data Center, Mission Critical or Colocation...SeniorVisa sponsorshipWeekend work3 days per week$224k - $356.5k
...We are seeking a senior system software engineer to work on next-generation Data Center GPU diagnostics for rack-scale AI supercomputer systems. Our charter is to... ...crafting CUDA/C++ diagnostic workloads and software infrastructure required for new chip development, validation,...Senior$117.2k - $313.7k
...Job Category Software Engineering Job Details... ...Distributed Systems Software Engineer... ...Public Cloud (Senior/Lead/Principal... ...and hiring managers across the organization... ...real-world data management... ...Deliver cloud infrastructure automation tools... ...multiple data centers Using and...Senior$10,200 - $11,100 per month
...partnering with a leading cloud and software provider to hire a Senior Systems Software Engineer (Cloud Infrastructure) who will play a key role... ...experience with server and data center hardware testing and... ...• Able to independently manage testing environments and drive...SeniorContract workLocal areaImmediate startShift work- ...Principal Infrastructure Engineer As a Principal Infrastructure... ...JPMorgan Chase within the Data Center Networking group, you... ...knowledge of software, applications, and technical systems and processes in infrastructure... ...domains to influence, manage and implement...
$130k - $150k
...Accenture Infrastructure & Capital Projects You... ...impact, not just manage projects, but... ..., grids, transit systems, and public infrastructure... ...managers, engineers, technologists, and... ...You'll report to senior leadership on... ...working in hyper-scale data centers is a plus...SeniorFor subcontractorWork at officeLocal areaFlexible hours$130k - $150k
...impact, not just manage projects, but change... .... At Accenture Infrastructure & Capital... ..., grids, transit systems, and public infrastructure... ...managers, engineers, technologists, and... ...You'll report to senior leadership on cost... ...working in hyper-scale data centers is a plus...SeniorFor subcontractorLive inWork at officeLocal areaFlexible hours- ...IDR is seeking a Data Center Infrastructure Management Engineer II to join one of our top clients for a remote opportunity. This role is involved in managing... ...center environments, supporting high-availability systems and building automation across large-scale facilities...Remote work
$130k - $170k
...Accenture Infrastructure & Capital Projects You've never... ...an impact, not just manage projects, but change... ...factories, grids, transit systems, and public... ...- project managers, engineers, technologists, and strategists... ...working with data centers or similar mission-critical...SeniorContract workWork at officeLocal areaFlexible hours- ...Move to Skip to Content Link Senior IT Infrastructure Project Manager Founded in 1965, Herc... ...projects may include hardware, software, network, security, cloud, or data center solutions. The IT... ...computer science, information systems, engineering, or related field. PMP Certification...Senior
- ...IDR is seeking a Data Center Infrastructure Management Engineer III to join one of our top clients for an opportunity in Atlanta, GA. This role offers... ...data center industry, focusing on supporting critical systems and automation. Position Overview for the Data...
$185k - $210k
...Accenture Infrastructure & Capital Projects You... ...impact, not just manage projects, but... ..., grids, transit systems, and public infrastructure... ...managers, engineers, technologists, and... ...across data center construction projects... ...or other relevant software $185,000 - $2...SeniorFor subcontractorWork at officeLocal areaRemote workRelocationFlexible hours- ...The Data Center Infrastructure Management (DCIM) Engineer II is responsible for the management and administration of the Electric Power Monitoring Systems (EPMS) and Building Management Systems (BMS's). Primary responsibilities for strategy, engineering, design, and...For contractorsWork at officeFlexible hours
- ...Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Data Center Infrastructure Engineer. Position requires an active Top Secret/SCI clearance with ability... ...across servers, storage systems, networking architecture, and power...Local areaFlexible hours
- A global IT solutions provider seeks a Solutions Engineer for Enterprise to collaborate with account teams, design tailored infrastructure solutions, and manage client relationships. The role emphasizes teamwork, technology opportunities, and requires 3-5 years of related...SeniorRemote work
$150k - $170k
...Accenture Infrastructure & Capital Projects You've never... ...an impact, not just manage projects, but change... ...factories, grids, transit systems, and public... ...- project managers, engineers, technologists, and strategists... ...protected across all data center projects. You'll...SeniorWork at officeLocal areaRemote workFlexible hoursShift work- ...A leading software development platform in the United States is seeking a Senior Software Engineer to enhance its physical infrastructure frameworks. This role involves designing and building systems for scalable data centers while collaborating with cross-functional teams...SeniorRemote work
- ...A leading Managed Service Provider is seeking a Sr Program Project Manager to oversee large-scale data center migration projects. This fully remote position requires a strong background in IT program management with a minimum of 10 years of experience. The ideal candidate...SeniorRemote work
$130k - $180k
...Accenture Infrastructure & Capital Projects... ...impact, not just manage projects, but change... ...grids, transit systems, and public... ...managers, engineers, technologists,... ...Ensure cost data, change orders,... ...Accenture office or center. Here's what... ...decision support to senior stakeholders....SeniorContract workFor contractorsWork at officeLocal areaRemote workFlexible hours$140k - $165k
...an impact, not just manage projects, but change... ...built. At Accenture Infrastructure & Capital Projects, you... ...factories, grids, transit systems, and public... ...- project managers, engineers, technologists, and strategists... ...Civils Projects (Data Center) 6+ years of minimum...SeniorContract workLocal areaFlexible hours$150k - $170k
...an impact, not just manage projects, but change... ...built. At Accenture Infrastructure & Capital Projects, you... ...factories, grids, transit systems, and public... ...- project managers, engineers, technologists, and strategists... ...protected across all data center projects. You'll...SeniorLive inWork at officeLocal areaRemote workFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps. Be the first to apply!
- IT system engineer United States
- systems software developer United States
- IT system support engineer United States
- system programmer United States
- bi data engineer United States
- staff data engineer United States
- data visualization developer United States
- data science developer United States
- senior data center engineer United States
- sr information security engineer United States

