Senior Network Reliability Engineer - DGX Cloud
$136k - $224.25kDormont Manufacturing Co
NVIDIA is looking for a Senior Network Reliability Engineer to support and maintain our cloud and datacenter network infrastructures. This network serves the needs across the whole software stack for NVIDIA, from Graphics Drivers to Autonomous Vehicles and Artificial Intelligence. In this role, the Senior Network Operations Engineer will remediate critical alerts within defined SLAs, triage production impacting network incidents, and interact with internal customers on network related issues. They will also be responsible for engaging with external vendors to remediate hardware and software issues, and participate in project related work such as network device upgrades and capacity augmentations. An ideal candidate will possess a wide range of skills, including alert monitoring & resolution in large-scale networks and CSP environments, outstanding troubleshooting skills, understanding of L3 underlay networks, and network protocol knowledge in large multi-vendor infrastructures. What you will be doing: Engage in 24/7 global shift rotations to provide remote support for network repairs and changes while collaborating across teams and updating customers on status and ticket information. Drive operational improvements in change management and daily operations by following procedures. Manage and operate large scale IP network technologies and infrastructures. Utilize your skills in Peering and Datacenter interconnect technologies: PNI, Transit, Exchange, Passive DWDM, Wave circuits. Monitor and support the network health of on-premises and cloud infrastructures. Collaborate and develop workflow enhancements while documenting best practices. What we need to see: Deep knowledge and experience of TCP/IP, BGP, OSPF, MPLS, IS-IS, VxLAN, EVPN, QoS, GRE, IPsec, DNS, and MACsec. 5+ years of experience in network operations. Skilled in network troubleshooting techniques and demonstrating creative problem‑solving abilities. Strong track record of alert response within defined SLAs and Incident management. Experience with one or more of the following CSP environments: AWS, Azure, GCP, OCI. Familiarity with Arista, Fortinet and Juniper. Hands‑on experience with contributing to tooling and automation for provisioning, monitoring, and managing complex network infrastructures. Bachelor’s degree in Computer Science, related technical field, or equivalent experience. Excellent verbal and written communication skills. Ways To Stand Out From The Crowd: Solid understanding of Mellanox/Cumulus OS and Infiniband technology. Skilled in Unix/Linux system administration, with the ability to write and understand Python/Shell scripts to improve efficiency in hyperscale environments. Familiarity with leveraging tools such as Netbox/Nautobot, Prometheus, Grafana, Panoptes to monitor and manage a global network. Passionate about innovating and investing in ground breaking technologies. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 224,250 USD for Level 3, and 168,000 USD - 264,500 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until May 17, 2026. This posting is for an existing vacancy. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr Dormont Manufacturing Co
$184k - $357k
...57K Joining NVIDIA's DGX Cloud Lepton Team means contributing... ...software engineer to join our team. You'... ...in production. As a senior DGX Cloud AI Infrastructure... ...meaningful and actionable reliability metrics to track and... ...of NVIDIA GPUs and network technologies (RDMA, IB...SeniorNetwork$140k - $180k
...and accelerate revenue. We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly to the SVP... ...safely and predictably. Cloud Security: Harden our network architecture and application security posture,...SeniorNetworkFull timeWork at officeFlexible hours2 days per week- ITProposal is seeking a Nutanix Engineer/Networking IT Support Engineer based in the United States. This role focuses on supporting enterprise infrastructure across the country with on-call flexibility. The candidate must have 5+ years in infrastructure engineering and...SeniorNetworkImmediate start
$272k - $431.25k
NVIDIA DGX Cloud is the AI supercomputing-as-a-service substrate designed to power the next... ...scale breakthroughs. As a Security Data Engineer within our Infrastructure Security... ...repudiable audit logging, data classification, network isolation, and verifiable retention and...Network$170k - $277k
...Manufacturing Co in California is seeking an innovative software engineer to enhance our Next-Generation Firewall capabilities. The role... ...a strong background in computer science with expertise in networking and be eager to tackle challenging problems in cybersecurity....SeniorNetwork$149.4k - $202k
Senior Software Engineer- Site Reliability Engineering (SRE) DC, MD, VA, CA The Site Reliability Engineering discipline... ..., and long-term reliability of cloud native systems. Our SREs don’t just... ...and Kubernetes. Deep knowledge of networking concepts, cloud security best...SeniorNetworkRemote work$152k - $241.5k
Senior Site Reliability Engineer - Compute Farm Team What you’ll be doing: Own SRE solutions end‑to‑end,... ...cleanly with HPC schedulers, storage, and network fabrics. Use IaC and configuration... ...in a globally distributed, multi‑cloud hybrid environment - on‑prem, AWS, GCP...SeniorNetwork$208k - $327.75k
...seeking a world‑class Senior Product Manager to... ...While the NVIDIA DGX is the undisputed... ...as the public cloud? The mission is to... ...provisioning and network fabric configuration... ...intersection of multiple engineering fields. As you... ...—it’s about the reliability and simplicity of...SeniorNetworkNight shift$184k - $287.5k
...NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider) Engagements team to focus on the... ...record debugging large-scale, cloud-native stacks across networking (RDMA/RoCE), storage, and control planes. Customer-...SeniorNetwork$141.3k - $226k
Senior Software Engineer - Cloud Native Storage Design and implement scalable distributed storage control... ...datastores. Key responsibilities include: Reliable Operation: Ensure the dependable... ...upgrades, split-brain conditions, networking outages, and version skew—when...SeniorNetworkLocal area$108k - $172.5k
...telephone, email or conference calls on the DGX Platform (hardware and software) stack... ...team meetings and give feedback to engineering and marketing regarding product requirements... ...an accredited university or college in Networking, Computer Science/Engineering, or Electrical...SeniorNetworkWork experience placement- ...company for Bitcoin mining and AI cloud. Bitdeer is committed to... ...construction, equipment management, and network and facility operations.... ...code following software engineering best practices (CI/CD, code... ...workflows, and ensuring system reliability, the role directly applies...SeniorNetworkLocal area
- 6AM City, LLC is looking for a skilled Cloud Engineer to deliver network solutions for business applications and infrastructure technology. The role requires leading the deployment of communication solutions and ensuring network performance across on-prem and cloud networks...SeniorNetwork
$152k - $241.5k
Dormont Manufacturing Co in California is looking for a Senior Site Reliability Engineer to manage SRE solutions from design to operations within a multi-cloud hybrid environment. The ideal candidate will have experience in HPC clusters and a strong background in Infrastructure...Senior$184k - $287.5k
Joining NVIDIA’s DGX Cloud AI Efficiency Team means contributing to... ...an AI infrastructure software engineer to join our team. You’ll be instrumental... ...of AI systems. As a senior DGX Cloud AI Infrastructure... ...meaningful and actionable reliability metrics to track and improve...Senior$126k - $204.5k
Dormont Manufacturing Co is seeking a Sr. Staff Software Engineer to help build a cloud management platform to manage network security solutions. The role involves developing scalable architectures and mentoring engineers. This professional will lead teams in creating...SeniorNetwork- ...enterprise. Our mission is to make reliable data available and enable... ...by the data community of engineers, analysts and decision makers... ...investigation.**THE ROLE**The Cloud Engineer will be working as part... ...troubleshooting skills* Good networking skills; route tables,...SeniorNetworkWorldwideFlexible hours
$168k - $258.75k
...boundaries of computing to deliver world-class technology. The DGX Cloud organization plays a pivotal role in this mission, crafting... ...raised by partners and incorporate findings into product and engineering plans. Own lab operations and partner onboarding infrastructure...SeniorWorldwide$126k - $205k
Senior Software Engineer - Frontend Platform (Machine Identity Management) - hybrid $126K - $205K... ...focus on usability, performance, and reliability. Responsibilities Design and build... ...0.00/yr Equal Opportunity Palo Alto Networks is an equal opportunity employer. We...SeniorNetwork- ...systems that power our products, enable our engineers, and keep our platform infrastructure reliable as we grow. As a Senior Software Engineer on the Platform Team, you... ...alarms, and dashboards Solid understanding of networking, distributed systems, and database...SeniorNetwork
$136k - $212.75k
...the choice to join us today. We are now looking for a Senior Validation Engineer in the DGX Server Product Engineering Team. In this role you will... ...datacenter products including system management, security, networking, and storage. Ways to stand out from the crowd:...SeniorNetwork$170k - $277k
...Dormont Manufacturing Co is seeking a Senior Principal Software Engineer to lead the development of next-generation Layer 7 security capabilities. The ideal candidate will have extensive experience in designing scalable security technologies and will drive innovation...Senior$126k - $204.5k
Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital... ...Career: Help build what is next . Our Cloud Management Platform is a public cloud delivered... ...security portfolio. The Team: Our engineering team is at the core of our products -...SeniorNetworkFull timeWork at office$200k - $322k
As a Senior Technical Program Manager passionate about Cloud Security, you will drive the DGX Cloud infrastructure security program that improves how DGX coordinates with Cloud... ...teams in Security, Compliance, SRE, and Engineering to continually advance and strengthen the...Senior- NVIDIA, an innovator in computer graphics and AI, is seeking a Senior Reliability Engineer to contribute to groundbreaking projects. In this pivotal role, you will collaborate with teams to develop reliability test plans for advanced GPUs and other products, while enhancing...Senior
- ...expertise in ETL processes, and strong communication skills for collaboration across teams. The position requires familiarity with cloud environments and Agile methodologies while offering opportunities to engage in exciting B2B eCommerce projects. #J-18808-Ljbffr TechDigital...Senior
$168k - $310.5k
NVIDIA is seeking a Senior Reliability Engineer to manage the reliability specifications for packaging. This role involves defining qualification requirements and leading materials selection for reliability in high-performance packages. Candidates should have 8+ years of...Senior$178k - $288k
Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital way of life. We thrive at the intersection... ...outcomes. Job Summary We are seeking a visionary Sr. Engineering Manager, Sales Cloud to lead our Salesforce engineering portfolio into the...SeniorNetworkFull timeWork at officeVisa sponsorshipWork visa3 days per week$130.7k - $261.3k
...scientists. THE OPPORTUNITY This Senior Cloud Solutions Architect position... ...and mentoring junior engineers, contributing to product improvements... ...Excellence: Ensure reliability, performance, security, and... ...engineering). Strong background in networking and security (routing, BGP,...SeniorNetworkContract workShift work- ...applications. The Sr. Test Automation Engineer will work across modern lab... ...ELN platforms, LIMS systems, cloud data platforms (e.g.,... ...integrity, traceability, and system reliability. This role partners with... ...systems, instruments, and network infrastructure. Disciplined...SeniorNetworkTemporary workInternshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Network Reliability Engineer - DGX Cloud. Be the first to apply!
- ip network engineer California, MO
- network software engineer California, MO
- core network engineer California, MO
- senior network engineer California, MO
- production network engineer California, MO
- network engineer California, MO
- network engineer - transport California, MO
- network engineer contract California, MO
- data center network engineer California, MO
- remote cisco network engineer California, MO


