Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Network Reliability Engineer - DGX Cloud

$136k - $224.25k

Dormont Manufacturing Co

NVIDIA is looking for a Senior Network Reliability Engineer to support and maintain our cloud and datacenter network infrastructures. This network serves the needs across the whole software stack for NVIDIA, from Graphics Drivers to Autonomous Vehicles and Artificial Intelligence. In this role, the Senior Network Operations Engineer will remediate critical alerts within defined SLAs, triage production impacting network incidents, and interact with internal customers on network related issues. They will also be responsible for engaging with external vendors to remediate hardware and software issues, and participate in project related work such as network device upgrades and capacity augmentations. An ideal candidate will possess a wide range of skills, including alert monitoring & resolution in large-scale networks and CSP environments, outstanding troubleshooting skills, understanding of L3 underlay networks, and network protocol knowledge in large multi-vendor infrastructures. What you will be doing: Engage in 24/7 global shift rotations to provide remote support for network repairs and changes while collaborating across teams and updating customers on status and ticket information. Drive operational improvements in change management and daily operations by following procedures. Manage and operate large scale IP network technologies and infrastructures. Utilize your skills in Peering and Datacenter interconnect technologies: PNI, Transit, Exchange, Passive DWDM, Wave circuits. Monitor and support the network health of on-premises and cloud infrastructures. Collaborate and develop workflow enhancements while documenting best practices. What we need to see: Deep knowledge and experience of TCP/IP, BGP, OSPF, MPLS, IS-IS, VxLAN, EVPN, QoS, GRE, IPsec, DNS, and MACsec. 5+ years of experience in network operations. Skilled in network troubleshooting techniques and demonstrating creative problem‑solving abilities. Strong track record of alert response within defined SLAs and Incident management. Experience with one or more of the following CSP environments: AWS, Azure, GCP, OCI. Familiarity with Arista, Fortinet and Juniper. Hands‑on experience with contributing to tooling and automation for provisioning, monitoring, and managing complex network infrastructures. Bachelor’s degree in Computer Science, related technical field, or equivalent experience. Excellent verbal and written communication skills. Ways To Stand Out From The Crowd: Solid understanding of Mellanox/Cumulus OS and Infiniband technology. Skilled in Unix/Linux system administration, with the ability to write and understand Python/Shell scripts to improve efficiency in hyperscale environments. Familiarity with leveraging tools such as Netbox/Nautobot, Prometheus, Grafana, Panoptes to monitor and manage a global network. Passionate about innovating and investing in ground breaking technologies. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 224,250 USD for Level 3, and 168,000 USD - 264,500 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until May 17, 2026. This posting is for an existing vacancy. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr Dormont Manufacturing Co

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Network Reliability Engineer - DGX Cloud in California, MO vacancy
  • $184k - $357k

     ...57K Joining NVIDIA's DGX Cloud Lepton Team means contributing...  ...software engineer to join our team. You'...  ...in production. As a senior DGX Cloud AI Infrastructure...  ...meaningful and actionable reliability metrics to track and...  ...of NVIDIA GPUs and network technologies (RDMA, IB... 
    Senior
    Network

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $140k - $180k

     ...and accelerate revenue. We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly to the SVP...  ...safely and predictably. Cloud Security: Harden our network architecture and application security posture,... 
    Senior
    Network
    Full time
    Work at office
    Flexible hours
    2 days per week

    Dormont Manufacturing Company

    California, MO
    1 day ago
  • ITProposal is seeking a Nutanix Engineer/Networking IT Support Engineer based in the United States. This role focuses on supporting enterprise infrastructure across the country with on-call flexibility. The candidate must have 5+ years in infrastructure engineering and... 
    Senior
    Network
    Immediate start

    ITProposal

    California, MO
    3 days ago
  • $272k - $431.25k

    NVIDIA DGX Cloud is the AI supercomputing-as-a-service substrate designed to power the next...  ...scale breakthroughs. As a Security Data Engineer within our Infrastructure Security...  ...repudiable audit logging, data classification, network isolation, and verifiable retention and... 
    Network

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $170k - $277k

     ...Manufacturing Co in California is seeking an innovative software engineer to enhance our Next-Generation Firewall capabilities. The role...  ...a strong background in computer science with expertise in networking and be eager to tackle challenging problems in cybersecurity.... 
    Senior
    Network

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $149.4k - $202k

    Senior Software Engineer- Site Reliability Engineering (SRE) DC, MD, VA, CA The Site Reliability Engineering discipline...  ..., and long-term reliability of cloud native systems. Our SREs don’t just...  ...and Kubernetes. Deep knowledge of networking concepts, cloud security best... 
    Senior
    Network
    Remote work

    Noctua Technology

    California, MO
    1 day ago
  • $152k - $241.5k

    Senior Site Reliability Engineer - Compute Farm Team What you’ll be doing: Own SRE solutions end‑to‑end,...  ...cleanly with HPC schedulers, storage, and network fabrics. Use IaC and configuration...  ...in a globally distributed, multi‑cloud hybrid environment - on‑prem, AWS, GCP... 
    Senior
    Network

    Dormont Manufacturing Co

    California, MO
    3 days ago
  • $208k - $327.75k

     ...seeking a world‑class Senior Product Manager to...  ...While the NVIDIA DGX is the undisputed...  ...as the public cloud? The mission is to...  ...provisioning and network fabric configuration...  ...intersection of multiple engineering fields. As you...  ...—it’s about the reliability and simplicity of... 
    Senior
    Network
    Night shift

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $184k - $287.5k

     ...NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider) Engagements team to focus on the...  ...record debugging large-scale, cloud-native stacks across networking (RDMA/RoCE), storage, and control planes. Customer-... 
    Senior
    Network

    Dormont Manufacturing Company

    California, MO
    2 days ago
  • $141.3k - $226k

    Senior Software Engineer - Cloud Native Storage Design and implement scalable distributed storage control...  ...datastores. Key responsibilities include: Reliable Operation: Ensure the dependable...  ...upgrades, split-brain conditions, networking outages, and version skew—when... 
    Senior
    Network
    Local area

    jobs.frontdoordefense.com - Jobboard

    California, MO
    14 hours ago
  • $108k - $172.5k

     ...telephone, email or conference calls on the DGX Platform (hardware and software) stack...  ...team meetings and give feedback to engineering and marketing regarding product requirements...  ...an accredited university or college in Networking, Computer Science/Engineering, or Electrical... 
    Senior
    Network
    Work experience placement

    NVIDIA Gruppe

    California, MO
    2 days ago
  •  ...company for Bitcoin mining and AI cloud. Bitdeer is committed to...  ...construction, equipment management, and network and facility operations....  ...code following software engineering best practices (CI/CD, code...  ...workflows, and ensuring system reliability, the role directly applies... 
    Senior
    Network
    Local area

    Bitdeer Group

    California, MO
    14 hours ago
  • 6AM City, LLC is looking for a skilled Cloud Engineer to deliver network solutions for business applications and infrastructure technology. The role requires leading the deployment of communication solutions and ensuring network performance across on-prem and cloud networks... 
    Senior
    Network

    6AM City

    California, MO
    2 days ago
  • $152k - $241.5k

    Dormont Manufacturing Co in California is looking for a Senior Site Reliability Engineer to manage SRE solutions from design to operations within a multi-cloud hybrid environment. The ideal candidate will have experience in HPC clusters and a strong background in Infrastructure... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $184k - $287.5k

    Joining NVIDIA’s DGX Cloud AI Efficiency Team means contributing to...  ...an AI infrastructure software engineer to join our team. You’ll be instrumental...  ...of AI systems. As a senior DGX Cloud AI Infrastructure...  ...meaningful and actionable reliability metrics to track and improve... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $126k - $204.5k

    Dormont Manufacturing Co is seeking a Sr. Staff Software Engineer to help build a cloud management platform to manage network security solutions. The role involves developing scalable architectures and mentoring engineers. This professional will lead teams in creating... 
    Senior
    Network

    Dormont Manufacturing Co

    California, MO
    1 day ago
  •  ...enterprise. Our mission is to make reliable data available and enable...  ...by the data community of engineers, analysts and decision makers...  ...investigation.**THE ROLE**The Cloud Engineer will be working as part...  ...troubleshooting skills* Good networking skills; route tables,... 
    Senior
    Network
    Worldwide
    Flexible hours

    Live Nation International

    California, MO
    3 days ago
  • $168k - $258.75k

     ...boundaries of computing to deliver world-class technology. The DGX Cloud organization plays a pivotal role in this mission, crafting...  ...raised by partners and incorporate findings into product and engineering plans. Own lab operations and partner onboarding infrastructure... 
    Senior
    Worldwide

    Dormont Manufacturing Company

    California, MO
    9 hours ago
  • $126k - $205k

    Senior Software Engineer - Frontend Platform (Machine Identity Management) - hybrid $126K - $205K...  ...focus on usability, performance, and reliability. Responsibilities Design and build...  ...0.00/yr Equal Opportunity Palo Alto Networks is an equal opportunity employer. We... 
    Senior
    Network

    Dormont Manufacturing Company

    California, MO
    3 days ago
  •  ...systems that power our products, enable our engineers, and keep our platform infrastructure reliable as we grow. As a Senior Software Engineer on the Platform Team, you...  ...alarms, and dashboards Solid understanding of networking, distributed systems, and database... 
    Senior
    Network

    Compa

    California, MO
    9 days ago
  • $136k - $212.75k

     ...the choice to join us today. We are now looking for a Senior Validation Engineer in the DGX Server Product Engineering Team. In this role you will...  ...datacenter products including system management, security, networking, and storage. Ways to stand out from the crowd:... 
    Senior
    Network

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $170k - $277k

     ...Dormont Manufacturing Co is seeking a Senior Principal Software Engineer to lead the development of next-generation Layer 7 security capabilities. The ideal candidate will have extensive experience in designing scalable security technologies and will drive innovation... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $126k - $204.5k

    Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital...  ...Career: Help build what is next . Our Cloud Management Platform is a public cloud delivered...  ...security portfolio. The Team: Our engineering team is at the core of our products -... 
    Senior
    Network
    Full time
    Work at office

    Dormont Manufacturing Company

    California, MO
    2 days ago
  • $200k - $322k

    As a Senior Technical Program Manager passionate about Cloud Security, you will drive the DGX Cloud infrastructure security program that improves how DGX coordinates with Cloud...  ...teams in Security, Compliance, SRE, and Engineering to continually advance and strengthen the... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • NVIDIA, an innovator in computer graphics and AI, is seeking a Senior Reliability Engineer to contribute to groundbreaking projects. In this pivotal role, you will collaborate with teams to develop reliability test plans for advanced GPUs and other products, while enhancing... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  •  ...expertise in ETL processes, and strong communication skills for collaboration across teams. The position requires familiarity with cloud environments and Agile methodologies while offering opportunities to engage in exciting B2B eCommerce projects. #J-18808-Ljbffr TechDigital... 
    Senior

    TechDigital Group

    California, MO
    4 days ago
  • $168k - $310.5k

    NVIDIA is seeking a Senior Reliability Engineer to manage the reliability specifications for packaging. This role involves defining qualification requirements and leading materials selection for reliability in high-performance packages. Candidates should have 8+ years of... 
    Senior

    Dormont Manufacturing Co

    California, MO
    3 days ago
  • $178k - $288k

    Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital way of life. We thrive at the intersection...  ...outcomes. Job Summary We are seeking a visionary Sr. Engineering Manager, Sales Cloud to lead our Salesforce engineering portfolio into the... 
    Senior
    Network
    Full time
    Work at office
    Visa sponsorship
    Work visa
    3 days per week

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $130.7k - $261.3k

     ...scientists. THE OPPORTUNITY This Senior Cloud Solutions Architect position...  ...and mentoring junior engineers, contributing to product improvements...  ...Excellence: Ensure reliability, performance, security, and...  ...engineering). Strong background in networking and security (routing, BGP,... 
    Senior
    Network
    Contract work
    Shift work

    Dormont Manufacturing Co

    California, MO
    2 days ago
  •  ...applications. The Sr. Test Automation Engineer will work across modern lab...  ...ELN platforms, LIMS systems, cloud data platforms (e.g.,...  ...integrity, traceability, and system reliability. This role partners with...  ...systems, instruments, and network infrastructure. Disciplined... 
    Senior
    Network
    Temporary work
    Internship
    Flexible hours

    Software Testing Notes

    California, MO
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Network Reliability Engineer - DGX Cloud. Be the first to apply!