Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

DGX Cloud Senior Reliability Engineer (Remote)

NVIDIA

Santa Clara, CA
  • Remote job

NVIDIA Corporation in Santa Clara is seeking a Senior Reliability Engineer, DGX Cloud, to build and enhance reliability strategies for large-scale systems. You will lead efforts to implement SLO programs, improve operational practices, and ensure system resilience. The ideal candidate has over 10 years of experience, strong software skills, and expertise in chaos engineering or reliability fields. This role offers excellent compensation and benefits, fostering a diverse and inclusive work environment. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the DGX Cloud Senior Reliability Engineer (Remote) in Santa Clara, CA vacancy
  •  ...Passionate about building world-class reliability systems, the full-time Senior Reliability Engineer will develop and implement an organization-wide reliability strategy for DGX Cloud, focusing on operational excellence and incident response in a 24/7 environment. Key... 
    Remote work
    Cloud
    Senior
    Full time

    Virtual Vocations Inc

    United States
    1 day ago
  • $168k - $270.25k

    ## Senior Reliability Engineer, DGX CloudApplylocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Todayjob requisition...  ...systems? Join NVIDIA as a Sr. Reliability Engineer, DGX Cloud, and be a pivotal part of a team that redefines operational... 
    Cloud
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  •  ...Joining a high-performing team remotely, the full-time Senior Site Reliability Engineer will own the reliability and automation of critical AI infrastructure,...  ...+ years of experience in automating and supporting cloud infrastructure (AWS) and network environments Proven... 
    Remote work
    Cloud
    Senior
    Full time

    Virtual Vocations Inc

    United States
    8 hours ago
  •  ...initiatives while building a public cloud platform from scratch? Would...  ...? Join our IaaS Site Reliability Engineering (SRE) team. We design,...  ...your office. We are a 100% remote-first team. We will support...  ...SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior... 
    Remote work
    Cloud
    Senior
    Work at office

    Akamai

    New York, NY
    1 day ago
  •  ...technology firm focused on AI infrastructure is seeking a Senior Site Reliability Engineer in Austin, TX. The role involves ensuring the reliability...  ...Computer Science and 5 years of relevant experience with cloud providers and system monitoring. Telecommuting is... 
    Remote job
    Cloud
    Senior

    trustwise Inc.

    Austin, TX
    1 day ago
  • Akamai Technologies GmbH is looking for a Senior Site Reliability Engineer in Cambridge, MA. This role involves designing and operating critical services...  ...that support the reliability and performance of Akamai Cloud infrastructure. Ideal candidates should have at least 5... 
    Remote job
    Cloud
    Senior

    Akamai Technologies GmbH

    Cambridge, MA
    1 day ago
  •  ...The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native architectures, and Unix/Linux environments. We offer a warm work... 
    Remote job
    Cloud
    Senior

    BuildBuddy

    Palo Alto, CA
    3 days ago
  • $65 - $75 per hour

     ...no C2C, no exceptions Fully remote Key Responsibilities: Process...  ...monitoring solutions for a hybrid cloud environment (cloud, on-prem,...  ...tools. Description: As an Engineer 2, you will collaborate with...  ...across the IT organization. Seniority level Mid-Senior level Employment... 
    Remote work
    Cloud
    Senior
    Contract work

    SBS Creatix

    New York, NY
    1 day ago
  • $150k - $170k

     ...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and...  ...engineering team. We offer a remote‑first opportunity for US‑based employees... 
    Remote work
    Cloud
    Senior
    Casual work
    Work at office
    Flexible hours

    ZIP

    New York, NY
    3 days ago
  • Jobgether is seeking a Senior Site Reliability Engineer to ensure the reliability, scalability, and security of systems supporting a next-generation platform. The role is fully remote and requires expertise in Kubernetes and AWS, along with responsibilities such as designing... 
    Remote job
    Cloud
    Senior
    Flexible hours

    Jobgether

    New York, NY
    3 days ago
  • $170k - $290k

     ...Senior Site Reliability Engineer Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality...  ...Luma's infrastructure across on-prem and multi-vendor clouds (AWS & OCI), serving as the bridge between hardware... 
    Remote work
    Cloud
    Senior
    Work experience placement

    Luma AI

    United States
    1 day ago
  •  ...and trends from the DevOps World. As a Senior Site Reliability Engineer, you will play a pivotal role in...  ...the reliability and performance of our cloud-based infrastructure. Your primary responsibilities...  ...challenging and rewarding role in a remote setting, we encourage you to apply.... 
    Remote work
    Cloud
    Senior
    Flexible hours

    DevOpsChat

    New York, NY
    1 day ago
  • $141k - $208k

     ...Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one...  ...providing our customers with reliable and secure services so we are...  ...our central Site Reliability Engineering team. You will be responsible...  ...interpersonal skills. LI-Remote The typical starting salary... 
    Remote work
    Cloud
    Senior
    Local area
    Home office
    Flexible hours

    GrabJobs

    United States
    4 hours ago
  • ISN, based in Dallas, is seeking an Advanced Site Reliability Administrator to ensure uptime and reliability of cloud environments. This role involves managing Azure-...  ...OS management. After 90 days, there will be remote work options with monthly in-person engagements.... 
    Remote work
    Cloud
    Senior

    ISN

    Dallas, TX
    2 days ago
  • $125k - $165k

     .... About the Role We're looking for a Senior Site Reliability Engineer who genuinely enjoys the craft. Someone...  ..., scale, and operate resilient, cloud‑native infrastructure in AWS with a strong...  ...Nine paid holidays & Unlimited PTO Remote working arrangements Please note the... 
    Remote work
    Cloud
    Senior
    Temporary work

    DexCare

    New York, NY
    3 days ago
  •  ...work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform...  ...and Postgres and runs as a scalable cloud service in AWS, supporting millions of...  ...endpoints. Location We are flexible on remote work from home for candidates located... 
    Remote work
    Cloud
    Senior
    Permanent employment
    Work from home
    Flexible hours

    NinjaOne

    Austin, TX
    2 days ago
  • ULTA Beauty is seeking a Sr IT Engineer - Cloud Site Reliability in Bolingbrook, IL. The role involves designing, developing, and ensuring the sustainability...  ...offers full benefits, including health and dental insurance, and is open to remote work. #J-18808-Ljbffr ULTA Beauty
    Remote work
    Cloud
    Senior

    ULTA Beauty

    Bolingbrook, IL
    5 days ago
  •  ...Site Reliability Engineers are responsible for ensuring the availability, reliability, scalability,...  ...The role combines software engineering, cloud engineering, automation, and production...  ...site position located in Springfield, MO. Remote work is not an option for this position... 
    Remote work
    Cloud
    Senior
    Local area
    Flexible hours
    Shift work

    O'Reilly Technology Services, Inc.

    Pierce, ID
    3 days ago
  • Join Flutter Entertainment in a challenging role focused on building cloud infrastructure and improving system reliability. As part of our Site Reliability Engineering team, you will design and execute strategies that enhance our cloud services using modern technologies... 
    Remote job
    Cloud
    Senior

    Flutter Entertainment

    New York, NY
    2 days ago
  •  ...Site Reliability Engineer CodeRabbit is an innovative research and development company focused on building extraordinarily productive human...  ..., implement, and maintain scalable infrastructure on Google Cloud Platform to support CodeRabbit's growing user base and... 
    Remote work
    Cloud
    Senior

    CodeRabbit

    United States
    4 days ago
  •  ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies is...  ...experience engineering solutions in Azure Cloud, as part of a remote role, with occasional travel to headquarters in... 
    Remote work
    Cloud
    Senior
    Full time
    Local area
    Immediate start
    Flexible hours

    Concord Technologies

    New York, NY
    1 day ago
  • $125.04k - $187.56k

     ...Technology and more. Overview The Site Reliability Engineer (SRE) III is responsible for ensuring the...  ...available, fault-tolerant systems in a cloud-native environment. The SRE III...  ...person days at our Chicago office and 2 remote days. Applicants must be currently authorized... 
    Remote work
    Cloud
    Senior
    Full time
    Work at office
    Flexible hours

    ViziRecruiter

    Quincy, MA
    3 days ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind...  ..., and optimize local developer and remote CI feedback loops. Our software is used...  ...You’ll work on our internally‑built Cloud Application Platform, Kubernetes on AWS... 
    Remote work
    Cloud
    Senior
    Full time
    Local area
    Work from home

    Gradle Inc.

    New York, NY
    1 day ago
  •  ...Evanston, IL. This will be a fully remote role, however it is required...  ...Our client is seeking a Senior SRE with proven industry experience...  ...to join our remote-based Engineering team. Our teams are...  ...of working within the public cloud - Azure, AWS or GCP. Hands... 
    Remote work
    Cloud
    Senior

    Insight Global

    Boca Raton, FL
    2 days ago
  •  ...the position? We are looking for a Senior Site Reliability Engineer who combines deep infrastructure expertise...  ...JVM tuning and optimization. Cloud platform expertise (AWS preferred; GCP...  ...Flexible vacation allowance A hybrid / remote working environment Startup... 
    Remote work
    Cloud
    Senior
    Flexible hours
    Night shift

    GrabJobs

    United States
    1 day ago
  • $160k - $210k

     ...our success. We are looking for a Senior Site Reliability engineer to work on expanding our global footprint...  ...begin to rapidly expand our hybrid cloud infrastructure. Past that, we are a...  ...days in office (Mon/Tue/Wed) and 2 days remote (Thursday/Friday). Responsibilities... 
    Remote work
    Cloud
    Senior
    Work at office
    Local area
    Immediate start

    GrabJobs

    United States
    8 hours ago
  •  ...Senior Site Reliability Engineer - Waltham, MA Dentsply Sirona is the world’s largest manufacturer of professional...  ...occurs. This role is partially remote, providing a mix of working remotely...  ...is a bonus. At least Google Associate Cloud Engineer certification , higher... 
    Remote work
    Cloud
    Senior
    Work at office
    Immediate start
    Worldwide

    Wellspect HealthCare

    Waltham, MA
    3 days ago
  • $148k - $235.75k

     ...where you will be working as a Senior SRE Engineer. The position will be part of a...  ...infrastructure. Maintain uptime, reliability and readiness of on-prem engineering cloud spread across multiple data...  ...tools for hardware provisioning, remote access, and troubleshooting. Knowledge... 
    Remote work
    Cloud
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives...  ...graceful degradation). Strong experience with AWS cloud infrastructure and IaC tools (Terraform, CloudFormation... 
    Remote work
    Cloud
    Senior

    Govserviceshub

    New York, NY
    1 day ago
  •  ...motivated, diligent, and skillful Site Reliability Engineer to join the Cyber Security...  ...s lives. This position can be remote anywhere in the U.S. The Senior Site Reliability Engineer will be...  ...a hybrid of both on-premise and cloud-native environments. The ideal candidate... 
    Remote work
    Cloud
    Senior
    Temporary work

    PowerToFly

    Springfield, IL
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to DGX Cloud Senior Reliability Engineer (Remote). Be the first to apply!