DGX Cloud Senior Reliability Engineer (Remote)
NVIDIA
- Remote job
NVIDIA Corporation in Santa Clara is seeking a Senior Reliability Engineer, DGX Cloud, to build and enhance reliability strategies for large-scale systems. You will lead efforts to implement SLO programs, improve operational practices, and ensure system resilience. The ideal candidate has over 10 years of experience, strong software skills, and expertise in chaos engineering or reliability fields. This role offers excellent compensation and benefits, fostering a diverse and inclusive work environment. #J-18808-Ljbffr NVIDIA Corporation
- ...Passionate about building world-class reliability systems, the full-time Senior Reliability Engineer will develop and implement an organization-wide reliability strategy for DGX Cloud, focusing on operational excellence and incident response in a 24/7 environment. Key...Remote workCloudSeniorFull time
$168k - $270.25k
## Senior Reliability Engineer, DGX CloudApplylocations: US, CA, Santa Clara: US, Remotetime type: Full timeposted on: Posted Todayjob requisition... ...systems? Join NVIDIA as a Sr. Reliability Engineer, DGX Cloud, and be a pivotal part of a team that redefines operational...CloudSenior- ...Joining a high-performing team remotely, the full-time Senior Site Reliability Engineer will own the reliability and automation of critical AI infrastructure,... ...+ years of experience in automating and supporting cloud infrastructure (AWS) and network environments Proven...Remote workCloudSeniorFull time
- ...initiatives while building a public cloud platform from scratch? Would... ...? Join our IaaS Site Reliability Engineering (SRE) team. We design,... ...your office. We are a 100% remote-first team. We will support... ...SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior...Remote workCloudSeniorWork at office
- ...technology firm focused on AI infrastructure is seeking a Senior Site Reliability Engineer in Austin, TX. The role involves ensuring the reliability... ...Computer Science and 5 years of relevant experience with cloud providers and system monitoring. Telecommuting is...Remote jobCloudSenior
- Akamai Technologies GmbH is looking for a Senior Site Reliability Engineer in Cambridge, MA. This role involves designing and operating critical services... ...that support the reliability and performance of Akamai Cloud infrastructure. Ideal candidates should have at least 5...Remote jobCloudSenior
- ...The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native architectures, and Unix/Linux environments. We offer a warm work...Remote jobCloudSenior
$65 - $75 per hour
...no C2C, no exceptions Fully remote Key Responsibilities: Process... ...monitoring solutions for a hybrid cloud environment (cloud, on-prem,... ...tools. Description: As an Engineer 2, you will collaborate with... ...across the IT organization. Seniority level Mid-Senior level Employment...Remote workCloudSeniorContract work$150k - $170k
...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and... ...engineering team. We offer a remote‑first opportunity for US‑based employees...Remote workCloudSeniorCasual workWork at officeFlexible hours- Jobgether is seeking a Senior Site Reliability Engineer to ensure the reliability, scalability, and security of systems supporting a next-generation platform. The role is fully remote and requires expertise in Kubernetes and AWS, along with responsibilities such as designing...Remote jobCloudSeniorFlexible hours
$170k - $290k
...Senior Site Reliability Engineer Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality... ...Luma's infrastructure across on-prem and multi-vendor clouds (AWS & OCI), serving as the bridge between hardware...Remote workCloudSeniorWork experience placement- ...and trends from the DevOps World. As a Senior Site Reliability Engineer, you will play a pivotal role in... ...the reliability and performance of our cloud-based infrastructure. Your primary responsibilities... ...challenging and rewarding role in a remote setting, we encourage you to apply....Remote workCloudSeniorFlexible hours
$141k - $208k
...Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one... ...providing our customers with reliable and secure services so we are... ...our central Site Reliability Engineering team. You will be responsible... ...interpersonal skills. LI-Remote The typical starting salary...Remote workCloudSeniorLocal areaHome officeFlexible hours- ISN, based in Dallas, is seeking an Advanced Site Reliability Administrator to ensure uptime and reliability of cloud environments. This role involves managing Azure-... ...OS management. After 90 days, there will be remote work options with monthly in-person engagements....Remote workCloudSenior
$125k - $165k
.... About the Role We're looking for a Senior Site Reliability Engineer who genuinely enjoys the craft. Someone... ..., scale, and operate resilient, cloud‑native infrastructure in AWS with a strong... ...Nine paid holidays & Unlimited PTO Remote working arrangements Please note the...Remote workCloudSeniorTemporary work- ...work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform... ...and Postgres and runs as a scalable cloud service in AWS, supporting millions of... ...endpoints. Location We are flexible on remote work from home for candidates located...Remote workCloudSeniorPermanent employmentWork from homeFlexible hours
- ULTA Beauty is seeking a Sr IT Engineer - Cloud Site Reliability in Bolingbrook, IL. The role involves designing, developing, and ensuring the sustainability... ...offers full benefits, including health and dental insurance, and is open to remote work. #J-18808-Ljbffr ULTA BeautyRemote workCloudSenior
- ...Site Reliability Engineers are responsible for ensuring the availability, reliability, scalability,... ...The role combines software engineering, cloud engineering, automation, and production... ...site position located in Springfield, MO. Remote work is not an option for this position...Remote workCloudSeniorLocal areaFlexible hoursShift work
- Join Flutter Entertainment in a challenging role focused on building cloud infrastructure and improving system reliability. As part of our Site Reliability Engineering team, you will design and execute strategies that enhance our cloud services using modern technologies...Remote jobCloudSenior
- ...Site Reliability Engineer CodeRabbit is an innovative research and development company focused on building extraordinarily productive human... ..., implement, and maintain scalable infrastructure on Google Cloud Platform to support CodeRabbit's growing user base and...Remote workCloudSenior
- ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies is... ...experience engineering solutions in Azure Cloud, as part of a remote role, with occasional travel to headquarters in...Remote workCloudSeniorFull timeLocal areaImmediate startFlexible hours
$125.04k - $187.56k
...Technology and more. Overview The Site Reliability Engineer (SRE) III is responsible for ensuring the... ...available, fault-tolerant systems in a cloud-native environment. The SRE III... ...person days at our Chicago office and 2 remote days. Applicants must be currently authorized...Remote workCloudSeniorFull timeWork at officeFlexible hours$150k - $200k
...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind... ..., and optimize local developer and remote CI feedback loops. Our software is used... ...You’ll work on our internally‑built Cloud Application Platform, Kubernetes on AWS...Remote workCloudSeniorFull timeLocal areaWork from home- ...Evanston, IL. This will be a fully remote role, however it is required... ...Our client is seeking a Senior SRE with proven industry experience... ...to join our remote-based Engineering team. Our teams are... ...of working within the public cloud - Azure, AWS or GCP. Hands...Remote workCloudSenior
- ...the position? We are looking for a Senior Site Reliability Engineer who combines deep infrastructure expertise... ...JVM tuning and optimization. Cloud platform expertise (AWS preferred; GCP... ...Flexible vacation allowance A hybrid / remote working environment Startup...Remote workCloudSeniorFlexible hoursNight shift
$160k - $210k
...our success. We are looking for a Senior Site Reliability engineer to work on expanding our global footprint... ...begin to rapidly expand our hybrid cloud infrastructure. Past that, we are a... ...days in office (Mon/Tue/Wed) and 2 days remote (Thursday/Friday). Responsibilities...Remote workCloudSeniorWork at officeLocal areaImmediate start- ...Senior Site Reliability Engineer - Waltham, MA Dentsply Sirona is the world’s largest manufacturer of professional... ...occurs. This role is partially remote, providing a mix of working remotely... ...is a bonus. At least Google Associate Cloud Engineer certification , higher...Remote workCloudSeniorWork at officeImmediate startWorldwide
$148k - $235.75k
...where you will be working as a Senior SRE Engineer. The position will be part of a... ...infrastructure. Maintain uptime, reliability and readiness of on-prem engineering cloud spread across multiple data... ...tools for hardware provisioning, remote access, and troubleshooting. Knowledge...Remote workCloudSenior- ...York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives... ...graceful degradation). Strong experience with AWS cloud infrastructure and IaC tools (Terraform, CloudFormation...Remote workCloudSenior
- ...motivated, diligent, and skillful Site Reliability Engineer to join the Cyber Security... ...s lives. This position can be remote anywhere in the U.S. The Senior Site Reliability Engineer will be... ...a hybrid of both on-premise and cloud-native environments. The ideal candidate...Remote workCloudSeniorTemporary work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to DGX Cloud Senior Reliability Engineer (Remote). Be the first to apply!
- cloud developer Santa Clara, CA
- senior principal cloud computing engineer Santa Clara, CA
- aws cloud infrastructure engineer Santa Clara, CA
- principal cloud computing engineer Santa Clara, CA
- informatica cloud developer Santa Clara, CA
- software engineer - cloud services Santa Clara, CA
- cloud security engineer Santa Clara, CA
- cloud architect Santa Clara, CA
- big data cloud engineer Santa Clara, CA
- aws cloud architect Santa Clara, CA

