Senior Site Reliability Engineer, GeForce NOW
$168k - $270.25kNVIDIA
NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency, and performance. As SRE, you are responsible for the big picture of how our systems relate to each other; we use a breadth of tools and approaches to tackle complex problems. The person in this position will be responsible for Service Response and Workflows and will drive tools/service development to maintain and improve service SLOs. We partner with Service Owners to drive reliability of the service. What you will be doing: Working on building tools to improve the SRE Observability. Be part of the Kubernetes migration journey with VMI setup and problem solving. Rapidly debug and triage incidents and user-reported issues Taking ownership of automating, scripting, and tooling of new/existing scripts to help the team achieve 100% automation of daily tasks Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviews. Be part of an on call rotation to support production systems What we need to see: MS or BS in Computer Science/Engineering or a related field or equivalent experience. 8+ year’s Site reliability engineering experience working on large scale distributed micro services in a production environment with a real passion for automation and tooling. Very strong Kubernetes background and ability to understand Kubernetes with complex and highly available VMI setup on K8's. Lead significant production improvements including change management, post-mortem reviews, workflow processes, design and deliver software automation in various languages. Confirmed strengths in problem-solving and root causing issues, while continuously seeking ways to drive optimization, efficiency and the bottom line. Previous experience with Datadog, Prometheus, Alertmanager, or similar monitoring systems. Experience managing multi-region cloud deployments on hyperscalers like AWS, GCP, or Azure. Experience designing and managing deployment pipelines using tools such as GitHub Actions, GitLab CI, or ArgoCD. Excellent communication, presentation, social, and analytical skills; the ability to communicate complex interaction concepts clearly and persuasively across different audiences and varying levels of the organization. Production-grade coding proficiency in languages like Go, Python, or robust Bash scripting. Production on-call experience is a must. Should have served in a primary production on-call rotation, responding to and mitigating high-severity infrastructure alerts and service degradations. Ways to stand out from the crowd: Experience working with automated anomaly detection, log clustering tools, or LLM-assisted debugging platforms. Comfortable using AI on a day-to-day basis as an SRE Prior experience as an SRE or Service Engineer is a huge plus. With a competitive salary package and benefits, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous Site Reliability Engineer who loves challenges? The GFN Service is an exciting service in the newly growing game streaming industry. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 270,250 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 4, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.
$168k - $270.25k
NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers...SeniorFull time$224k - $356.5k
...looking for a talented and ambitious system software engineer to join the team working on NVIDIA GeForce Now Platform ( The main focus of the team is client... ...platforms. Partner closely with operations for the reliability and scalability of our features. Build features...SeniorLocal area$168k - $270.25k
...GeForce NOW is Nvidia’s Cloud Gaming service, streaming games at the highest quality... ..., see We are looking for a Senior System Software Engineer who sees the big picture of Cloud Computing... ...-time context. Ensuring System Reliability: Implement rigorous testing,...SeniorLocal area$224k - $356.5k
...team and see how you can make a lasting impact on the world. We are looking for a passionate member to join our Engineering Team in GeForce NOW as a Senior Systems Software Engineer. In this role, you will play a significant part in crafting and guiding the future of...SeniorRemote work$224k - $356.5k
...GeForce NOW is NVIDIA's Cloud Gaming service, streaming games at the highest quality to any and every user, regardless of their... ...play! For more details, see . We are looking for a Senior System Software Engineer for Cloud who sees the big picture of Cloud Computing...SeniorLocal area- GeForce NOW is NVIDIA's Cloud Gaming service, streaming games at the highest quality to any and every user, regardless of... ...development teams to continuously drive improvements in service reliability, using strong SW engineer processes, superb tools, and influential strategies....SeniorLocal area
$184k - $287.5k
...lasting impact on the world. You will be a part of NVIDIA GeForce NOW cloud team that allows users to play high-quality PC games on... ...high-quality gaming experience even at high resolutions. As an engineer on the GeForce NOW team, you will have the unique opportunity...$184k - $287.5k
...GeForce NOW (GFN) provides high-performance gaming to millions, regardless of their hardware... ..., unstructured datasets into precise engineering actions. We are looking for a validated... ...the "how" as much as the "why." As a Senior Data Scientist for VOC, you will be...Senior$256k - $414k
Overview GeForce NOW is the global leader in cloud gaming, dedicated... ...scale. We are looking for a Senior Manager to lead the design,... ...high‑throughput, and highly reliable interconnects across data centers... ...Science or a related engineering field (or equivalent experience...SeniorLocal area$200k - $322k
...continuous innovation and exceptional talent. We are now leveraging the immense potential of AI to usher in the... ...make a profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at scale...Senior$140k - $205k
...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability...SeniorFull timeTemporary workWork at officeFlexible hoursWeekend work- ...The TeamPlatform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours
$210.6k - $305.1k
...Qualifications: ~ You have led a distributed team of 5+ engineers, can demonstrate strong technical vision for your team, and ensure... ..., and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible...SeniorFull timeTemporary workLocal areaFlexible hours- OutSystems, Inc. is looking for a Site Reliability Engineer to join their team in San Francisco, CA. The ideal candidate will lead the onboarding of services and teams to reliability tenets while establishing SLOs and SLAs. Proficiency in Python and experience with Kubernetes...SeniorFlexible hours
$232k - $263k
..., Okta, Cylance, and Carbon Black. Now, we're transforming how SaaS is secured... ...future of SaaS security! Sr. Staff Site Reliability Engineer As a Sr. Staff SRE at Obsidian ,... ...related roles ~3+ years operating at a senior or technical leadership level (Staff...SeniorWork from homeFlexible hours- Poshmark, Inc. is seeking a talented Site Reliability Engineer to ensure the health and performance of our web-scale systems. You will collaborate with development teams and focus on automating and monitoring systems for high reliability. The ideal candidate has 5 years...Senior
$227.2k - $324.5k
...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization... ...automation. We are seeking an experienced and visionary Senior SRE Manager to lead and grow our newly built Site Reliability...SeniorFull timeContract workTemporary workLocal areaFlexible hours$184k - $287.5k
...acceleration. Its highly optimized capture and encode pipeline powers GeForce NOW. All video playback from browsers and applications runs... .... What we need to see: ~ Bachelors in Electrical Engineering or Computer Science (or equivalent experience). Master’s degree...SeniorWork experience placement$184k - $287.5k
...We are looking for a Senior Software Engineer who sees the big picture of cloud computing and loves building cloud infrastructure. You will... ...most demanding GPU‑powered services in the world, including GeForce NOW and NVIDIA’s GPU cloud offerings. Your work will directly...SeniorWorldwide$140k - $185k
...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that... ...in on-call and incident response: Improve operational reliability: Own parts of the production environment: Strengthen observability...SeniorWork at officeWorldwide- ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (... ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of...Senior
$224k - $356.5k
...NVIDIA's GeForce NOW, the next-generation gaming service powered by NVIDIA GPUs in the cloud, transforms a Mac, any PC, or... ...Just click and play! Visit us at We are looking for a Senior Systems Software engineer to join a team of highly skilled and motivated engineers...Senior$210k - $270k
...deeply thoughtful, driven, and collaborative teammates, read on. Your Impact on our Mission: Zocdoc is looking for a Senior Site Reliability Engineer to help develop, monitor, and maintain our distributed production systems. You’ll be challenged with building frameworks...SeniorFlexible hours$152k - $241.5k
...intelligence. Job Overview We’re looking for a Senior SRE to join our Compute Farm team and... ...host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑... ...Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through...Senior$144k - $191k
...capabilities by partnering closely with specialist engineering, operations, and production teams.... ...production methods—delivering safe, reliable propulsion systems that support a wide... ...mission environment. About The Job As a Site Reliability Engineer you will be responsible...SeniorFull timeWork experience placementRelocation package- A leading biotechnology firm in South San Francisco is seeking a Site Reliability Engineer to architect and implement Infrastructure as Code (IaC) solutions that enhance cloud-based platform solutions for Machine Learning and HPC workloads. The ideal candidate has extensive...Senior3 days per week
$126k - $204.5k
...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and... ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications Required...Senior- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved...SeniorFull timeRemote work
- Zocdoc, located in Silicon Valley, CA, is seeking a Senior Site Reliability Engineer to monitor and maintain cloud-based systems ensuring uptime for millions of patients. You'll work with cutting-edge technology in a diverse and collaborative environment. This role requires...Senior
$166.6k - $250.6k
Hong Kong Study Skills Research Institute is looking for a Senior Site Reliability Engineer in Culver City, California. This role entails building and supporting customer-facing services leveraging SRE best practices, ensuring system reliability, and collaborating with...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer, GeForce NOW. Be the first to apply!
- senior game producer California
- senior manager process engineering California
- senior manufacturing engineer California
- senior manager clinical operations California
- senior lead project manager California
- senior manager quality engineering California
- senior device engineer California
- senior work from home California
- senior program manager California
- senior network engineer remote California

