Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer, GeForce NOW

$168k - $270.25k
Full-time

NVIDIA

NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency, and performance. As SRE, you are responsible for the big picture of how our systems relate to each other; we use a breadth of tools and approaches to tackle complex problems. The person in this position will be responsible for Service Response and Workflows and will drive tools/service development to maintain and improve service SLOs. We partner with Service Owners to drive reliability of the service. What you will be doing: Working on building tools to improve the SRE Observability. Be part of the Kubernetes migration journey with VMI setup and problem solving. Rapidly debug and triage incidents and user-reported issues Taking ownership of automating, scripting, and tooling of new/existing scripts to help the team achieve 100% automation of daily tasks Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviews. Be part of an on call rotation to support production systems What we need to see: MS or BS in Computer Science/Engineering or a related field or equivalent experience. 8+ year’s Site reliability engineering experience working on large scale distributed micro services in a production environment with a real passion for automation and tooling. Very strong Kubernetes background and ability to understand Kubernetes with complex and highly available VMI setup on K8's. Lead significant production improvements including change management, post-mortem reviews, workflow processes, design and deliver software automation in various languages. Confirmed strengths in problem-solving and root causing issues, while continuously seeking ways to drive optimization, efficiency and the bottom line. Previous experience with Datadog, Prometheus, Alertmanager, or similar monitoring systems. Experience managing multi-region cloud deployments on hyperscalers like AWS, GCP, or Azure. Experience designing and managing deployment pipelines using tools such as GitHub Actions, GitLab CI, or ArgoCD. Excellent communication, presentation, social, and analytical skills; the ability to communicate complex interaction concepts clearly and persuasively across different audiences and varying levels of the organization. Production-grade coding proficiency in languages like Go, Python, or robust Bash scripting. Production on-call experience is a must. Should have served in a primary production on-call rotation, responding to and mitigating high-severity infrastructure alerts and service degradations. Ways to stand out from the crowd: Experience working with automated anomaly detection, log clustering tools, or LLM-assisted debugging platforms. Comfortable using AI on a day-to-day basis as an SRE Prior experience as an SRE or Service Engineer is a huge plus. With a competitive salary package and benefits, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous Site Reliability Engineer who loves challenges? The GFN Service is an exciting service in the newly growing game streaming industry. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 270,250 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 4, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer, GeForce NOW in California vacancy
  • $168k - $270.25k

    NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers... 
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

     ...looking for a talented and ambitious system software engineer to join the team working on NVIDIA GeForce Now Platform ( The main focus of the team is client...  ...platforms. Partner closely with operations for the reliability and scalability of our features. Build features... 
    Senior
    Local area

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $168k - $270.25k

     ...GeForce NOW is Nvidia’s Cloud Gaming service, streaming games at the highest quality...  ..., see We are looking for a Senior System Software Engineer who sees the big picture of Cloud Computing...  ...-time context. Ensuring System Reliability: Implement rigorous testing,... 
    Senior
    Local area

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

     ...team and see how you can make a lasting impact on the world. We are looking for a passionate member to join our Engineering Team in GeForce NOW as a Senior Systems Software Engineer. In this role, you will play a significant part in crafting and guiding the future of... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...GeForce NOW is NVIDIA's Cloud Gaming service, streaming games at the highest quality to any and every user, regardless of their...  ...play! For more details, see . We are looking for a Senior System Software Engineer for Cloud who sees the big picture of Cloud Computing... 
    Senior
    Local area

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • GeForce NOW is NVIDIA's Cloud Gaming service, streaming games at the highest quality to any and every user, regardless of...  ...development teams to continuously drive improvements in service reliability, using strong SW engineer processes, superb tools, and influential strategies.... 
    Senior
    Local area

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...lasting impact on the world. You will be a part of NVIDIA GeForce NOW cloud team that allows users to play high-quality PC games on...  ...high-quality gaming experience even at high resolutions. As an engineer on the GeForce NOW team, you will have the unique opportunity... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...GeForce NOW (GFN) provides high-performance gaming to millions, regardless of their hardware...  ..., unstructured datasets into precise engineering actions. We are looking for a validated...  ...the "how" as much as the "why." As a Senior Data Scientist for VOC, you will be... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $256k - $414k

    Overview GeForce NOW is the global leader in cloud gaming, dedicated...  ...scale. We are looking for a Senior Manager to lead the design,...  ...high‑throughput, and highly reliable interconnects across data centers...  ...Science or a related engineering field (or equivalent experience... 
    Senior
    Local area

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $200k - $322k

     ...continuous innovation and exceptional talent. We are now leveraging the immense potential of AI to usher in the...  ...make a profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at scale... 
    Senior

    NVIDIA

    Santa Clara, CA
    17 hours ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability... 
    Senior
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    Los Angeles, CA
    3 days ago
  •  ...The TeamPlatform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    3 days ago
  • $210.6k - $305.1k

     ...Qualifications: ~ You have led a distributed team of 5+ engineers, can demonstrate strong technical vision for your team, and ensure...  ..., and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Francisco, CA
    3 days ago
  • OutSystems, Inc. is looking for a Site Reliability Engineer to join their team in San Francisco, CA. The ideal candidate will lead the onboarding of services and teams to reliability tenets while establishing SLOs and SLAs. Proficiency in Python and experience with Kubernetes... 
    Senior
    Flexible hours

    OutSystems, Inc.

    San Francisco, CA
    4 days ago
  • $232k - $263k

     ..., Okta, Cylance, and Carbon Black. Now, we're transforming how SaaS is secured...  ...future of SaaS security! Sr. Staff Site Reliability Engineer As a Sr. Staff SRE at Obsidian ,...  ...related roles ~3+ years operating at a senior or technical leadership level (Staff... 
    Senior
    Work from home
    Flexible hours

    Obsidian Security

    Palo Alto, CA
    3 days ago
  • Poshmark, Inc. is seeking a talented Site Reliability Engineer to ensure the health and performance of our web-scale systems. You will collaborate with development teams and focus on automating and monitoring systems for high reliability. The ideal candidate has 5 years... 
    Senior

    Poshmark, Inc.

    Redwood City, CA
    4 days ago
  • $227.2k - $324.5k

     ...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization...  ...automation. We are seeking an experienced and visionary Senior SRE Manager to lead and grow our newly built Site Reliability... 
    Senior
    Full time
    Contract work
    Temporary work
    Local area
    Flexible hours

    Tubi

    San Francisco, CA
    2 days ago
  • $184k - $287.5k

     ...acceleration. Its highly optimized capture and encode pipeline powers GeForce NOW. All video playback from browsers and applications runs...  .... What we need to see: ~ Bachelors in Electrical Engineering or Computer Science (or equivalent experience). Master’s degree... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...We are looking for a Senior Software Engineer who sees the big picture of cloud computing and loves building cloud infrastructure. You will...  ...most demanding GPU‑powered services in the world, including GeForce NOW and NVIDIA’s GPU cloud offerings. Your work will directly... 
    Senior
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $140k - $185k

     ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that...  ...in on-call and incident response: Improve operational reliability: Own parts of the production environment: Strengthen observability... 
    Senior
    Work at office
    Worldwide

    Dormont Manufacturing Co

    San Francisco, CA
    4 days ago
  •  ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (...  ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of... 
    Senior

    OutSystems, Inc.

    San Francisco, CA
    4 days ago
  • $224k - $356.5k

     ...NVIDIA's GeForce NOW, the next-generation gaming service powered by NVIDIA GPUs in the cloud, transforms a Mac, any PC, or...  ...Just click and play! Visit us at We are looking for a Senior Systems Software engineer to join a team of highly skilled and motivated engineers... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $210k - $270k

     ...deeply thoughtful, driven, and collaborative teammates, read on. Your Impact on our Mission: Zocdoc is looking for a Senior Site Reliability Engineer to help develop, monitor, and maintain our distributed production systems. You’ll be challenged with building frameworks... 
    Senior
    Flexible hours

    Dormont Manufacturing Co

    Palo Alto, CA
    4 days ago
  • $152k - $241.5k

     ...intelligence. Job Overview We’re looking for a Senior SRE to join our Compute Farm team and...  ...host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑...  ...Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $144k - $191k

     ...capabilities by partnering closely with specialist engineering, operations, and production teams....  ...production methods—delivering safe, reliable propulsion systems that support a wide...  ...mission environment. About The Job As a Site Reliability Engineer you will be responsible... 
    Senior
    Full time
    Work experience placement
    Relocation package

    Slope

    Costa Mesa, CA
    4 days ago
  • A leading biotechnology firm in South San Francisco is seeking a Site Reliability Engineer to architect and implement Infrastructure as Code (IaC) solutions that enhance cloud-based platform solutions for Machine Learning and HPC workloads. The ideal candidate has extensive... 
    Senior
    3 days per week

    Genentech

    South San Francisco, CA
    4 days ago
  • $126k - $204.5k

     ...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and...  ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications Required... 
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved... 
    Senior
    Full time
    Remote work

    Andromeda

    San Francisco, CA
    4 days ago
  • Zocdoc, located in Silicon Valley, CA, is seeking a Senior Site Reliability Engineer to monitor and maintain cloud-based systems ensuring uptime for millions of patients. You'll work with cutting-edge technology in a diverse and collaborative environment. This role requires... 
    Senior

    Dormont Manufacturing Co

    Palo Alto, CA
    4 days ago
  • $166.6k - $250.6k

    Hong Kong Study Skills Research Institute is looking for a Senior Site Reliability Engineer in Culver City, California. This role entails building and supporting customer-facing services leveraging SRE best practices, ensuring system reliability, and collaborating with... 
    Senior

    Hong Kong Study Skills Research Institute

    Culver City, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer, GeForce NOW. Be the first to apply!