Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Director of Cloud Reliability & SRE Leadership

NVIDIA

NVIDIA Corporation is seeking a Director of Site Reliability and Software Engineering for its DGX Cloud team in Santa Clara, CA. This leadership role demands 12+ years in engineering management and 5+ years in a leadership position. The director will manage teams, define strategies, and drive innovative projects in NVIDIA's leading cloud computing environment. The salary range is competitive, with the total compensation including equity options. Applications are accepted until at least May 8, 2026. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Director of Cloud Reliability & SRE Leadership in Santa Clara, CA vacancy
  • $207k - $300k

    A leading technology company is seeking a Site Reliability Engineering Manager in Sunnyvale, CA. You will lead the SRE team, ensuring reliability and performance of cloud services, with a strong focus on Kubernetes and automation. The ideal candidate has extensive experience... 
    Suggested
    Full time

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $227k - $320k

    Technical Program Manager, Google Cloud Platform Reliability corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree in a technical...  ...years of experience in program management or engineering leadership. Experience with site reliability engineering, developer... 
    Suggested
    Full time
    Local area

    Google Inc.

    Sunnyvale, CA
    1 day ago
  •  ...Career You will lead the GTM team for Cortex Cloud, directly and visibly contributing to...  ...cloud security business. This is a leadership role with broad scope and team‑management...  ...transformative mindset and demonstrates reliable judgment. Your Impact Strategy Shape... 
    Suggested
    Base plus commission
    Work from home

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $207k - $300k

    Software Engineering Manager II, Site Reliability Engineering corporate_fare Google Sunnyvale...  ...the job Site Reliability Engineering (SRE) combines software and systems engineering...  ...products globally, providing technical leadership to key projects and empowering and developing... 
    Suggested
    Full time

    Google Inc.

    Sunnyvale, CA
    11 hours ago
  •  ...production infrastructure across multiple clouds. You will deploy and maintain Kubernetes...  ...automating CI/CD pipelines, ensuring system reliability, and collaborating with various teams....  ...has 8+ years of experience in DevOps or SRE, with a strong security mindset. This is... 
    Suggested
    Work at office
    2 days per week

    Koitecc Solutions

    Sunnyvale, CA
    11 hours ago
  • $207k - $300k

    Site Reliability Engineering Manager, Google Distributed Cloud Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical...  ...(GCE). About the job Site Reliability Engineering (SRE) combines software and systems engineering to build... 
    Full time

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $172.1k - $305.6k

     ...secure, end-to-end solutions. The Service Reliability Engineering (SRE) team is responsible for service...  ...projects and features for the Private Cloud Compute software that powers Apple...  ...with only consultative direction from leadership. Develop a broad organizational perspective... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $135.6k - $180k

     ...class infrastructure performance and reliability. This high-impact leadership role is responsible for driving...  ...(RCA/CAR) Implement and maintain SRE practices including SLOs, error budgets...  ...automation using Terraform, Kubernetes, and cloud platforms (GCP/AWS) Develop... 

    Vistance Networks, Inc.

    Sunnyvale, CA
    1 day ago
  • $184.12k - $275.45k

    General Motors is looking for a Staff Engineer in Sunnyvale to join the Hybrid Services & Reliability (HSR) team. Responsibilities include leading SLOs for cloud services, automating server provisioning, and ensuring system reliability. Ideal candidates will have extensive... 
    Work at office

    General Motors

    Sunnyvale, CA
    4 days ago
  • Rubrik, Inc. seeks a Staff Site Reliability Engineer in Palo Alto, California to lead reliability...  ...automation tools, and cross-functional leadership. The ideal candidate will have 8-12+...  ...engineering, with a strong background in cloud systems and operational excellence. Key... 

    Rubrik, Inc.

    Palo Alto, CA
    2 days ago
  • $207k - $300k

     ...Successful candidates will have extensive experience in software development, with a strong background in people management and project leadership. The base salary range for this position is between $207,000 and $300,000, plus bonuses and benefits. #J-18808-Ljbffr Google Inc... 

    Google Inc.

    Sunnyvale, CA
    3 days ago
  •  ...operations team focused on delivering world-class infrastructure reliability. The role requires mentoring engineers, managing incident...  ...extensive experience in technical operations, a background in cloud platforms, and expertise in infrastructure automation. This position... 

    Vistance Networks

    Sunnyvale, CA
    3 days ago
  • About the Team The Reliability Platform role is a key pillar of DoorDash...  ...pragmatic perspective of an SRE, and deliver solutions with...  ...Manage the team’s budget for Cloud Provider Infra and 3rd party...  ...Consistency: Your influence and leadership is seeking an outcome that... 
    Hourly pay
    Work at office
    Local area
    Remote work
    Flexible hours

    DoorDash USA

    Sunnyvale, CA
    1 day ago
  • $202k - $253k

    About the Role The Director, Cloud GTM Finance is the strategic finance business partner to Crusoe's Cloud GTM organization, including Sales...  ...head of Cloud Finance and working hand-in-hand with the GTM leadership team. This is a builder role. You will shape the GTM... 
    Contract work
    Temporary work

    jobr.pro

    Sunnyvale, CA
    4 days ago
  • $256k - $414k

    GeForce NOW is the global leader in cloud gaming, dedicated to making high‑end play accessible...  ...low‑latency, high‑throughput, and highly reliable interconnects across data centers and...  ...AI platform teams, hardware vendors, and SRE groups to influence technology direction... 
    Local area

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $135.6k - $180k

     ...to enhance infrastructure performance. Candidates should have over 8 years in technical operations, expertise in SRE, and proficiency in automation and cloud platforms. Competitive salary between $135,600 and $180,000, along with a comprehensive benefits package, is... 

    Vistance Networks, Inc.

    Sunnyvale, CA
    3 days ago
  • Palo Alto Networks, Inc. is seeking a seasoned GTM expert to lead the GTM team for Cortex Cloud. This leadership role involves shaping the cloud security business and managing a team to achieve specific GTM objectives. Candidates should have over 12 years of relevant experience... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $320k

    Director, Site Reliability and Software Engineering - DGX Cloud page is loaded## Director, Site Reliability and Software Engineering - DGX Cloudlocations: US, CA...  ...levels of technical and organizational leadership is critical. Operating with scale and speed, our... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • Google in Sunnyvale is looking for a Technical Program Manager to lead complex, multidisciplinary projects and enhance the lifecycle of AI solutions. You will collaborate closely with stakeholders to manage project schedules, identify risks, and drive product launch strategies...

    Google

    Sunnyvale, CA
    3 days ago
  • $132k - $190k

     ...project requirements and schedules. Excellent leadership, investigative, quantitative reasoning,...  ...NPI Program teams within Google Cloud, Platforms Engineering, and external partners...  ..., material availability, Quality/Reliability, Cost/Assembly/Test/BOM/Capacity Ramp readiness... 

    Google

    Sunnyvale, CA
    2 hours ago
  •  ...development teams working on Cortex, focusing on delivering scalable, reliable cloud-native services. The role involves mentoring engineers and...  ...extensive software development experience and a strong technical leadership background. #J-18808-Ljbffr Palo Alto Networks, Inc.
    Remote job

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago
  •  ...architecture, platform strategy, and technical leadership for enterprise-scale pricing management...  ...engineering teams Improve system reliability, operational excellence, and platform...  ...on background with Java/Spring Boot and cloud-native systems preferred Strong understanding... 
    Contract work
    Work at office

    GSPANN Technologies, Inc

    Sunnyvale, CA
    3 days ago
  • jobr.pro is seeking a Director, Cloud GTM Finance to partner with their Cloud GTM organization in Sunnyvale, California. This strategic...  ...especially within high-growth technology companies, alongside strong leadership skills and expertise in revenue mechanics and sales... 

    jobr.pro

    Sunnyvale, CA
    4 days ago
  • $192k - $267k

     ...experience. 10 years of experience with cloud native architecture in a customer-facing...  ...prototyping, or workshops with customers. Leadership experience (e.g., people management,...  ...Our products are developed for security, reliability and scalability, running the full stack... 
    Full time
    Temporary work

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $200k - $250k

     ...Communicate progress, risks, and tradeoffs clearly to engineering leadership and cross-functional stakeholders. Help set the cultural...  ...that meaningfully improves team velocity or product reliability. Establish team rituals and a working culture that reflects... 
    Flexible hours

    Radar

    Sunnyvale, CA
    4 days ago
  • $262.7k - $371.7k

    Infoblox is looking for a Senior Director of Software Engineering to provide technical leadership for development teams, enhancing their flagship networking product. The ideal candidate will manage agile teams, ensuring high-quality delivery and mentoring future leaders... 

    Infoblox

    Santa Clara, CA
    11 hours ago
  • $244.8k

     ...Responsibilitie Team Introduction: Our Site Reliability Engineering (SRE) team blends software and systems...  ...efficiency. We provide a dependable cloud environment that powers our global...  ...(SRE) who can provide deep technical leadership, drive architectural improvements,... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    1 day ago
  • $200k - $322k

     ...collaboration with major enterprise customers, cloud service providers, and Tier‑1 partners....  ...feasibility. Provide cross‑functional leadership and coordination, driving execution...  ...Strong grasp of production scaling and reliability of inference pipelines or GPU infrastructure... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...’s motivated by great technology—and outstanding people. The Reliability Test Manager owns the planning, execution, and management of...  ...Supervision of lab engineering and technician personnel. Strategic Leadership: Provide direction for reliability test programs aligned with... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $156k - $190k

    Crusoe Energy Systems in Sunnyvale, CA, is seeking a Staff Cloud Support Engineer to provide technical leadership in cloud infrastructure. You will lead incident responses, design reliability architecture, and mentor team members. The ideal candidate will have over 8 years... 

    Crusoe Energy Systems

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Director of Cloud Reliability & SRE Leadership. Be the first to apply!