Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Staff Technical Program Manager - Reliability

$191.4k - $252.72k

Databricks Inc.

At Databricks, we are passionate about empowering data teams to tackle the world's most complex challenges — from bringing the next mode of transportation to reality to accelerating the development of medical breakthroughs. We achieve this by building and operating the world's best data and AI infrastructure platform, enabling our customers to leverage deep data insights and enhance their business.

We are seeking an exceptional Senior Staff Technical Program Manager (TPM) for Reliability to lead the strategy, execution, and continuous improvement of our most critical Reliability initiatives across infrastructure and product engineering teams at Databricks. As Databricks scales to support thousands of customers and the world’s most data-intensive workloads, Reliability is foundational to our mission. In this role, you will lead cross-company programs that significantly enhance the reliability, performance, and operational excellence of our multi-cloud infrastructure.

This is a high-visibility, high-impact leadership role partnering closely with our most senior engineering leaders, including Reliability Program executive sponsors, senior TLs, and Engineering Managers to define Reliability strategy, set long-term goals, and execute multi‑quarter programs to build the most reliable cloud platform on the planet to help our customers run their mission‑critical workloads on.

To be successful, you must possess a deep understanding of large‑scale distributed systems, cloud infrastructure, and engineering principles that drive operational excellence. You will leverage your background to anticipate risks, shape technical direction, and deliver complex programs across product, engineering, SRE, and cloud partner teams.

You will have the opportunity to:

Lead Reliability Strategy + Multi‑Quarter Roadmaps

  • Partner with senior engineering leadership to define the long‑term Reliability roadmap, influence technical direction, and ensure alignment across teams.
  • Ensure clarity and alignment on priorities across engineering teams, including Platform Engineering, Compute Fleet Management, SRE, Security, and Cloud Partnerships.

Drive Execution of Critical Reliability Programs

  • Own program execution end‑to‑end: planning, risk management, dependency mapping, trade‑off decisions, status reporting, and delivery.
  • Identify gaps in process or architecture and work with TLs to proactively drive organizational or technical improvements.

Partner Deeply with Engineering & Influence Technical Direction

  • Using your background in infrastructure, distributed systems, or SRE to help teams make sound design and prioritization decisions.
  • Facilitate alignment between cross‑functional teams to ensure programs are technically grounded and execution‑ready.
  • Bring systems thinking to diagnose reliability bottlenecks and drive improvements to scalability, fault tolerance, automation, and operational tooling.

Elevate Reliability Culture Across the Organization

  • Drive adoption of reliability best practices across engineering teams - including error budgets, incident reviews, design‑for‑resilience patterns, and operational readiness.
  • Define and implement program governance, repeatable processes, metrics, and documentation to scale reliability efforts across teams.
  • Evangelize reliability expectations and engineer‑empowering processes that reduce operational load and improve incident preparedness.

What we look for:

Required Experience & Qualifications

  • 10+ years of experience managing and delivering large‑scale technical programs in cloud infrastructure, distributed systems, SRE, or platform engineering environments.
  • Experience developing infrastructure at two or more hyperscale cloud providers (e.g., AWS, Azure, GCP), with knowledge of cloud primitives, multi‑AZ/region architecture, and control plane/data plane patterns.
  • Demonstrated success leading Reliability Programs at scale - including availability, failover, operational excellence, incident reduction, or dependency hardening.
  • Strong understanding of infrastructure, distributed systems, or SRE practices; previous engineering or SRE experience is highly preferred.
  • Experience partnering directly with senior engineering leadership to define strategy and drive large, multi‑team initiatives.
  • Ability to translate ambiguous goals into actionable program plans with clear milestones, KPIs, and success metrics.
  • Demonstrated ability to manage complex cross‑organizational dependencies, technical risks, and multi‑quarter timelines.
  • Experience delivering programs across multiple clouds and/or large‑scale cloud‑native services.
  • Experience building and scaling engineering processes, operational frameworks, and stakeholder alignment mechanisms.

Preferred Qualifications

  • Background in distributed systems engineering, SRE, platform infrastructure, or cloud services.
  • Experience with large‑scale compute fleets, container orchestration, autoscaling, or control‑plane architecture.
  • Familiarity with reliability methodologies such as SLOs, error budgets, chaos engineering, failure mode analysis, and incident management frameworks.
  • Expertise using Jira or equivalent tools for program tracking and execution.
  • Bachelor’s degree in Computer Science, Engineering, or related technical field; advanced degree preferred.

Local Pay Range

$191,400 — $252,720 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.

Benefits

At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio‑economic status, veteran status, and other protected characteristics.

Compliance

If access to export‑controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

#J-18808-Ljbffr

Vacancy posted 2 hours ago
Similar jobs that could be interesting for youBased on the Sr. Staff Technical Program Manager - Reliability in Mountain View, CA vacancy
  •  ...Sr. Technical Program Manager, Physical Infrastructure Client is the world's largest professional network, built to help members of all backgrounds...  ...Continuously improving processes and systems to deliver reliable, cost-effective, and sustainable infrastructure. AI... 
    Senior
    For contractors
    Work at office
    Remote work
    Work from home
    Worldwide
    Flexible hours

    eTeam

    Mountain View, CA
    5 days ago
  • $166k - $271k

     ...team. The LinkedIn Infrastructure Engineering Technical Program Management Group is looking for a Senior Staff Technical Program Manager (TPM) to join our team...  ..., and modernizing these systems to ensure reliability, scalability, and efficiency across LinkedIn's services... 
    Senior
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Sunnyvale, CA
    2 days ago
  •  ...Senior Technical Program Manager A leading technology company is seeking a Senior Technical Program Manager to join its Core Infrastructure...  ...building, maintaining, and modernizing systems that ensure reliability, scalability, and efficiency across large-scale services.... 
    Senior

    Flexton

    Mountain View, CA
    3 days ago
  • $185k - $220k

     ...is supported by elastic, multi-cloud services that deliver reliability and responsiveness, and a new software engineering model...  ...We are seeking a highly skilled and experienced Senior Technical Program Manager to join our team at Afero. As a Senior Technical Program Manager... 
    Senior
    Full time
    Work at office
    3 days per week

    Afero

    Los Altos, CA
    5 days ago
  • $125k - $170k

     ...about creating innovative and reliable solutions that help people...  ...features and technologies. These programs will be diverse in nature...  ..., or the ability to learn technical concepts, mainly software related...  ...with your program management skills to deliver the best outcome... 
    Senior
    Immediate start
    Flexible hours

    Arlo Technologies, Inc.

    Milpitas, CA
    5 days ago
  • $130k - $260k

     ...Great Company, Great Culture, Great Rewards, and Great Careers. Position Summary GEICO is seeking a Senior Technical Program Manager to support a transformational, multi-stakeholder strategic initiative to help build the future of insurance technology. At... 
    Senior
    Hourly pay
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    4 days ago
  • $175k - $245k

     ...hard problems, and working with low drama, high output teammates, you will fit here. Role Summary We are hiring a Sr. Technical Program Manager to own our Asia supplier relationships end-to-end. This is a high-visibility, high-accountability role at the... 
    Senior
    Full time

    Range Energy Co

    Mountain View, CA
    4 days ago
  • $120k - $260k

     ...Careers. The Team We are seeking a Senior Technical Program Manager to join the Strategy, Analytics, and Operations team that...  ...enabling teams across the enterprise to deliver faster, operate reliably, and innovate with confidence. The Role In... 
    Senior
    Hourly pay
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    6 days ago
  • $140k - $182k

     ...market. About the role We are looking for a Senior Technical Program Manager who not only drives execution across backend, platform,...  ...individual components Support production readiness, including reliability, scalability, and compliance considerations Translate... 
    Senior
    Work at office
    Monday to Friday
    Monday to Thursday
    Afternoon shift

    Drivemode

    Mountain View, CA
    5 days ago
  • $132.1k - $165.1k

     ...Rivian Autonomy Technical Program Manager Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge... 
    Senior
    Full time
    Contract work
    Local area

    Rivian

    Palo Alto, CA
    1 day ago
  • $163k - $237k

     ...Technical Program Manager III, GPU Infrastructure Reliability Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. In accordance with Washington state law, we are highlighting... 
    Temporary work

    Google

    Sunnyvale, CA
    1 day ago
  • $192k - $279k

    Technical Program Manager, Product Quality and Reliability Bachelor's degree in Electrical Engineering, Computer Science, Computer Engineering, Mechanical Engineering, Software Engineering, related technical field, or equivalent practical experience. 8 years of experience... 
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $192k - $279k

    Google Inc. is seeking a Technical Program Manager for Product Quality and Reliability in Sunnyvale, California. This role involves managing the end-to-end quality of AI and Machine Learning products and leading cross-functional teams to ensure high-quality standards. You... 

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $130k - $260k

     ...Position Summary Senior Technical Program Manager to support a transformational, multi-stakeholder strategic initiative focused on building the future of insurance technology. The role leads complex, transformational strategic programs spanning multiple teams, ensuring... 
    Senior
    Local area

    Government Employees Insurance Company

    Palo Alto, CA
    2 days ago
  • $120k - $260k

     ...Senior Staff Technical Program Manager At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities. Every day...  ...enabling teams across the enterprise to deliver faster, operate reliably, and innovate with confidence. In this role, you'll... 
    Senior
    Hourly pay
    Work experience placement
    Local area

    GEICO

    Palo Alto, CA
    3 days ago
  • A leading technology solutions provider is seeking a Technical Program Manager III in Santa Clara, CA. This role will coordinate with customer engineering teams to define requirements for AI server and rack systems and manage the full NPI lifecycle from design to mass production... 
    Senior

    Foxconn E BG Group

    Santa Clara, CA
    4 days ago
  •  ...is paramount. Our AI rack‑scale solutions demand rigorous testing for optimum performance and reliability. In this role, you will serve as a critical technical program manager in a dynamic, fast‑paced environment. As part of the AI Group Customer Solutions Engineering team... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    3 days ago
  • $192k - $279k

    Senior Technical Program Manager, Silicon Google - Sunnyvale, CA, USA Requirements: Bachelor's degree in a technical field, or equivalent practical...  ...AI and Infrastructure at unparalleled scale, efficiency, reliability, and velocity. Our customers include Googlers, Google... 
    Senior
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $133k - $238k

    Sr Technical Program Manager, ML & Robotics Platform We’re Blue River, a team of innovators driven to create intelligent machinery that solves monumental...  ...not only new products but also new platforms that reliably create value for both Deere and its customers. From fully... 
    Senior
    Remote job
    Full time
    Immediate start
    Visa sponsorship

    Blue River Technology

    Santa Clara, CA
    2 days ago
  • $192k - $279k

     ...Senior Technical Program Manager A problem isn't truly solved until it's solved for all. That's why Googlers build products that help create...  ...AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud... 
    Senior
    Worldwide

    Google

    Sunnyvale, CA
    4 days ago
  •  ...Senior Technical Program Manager, Launchpad San Francisco, CA; Seattle, WA; New York, NY; Sunnyvale, CA About the Team At DoorDash, we are building the industry's most scalable and reliable delivery network to support our three-sided marketplace of consumers,... 
    Senior
    Work at office
    Local area
    Remote work

    DoorDash

    Sunnyvale, CA
    1 day ago
  • $196k - $242k

     ...Senior Technical Program Manager, Simulation Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...triage workflows for simulation platforms to ensure high reliability and usability for developer teams. Process Scaling:... 
    Senior
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  •  ...select days, as determined by the business needs of the team. The LinkedIn Infrastructure Engineering Technical Program Management Group is looking for a Senior Staff Technical Program Manager (TPM) to join our team which is part of LinkedIns Infrastructure engineering... 
    Senior
    Work at office
    Flexible hours

    LinkedIn

    Sunnyvale, CA
    3 days ago
  • $192k - $279k

    Senior Technical Program Manager, Google for Education Google New York, NY, USA; Mountain View, CA, USA Preferred working location: New York,...  ...including executive and key partner communications. Establish a reliable and visible cadence for program reviews, decision‑making,... 
    Senior

    Google Inc.

    Mountain View, CA
    4 days ago
  • A leading technology company in Sunnyvale seeks an experienced Technical Program Manager to enhance efficiency, reliability, and performance of network infrastructure solutions. You will manage cross-functional projects, leverage networking expertise, and drive product... 
    Senior

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $136k - $264k

    What to Expect Datacenter Technical Program Manager (TPM) role focused on AI builds/clusters includes a strong technical background in hardware deployment, data center infrastructure, and program management for large-scale AI/high-performance computing (HPC) systems. This... 
    Senior
    Hourly pay
    Full time
    Temporary work
    Work at office
    Flexible hours

    Tesla Motors, Inc.

    Palo Alto, CA
    3 days ago
  • $300 per month

     ...Crusoe. About This Role: Join Crusoe Energy as a Capacity Planning Manager, a pivotal role providing critical leadership to our Capacity...  ...Executive Decision‑Making: Develop and analyze business and technical data and scenarios to inform high‑level executive decisions regarding... 
    Full time
    Temporary work
    Shift work

    Crusoe Energy Systems

    Sunnyvale, CA
    1 day ago
  • $130k - $260k

    ## Sr. Staff Technical Program ManagerApplyremote type: Hybridlocations: Palo Alto, CA: San Francisco, CA: San Jose, CA: Seattle, WAtime type: Full...  ...Overview** We are seeking a Sr. Staff Technical Program Manager to serve as both Chief of Staff & Technical Program... 
    Senior
    Hourly pay
    Work experience placement
    Local area
    2 days per week
    3 days per week

    GEICO

    Palo Alto, CA
    1 day ago
  • $15k

     ...alignment with key team members, including creating plans, managing risks, and refining communication models to ensure successful...  ...velocity culture. You'll need: ~6+ years of experience in Technical Program Management, preferably within Trust & Safety, Integrity, or... 
    Senior
    Full time
    Work experience placement
    Work at office
    Flexible hours
    3 days per week

    Match Group

    Palo Alto, CA
    1 day ago
  • $216.15k - $262k

     ...are hiring the Senior Staff TPM who will own that...  ...generation introduction. Not manage a workstream inside it...  ...is a generation-level program, not a SKU-level one....  ...owns all of that: the technical depth to define what...  ...matters for fleet reliability at scale. Networking... 
    Senior
    Temporary work

    Crusoe

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Staff Technical Program Manager - Reliability. Be the first to apply!