Sr. Staff Site Reliability Engineer

$232k - $263k

Obsidian Security

Job Description

Obsidian Security is the leading SaaS security platform, trusted by global enterprises like Snowflake, T-Mobile, and Algolia. We protect 200+ organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand, including many of the world's largest Fortune 1000 and Global 2000 companies.

Founded in 2017 and backed by top investors like Greylock, Obsidian was built to close a critical gap: securing SaaS apps where business happens—Microsoft 365, Salesforce, and hundreds more. The company does this by offering a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Obsidian was built by leaders who redefined endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, they're transforming how SaaS is secured.

With AI driving rapid SaaS growth and complexity, agentic AI tools gain privileged access to sensitive data through integrations, creating new risks most security tools miss. Obsidian uniquely detects anomalous OAuth token activity and manages integration risks. Major announcements are on the horizon. Recognizing that SaaS security needs to evolve, Obsidian enables growing organizations to start with a lightweight, prevention-focused browser extension and expand coverage over time.

With global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise ahead, Obsidian is scaling rapidly toward long-term growth and IPO readiness.

Sr. Staff Site Reliability Engineer

As a Sr. Staff SRE at Obsidian , you will define and drive the company-wide reliability vision for a complex, multi-tenant SaaS platform serving enterprise and financial customers. You will operate as a strategic partner to DevOps and Platform Engineering leadership, shaping a unified reliability strategy that scales across the organization.

Your core mandate: ensure Obsidian detects, diagnoses, and communicates system issues before customers are impacted—consistently and predictably.

This is a hands-on technical role that involves architecting and leading the implementation of systems that handle real-world complexity, including upstream SaaS dependencies, sparse and noisy signals, and mission-critical enterprise workloads.

Key Responsibilities

Reliability Strategy & Architecture - Define and lead long-term reliability strategy across services. Establish end-to-end system visibility frameworks and guide architecture for observability, detection, and resilience.
Cross-Org Leadership - Partner across teams to embed reliability, standardize SLI/SLOs, and serve as a technical escalation expert.
Detection & Observability - Build intelligent detection systems (anomaly detection, connector health models) and enable self-service observability.
Incident Management - Define and evolve a tiered incident communication strategy , improve response practices, and lead postmortems to strengthen reliability and customer trust.
Execution - Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines.

Required Qualifications

5+ years in SRE, Production Engineering, or related roles
3+ years operating at a senior or technical leadership level (Staff or equivalent scope)
Deep expertise in:

AWS and/or GCP
Kubernetes and Helm
Observability stacks (Prometheus, Grafana, or equivalent)
CI/CD systems (GitLab CI/CD, ArgoCD, etc.)

Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms
Strong debugging and systems thinking across distributed microservices and legacy systems
Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience
Hands-on engineering approach with a track record of building—not just configuring—reliability systems

Preferred Qualifications

Experience in B2B SaaS serving enterprise or financial customers
Familiarity with third-party SaaS connector architectures and ingestion patterns
Experience building anomaly detection or intelligent alerting systems
Experience designing customer-facing status pages and incident communication frameworks

Why This Role

Drive org-wide reliability strategy
Own and build new detection & observability systems
Tackle complex distributed systems challenges
Safeguard critical infrastructure for financial customers

What Success Looks Like

Issues caught and resolved before customer impact
Reliability is measurable and continuously improving
Teams self-serve observability with scalable tools
Clear, proactive incident communication builds trust
Reliability becomes a competitive advantage

Employee Benefits

Our competitive benefits packages are designed to support our employees' well-being, both at work and at home. Our US based employees enjoy:

Competitive compensation with equity and 401k
Comprehensive healthcare with dental and vision coverage
Flexible paid time off and paid holiday time off
12 weeks of new parent or family leave
Personal and professional development resources

For more details on our US benefits, or for information on our international benefits, please see here.

Pay Transparancy

Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.

At Obsidian, we are proud to be an equal-opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact View email address on ziprecruiter.com

Information collected and processed as part of any job applications you choose to submit is subject to Obsidian's Applicant Privacy Policy.

Base Salary Range

$232,000—$263,000 USD

Apply

Vacancy posted 11 days ago

Similar jobs that could be interesting for youBased on the Sr. Staff Site Reliability Engineer in Palo Alto, CA vacancy

Sr. Site Reliability Engineer, Vehicle Software, Build Infrastructure
$140k - $312k
What to Expect Tesla's continued success depends on engineers being able to develop, debug, and deploy software quickly. Our build, tools... ...documents, with an eye toward identifying performance and reliability bottlenecks Provide SRE expertise and implementing standard methodologies...
Senior
Hourly pay
Full time
Temporary work
Immediate start
Flexible hours
Tesla Motors, Inc.
Palo Alto, CA
1 day ago
Senior Director of Site Reliability Engineering
...in transformative projects. Together, let's push boundaries and achieve unparalleled success. As a Senior Director of Site Reliability Engineering at JPMorgan Chase within the I nfrastructure Platforms and Foundational Services (IPFS) team , you are deemed as...
Senior
J.P. Morgan
Palo Alto, CA
15 days ago
Sr Staff Android Framework Developer
$167.2k - $316.6k
...architecture, workflows, and technical specifications. Qualifications Bachelor’s or Master’s degree in Computer Science, Software Engineering, or equivalent combination of relevant education and experience. 8+ years of software development experience, particularly in...
Senior
Immediate start
Visa sponsorship
Flexible hours
Ford Motor Company
Palo Alto, CA
1 day ago
Senior Site Reliability Engineer - Remote & Scalable Impact
...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native...
Senior
Remote job
BuildBuddy
Palo Alto, CA
3 days ago
Sr Staff Design Verification
$224k - $257k
...center infrastructure, enabling the next giant leaps in human progress. The company invented the world’s first 3D-stacked photonics engine, Passage™, capable of connecting thousands to millions of processors at the speed of light in extreme-scale data centers for the...
Senior
Full time
Temporary work
Flexible hours
Lightmatter
Mountain View, CA
1 day ago
Senior Site Reliability Engineer- Palo Alto, the US
Senior Site Reliability Engineer (Payments Infrastructure) Kody is seeking a Senior Site Reliability Engineer to ensure the reliability, availability, scalability, and operational excellence of our global payment platform. You will own production observability, incident...
Senior
Kody
Palo Alto, CA
1 day ago
Sr Site Reliability Engineer (Prisma Access)
$120k - $200k
Sr Site Reliability Engineer (Prisma Access) 2 days ago Be among the first 25 applicants Job Description This role requires US Citizenship. Your Career Palo Alto Networks runs a large infrastructure and is one of the biggest GCP customers. As a Principal SRE, you'll...
Senior
Rotating shift
Palo Alto Networks
Santa Clara, CA
1 day ago
Senior Site Reliability Engineer
$179.2k - $268.8k
...sensors and compute systems, test operations, systems and safety engineering - all dedicated to redefining the relationship between people and their vehicles for millions of customers. As a Site Reliability Engineer on the team, you will be responsible for helping to...
Senior
Permanent employment
Full time
Work at office
Immediate start
Visa sponsorship
Latitude AI
Palo Alto, CA
1 day ago
Sr. Staff Hardware Systems Architect - Infotainment/Skunkworks
$213k - $266.3k
...as reference platforms for future bring‑up, system integration, or system‑level validation. Collaborate with system performance engineers, hardware design and software teams to create comprehensive validation plans that surface key system‑level performance metrics, locate...
Senior
Full time
Local area
Rivian and Volkswagen Group Technologies
Palo Alto, CA
1 day ago
Senior Site Reliability Engineer - Cloud, Kubernetes & Infra
Tesla is looking for a seasoned Site Reliability Engineer (SRE) to join our growing team focused on next-generation diagnostics software. This role requires strong expertise in Linux internals, cloud-native apps, and containerization technologies such as Kubernetes and...
Senior
Tesla
Palo Alto, CA
1 day ago
Senior Staff Site Reliability Engineer, AViD, YouTube Ads
$262k - $365k
Senior Staff Site Reliability Engineer, AViD, YouTube Ads Mountain View, CA, USA; London, UK Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Mountain View, CA, USA; London, UK . Advanced Experience...
Senior
Google
Mountain View, CA
1 day ago
Sr Staff Site Reliability Engineer Veza
$165.5k - $289.6k
Sr Staff Site Reliability Engineer - Veza Full-time Employee Type: Regular Region: AMS - North America and Canada Work Persona: Flexible or Remote Veza is the pioneer in identity security, purpose-built to answer the fundamental question enterprises face: who can...
Senior
Full time
Work at office
Remote work
Flexible hours
ServiceNow
Santa Clara, CA
1 day ago
Senior PaaS Site Reliability Engineer (Cloud Ops)
$115.8k - $160k
Tencent is seeking a skilled professional to manage and optimize PaaS products in North America. This role involves monitoring product stability, resolving technical issues, and applying tools like CI/CD to enhance operational efficiency. Candidates should have a Bachelor...
Senior
Tencent
Palo Alto, CA
1 day ago
Sr. Site Reliability Engineer
...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity... ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in...
Senior
Full time
Work experience placement
Immediate start
Illumio
Sunnyvale, CA
2 days ago
Sr Cloud Site Reliability Engineer, IS&T Ai & Data Platforms
$170.7k - $300.2k
...business value. We do all this with an outstanding group of software engineers, data scientists, SRE/MLOps engineers and managers. We are... ...and providing insight for the Infrastructure service reliability and availability through extensible services & platforms. Design...
Senior
Relocation
Apple Inc.
Cupertino, CA
1 day ago
Staff OR Sr. Staff Product Manager - Agentic AI (TurboTax)
$188.5k - $255k
...agentic AI features and experiences, partnering deeply with AI/ML engineers, data scientists, designers, and tax experts to deliver high‑... ...comparisons across categories of ethnicity and gender. Staff PM: - SF Bay Area, CA: $188,500 - $255,000 - San Diego, CA: $...
Senior
Work at office
3 days per week
Intuit Inc.
Mountain View, CA
4 days ago
Site Reliability Engineer - PaaS
$115.8k - $160k
Responsibilities Monitor and maintain Tencent Cloud's PaaS products in the North American region to ensure stability and reliability, resolving technical issues and mitigating risks to keep services operating smoothly in various technical scenarios. Utilize tools or platforms...
Relocation package
Tencent
Palo Alto, CA
1 day ago
Site Reliability Engineer (SRE)
$100k - $200k
OPPO US Research Center is seeking a skilled and proactive Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for ensuring the stability, scalability, and performance of our application systems. The ideal candidate is passionate about...
Full time
OPPO
Palo Alto, CA
2 days ago
Staff/Sr. Staff Electrical Test Engineer
$170k - $220k
...center infrastructure, enabling the next giant leaps in human progress. The company invented the world’s first 3D-stacked photonics engine, Passage™, capable of connecting thousands to millions of processors at the speed of light in extreme‑scale data centers for the...
Senior
Full time
Temporary work
Flexible hours
Lightmatter
Mountain View, CA
1 day ago
Senior Staff PM, Small Business Health Growth
$205.5k - $278k
Intuit is seeking a Senior Staff PM to develop the Small Business Health product, focused on providing insights to small business customers. You'll own the vision and strategy, set key engagement metrics, and ensure alignment across multiple teams. With a competitive compensation...
Senior
Intuit
Mountain View, CA
1 day ago
Site Reliability Engineer - W2 Role
Role: Site Reliability Engineer (SRE) Location: Palo Alto, CA (Onsite from Day 1) Job Type: Contract (W2) Key Skills Programming: Proficiency in languages like Python, Java, or Go. System Administration: Strong understanding of Linux/Unix systems. Cloud Infrastructure...
Contract work
Saransh Inc
Palo Alto, CA
1 day ago
Site Reliability Engineer, Diagnostics
$140k - $312k
...design documents, with an eye toward identifying performance and reliability bottlenecks Productionalize new services and features, as... ...What You'll Bring Degree in Computer Science, Electrical Engineering, Automotive Engineering, or related field, or equivalent experience...
Hourly pay
Temporary work
Immediate start
Flexible hours
Tesla
Palo Alto, CA
1 day ago
Site Reliability Engineer
$165k - $190k
...DevOps / SRE Team The DevOps/SRE team at Obsidian ensures that engineering excellence translates into stable, scalable, and high‑... ...security platform Address complex challenges around scalability, reliability, observability, and cost efficiency Collaborate with Engineering...
Work from home
Flexible hours
Obsidian-Security
Palo Alto, CA
1 day ago
Site Reliability Engineer
...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid...
Work at office
Weekend work
FLUIX
Palo Alto, CA
3 days ago
Senior Staff TPM: Platform Strategy & Execution
Coupang in Mountain View, CA is seeking a Senior Staff Technical Program Manager to lead company-wide, technically complex programs... ...drive strategy, architecture, and data-driven decisions across engineering and business teams to deliver critical platform capabilities. The...
Senior
Coupang
Mountain View, CA
1 day ago
Senior Staff Tech Lead, YouTube Shorts Discovery & ML
Google is seeking a Software Engineer in Mountain View, CA, to enhance YouTube Shorts discovery through large-scale recommendation systems. You will lead projects that optimize user engagement and drive growth. The role requires significant experience in software development...
Senior
Google
Mountain View, CA
1 day ago
Senior Staff ML Tech Lead for Shorts Discovery
$262k - $365k
Google is seeking an experienced Software Engineer specializing in AI/ML in Mountain View, CA. You will lead project teams and develop large-scale recommendation models to grow the YouTube Shorts ecosystem, working in a dynamic, innovative environment. The role necessitates...
Senior
Google
Mountain View, CA
4 days ago
Senior Staff Platform PM: Data Integrations & Ecosystem
Intuit is looking for a Senior Staff Platform Product Manager to lead the strategic vision for its data integration platform, enhancing connectivity across QuickBooks, TurboTax, Credit Karma, and Mailchimp. This role involves onboarding data providers and driving revenue...
Senior
Intuit
Mountain View, CA
1 day ago
Senior Staff Manufacturing Eng, High-Volume Assembly & Test
$91.6k - $316.8k
Tesla Motors, Inc. is looking for a Sr. Staff Manufacturing Engineer based in Palo Alto, CA. This role involves leading a team focused on high-volume production while optimizing manufacturing processes and developing automation solutions. Candidates should have a Bachelor...
Senior
Tesla Motors, Inc.
Palo Alto, CA
1 day ago
Senior Staff PM — Agentic AI for Tax
Intuit Inc. is seeking a seasoned Product Manager to lead the strategy for Agentic AI in tax preparation. This role is pivotal in designing AI agents capable of complex reasoning, enhancing user trust and accuracy. Candidates should have 6+ years in software product management...
Senior
Intuit Inc.
Mountain View, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Staff Site Reliability Engineer. Be the first to apply!