Sr. Staff Site Reliability Engineer
$232k - $263kObsidian Security
Job Description
Job Description
Obsidian Security is the leading SaaS security platform, trusted by global enterprises like Snowflake, T-Mobile, and Algolia. We protect 200+ organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand, including many of the world's largest Fortune 1000 and Global 2000 companies.
Founded in 2017 and backed by top investors like Greylock, Obsidian was built to close a critical gap: securing SaaS apps where business happens—Microsoft 365, Salesforce, and hundreds more. The company does this by offering a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Obsidian was built by leaders who redefined endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, they're transforming how SaaS is secured.
With AI driving rapid SaaS growth and complexity, agentic AI tools gain privileged access to sensitive data through integrations, creating new risks most security tools miss. Obsidian uniquely detects anomalous OAuth token activity and manages integration risks. Major announcements are on the horizon. Recognizing that SaaS security needs to evolve, Obsidian enables growing organizations to start with a lightweight, prevention-focused browser extension and expand coverage over time.
With global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise ahead, Obsidian is scaling rapidly toward long-term growth and IPO readiness.
Sr. Staff Site Reliability Engineer
As a Sr. Staff SRE at Obsidian , you will define and drive the company-wide reliability vision for a complex, multi-tenant SaaS platform serving enterprise and financial customers. You will operate as a strategic partner to DevOps and Platform Engineering leadership, shaping a unified reliability strategy that scales across the organization.
Your core mandate: ensure Obsidian detects, diagnoses, and communicates system issues before customers are impacted—consistently and predictably.
This is a hands-on technical role that involves architecting and leading the implementation of systems that handle real-world complexity, including upstream SaaS dependencies, sparse and noisy signals, and mission-critical enterprise workloads.
Key Responsibilities
- Reliability Strategy & Architecture - Define and lead long-term reliability strategy across services. Establish end-to-end system visibility frameworks and guide architecture for observability, detection, and resilience.
- Cross-Org Leadership - Partner across teams to embed reliability, standardize SLI/SLOs, and serve as a technical escalation expert.
- Detection & Observability - Build intelligent detection systems (anomaly detection, connector health models) and enable self-service observability.
- Incident Management - Define and evolve a tiered incident communication strategy , improve response practices, and lead postmortems to strengthen reliability and customer trust.
- Execution - Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines.
Required Qualifications
- 5+ years in SRE, Production Engineering, or related roles
- 3+ years operating at a senior or technical leadership level (Staff or equivalent scope)
- Deep expertise in:
- AWS and/or GCP
- Kubernetes and Helm
- Observability stacks (Prometheus, Grafana, or equivalent)
- CI/CD systems (GitLab CI/CD, ArgoCD, etc.)
- Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms
- Strong debugging and systems thinking across distributed microservices and legacy systems
- Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience
- Hands-on engineering approach with a track record of building—not just configuring—reliability systems
Preferred Qualifications
- Experience in B2B SaaS serving enterprise or financial customers
- Familiarity with third-party SaaS connector architectures and ingestion patterns
- Experience building anomaly detection or intelligent alerting systems
- Experience designing customer-facing status pages and incident communication frameworks
Why This Role
- Drive org-wide reliability strategy
- Own and build new detection & observability systems
- Tackle complex distributed systems challenges
- Safeguard critical infrastructure for financial customers
What Success Looks Like
- Issues caught and resolved before customer impact
- Reliability is measurable and continuously improving
- Teams self-serve observability with scalable tools
- Clear, proactive incident communication builds trust
- Reliability becomes a competitive advantage
Employee Benefits
Our competitive benefits packages are designed to support our employees' well-being, both at work and at home. Our US based employees enjoy:
- Competitive compensation with equity and 401k
- Comprehensive healthcare with dental and vision coverage
- Flexible paid time off and paid holiday time off
- 12 weeks of new parent or family leave
- Personal and professional development resources
For more details on our US benefits, or for information on our international benefits, please see here.
Pay Transparancy
Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.
At Obsidian, we are proud to be an equal-opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact View email address on ziprecruiter.com
Information collected and processed as part of any job applications you choose to submit is subject to Obsidian's Applicant Privacy Policy.
Base Salary Range
$232,000—$263,000 USD
$167.2k - $316.6k
...components of the Android framework, enhancing the performance, reliability, and security of our IVI platform. The ideal candidate will... ...Bachelor's or Master's degree in Computer Science, Software Engineering, or equivalent combination of relevant education and experience...SeniorImmediate startVisa sponsorshipFlexible hours$270k - $340k
...agendas and dive deep into low-level implementation details with engineering partners. Role Summary As a Principal Research... ...teams, ensuring that research ideas translate into performant, reliable infrastructure. Partner with product and engineering to translate...SeniorLocal areaWorldwide$210k - $270k
Zocdoc is seeking a Senior Site Reliability Engineer to develop and maintain distributed production systems. The ideal candidate will have over 5 years of experience in site reliability or production engineering, particularly in cloud environments like AWS. Responsibilities...Senior- ...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native...SeniorRemote job
- ...system-level validation. Collaborate with system performance engineers, hardware design and software teams to create comprehensive... ...available for full-time employees, check out our Global Benefits Site. External candidates can apply for this role through the Rivian...SeniorFull timeContract workLocal area
- The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast-growing...Senior
- ...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity... ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in...SeniorWork experience placement
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon’s production systems. This means designing and implementing... ...scale as we grow. You’ll build tooling that makes the entire engineering team more effective, establish on‑call rotations and runbooks...Senior- ..., and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. The team owns...SeniorRemote work
$210k - $270k
Your Impact on our Mission: Zocdoc is looking for a Senior Site Reliability Engineer to help develop, monitor, and maintain our distributed production systems. You’ll be challenged with building frameworks and processes for ensuring uptime for our patients and providers...SeniorFlexible hours$180k - $200k
...Holmdel, NJ. Join us and be part of a team that's shaping the future of payments—one experience at a time. As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their...SeniorFor contractorsWork at officeWork from homeFlexible hours- ...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems...Senior
$180k - $260k
...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will...SeniorOdd jobWork at officeRemote work$176.2k - $233.53k
...projects and functions. Responsibilities Mentor and teach our engineers by providing thoughtful critique and guidance grounded in... ...systems An awareness of design for manufacturability, service, reliability and sustainability An eagerness to work collaboratively and cross...SeniorFull timeContract workTemporary workPart timeLocal areaShift work- A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS. The role involves maintaining high availability through Kubernetes clusters and improving CI/CD pipelines with Terraform. Ideal...Senior
$120.3k - $194.53k
...drives great outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting Advanced DNS Security services. This...SeniorFull timeWork at officeVisa sponsorshipWork visa$232.9k - $335.81k
## Principal Site Reliability EngineerApplylocations: USA - CA - Palo Altotime type: Full timeposted... ...for a Principal Site Reliability Engineer to join our Platform Engineering team —... ...Platform Engineering, with demonstrated Staff- or Principal-scope impact and a track...Permanent employment$281k - $356k
...miles of driving data from a diverse set of sensors, enabling engineers like you to (1) develop methods for efficiently and continuously... ...offboard hardware. In this hybrid role you will report to a Sr Staff Technical Lead Manager. You will: * Own object detection...SeniorFull timeTemporary workRemote work- ...REEVO ENGINEERING BUILDER - JD We are seeking a Senior Software Engineer who thrives as a T-shaped individual—bringing deep technical expertise in software engineering while also possessing a broad range of skills that allow them to creatively tackle diverse challenges...Senior
$180k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform...Permanent employmentTemporary workRelocation- ...Job Description Job Description Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes...
$163k - $261k
Who we are Aurora's mission is to deliver the benefits of self-driving technology safely, quickly, and broadly. The Aurora Driver will create a new era in mobility and logistics, one that will bring a safer, more efficient, and more accessible future to everyone...SeniorWork at officeLocal area3 days per week- A global data and AI company is seeking a Senior Staff Technical Program Manager to lead Reliability initiatives within product engineering teams. This role requires over 10 years of experience in managing cloud infrastructure programs and driving improvements in reliability...SeniorLocal area
- REEVO ENGINEERING BUILDER - JD We are seeking a Senior Software Engineer who thrives as a T-shaped individual—bringing deep technical expertise in software engineering while also possessing a broad range of skills that allow them to creatively tackle diverse challenges....Senior
$180k - $360k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform...Temporary workRelocation$300k - $334k
Google Inc. is seeking a Senior Staff Technical Program Manager for the GeminiApp team in Mountain View, CA. This role involves translating... ...inception to delivery, and cultivating partnerships across engineering teams. A minimum of 10 years of experience in technical program...Senior$86.33k - $191.9k
...guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically . You dive deep into... ...of AI‑assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real...Local areaFlexible hours- ...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid...Work at officeWeekend work
$144k - $216k
...Date posted 05/25/2026 Category Engineering Hire Type Employee Job ID 17502 Base Salary Range $144000-$216000 Remote... ..., take ownership once work is scoped, and focus on delivering reliable code, strong test coverage, and solid performance. At Synopsys...SeniorRemote work$262k - $365k
Google is seeking an experienced Software Engineer specializing in AI/ML in Mountain View, CA. You will lead project teams and develop large-scale recommendation models to grow the YouTube Shorts ecosystem, working in a dynamic, innovative environment. The role necessitates...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Staff Site Reliability Engineer. Be the first to apply!
- staff data engineer Palo Alto, CA
- assistant engineer Palo Alto, CA
- staff engineer Palo Alto, CA
- software engineer staff Palo Alto, CA
- senior staff systems engineer Palo Alto, CA
- senior staff engineer Palo Alto, CA
- technology administrator Palo Alto, CA
- engineering aide Palo Alto, CA
- site reliability engineer sre Palo Alto, CA
- site reliability engineer Palo Alto, CA


