Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Site Reliability Engineer

$182k - $249k

Okta, Inc.

Secure Every Identity, from AI to HumanIdentity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.We are seeking an experienced Staff Site Reliability Engineer to join our Infrastructure Platform AGILE SRE team. This role focuses on providing cross-functional support, enabling teams to build critical infrastructure at the same time strengthening our internal tooling and operational capabilities. You will work closely with other Infrastructure Operations teams to diagnose, troubleshoot, and resolve complex infrastructure challenges by building tooling and designing clever solutions.Key ResponsibilitiesInvestigate and resolve infrastructure issues reported by internal teamsProvide technical guidance and support across multiple technical domainsContribute to runbooks, documentation, and knowledge sharingMentor junior team members on SRE best practices and troubleshooting methodologiesIdentify and implement improvements to monitoring, alerting, and incident response processesRequired Qualifications7+ years of Site Reliability Engineering or equivalent systems administration experienceProficiency with Kubernetes and container orchestrationStrong Linux/Unix systems administration backgroundGood understanding of CI/CD and deployment strategiesGood grasp of networking conceptsExperience with infrastructure as code, infrastructure troubleshooting and general architectureExcellent communication and documentation skillsPreferred technologies/languages (Priority Areas)Kubernetes, Terraform, Golang, PythonExperience working across multiple teams in a cross-functional capacityFamiliarity with compliance and change management processesWhy Join UsWork on critical infrastructure supporting multiple teamsOpportunity to grow expertise in modern infrastructure toolingCollaborative environment with strong knowledge-sharing cultureImpact on infrastructure reliability and team efficiency#LI-Hybrid#LI-LSS1requisition ID- P4489_3418534The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $182,000—$249,000 USDBelow is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between:$162,000—$223,000 USDThe Okta ExperienceSupporting Your Well-BeingDriving Social ImpactDeveloping Talent and Fostering Connection + CommunityWe are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, pleaseOkta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at foundation for secure connections between people and technologyOkta is the leading independent provider of identity for the enterprise. The Okta Identity Cloud enables organizations to securely connect the right people to the right technologies at the right time. With over 7,000 pre-built integrations to applications and infrastructure providers, Okta customers can easily and securely use the best technologies for their business. More than 19,300 organizations, including JetBlue, Nordstrom, Slack, T-Mobile, Takeda, Teach for America, and Twilio, trust Okta to help protect the identities of their workforces and customers. #J-18808-Ljbffr

Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the Staff Site Reliability Engineer in San Francisco, CA vacancy
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability... 
    Suggested
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    San Francisco, CA
    3 days ago
  •  ...The TeamPlatform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As... 
    Suggested
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    3 days ago
  • $140k - $220k

     ...About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing...  ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks... 
    Suggested

    Pylon

    San Francisco, CA
    1 day ago
  •  ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that...  ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes... 
    Suggested
    Work at office
    Worldwide

    Heidi Health Ltd

    San Francisco, CA
    1 day ago
  • $125k - $165k

     ...Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud infrastructure... 
    Suggested
    Temporary work
    Work at office
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    TELCOR

    San Francisco, CA
    14 hours ago
  •  ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like...  ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and... 

    Unify

    San Francisco, CA
    12 hours ago
  •  ...and experiment constantly as we find the right paths in an AI-native landscape. The Role: You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data... 
    Local area

    Airbyte

    San Francisco, CA
    1 day ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week

    Prosper.com

    San Francisco, CA
    3 days ago
  • $50 per hour

     ...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor's Degree in Computer Science or related field, or 8+ years relevant work... 
    Temporary work
    Work experience placement

    Epoch Biodesign

    San Francisco, CA
    13 hours ago
  • $166.9k - $225.9k

     ...SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close...  ...with product engineering leads and staff engineers to define SLOs and SLIs...  ...ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering... 
    Flexible hours

    Drata

    San Francisco, CA
    13 hours ago
  •  ...values and enthusiasm for building a great culture and product, you will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our... 
    Remote work
    Work from home
    Flexible hours

    Fieldguide.ai

    San Francisco, CA
    12 hours ago
  •  ...The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-... 

    Blaxel

    San Francisco, CA
    1 day ago
  • $165k - $225k

     ...and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability... 
    Temporary work
    Work at office
    Local area
    Worldwide
    Flexible hours

    Stellar Services

    San Francisco, CA
    13 hours ago
  •  ...millions of daily users while enabling our engineering teams to ship fast. You'll own the...  ...building automation and tooling that improves reliability and partnering with engineering to...  ...services What you'll bring ~5+ years in Site Reliability Engineering, DevOps, or... 
    Work at office
    Work from home

    Gamma

    San Francisco, CA
    13 hours ago
  •  ...TELCOR Inc is looking for a Site Reliability Engineer to ensure the reliability, scalability, and performance of our AI products' systems. The role involves designing and operating resilient systems in cloud and containerized environments while managing production infrastructure... 
    Remote work

    TELCOR

    San Francisco, CA
    13 hours ago
  •  ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building...  ...observability adoptable and improve product reliability. Lead members of other engineering teams...  ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess... 
    Work at office
    Local area
    Work from home

    Lambda

    San Francisco, CA
    13 hours ago
  •  ...co‑founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and security... 

    deCircle

    San Francisco, CA
    1 day ago
  • # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we...  ...'s SRE team operates as both a central engineering function and an embedded reliability practice...  ...with product engineering leads and staff engineers to define SLOs and SLIs for... 
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    12 hours ago
  •  ...For more information, please read ourSenior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: US - San Francisco Bay Areatime type: Full timeposted on: Posted Yesterdayjob requisition id: R1478**There are NO limits to your career: come... 
    Immediate start
    Remote work
    Worldwide

    OutSystems

    San Francisco, CA
    13 hours ago
  •  ...advanced algorithms that significantly outperforms individual engineers. We combine language models with human ingenuity to push the...  ...and quality. The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area... 

    CodeRabbit

    San Francisco, CA
    13 hours ago
  •  ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas...  ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle... 

    Forhyre

    San Francisco, CA
    18 days ago
  •  ...Job Description Velia Multiservices is proud to partner with a fast-growing, early-stage startup to identify a top-tier Site Reliability Engineer who will play a critical role in scaling and strengthening a high-performance platform used by enterprise clients such as... 

    Velia multiservices

    San Francisco, CA
    18 days ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week

    Prosper

    San Francisco, CA
    13 days ago
  • $150k

     ...Job Description Job Description About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and... 

    VantageScore

    San Francisco, CA
    22 days ago
  • $220k - $235k

     ...We are seeking a strategic, high-output Staff/Senior Staff SRE to define the future of our cloud platform and champion engineering excellence across Ironclad. In this role...  ...leadership and strategic direction for the Site Reliability Engineering team and our broader Cloud... 
    Full time
    Work at office

    Jobr

    San Francisco, CA
    13 hours ago
  • $210.6k - $305.1k

     ...Qualifications: ~ You have led a distributed team of 5+ engineers, can demonstrate strong technical vision for your team, and ensure...  ..., and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Francisco, CA
    3 days ago
  •  ...Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only... 
    Full time
    Remote work

    Andromeda Cluster

    San Francisco, CA
    13 hours ago
  • $125k - $195k

     ...a small team of exceptional, hands-on engineers to make this happen. Mechanical, electrical...  ...We are seeking an Infrastructure & Site Reliability Engineer to design, build, deploy, and...  ...exceptional early-career engineers to senior and staff-level builders. There isn’t a single... 
    Work at office
    Visa sponsorship
    Night shift

    Atomicsemi

    San Francisco, CA
    13 hours ago
  • $138k - $179k

     ...write up and follow up tasks to close any gaps identified. We partner with a wide variety of other teams from infrastructure and engineering, to QA and business teams, so strong collaborative instincts and clear communication skills are a key part of our toolset. As well... 
    Flexible hours

    MSCI

    San Francisco, CA
    13 hours ago
  •  ...Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early‑stage startups access to the kind of scaled AI infrastructure once reserved... 
    Full time
    Remote work

    Cortes 23

    San Francisco, CA
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Site Reliability Engineer. Be the first to apply!