Manager, Site Reliability Engineering

$204k - $306k

Full-time

Okta

Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Manager, Site Reliability Engineering San Francisco, California Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. **This position requires 2 days a week in our San Francisco Office. The IDaaS Site Reliability Engineering Group Okta authenticates, authorizes and provisions millions of users a day. The service is hosted on Amazon Web Services (AWS) across multiple availability zones and geographically separated regions. The service is designed for high throughput and 99.999 availability. We're looking for a technical leader to help us continue to scale the service with great people and reliable, cost-effective, and efficient infrastructure, processes, and tooling. As the Manager of Infrastructure Platform and Shared Services, you will oversee multiple teams focused on Edge networking, K8s platform, CI/CD, Observability, automation platform & tooling. What you’ll be doing Managing a team of SRE’s supporting various workloads and teams that support our IDaaS platform. Drive the microservice journey, DevOps maturity, and workload reliability in tandem with architects and teams across the organization. Accelerate the velocity of SRE and product engineering by developing powerful tooling, intuitive self-service capabilities, and robust self-healing patterns. Lead, mentor, and grow a high-performing team of engineers and managers across platform, infrastructure, and shared services domains. Perform engineering design evaluations and ensure the completion of projects within resource, budget, and scheduling constraints. Improve SDLC processes for Cloud infrastructure as a code, including the maturity of CI/CD pipelines, change and release management Manage service and business expectations and prioritize resource allocation Maintain a deep knowledge of industry best practices, evolving trends, and technologies What you’ll bring to the role 3+ years of experience in technical leadership & people management Extensive experience using Agile and DevOps methodologies to build product infrastructure and shared service at scale Experience running large-scale infrastructure platforms supporting a SaaS/Cloud service in a public Cloud, preferably AWS. Experience supporting a multi-Cloud environment will be a plus. Strong expertise in cloud-native architectures, containerization (Kubernetes), IaC (Terraform), and CI/CD pipelines Strong background and hands-on experience in SW development, PaaS and automation Deep experience with building and operating observability platforms and monitoring tools (Grafana, Splunk, APM etc.) in a large scale environment. Effective verbal, written communication and interpersonal skills Computer Science Degree or related degree or equivalent experience Additional requirements: This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire. #LI-Hybrid

P24518_3462184

Below is the annual base salary range for candidates located in San Francisco Bay Area. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: The annual base salary range for this position for candidates located in the San Francisco Bay area is between:

$204,000—$306,000 USD

The Okta Experience Supporting Your Well-Being Driving Social Impact Developing Talent and Fostering Connection + Community We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one. Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation. Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.

Apply

Vacancy posted 23 hours ago

Similar jobs that could be interesting for youBased on the Manager, Site Reliability Engineering in San Francisco, CA vacancy

Senior Site Reliability Engineer
...our Series B and have grown 800% over the last 12 months. Engineering at Ivo Engineers at Ivo are inventors. Ivo was first-to-... ...expect us to hit our SLAs. What? We’re looking for an Senior Site level Reliability Engineer as part of Infrastructure team to: Own uptime,...
Suggested
Contract work
Work at office
Remote work
Visa sponsorship
Relocation package
Flexible hours
Ivo Inc.
San Francisco, CA
2 days ago
Senior Site Reliability Engineer
...includes AI agents. The Role: You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team... ...platform - Kubernetes clusters, CI/CD pipelines, secrets management, networking, and cloud resource configuration across AWS...
Suggested
Full time
Work at office
Local area
Flexible hours
Airbyte
San Francisco, CA
5 days ago
Manager, Site Reliability Engineering
$204k - $281k
...This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. MANAGER, SITE RELIABILITY ENGINEERING San Francisco, California Secure Every Identity, from AI to Human Identity is the key to unlocking the potential...
Suggested
Permanent employment
Full time
Work at office
Local area
Worldwide
Flexible hours
2 days per week
Okta
San Francisco, CA
39 minutes ago
Senior Site Reliability Engineer (San Francisco)
$210k - $240k
...Join to apply for the Senior Site Reliability Engineer role at Alembic Technologies This range is provided by Alembic Technologies. Your actual... ..., deployment automation, rollback mechanisms, and config management Implement and maintain monitoring, alerting, and incident...
Suggested
Full time
Alembic Technologies
San Francisco, CA
1 day ago
Senior Site Reliability Engineer (San Francisco)
$175k - $250k
...Senior Cloud Infrastructure Engineer Location: San Francisco, CA.... ...Remote unavailable. Modality: On-Site only. Must live within... ...scalability, performance, and reliability across environments. What You... ...powers AI workloads at scale Manage and automate GPU compute clusters...
Suggested
Full time
Remote work
Relocation
Relocation package
The Recruiting Guy
San Francisco, CA
1 day ago
Site Reliability Engineer (San Francisco)
...valued at $10 billion. We work in‑person five days a week in our new San Francisco headquarters. About the Role As a Site Reliability Engineer (SRE) at Mercor, you’ll own production reliability across our most critical systems, partnering directly with...
Full time
Mercor
San Francisco, CA
1 day ago
Site Reliability Engineer
$155k - $222.6k
...cloud platform. As a team of six engineers distributed across the US,... ...strong focus on automation, reliability, and operational excellence.... ...more than 500,000 customers and manages over 18 million devices... ...~2+ years of experience in Site Reliability Engineering, DevOps...
Permanent employment
Full time
Temporary work
Local area
Worldwide
Flexible hours
Cisco
Daly City, CA
1 day ago
Site Reliability Engineer
...DESCRIPTION Project Outline: We are looking for a Site Reliability Engineer with experience in incident response. In this role, you... ...Background: 4+ years in SRE, DevOps, or Systems Engineering roles managing production environments at scale. - Data Proficiency:...
BayOne Solutions
San Francisco, CA
4 days ago
Site Reliability Engineer
$260k - $300k
...makers of Devin, the first AI software engineer. Our team is extremely talent-dense... .... You will own both the production reliability of our user-facing products and the... ...that matters. Infrastructure as Code: Manage cloud infrastructure through code. Build...
Cognition Corp
San Francisco, CA
2 days ago
Senior Site Reliability Engineer
$148.5k - $223.9k
...efforts. Job Category Software Engineering Job Details About Salesforce... ...senior engineering candidate to join the Site Reliability organization in San Francisco. Working... ...improve operational efficiency. Incident Management: Lead the coordinated response to...
Worldwide
Weekend work
Salesforce
San Francisco, CA
2 days ago
Site Reliability Engineer
...The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure... ...) ~ Strong debugging, problem-solving, and incident-management skills Preferred Experience with...
Blaxel, Inc
San Francisco, CA
5 days ago
Site Reliability Engineer
$120k - $168.49k
...Site Reliability Engineer, Cloud Infrastructure About Quizlet At Quizlet, our mission is to help every learner achieve their outcomes in the... ...automation scripts and tools for deployments, infrastructure management, and operational tasks to reduce manual effort (toil)....
Internship
Work at office
3 days per week
Quizlet
San Francisco, CA
3 days ago
Site Reliability Engineer
...Open Source LLM Gateway Engineer LiteLLM is an open-source LLM Gateway with 34K+ stars on GitHub and trusted by companies like NASA... ...expanding and seeking our 6th Engineer focused on owning reliability, performance, and infrastructure stability for the LiteLLM proxy...
BerriAI
San Francisco, CA
3 days ago
Site Reliability Engineer
...enterprise that runs the real economy. Learn more about our vision in our manifesto. About the Role We're looking for a Site Reliability Engineer to take the lead on scaling our operational resilience as we grow. You'll own the stability, observability, and debugging...
Worldwide
Shift work
Happy Robot
San Francisco, CA
4 days ago
Site Reliability Engineer (SRE)
...products that empower people across the globe. Join us on this journey to redefine resource management-and change lives along the way. The Role As a Site Reliability Engineer (SRE) at Air Apps, you will be responsible for ensuring the reliability, availability,...
Temporary work
Worldwide
Air Apps
San Francisco, CA
4 days ago
Site Reliability Engineer
...A tech startup in San Francisco is looking for Site Reliability Engineers to enhance system reliability and performance. Ideal candidates have over 5 years of relevant experience and strong expertise in cloud infrastructure, including AWS and Kubernetes. The role involves...
Breakout Tools
San Francisco, CA
2 days ago
Site Reliability Engineer
A leading technology firm is looking for a Manager to expand their Cloud Site Reliability team. The ideal candidate will have extensive Linux administration experience, a passion for automation, and be comfortable in a remote, diverse workplace. This position emphasizes...
Remote work
mbhsobana
San Francisco, CA
2 days ago
Site Reliability Engineer
...Arena Intelligence Engineer Arena Intelligence is looking for an engineer to build the... ...infrastructure for our users that scales, is reliable, and makes the complexities of operating... ...of the challenges: streaming, token management, rate limits, model-specific quirks....
Permanent employment
Shift work
Arena AI
San Francisco, CA
1 day ago
Site Reliability Engineer
...Site Reliability Engineer Job Location: San Francisco, CA or Charlotte, NC. Job Type: Contract Work with local API development squads, platform teams, product owners, scrum masters, and architects. The SRE ensures that both our internally critical and our externally...
Contract work
Local area
InterSources
San Francisco, CA
2 days ago
Site Reliability Engineer
$140k - $205k
...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam... ...ensure high availability and performance Implement and manage service-level indicators (SLIs), objectives (SLO's),...
Full time
Temporary work
Work at office
Flexible hours
Weekend work
Cooley
Daly City, CA
1 day ago
Site Reliability Engineer
$230k - $310k
...daily users while enabling our engineering teams to ship fast. You'll... ...automation and tooling that improves reliability and partnering with... ...scale with the product Manage and optimize our compute, networking... ...'ll Bring ~5+ years in site reliability engineering,...
Full time
Work at office
Work from home
Gamma
San Francisco, CA
1 day ago
Senior Site Reliability Engineer
$81.1k - $187k
...Site Reliability Engineer 3 We are looking for a Site Reliability Engineer 3 to support mission-critical cloud services and production operations... ...Key Responsibilities Capacity Ingestion and Management: Takes proactive steps to design and architect infrastructure...
Temporary work
Immediate start
Flexible hours
Shift work
Oracle
San Francisco, CA
2 days ago
Senior Site Reliability Engineer
$166.9k - $225.9k
...SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a... ...infrastructure requests: ECS task management, secret rotations, Terraform changes... ...: ~6+ years of experience in Site Reliability Engineering, Cloud Engineering...
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Drata Inc
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
...Senior Site Reliability Engineer Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded... ...reserved only for hyperscalers. We began with a single managed cluster - but it filled almost instantly. Since then, we've...
Full time
Remote work
Andromeda Cluster, Inc
San Francisco, CA
1 day ago
Site Reliability Engineer
...Site Reliability Engineer Specter's mission is to help automate the physical world. Today, we build video sensors with state-of-the-art AI... ...Systems Builder — Close the Loop Build and maintain fleet management systems: OTA update pipelines, device health tracking,...
Remote work
Specter Services LLC
San Francisco, CA
4 days ago
Staff Site Reliability Engineer
$200k - $260k
...Infrastructure Team as a technical leader driving reliability, automation, and scalability across the... ...practices across teams, mentor senior engineers, and be a primary escalation point for... ...engineers, without needing formal management authority to do it ~ Strong bias for...
Casual work
Work at office
Remote work
Flexible hours
Sight Machine
San Francisco, CA
1 day ago
Senior Staff Site Reliability Engineer
$220k - $235k
...Staff/Senior Staff Site Reliability Engineer Ironclad is the leading AI contracting platform that transforms agreements into assets. Contracts... ...Wave and Gartner Magic Quadrant for Contract Lifecycle Management, a Fortune Great Place to Work, and one of Fast Company's...
Full time
Contract work
Work at office
Ironclad Inc
San Francisco, CA
3 days ago
Lead Site Reliability Engineer
...Lead Site Reliability Engineer Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies... .... Lead Incident Response & Postmortems: own incident management practices, lead major incident response, and drive...
Full time
Flexible hours
Stuut
San Francisco, CA
4 days ago
Senior Staff Site Reliability Engineer
$181k - $263k
...line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability... ...expertise: internals, autoscaling, multi-tenant workload management, and rightsizing ~ Advanced experience with real-time and...
Work from home
Flexible hours
Night shift
LiveRamp
San Francisco, CA
4 days ago
Senior Software Engineer, Site Reliability Engineering
$210.8k - $272.8k
...millions of people confidently care for their homes. About the Site Reliability Engineering Team The Site Reliability Engineering team focuses on... ...for deployment, change, service, and infrastructure management Troubleshoot and debug critical systems throughout the SDLC...
Local area
Thumbtack
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager, Site Reliability Engineering. Be the first to apply!