Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Site Reliability Engineer - Kubernetes

$194k - $267k

Okta

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Workforce Identity Cloud

Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities—like reducing costs, and doing more for your customers.

If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.

Position Overview:

The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimizing costs and automation. The ideal candidate will have hands-on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.

Key Responsibilities:

  • Kubernetes Platform Creation: Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms. Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.
  • AWS Infrastructure Management: Build, manage, and optimize AWS cloud infrastructure, including EKS,ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.
  • Helm Management: Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production-ready deployments.
  • Karpenter Implementation: Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands. 
  • Istio Service Mesh Management: Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters. Enable fine-grained traffic management, service discovery, and policy enforcement.
  • Platform Automation & Scaling: Automate the deployment, scaling, and management of infrastructure and applications. Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.
  • Incident Management & Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.
  • Security & Compliance: Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.
  • Documentation & Knowledge Sharing: Create and maintain detailed documentation for Kubernetes platform setup, operational procedures, and best practices. Promote knowledge sharing across teams.

Required Qualifications:

  • 4+ years of experience with Kubernetes/Helm;
  • 4+ years of Experience with Terraform.
  • 5+ years of Experience with AWS
  • Experience with multi-region cloud environments.
  • Proven experience with AWS (EC2, RDS, S3, CloudFormation, IAM, etc.) and solid understanding of cloud-native architectures.
  • Strong expertise in Kubernetes platform creation, management, and optimisation (e.g., setting up highly available clusters, networking, and storage).
  • Hands-on experience with Helm for Kubernetes application deployment and management.
  • Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimising resource usage.
    Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.
  • Proficiency in CI/CD pipelines and automation tools (e.g., Jenkins, GitLab, CircleCI, Terraform, Ansible, Spinnaker).
    Strong scripting and automation skills in Python, Bash, or Go for infrastructure management and platform automation.
  • Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, CloudWatch, and ELK Stack.

Preferred Qualifications:

  • Understanding of security best practices for cloud platforms and Kubernetes (e.g., role-based access control (RBAC), encryption, and compliance frameworks).
  • Familiarity with Docker and containerization principles.
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
  • Certifications (Preferred): CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer are highly desirable.

Additional requirements:

  • This position requires the ability to access federal environments and/or have access to protected federal data.  As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
  • Requires in-person onboarding and travel to our San Francisco, CA HQ office or our Chicago office during the first week of employment.

#LI-Hybrid

#LI-LSS1

requisition ID- (P16373_3396241)

The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $194,000—$267,000 USD

Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .   

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between: $174,000—$214,000 USD


The Okta Experience

We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please  use this Form to request an accommodation.

Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please  click here to view our full NYC AEDT Notice.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at  .
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Staff Site Reliability Engineer - Kubernetes in Bellevue, WA vacancy
  • $160k - $220k

     ...developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. KUBERNETES PLATFORM SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we’re leveraging our experience in building rockets and spacecraft to deploy Starlink, the... 
    Suggested
    Permanent employment
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Weekend work

    Latent AI

    Redmond, WA
    1 day ago
  •  ...containerization technologies such as Docker and orchestration tools like Kubernetes. Experience with infrastructure as code tools such as...  ...related to cloud platforms (AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, etc.) is a plus. Experience with... 
    Suggested

    TechDigital Group

    Bellevue, WA
    2 days ago
  • $163.62k - $212.71k

     ...operating mainly in AWS with multiple Kubernetes clusters and thousands of servers. We...  ...platforms, and processes that improve our engineering teams' productivity and streamline the...  ...seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability,... 
    Suggested
    Full time
    Part time
    Work experience placement
    Work at office
    Local area
    Immediate start
    Remote work
    Work from home
    Flexible hours
    Shift work
    3 days per week
    1 day per week

    iSpot

    Bellevue, WA
    14 days ago
  •  ...Staff Platform Engineer, Kubernetes Bellevue Office, Sunset Corporate Campus Armada is the hyperscaler for the edge, delivering modular AI...  ...edge locations and cloud infrastructure, ensuring the reliability of our distributed computing platform. Location. This... 
    Suggested
    Work at office
    Local area
    Flexible hours

    Armada

    Bellevue, WA
    4 days ago
  • $120k - $176k

     ...Software Engineer, Kubernetes Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built...  ..., you will play a key part in ensuring the availability, reliability, and scalability of one of the industry's largest Kubernetes... 
    Suggested
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    4 days ago
  •  ...building the AI Cloud of the future. We are seeking a Staff Engineer to help our development of our Managed Kubernetes platform. Think GKE, but purpose-built for AI...  ...Native infrastructure to build systems that are reliable, performant, and elegantly simple for our customers... 
    Work at office
    Local area
    Immediate start
    Work from home
    Flexible hours

    Lambda Corporation

    Bellevue, WA
    7 hours ago
  • IBM Computing is seeking a Staff Software Engineer for the Secure Compute team, located in Bellevue, WA. You will lead the development of a cloud-native compute platform built on Kubernetes, supporting both trusted and untrusted workloads at scale. The ideal candidate... 

    IBM Computing

    Bellevue, WA
    5 days ago
  • A leading financial institution is seeking a Site Reliability Engineer III in Seattle, WA. This role involves designing and managing cloud infrastructure...  ...Reliability Engineering or DevOps, with strong AWS and Kubernetes expertise. The position offers competitive compensation... 

    JPMorgan Chase & Co.

    Seattle, WA
    5 days ago
  •  ...Job Title: DevOps Engineer - Kubernetes & Cloud Location: Bellevue WA 98006 - Onsite from Day 1 Long Term Contract Must-Have Skills...  ...YAML templates for various workloads. Enhance system reliability, scalability, and observability using tools like... 
    Long term contract
    Contract work

    AceStack LLC

    Bellevue, WA
    4 days ago
  •  ...in Bellevue is seeking an experienced technical leader to mentor a collaborative engineering team. The ideal candidate will have over 10 years of experience, deep expertise in Kubernetes, and a passion for distributed systems. This role combines technical leadership with... 

    Union.ai

    Bellevue, WA
    2 days ago
  • $160k - $210k

     ...achieving remarkable growth in a rapidly evolving industry. Now, we're growing! The Role We are looking for a senior site reliability engineer to work on expanding our global footprint of datacenters and improve service management across Cognitiv. Our immediate... 
    Work at office
    Immediate start
    Remote work
    Work from home

    Cognitiv

    Bellevue, WA
    14 days ago
  •  ...DevOps Engineer/ Site Reliability Engineer We are seeking a skilled DevOps Engineer with SRE capabilities to join our team in Seattle, WA....  ...registries, kaniko, devcontainers). Manage and optimize Kubernetes environments from the application development side (CKAD certification... 

    Staffing the Universe

    Seattle, WA
    4 days ago
  • $135k - $200k

    A leading technology firm based in Seattle seeks a Software Engineer to author strategies for containerization within its software platforms...  ...in software development and a strong background with Kubernetes and API design. This position also encompasses building scalable... 

    Palantir

    Seattle, WA
    2 days ago
  •  ...Enterprise Kubernetes Senior Platform Engineer We are an innovative performance apparel company for yoga, running, training, and other athletic pursuits. Setting the bar in technical fabrics and functional design, we create transformational products and experiences... 

    Samprasoft

    Seattle, WA
    7 hours ago
  • $177.57k - $248.59k

    Site Reliability Engineering - Sr. Software Development Engineer Implement and manage the infrastructure for rapid development and deployment of...  ...Origin and beyond. At Blue Origin we rely heavily on Kubernetes for rapid software development and deployment. A candidate... 
    Permanent employment
    Temporary work
    Local area

    jobs.frontdoordefense.com - Jobboard

    Seattle, WA
    3 days ago
  • Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of...  ...Scalability: Architect and manage auto-scaling strategies for Kubernetes (GKE) to handle fluctuating workloads during model... 
    Local area

    Tiger Analytics

    Seattle, WA
    4 days ago
  • $122.3k - $158.5k

     ...Canada Kirkland Washington United States of America Senior Site Reliability Engineer (SRE) - SPEAR Electronic Arts is looking for a Senior Site...  ...both Linux and Windows Deep experience with Kubernetes and cloud infrastructure (AWS preferred) Strong experience... 
    Full time

    Electronic Arts

    Kirkland, WA
    5 days ago
  • APPIT Software Solutions is hiring a Senior Site Reliability Engineer (SRE) in Seattle, USA . Lead site reliability engineering efforts for large...  ...metrics, logs, traces, and profiling at scale Advanced Kubernetes operations experience including multi-cluster management... 
    Flexible hours

    Appit LLC

    Seattle, WA
    5 days ago
  • $143.7k - $194.4k

     ...are looking for a Software Development Engineer to join the EKS KCP Scalability team and...  ...workloads on the planet - experience a reliable, performant control plane. This is not...  ...will work across the full stack: from the Kubernetes API server process and upstream... 
    Internship
    Flexible hours
    Shift work

    Amazon

    Seattle, WA
    7 hours ago
  •  ...and is responsible for the reliability, performance, security, and...  ...databases invisible: product engineers should be able to provision,...  ...What you'll do As a Senior/Staff Software Engineer on the Database...  ...-as-code (Terraform), Kubernetes, and Docker. Demonstrated track... 
    Worldwide

    Airwallex-

    Seattle, WA
    3 days ago
  • $70 - $80 per hour

     ...leading organization in the technology sector, is seeking a Sr. Site Reliability Engineer to join their team. As a Sr. Site Reliability Engineer,...  ...Azure resource provisioning Deep practical knowledge of Kubernetes internals: scheduling, networking, RBAC, resource... 
    Weekly pay
    Temporary work
    Flexible hours
    3 days per week

    ManpowerGroup Global, Inc.

    Seattle, WA
    1 day ago
  • $135k - $154k

     ...mission to Protect Life. As the APX platform engineering organization works on our CloudNet team...  ...and maintain the high quality and reliability that our customers demand. You will...  ...Experience operating components or services in Kubernetes clusters at scale. Experience in one... 

    Accreditation Council for Graduate Medical Education

    Seattle, WA
    20 hours ago
  •  ...industry player is seeking a skilled support engineer to enhance system stability and...  ...systems, and support applications running on Kubernetes and cloud platforms. Your expertise...  ...impact on operational excellence and system reliability. #J-18808-Ljbffr TechDigital Group

    TechDigital Group

    Bellevue, WA
    3 days ago
  • $139k - $242k

     ...Senior Software Engineer, Server Fleet Infrastructure Livingston...  ...leveraging technologies like gRPC and Kubernetes CRDs/Controllers/Operators....  ...to the company's delivery of reliable and efficient infrastructure....  ...problems of scale for multi-site deployment and management of... 
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • Overview We are looking for a skilled Site Reliability Engineer (SRE) with solid experience in AWS cloud infrastructure to join our growing engineering...  ...as code using Terraform or CloudFormation. Manage Kubernetes (EKS) clusters and containerized workloads. Develop... 

    TechDigital Group

    Bellevue, WA
    4 days ago
  •  ...world's most complex and mission-critical systems. As a Site Reliability Engineer III - DevOps Engineer at JPMorgan Chase within the Commercial...  ..., manage, and scale containerized applications using Kubernetes (EKS) and ECS. Develop and maintain infrastructure as code... 

    JPMorgan Chase & Co.

    Seattle, WA
    5 days ago
  • About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure...  ...providers, preferably Google Cloud SQL knowledge Containers, Kubernetes, and related tooling like Kustomize and Helm Service Mesh,... 
    Remote job
    Work from home
    Sleeping nights

    Hopper

    Seattle, WA
    5 days ago
  • IBM Computing is seeking a Staff Software Engineer for its Secure Compute Platform team. This role involves defining the technical direction...  ...public clouds. You will leverage your extensive experience in Kubernetes and Go, while mentoring engineers and ensuring operational... 

    IBM Computing

    Seattle, WA
    5 days ago
  • $106.61k - $284.28k

     ...Alliance for Career Enhancement is seeking a seasoned Cloud Platform Engineer to lead the maintenance and evolution of our high-availability cloud infrastructure. This role requires expertise in Kubernetes and Azure, along with experience in Terraform and Ansible for... 

    Hispanic Alliance for Career Enhancement

    Seattle, WA
    5 days ago
  • $194k - $267k

     ...to you. We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our...  ...TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/GKE). Problem Solving: A data-driven approach to debugging... 
    Permanent employment
    Full time
    Work at office
    Local area
    Flexible hours

    Okta

    Bellevue, WA
    more than 2 months ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Site Reliability Engineer - Kubernetes. Be the first to apply!