Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Site Reliability Engineer - Kubernetes

$194k - $267k

Okta

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Workforce Identity Cloud

Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities—like reducing costs, and doing more for your customers.

If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.

Position Overview:

The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimizing costs and automation. The ideal candidate will have hands-on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.

Key Responsibilities:

  • Kubernetes Platform Creation: Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms. Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.
  • AWS Infrastructure Management: Build, manage, and optimize AWS cloud infrastructure, including EKS,ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.
  • Helm Management: Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production-ready deployments.
  • Karpenter Implementation: Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands. 
  • Istio Service Mesh Management: Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters. Enable fine-grained traffic management, service discovery, and policy enforcement.
  • Platform Automation & Scaling: Automate the deployment, scaling, and management of infrastructure and applications. Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.
  • Incident Management & Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.
  • Security & Compliance: Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.
  • Documentation & Knowledge Sharing: Create and maintain detailed documentation for Kubernetes platform setup, operational procedures, and best practices. Promote knowledge sharing across teams.

Required Qualifications:

  • 4+ years of experience with Kubernetes/Helm;
  • 4+ years of Experience with Terraform.
  • 5+ years of Experience with AWS
  • Experience with multi-region cloud environments.
  • Proven experience with AWS (EC2, RDS, S3, CloudFormation, IAM, etc.) and solid understanding of cloud-native architectures.
  • Strong expertise in Kubernetes platform creation, management, and optimisation (e.g., setting up highly available clusters, networking, and storage).
  • Hands-on experience with Helm for Kubernetes application deployment and management.
  • Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimising resource usage.
    Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.
  • Proficiency in CI/CD pipelines and automation tools (e.g., Jenkins, GitLab, CircleCI, Terraform, Ansible, Spinnaker).
    Strong scripting and automation skills in Python, Bash, or Go for infrastructure management and platform automation.
  • Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, CloudWatch, and ELK Stack.

Preferred Qualifications:

  • Understanding of security best practices for cloud platforms and Kubernetes (e.g., role-based access control (RBAC), encryption, and compliance frameworks).
  • Familiarity with Docker and containerization principles.
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
  • Certifications (Preferred): CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer are highly desirable.

Additional requirements:

  • This position requires the ability to access federal environments and/or have access to protected federal data.  As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
  • Requires in-person onboarding and travel to our San Francisco, CA HQ office or our Chicago office during the first week of employment.

#LI-Hybrid

#LI-LSS1

requisition ID- (P16373_3396241)

The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $194,000—$267,000 USD

Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .   

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between: $174,000—$214,000 USD


The Okta Experience

We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please  use this Form to request an accommodation.

Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please  click here to view our full NYC AEDT Notice.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at  .
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Staff Site Reliability Engineer - Kubernetes in Bellevue, WA vacancy
  •  ...containerization technologies such as Docker and orchestration tools like Kubernetes. Experience with infrastructure as code tools such as...  ...related to cloud platforms (AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, etc.) is a plus. Experience with... 
    Suggested

    TechDigital Group

    Bellevue, WA
    4 days ago
  • IBM Computing seeks a Senior Software Engineer to join the Compute Platform team. You will be a key leader in developing cloud-native...  ...solutions that power diverse workloads. With deep expertise in Kubernetes and Go, you will drive technical initiatives and mentor team... 
    Suggested

    IBM Computing

    Bellevue, WA
    4 days ago
  •  ...Energy Management Services, LLC is seeking a Software Integration & Deployment Engineer to support the deployment of GridOS Plan solutions. The role requires hands-on expertise with Kubernetes, Linux systems, and experience in software deployment and systems integration.... 
    Suggested

    PS0178 GE Energy Management Services, LLC

    Bellevue, WA
    4 days ago
  • $157.6k - $197k

     ...experienced, and detail-oriented Senior Platform Engineer to join our growing Edge team. You...  ..., optimization, and operation of our Kubernetes-based platform supporting our Galleon...  ...cloud infrastructure, ensuring the reliability of our distributed computing platform.... 
    Suggested
    Full time
    Work at office
    Local area
    Flexible hours

    Armada

    Bellevue, WA
    4 days ago
  •  ...APPIT Software Solutions is hiring a Senior Site Reliability Engineer (SRE) in Seattle, USA . Lead site reliability engineering efforts for large...  ...metrics, logs, traces, and profiling at scale Advanced Kubernetes operations experience including multi-cluster management and... 
    Suggested
    Flexible hours

    Appit LLC

    Seattle, WA
    2 days ago
  •  ...Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of...  ...Scalability: Architect and manage auto-scaling strategies for Kubernetes (GKE) to handle fluctuating workloads during model... 
    Local area

    Tiger Analytics

    Seattle, WA
    2 days ago
  •  ...industry player is seeking a skilled support engineer to enhance system stability and...  ...systems, and support applications running on Kubernetes and cloud platforms. Your expertise...  ...impact on operational excellence and system reliability. #J-18808-Ljbffr TechDigital Group

    TechDigital Group

    Bellevue, WA
    4 days ago
  • Ll Oefentherapie seeks a Senior IC5 Software Engineer in Seattle to lead the Oracle Kubernetes Engine team. You will tackle complex, large-scale cloud products while enhancing system reliability and operational excellence. The ideal candidate should possess advanced Kubernetes... 

    Ll Oefentherapie

    Seattle, WA
    6 days ago
  • Boeing is seeking a Senior Cloud Platform Kubernetes Specialist in Seattle, WA. In this role, you will develop a cloud-agnostic Kubernetes platform, manage CI/CD pipelines, and ensure system performance using tools like Prometheus and Grafana. The ideal candidate should... 

    Boeing

    Seattle, WA
    4 days ago
  • $139.5k - $258.1k

     ...Software and Services Apple Services Engineering team is one of the most exciting examples...  ...Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud...  ...on open-source software, such as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, etc... 
    Relocation

    Apple Inc.

    Seattle, WA
    4 days ago
  • $134.25k - $214.8k

     ...where you matter. Your Impact Are you an engineer who gets excited about the challenge...  ...the Observability team within Axon's Site Reliability organization - a focused team...  ...+ Alloy), expanding use cases beyond Kubernetes event logs and reducing organizational... 
    Work experience placement
    Work at office
    Remote work

    Koitecc Solutions

    Seattle, WA
    2 days ago
  • $139.5k - $258.1k

     ...States Software and Services The Apple Service Engineering - Data Streaming SRE team is looking for Site Reliability Engineers with experience developing processes,...  ...across bare metal, and virtualized (EC2), Kubernetes platforms Prepare alert handling procedures, run... 
    Relocation

    Apple Inc.

    Seattle, WA
    5 days ago
  •  ...apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE...  ...You Will Be Doing Improving production reliability and system resilience within an SRE scoped...  ...team, not just yourself Strong Kubernetes and broader ecosystem fundamentals Cloud... 
    Flexible hours

    Megaport

    Seattle, WA
    6 days ago
  • $184k - $287.5k

    ## Senior Software Engineer - Accelerated Kubernetes Runtime TeamApplylocations: US, WA, Remote: US, Remotetime type: Full timeposted on: Posted 3...  ...beyond, ensuring that AI researchers and developers have reliable, secure, and performant infrastructure at their... 
    Remote work

    NVIDIA Corporation

    Seattle, WA
    2 days ago
  • Edera in Seattle is looking for a Staff Forward Deployed Engineer who will be integral to customer relationships. You will work closely with clients...  ...to maximize their use of Edera's platform, managing Kubernetes platforms and architecting secure solutions. The ideal candidate... 

    Edera

    Seattle, WA
    4 days ago
  • $184k - $287.5k

     ...workloads. We are a group of forward‑thinking engineers tackling some of the globe’s toughest...  ...deep expertise in distributed systems, Kubernetes, containers, and systems performance and...  ...plane and related projects to enable reliable operation at hyperscale cluster sizes, doing... 
    Full time
    Remote work

    NVIDIA

    Seattle, WA
    3 days ago
  • $147k - $202k

     ...Position Overview: We are seeking a highly technical Staff Observability Site Reliability Engineer with a specialty in Splunk to own and evolve our...  ...DNS, Load Balancing), and container orchestration (Kubernetes/EKS). Problem Solving: A data-driven approach to... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    Bellevue, WA
    a month ago
  • $163.62k - $212.71k

     ...operating mainly in AWS with multiple Kubernetes clusters and thousands of servers. We...  ...platforms, and processes that improve our engineering teams' productivity and streamline the...  ...seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability,... 
    Full time
    Part time
    Work experience placement
    Work at office
    Local area
    Immediate start
    Remote work
    Work from home
    Flexible hours
    Shift work
    3 days per week
    1 day per week

    iSpot

    Bellevue, WA
    10 days ago
  • $140.6k - $173.1k

    Staff Software Engineer - Java | Kafka | Kubernetes page is loaded## Staff Software Engineer - Java | Kafka | Kuberneteslocations: New Jersey - Remote Office...  ...*** Design, develop, and implement scalable and reliable software solutions using Kafka, ElasticSearch, and other... 
    Work at office
    Remote work

    WEX

    Seattle, WA
    5 days ago
  • $139.5k - $258.1k

    A leading technology company in Seattle is seeking a Senior Software Engineer to work on Kubernetes clusters. You will partner with teams across the organization to enhance container orchestration features and improve service performance. The ideal candidate has over 5... 

    Apple Inc.

    Seattle, WA
    2 days ago
  • $140k - $200k

     ...– Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and...  ...: Proficiency in deploying high availability applications on Kubernetes What We Offer A dynamic environment where your contributions... 
    Work at office

    Speechify

    Bellevue, WA
    4 days ago
  • United States Digital Space LLC seeks a Staff Software Engineer to design and develop cutting-edge data streaming platforms in Bellevue, WA. This role emphasizes collaboration with product and ML teams, focusing on creating scalable event-processing systems. The ideal candidate... 

    United States Digital Space LLC

    Bellevue, WA
    6 days ago
  •  ...Software Engineer At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our...  ..., CSS SQL/NoSQL Database Devops - Gitlab CI, Docker, Kubernetes, AWS Strong knowledge of Agentic AI Strong knowledge of... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Local area
    Flexible hours

    T Mobile US

    Bellevue, WA
    6 days ago
  • $91k - $162k

     ...maintainability. We’re looking for both front‑end and back‑end engineers, while the ideal profile is full‑stack. Key Responsibilities...  ...AWS, GCP, or Azure), infrastructure‑as‑code tools (Terraform), Kubernetes tooling (Helm), and observability stacks (Prometheus and Grafana... 
    Local area

    Broadcom Corporation

    Bellevue, WA
    2 days ago
  • Pay-i is seeking a skilled DevOps Engineer based in Seattle to design and manage cloud infrastructures across AWS, Azure, and GCP. This...  .... The ideal candidate has strong experience with Terraform, Kubernetes, and Azure DevOps. Join an innovative team and contribute to enterprise... 

    PassFort

    Seattle, WA
    5 days ago
  •  ...accelerate data analytics for Data Lakehouses—independent of query engine or table format. Born from research at the Barcelona...  ...frameworks. Cloud‑based infrastructure spanning multiple providers. Kubernetes for orchestration, Terraform for infrastructure management, and... 
    Remote work
    Worldwide

    Qbeast Analytics Inc.

    Bellevue, WA
    2 days ago
  • $164.9k - $239.2k

    Senior Cloud Platform Kubernetes Specialist Location: Seattle, WA; Kent, WA; Tukwila, WA; Hazelwood, MO; Mesa, AZ; El Segundo, CA; Seal...  ...incident response and post‑mortem analysis to improve system reliability and performance. Stay updated with the latest trends and... 
    Flexible hours

    Boeing

    Seattle, WA
    4 days ago
  • $182k - $242k

     ...Senior Software Engineer, Applied AI CoreWeave is The Essential Cloud for AI™. Built...  ...frontends to backend services running on Kubernetes - while integrating AI/LLM capabilities...  ...gRPC) with a focus on performance and reliability ~ Experience developing and deploying... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Bellevue, WA
    4 days ago
  • $165k - $242k

     ...Senior Software Engineer, IAM New York, NY, Sunnyvale, CA, Bellevue, WA CoreWeave is...  ...members Preferred Experience with Kubernetes and a conceptual understanding of its major components. Experience building reliable and scalable platform services that... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Bellevue, WA
    2 days ago
  • $109k - $204k

     ...Software Engineer II, Developer Experience Livingon, NJ / New York, NY / Sunnyvale, CA...  ...retrieval, and version management that reliably serve engineering teams at scale. Identify...  ...how they're built. ~ Familiarity with Kubernetes basics, such as how workloads are... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Bellevue, WA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Site Reliability Engineer - Kubernetes. Be the first to apply!