Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Site Reliability Engineer - Kubernetes

$194k - $267k

Okta

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Workforce Identity Cloud

Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities—like reducing costs, and doing more for your customers.

If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.

Position Overview:

The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimizing costs and automation. The ideal candidate will have hands-on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.

Key Responsibilities:

  • Kubernetes Platform Creation: Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms. Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.
  • AWS Infrastructure Management: Build, manage, and optimize AWS cloud infrastructure, including EKS,ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.
  • Helm Management: Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production-ready deployments.
  • Karpenter Implementation: Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands. 
  • Istio Service Mesh Management: Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters. Enable fine-grained traffic management, service discovery, and policy enforcement.
  • Platform Automation & Scaling: Automate the deployment, scaling, and management of infrastructure and applications. Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.
  • Incident Management & Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.
  • Security & Compliance: Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.
  • Documentation & Knowledge Sharing: Create and maintain detailed documentation for Kubernetes platform setup, operational procedures, and best practices. Promote knowledge sharing across teams.

Required Qualifications:

  • 4+ years of experience with Kubernetes/Helm;
  • 4+ years of Experience with Terraform.
  • 5+ years of Experience with AWS
  • Experience with multi-region cloud environments.
  • Proven experience with AWS (EC2, RDS, S3, CloudFormation, IAM, etc.) and solid understanding of cloud-native architectures.
  • Strong expertise in Kubernetes platform creation, management, and optimisation (e.g., setting up highly available clusters, networking, and storage).
  • Hands-on experience with Helm for Kubernetes application deployment and management.
  • Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimising resource usage.
    Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.
  • Proficiency in CI/CD pipelines and automation tools (e.g., Jenkins, GitLab, CircleCI, Terraform, Ansible, Spinnaker).
    Strong scripting and automation skills in Python, Bash, or Go for infrastructure management and platform automation.
  • Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, CloudWatch, and ELK Stack.

Preferred Qualifications:

  • Understanding of security best practices for cloud platforms and Kubernetes (e.g., role-based access control (RBAC), encryption, and compliance frameworks).
  • Familiarity with Docker and containerization principles.
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
  • Certifications (Preferred): CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer are highly desirable.

Additional requirements:

  • This position requires the ability to access federal environments and/or have access to protected federal data.  As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
  • Requires in-person onboarding and travel to our San Francisco, CA HQ office or our Chicago office during the first week of employment.

#LI-Hybrid

#LI-LSS1

requisition ID- (P16373_3396241)

The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $194,000—$267,000 USD

Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .   

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between: $174,000—$214,000 USD


The Okta Experience

We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please  use this Form to request an accommodation.

Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please  click here to view our full NYC AEDT Notice.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at  .
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Staff Site Reliability Engineer - Kubernetes in Bellevue, WA vacancy
  • $125k - $150k

    Site Reliability Engineer, Kubernetes Platform (Starshield) Redmond, WA SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies... 
    Suggested
    Permanent employment
    Temporary work
    Work at office
    Immediate start
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    5 days ago
  • $194k - $267k

     ...and who can rapidly self-educate on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting... 
    Suggested
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    Bellevue, WA
    more than 2 months ago
  •  ...containerization technologies such as Docker and orchestration tools like Kubernetes. Experience with infrastructure as code tools such as...  ...related to cloud platforms (AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, etc.) is a plus. Experience with... 
    Suggested

    TechDigital Group

    Bellevue, WA
    4 days ago
  • IBM Computing seeks a Senior Software Engineer to join the Compute Platform team. You will be a key leader in developing cloud-native...  ...solutions that power diverse workloads. With deep expertise in Kubernetes and Go, you will drive technical initiatives and mentor team... 
    Suggested

    IBM Computing

    Bellevue, WA
    4 days ago
  • $163.62k - $212.71k

     ...operating mainly in AWS with multiple Kubernetes clusters and thousands of servers. We...  ...platforms, and processes that improve our engineering teams’ productivity and streamline the...  ...seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability,... 
    Suggested
    Permanent employment
    Full time
    Part time
    Work experience placement
    Work at office
    Local area
    Immediate start
    Remote work
    Work from home
    Flexible hours
    Shift work
    3 days per week
    1 day per week

    iSpot.tv, Inc.

    Bellevue, WA
    4 days ago
  • Site Reliability Engineer Your role and responsibilities Manage deployments of Apptio services to AWS GovCloud. Monitor KPIs of services running...  ...Familiarity with container technologies (e.g., Kubernetes, Docker) Familiarity with bash, python, or other scripting... 
    Temporary work
    Remote work

    IBM

    Bellevue, WA
    5 days ago
  • $154.56k - $193.2k

     ...challenges on a global scale. We are actively seeking passionate AI Engineers with hands‑on expertise across a range of domains, including...  ...requirements. Background with container platforms such as Kubernetes. Strong analytical skills with a bias for action. Strong... 
    Work at office
    Remote work
    Flexible hours

    AI Chopping Block

    Bellevue, WA
    3 days ago
  •  ...building the AI Cloud of the future. We are seeking a Staff Engineer to help our development of our Managed Kubernetes platform. Think GKE, but purpose-built for AI...  ...Native infrastructure to build systems that are reliable, performant, and elegantly simple for our customers... 
    Work at office
    Local area
    Immediate start
    Work from home
    Flexible hours

    AI Chopping Block, Inc.

    Bellevue, WA
    5 days ago
  • $144k - $180k

     ...experienced, detail‑oriented Senior Platform Engineer to join our growing Edge team. You will...  ..., optimization, and operation of our Kubernetes‑based platform supporting our Galleon...  ...and cloud infrastructure, ensuring the reliability of our distributed computing platform.... 
    Work at office
    Local area
    Remote work
    Flexible hours

    Garuda Ventures

    Bellevue, WA
    6 days ago
  •  ...Energy Management Services, LLC is seeking a Software Integration & Deployment Engineer to support the deployment of GridOS Plan solutions. The role requires hands-on expertise with Kubernetes, Linux systems, and experience in software deployment and systems integration.... 

    PS0178 GE Energy Management Services, LLC

    Bellevue, WA
    4 days ago
  •  ...APPIT Software Solutions is hiring a Senior Site Reliability Engineer (SRE) in Seattle, USA . Lead site reliability engineering efforts for large...  ...metrics, logs, traces, and profiling at scale Advanced Kubernetes operations experience including multi-cluster management and... 
    Flexible hours

    Appit LLC

    Seattle, WA
    2 days ago
  •  ...Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of...  ...Scalability: Architect and manage auto-scaling strategies for Kubernetes (GKE) to handle fluctuating workloads during model... 
    Local area

    Tiger Analytics

    Seattle, WA
    2 days ago
  • Madrona Venture Labs is looking for a Senior Forward Deployed Engineer to work on their enterprise AI platform. This hybrid role is based...  ...deployment of AI agents. The ideal candidate should have deep Kubernetes expertise and a strong operator literacy. They will also need... 

    Madrona Venture Labs

    Bellevue, WA
    4 days ago
  •  ...industry player is seeking a skilled support engineer to enhance system stability and...  ...systems, and support applications running on Kubernetes and cloud platforms. Your expertise...  ...impact on operational excellence and system reliability. #J-18808-Ljbffr TechDigital Group

    TechDigital Group

    Bellevue, WA
    4 days ago
  • Ll Oefentherapie seeks a Senior IC5 Software Engineer in Seattle to lead the Oracle Kubernetes Engine team. You will tackle complex, large-scale cloud products while enhancing system reliability and operational excellence. The ideal candidate should possess advanced Kubernetes... 

    Ll Oefentherapie

    Seattle, WA
    6 days ago
  • $134.96k - $188.95k

    ## Ground System Site Reliability Engineer IIApplylocations: Greater Seattle Areatime type: Full timeposted on: Posted Yesterdayjob requisition...  ...cloud platforms such as AWS/Azure/GCP* Experience with Kubernetes and common service meshes like Istio/Cilium/Linkerd* Proficient... 
    Permanent employment
    Temporary work
    Local area

    Blue Origin LLC

    Seattle, WA
    3 days ago
  • NeuroNav is seeking a Senior Platform Engineer in Seattle, WA. This role involves building applications and managing infrastructure for our Kubernetes-based hosting platform in AWS, ensuring security and high availability. The ideal candidate has strong experience with... 

    NeuroNav

    Seattle, WA
    4 days ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Software Engineer for the Accelerated Kubernetes Runtime Team. You will design automation systems for seamless installation and management of runtime packages, optimizing them for NVIDIA's GPU architecture. Ideal candidates have extensive... 

    NVIDIA Corporation

    Seattle, WA
    2 days ago
  • $125k - $175k

    United States Digital Space LLC is seeking a Kubernetes Platform Infrastructure Engineer for their Starlink project. Your role involves developing automation...  .... You will need a strong background in Site Reliability Engineering and experience with tools like Terraform... 

    United States Digital Space LLC

    Redmond, WA
    2 days ago
  • Boeing is seeking a Senior Cloud Platform Kubernetes Specialist in Seattle, WA. In this role, you will develop a cloud-agnostic Kubernetes platform, manage CI/CD pipelines, and ensure system performance using tools like Prometheus and Grafana. The ideal candidate should... 

    Boeing

    Seattle, WA
    4 days ago
  • $177.57k - $248.59k

    Site Reliability Engineering - Sr. Software Development Engineer Implement and manage the infrastructure for rapid development and deployment of...  ...Origin and beyond. At Blue Origin we rely heavily on Kubernetes for rapid software development and deployment. A candidate... 
    Permanent employment
    Temporary work
    Local area

    jobs.frontdoordefense.com - Jobboard

    Seattle, WA
    4 days ago
  • $139.5k - $258.1k

     ...Software and Services Apple Services Engineering team is one of the most exciting examples...  ...Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud...  ...on open-source software, such as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, etc... 
    Relocation

    Apple Inc.

    Seattle, WA
    4 days ago
  • $134.25k - $214.8k

     ...where you matter. Your Impact Are you an engineer who gets excited about the challenge...  ...the Observability team within Axon's Site Reliability organization - a focused team...  ...+ Alloy), expanding use cases beyond Kubernetes event logs and reducing organizational... 
    Work experience placement
    Work at office
    Remote work

    Koitecc Solutions

    Seattle, WA
    2 days ago
  •  ...apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE...  ...You Will Be Doing Improving production reliability and system resilience within an SRE scoped...  ...team, not just yourself Strong Kubernetes and broader ecosystem fundamentals Cloud... 
    Flexible hours

    Megaport

    Seattle, WA
    6 days ago
  • $135k - $154k

     ...mission to Protect Life. As the APX platform engineering organization works on our CloudNet team...  ...and maintain the high quality and reliability that our customers demand. You will...  ...Experience operating components or services in Kubernetes clusters at scale. Experience in one... 

    Accreditation Council for Graduate Medical Education

    Seattle, WA
    2 days ago
  • $150k - $180k

     ...improve cloud infrastructure reliability, scalability, and...  ...platforms and tools that enable engineering teams to provision services...  ...across Azure and AWS, including Kubernetes platforms, networking, and identity...  ..., cloud infrastructure, or site reliability engineering.... 

    Axon Enterprise

    Seattle, WA
    5 days ago
  • $139.5k - $258.1k

     ...States Software and Services The Apple Service Engineering - Data Streaming SRE team is looking for Site Reliability Engineers with experience developing processes,...  ...across bare metal, and virtualized (EC2), Kubernetes platforms Prepare alert handling procedures, run... 
    Relocation

    Apple Inc.

    Seattle, WA
    5 days ago
  •  ...world's most complex and mission-critical systems. As a Site Reliability Engineer III - DevOps Engineer at JPMorgan Chase within the Commercial...  ..., manage, and scale containerized applications using Kubernetes (EKS) and ECS. Develop and maintain infrastructure as code... 

    Next Frontier Capital

    Seattle, WA
    2 days ago
  • US staffing Inc is seeking an Azure Platform Engineer to work onsite in Seattle, WA. The role involves designing and managing infrastructure...  ...on AI/ML platforms. Ideal candidates should have expertise in Kubernetes and containerization solutions, with a strong focus on best... 

    US staffing Inc

    Seattle, WA
    6 days ago
  • Edera in Seattle is looking for a Staff Forward Deployed Engineer who will be integral to customer relationships. You will work closely with clients...  ...to maximize their use of Edera's platform, managing Kubernetes platforms and architecting secure solutions. The ideal candidate... 

    Edera

    Seattle, WA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Site Reliability Engineer - Kubernetes. Be the first to apply!