Staff Site Reliability Engineer - Observability GCP
$194k - $267kOkta
Secure Every Identity, from AI to Human
Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our Observability ecosystem into GCP. In this role, you will move beyond simple monitoring to delivering a world class, comprehensive, scalable Observability Platform that enables our SRE teams and business partners. You will treat infrastructure as code —utilizing Terraform and strong coding proficiency in Go, Python, or Ruby —to automate the deployment of agents and collectors across complex distributed systems.
Key Responsibilities
- Automated Infrastructure: Design, build, and maintain scalable observability infrastructure using tools like Terraform.
- GCP Observabilty Engineering: Optimize the collection, processing, and storage of Observabilty data to ensure high reliability and low latency of our Splunk and Grafana services
- Incident Response: Participate in on-call rotations and lead post-incident reviews to drive systemic improvements and "observability-driven development."
- Automation: Eliminate "toil" by automating the deployment and scaling of observability agents and collectors.
Required Skills & Experience (The Essentials)
GKE: Minimum 5+ Experience scaling and managing observability in a Google Cloud platform. Visualization: Expertise in creating intuitive, actionable Splunk or Grafana dashboards that correlate data across multiple sources. SRE Mindset: Minimum 3+ years of experience in an SRE, DevOps, or Systems Engineering role with a focus on high-availability systems.
- Programming Proficiency: Strong coding skills in Python , Go for building internal tools and automating workflows.
- Distributed Systems: Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/GKE).
- Problem Solving: A data-driven approach to debugging complex, cross-service performance bottlenecks.
Bonus Skills (The "Nice-to-Haves")
- Telemetry Standards: Hands-on experience with OpenTelemetry (OTel), Vector, or similar frameworks for instrumenting applications.
- Grafana Loki: Experience in migrating Splunk to Grafana Loki
Other Cloud Platforms: Experience managing observability native tools within AWS.
Additional requirements:
- This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
#LI-MM
#LI-Hybrid
P24517_3387022
Below is the annual base salary range for candidates located in San Francisco Bay Area. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .
The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $194,000—$267,000 USDThe Okta Experience
We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.
Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation. Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.$99.6k - $223.4k
...to-end system design for scalability, reliability, and observability. Stay hands-on with coding,... ...debugging, and production delivery. Drive engineering excellence through code reviews and... ...Cloud experience (OCI, AWS, Azure, or GCP). ~ AI/ML or AIOps production...SuggestedFull timeTemporary workRemote workFlexible hours- A leading insurance company is seeking a Senior Engineer to drive innovation in building high-performance, low-maintenance platforms... .... The role requires deep technical expertise in open-source observability and experience with distributed systems, Docker, and Kubernetes...Suggested
$103.5k - $150k
...whole self. The Role and Team The Site Reliability Engineering organization at Medallia brings together... ...availability, and performance using observability and alerting platforms. Participate... ...infrastructure platforms such as AWS, OCI, or GCP. Demonstrated experience with Linux...SuggestedTemporary workWork experience placementLocal area3 days per week- ...This role requires regularly working on-site at customer locations in Arlington, VA... ...About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security... ...by building the tools, processes, and observability that make "fast recovery" a reality....SuggestedRelocationRelocation package
$166k - $220k
ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy, systems... ..., Ansible). Experience with cloud platforms (Azure, AWS, GCP). Proficiency in containerization (Docker) and container orchestration...SuggestedFull timeWork experience placementRelocation package$45 per hour
...Description: Overview: • Seeking an experienced Observability and Monitoring Engineer to build and mature our enterprise-wide monitoring, logging... ...guidelines. • You will define SLOs/SLIs and reliability KPIs for critical services. • You will partner with scrum...Hourly payPermanent employment$100k - $215k
GEICO is seeking a Senior Engineer in Bethesda, Maryland to enhance their cloud platforms through innovative design and deployment. The role focuses on improving performance, automation, and observability within OpenStack-based environments. Ideal candidates will have...Flexible hours$126k - $248k
...you will partner with SRE leaders and engineers to scale the platform that underpins all... ...program execution, strengthen production reliability practices, and coordinate cross-... ...with Kubernetes, cloud networking, or observability stacks (metrics, logs, tracing, alerting...Local areaRemote workWorldwideFlexible hours- ...GCP Cloud Engineer Location: Washington, DC (Hybrid - 4 days/week; 1 day telework) Clearance: Active Secret Clearance required... ..., and improve system performance to ensure uptime, reliability, and SLA compliance . Implement federal cloud compliance...Full timeRemote work
$105k - $215k
GEICO is looking for a Senior Software Engineer to build the next-generation Release Platform and DevOps Tooling at their Bethesda, MD location. You will enhance software delivery workflows and mentor junior engineers, contributing to a collaborative environment. This...- ...Senior GCP Cloud Engineer LightFeather is seeking a Senior GCP Cloud Engineer who will play a critical role in designing, implementing, and maintaining cloud infrastructure solutions within Google Cloud Platform (GCP). This role requires expertise in infrastructure...Full timeContract workLocal area
$44k - $154k
...Management: Design, implement, and maintain scalable, reliable, and secure cloud infrastructure using GCP services. Automate cloud infrastructure... ...Manage and optimize GCP resources such as Compute Engine, Kubernetes Engine, Cloud Functions, and BigQuery to...Full time$113k - $188k
Dovel Technologies, Inc is looking for a highly skilled Senior DevOps / Cloud Engineer to support AWS workloads and establish GCP capabilities. This role requires deep expertise in cloud infrastructure and automation tools like Ansible and Python. Responsibilities include...- A technology consulting firm is seeking a Senior Google Cloud Engineer to enhance cloud capabilities in secure environments. This role requires strong experience in Google Cloud Platform (GCP) and Infrastructure-as-Code (IaC). The ideal candidate has significant experience...
- ...your table fresh from our open-scratch kitchen. Our knowledgeable staff can help you pair each dish with the perfect glass. Because food... .... Maintain adequate levels of ice for drink preparation. Observe all state and federal laws regarding the service of food and alcohol...Seasonal workLocal areaImmediate startFlexible hoursShift work
- ...capabilities, ensuring platform reliability, and enabling security... ...blended with platform engineering capabilities to mature... ...-time monitoring for observability. • Partner with... ...tools and platforms (GCP, AWS, Azure) •... ...which case we request on-site presence up to 4 days...Immediate startRemote workFlexible hours
$105k - $215k
...seeking an experienced Senior Software Engineer to play a pivotal role in building the... ...that enable policy-driven, automated, reliable and observable software delivery across a large... ...multi-cloud environments (Azure, AWS, GCP) with a focus on containerized deployments...Hourly payWork experience placementLocal areaFlexible hours- ...building secure infrastructure solutions and establishing best practices for cloud resource management. The role requires deep expertise in GCP, automation with Terraform, and strong communication skills to interact with various stakeholders. This position is remote, but...Remote job
$79.2k - $178.1k
...Summary Oracle Health Platform Engineering builds core platform... ...strengthen platform security and reliability. Responsibilities Key... ...practices (testing, CI/CD, observability, security). • Diagnose and... ...(OCI, AWS, Azure, or GCP), including cloud-native development...Temporary workFlexible hours$105k - $215k
...seeking an experienced Senior Engineer with a passion for building... ...Senior Engineer works with our Sr Staff Engineer and other Sr.... ...expertise in the Open-Source Observability, Data platform domain. Position... ...of experience with AWS, GCP, Azure, or hybrid data center...Hourly payWork experience placementLocal areaFlexible hours- ...Business Data Analyst who will utilize strong analytical skills on data and BI projects. The position requires hands-on experience with GCP for cloud migrations and data modernization initiatives. Candidates should be skilled in data warehousing and have familiarity with...
- ...seeking an experienced Senior Engineer with a passion for building... ...Senior Engineer works with our Sr Staff Engineer and other Sr.... ...expertise in the Open-Source Observability, Data platform domain. Position... ...years of experience with AWS, GCP, Azure, or hybrid data center...Hourly payWork experience placementLocal areaFlexible hours
$225.1k - $264.5k
...: Remote Department Engineering Compensation: CA$225.1K... ...the Role: We are seeking a Staff Software Engineer to lead... ...teams-including Platform, Kafka, Observability, Developer Productivity,... ...identity , cloud IAM (AWS, GCP, Azure), and zero-trust architectures...Full timeRemote work- ...Title: DevOps Engineer Location: Washington, DC: 100% Onsite (Only Locals)... ...Experience working with AWS, Azure or GCP Should be comfortable working with... ...if needed. Previous experience with Observability, SRE. Establish expectations with stakeholders...H1bLocal area
$99.6k - $234.6k
...Summary Oracle Health Platform Engineering builds and operates shared... ...services that power secure, reliable product delivery at scale.... ..., deployment patterns, observability). • Mentor engineers through... ...strongly desired; AWS/Azure/GCP acceptable), including containerization...Temporary workVisa sponsorshipFlexible hours$67k - $136.8k
...The opportunity As an FSO DevOps Engineer Senior Analyst, you’ll be based in our... ...including CI/CD automation, infrastructure reliability, observability, and secure deployment patterns. You... ...of cloud platforms such as Azure, GCP or AWS. Azure DLT (Besu, Canton)...Summer holidayFlexible hours- ...Senior DevOps Engineer- Arlington, VA Job Description... ..., or ArgoCD. Drive observability — instrument... ...Kubernetes), and multi-region reliability. Write real code —... ...preferred; Azure and GCP familiarity a bonus).... ...reimbursement ~ On-site fitness center and/or...Full timeWork at officeWork from homeMonday to Thursday
- ...Staff Software Engineer - Log Management United States About Us Radiant... ...is mission-critical. Reliability and operational excellence... ...while continuously improving observability and operational excellence... ...expertise Experience with AWS, GCP, or Azure (S3, GCS, Data Lake...
$100k - $160k
...change) Position Title: Lead Software Engineer - DevSecOps & Modernization Clearance... ...Cloud certifications (AWS / Azure / GCP) Architecture certifications (TOGAF... ...Artifactory/Xray, ACR Datadog for monitoring/observability ServiceNow, Jira, Subject7, Postman...Work experience placementLocal areaImmediate start- ...Job Advertisement – DevOps Engineer Company Overview: We are... ...company focused on delivering reliable, scalable, and secure cloud‑native... .../CD pipelines, deployments, observability, and cloud infrastructure... ...hands-on with AWS, Azure, or GCP. - Expertise in CI/CD tools...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Site Reliability Engineer - Observability GCP. Be the first to apply!
- staff data engineer Washington DC
- assistant engineer Washington DC
- staff engineer Washington DC
- software engineer staff Washington DC
- senior staff systems engineer Washington DC
- senior staff engineer Washington DC
- technology administrator Washington DC
- engineering aide Washington DC
- site reliability engineer sre Washington DC
- site reliability engineer Washington DC


