GCP Kubernetes SRE
Prophecy Technologies
Role Overview: This role is for a highly skilled Site Reliability Engineer with strong expertise in Kubernetes and Google Cloud Platform (GCP), specifically GKE. The position requires a deep understanding of infrastructure as code (IaC) using Terraform, Helm, and GitHub Actions, alongside proficiency in Python, Ansible, and Node.js. The engineer will be crucial in maintaining and enhancing observability stacks with Prometheus and Grafana, ensuring robust Linux systems and networking fundamentals, and contributing to automation and CI/CD pipelines. A significant aspect of the role involves applying AI/ML concepts and AIOps practices to improve system reliability and incident management. Key Responsibilities:
- Manage incidents, provide on-call support, and perform production triage to ensure system stability.
- Develop and maintain automation scripts and CI/CD pipelines for efficient software delivery and infrastructure management.
- Implement and manage infrastructure using IaC principles with Terraform, Helm, and GitHub Actions.
- Monitor system performance and health using Prometheus and Grafana observability tools.
- Apply AI/ML concepts and AIOps practices, including model lifecycle management, monitoring, and AI-driven alerting, to enhance operational efficiency.
- Support and operate ML/AI platforms or pipelines (MLOps) and integrate AI-driven automation into monitoring and incident response.
- Strong experience with Kubernetes and GCP (GKE).
- Strong experience in IaC (Terraform), Helm, and GitHub Actions.
- Proficiency in Python, Ansible, Node.js.
- Strong experience with Prometheus and Grafana observability stack.
- Solid understanding of Linux systems and networking fundamentals.
- Experience in incident management, on-call support, and production triage.
- Hands-on experience with automation and CI/CD pipelines.
- Strong understanding of AI/ML concepts and AIOps practices (model lifecycle, monitoring, or AI-driven alerting).
- 10+ years of experience in Site Reliability Engineering or a related field.
- Google Cloud Architect Certification (Preferred).
- Certified Kubernetes Administrator (CKA) (Preferred).
- Experience in Java/J2EE, Spring Boot.
- Experience supporting or operating ML/AI platforms or pipelines (MLOps).
- Exposure to AIOps tools, anomaly detection, or predictive analytics systems.
- Experience with large-scale distributed systems and microservices architecture.
- Experience with GPU-based workloads or ML infrastructure on GCP.
- Knowledge of Kubeflow, Vertex AI, or ML pipelines.
- Experience integrating AI-driven automation into monitoring and incident response.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the GCP Kubernetes SRE in Scottsdale, AZ vacancy
- ...Job Title On call support Required Skills Strong experience in Kubernetes, Cloud (GCP/Azure/AWS). GCP is preferred. Ability to triage incidents (good debussing skills, Monitoring tools knowledge such as Splunk / Prometheus-Grafana / AppDynamics / Dynatrace...SuggestedFlexible hours
- ...Middlewares like Apigee, Vordel, Data power and Nodejs Applicant should expertise in configuring, supporting and manage Rancher, Kubernetes and Docker Containerization. Ability to work on-call production support and Managing in a 24/7 support environment...Suggested
$118.45k - $236.9k
...Software Engineering, Platform Engineering, or SRE. ~7+ years of experience with observability... ...cloud-native and containerized platforms (Docker, Kubernetes, Argo CD). ~7+ years working with public cloud platforms (AWS, GCP, or Azure). ~5+ years designing and scaling...SuggestedHourly payFull timeTemporary workLocal area- ...Hi, GCP Cloud Architect Scottsdale ARIZONA Qualifications Over 8 years of overall IT experience with... ...deployment and orchestration of container technologies (e.g., Docker, Kubernetes, GKE) Experience migrating and refactoring solutions to...Suggested
$60k - $80k
...to our clients with a sense of urgency because we know maintenance issues impact the bottom line. The Site Reliability Engineer (SRE) Analyst II is the organization's Second line of support and brings strong problem-solving skills to the SRE team, utilizing a mature...SuggestedPermanent employmentFull timeContract workLive inWork at officeRelocation- ...Data Engineering with an emphasis on Data Warehousing and Data Analytics Very strong hands on Python expertise Strong Data and GCP Vertex AI knowledge 6 years of experience with one of the leading public clouds 6 years of experience in design and build of salable...
- ...We are looking for a hands-on GCP Data Engineer with strong SAP Finance (FI) knowledge to support the buildout of a Finance Data Platform on Google Cloud (BigQuery). The consultant will play a critical role in integrating SAP S/4HANA or ECC FI data into GCP, enabling financial...
- ...Supporting the analytics platform across GCP (BigQuery) and Tableau development . Focus is on data pipeline stability, reporting creation and maintenance, and light migration support from legacy systems (Hive/Presto and Tableau). Work is SLA-based , following defined...
$157.49k - $174.71k
...Deploy and manage applications using Kubernetes and container orchestration with infrastructure... ...-managed Kubernetes (AWS EKS, Azure AKS, GCP EKS) Preferred Qualifications:... ...Understanding of Site Reliability Engineering (SRE) principles Experience working in DoD...Flexible hours$180k - $260k
...architect and deliver cloud-native solutions on AWS, Azure, or GCP ~ Extensive experience with RESTful API design and microservices... ...on experience with containerization and orchestration (Docker, Kubernetes), CI/CD pipelines, and infrastructure-as-code ~ Agile...Full timeRemote workVisa sponsorship$86.4k - $199.5k
...Familiarity with building or integrating autonomous agents for DevOps/SRE use cases Cloud & Multi-Cloud Ecosystems Strong... ...Advanced competency in CI/CD pipelines (Jenkins, Kubernetes) Infrastructure as Code (Terraform) Observability tools (...Temporary work$176.5k - $262.35k
...our work integrates elements of Site Reliability Engineering (SRE) and DevOps projects as well. On the DevOps front, we oversee... ...experience with containers & container orchestration: Docker, Kubernetes ~ Strong communication skills with the ability to understand...Work at officeImmediate startFlexible hours- ...and rapid recovery from incidents. ~ Working knowledge on Vertex AI, Gen AI and Bigquery ~ Google Cloud Platform (GCP) Containerization, Kubernetes ~ Infrastructure as Code (Terraform), CI/CD (GitHub Actions), and Helm ~ Automation and scripting using Python, Ansible...
- ...APM Solutions to monitor application performance & infrastructure and aide in troubleshooting Experience on GCP, Microservices, Rancher, Docker, Kubernetes and web APIs Strong understanding of Relational and Non-SQL Databases such as Oracle, MySQL, Cassandra etc...
- ...Rust). ~2+ years with cloud transitions & containerization (GCP, AWS, Rancher, OpenShift, CloudFormation). Core Technical... ...Clickhouse, Time-series DBs. Container platforms: GKE / RKE / AKE, Kubernetes. Cloud observability: OTEL (tracing, monitoring, incident...
$170k - $220k
...technical initiatives end-to-end ~ A strong aversion to manual, repetitive toil (per Google SRE principles) Preferred Extras Golang in production systems Kubernetes and container orchestration Terraform and cloud infrastructure (AWS preferred)...Work at officeVisa sponsorship- ...and automation (Python, Bash, Terraform, Ansible, or similar). ~ Proficiency with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes). ~ Familiarity with networking, security operations, and incident response workflows. Soft Skills...Relocation
- ...implementing REST services with Java, preferably including Spring Boot ~ Some experience in deploying to a cloud platform (Kubernetes, PCF, GCP, Azure, AWS, etc.) ~ Some experience with TDD, in both frontend and backend technologies ~ Good written and verbal...Contract work
- ...of experience with Python with working knowledge on Notebooks 2 years of experience with Kafka PubSub Docker Kubernetes 2 years hands on experience on GCP Cloud data implementation projects Dataflow DataProc Cloud Composer Big Query Cloud Storage GKE Airflow etc...
- ...• Preferred / Nice-to-Have Skills Experience with cloud platforms (AWS, Azure, GCP) Knowledge of microservices architecture Familiarity with Docker/Kubernetes Experience with testing frameworks (Jest, Mocha) CI/CD pipeline knowledge Experience...
- ...and experimentation environments. Collaborate with product, SRE, security, data, and operations teams in an agile delivery model... ...supporting containerized and cloud-native platforms (Kubernetes/EKS/AKS, managed PaaS services). Experience enabling data, analytics...Work at officeLocal area
$108k - $140k
...product managers, designers, security, compliance, and SRE teams to deliver features that meet the regulatory... ...management. ~ Cloud experience on AWS, Azure, or GCP, including containers (Docker), orchestration (Kubernetes or ECS). ~ Working knowledge of relational and at...Temporary workWork at office- ...renewal, rotation, and revocation. Build, maintain, and secure Kubernetes clusters hosting PKI and secret management services.... ...secret, and Kubernetes infrastructure issues. Collaborate with SRE, platform, and security teams to integrate PKI and secrets management...Work at officeRemote work
- ...infrastructure. ~ Strong knowledge of cloud services (AWS, Azure, or GCP) and cloud networking concepts. ~ Experience with VMware... ...workflows. Exposure to container technologies (Docker, Kubernetes) is a plus. Company Overview 74Software, affiliated with...WorldwideFlexible hours
- ...experience with REST services, preferably using Java and Spring. Some amount of experience deploying to cloud platforms (Kubernetes, PCF, GCP, Azure, AWS, etc.). Some amount of experience with TDD, both backend and frontend. Good communication skills, both written...For contractors
- ...communication skills. Preferred Qualifications: Experience with cloud platforms such as AWS, Azure, or GCP. Exposure to Docker, Kubernetes , or micro-frontend architecture. Knowledge of backend technologies like Node.js , .NET , or Java...
- ...the design and implementation of core infrastructure, including Kubernetes (EKS), CI/CD pipelines, and infrastructure as code. Deliver... ...the end-to-end design and continuous improvement of DevOps and SRE practices. Own CI/CD systems, cloud-native infrastructure, and...Immediate startWorldwideAfternoon shift
- ...years of experience implementing REST services with Java and Spring Boot Some experience in deploying to a cloud platform (Kubernetes, PCF, GCP, Azure, AWS, etc.) Some experience with TDD, in both frontend and backend technologies Good written and verbal...
- ...modeling Hands-on experience with cloud platforms (AWS, Azure, or GCP)-specifically choosing and configuring components for AI-native... ...Experience with containerization and orchestration (Docker, Kubernetes, ECS) Understanding of observability and monitoring for AI...Work experience placementLive inWork at officeLocal area
$92.7k - $185.4k
...creating responsive UIs with SCSS, Material Design, and AG Grid ~2+ years of experience in deploying to a cloud platform (Kubernetes, Azure, GCP, etc.) Preferred Qualifications: Experience writing unit tests and integration tests using Jest and Cucumber...Full timeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to GCP Kubernetes SRE. Be the first to apply!

