Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Engineer - Cloud Ops

AutoZone

Job Description

As a Systems Engineer on the Cloud Operations team, you will be responsible for deploying, managing, and optimizing our cloud-based infrastructure on Google Cloud Platform (GCP). You will work with technologies such as Terraform, Kubernetes (GKE), GitOps/ArgoCD, CI/CD pipelines, and observability tools to ensure reliable, secure, and scalable platform operations.

You will also contribute to our AI/ML platform initiatives, supporting infrastructure for LLM-based applications and AI-powered automation tools that enhance developer productivity and operational efficiency.

You will collaborate with development teams, SREs, and platform architects to ensure seamless deployment and delivery of applications while maintaining the highest standards of reliability, security, and performance.

Responsibilities

Cloud Infrastructure, Automation & Operations:

  • Design, build, and maintain cloud infrastructure using Terraform to automate provisioning, scaling, and lifecycle management of resources on GCP

  • Develop and maintain CI/CD pipelines using GitLab CI to automate build, test, and deployment workflows. Implement and maintain GitOps practices using ArgoCD for declarative, version-controlled application deployment

  • Monitor system performance using observability tools (Dynatrace, Cloud Monitoring, Prometheus/Grafana) and troubleshoot production issues

  • Participate in on-call rotation to provide 24/7 support for critical infrastructure incidents

  • Perform root cause analysis on incidents and implement preventive measures. Document runbooks, architecture decisions, and operational procedures

Kubernetes Platform Management:

  • Deploy, configure, and manage containerized applications on Google Kubernetes Engine (GKE), including GKE Autopilot and Standard clustersManage cluster lifecycle including upgrades, node pool configurations, and capacity planning

  • Troubleshoot pod failures, CrashLoopBackOff, OOMKilled events, and container resource issues

  • Configure and optimize resource requests/limits, Horizontal Pod Autoscaler (HPA), and Vertical Pod Autoscaler (VPA)

  • Manage Kubernetes networking including Services, Ingress controllers, Network Policies, and DNS configurations. Implement and manage service mesh (Istio) for traffic management, observability, and security

  • Manage secrets and configurations using Kubernetes Secrets, ConfigMaps, and external secret management tools. Implement pod security standards, RBAC policies, and workload identity configurations

AI/ML Platform & Automation:

  • Support infrastructure for AI/ML workloads including LLM-based applications and model serving platforms

  • Deploy and manage AI-powered developer tools such as coding assistants (Claude Code, GitHub Copilot) and agentic AI systems. Explore and implement AI-assisted incident response and automated remediation workflows

  • Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases

  • Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads

  • Implement MCP (Model Context Protocol) servers and AI agent integrations for operational automation

  • Stay current with emerging AI technologies and evaluate their applicability for infrastructure automation

Qualifications

Kubernetes Expertise (Essential):

  • 3+ years hands-on experience with Kubernetes in production environments

  • Deep understanding of Kubernetes architecture: API server, etcd, scheduler, controller manager, kubelet

  • Experience with GKE (Standard and Autopilot modes), including cluster creation, upgrades, and maintenance

  • Proficiency in troubleshooting workloads: analyzing pod logs, events, describe outputs, and container states

  • Strong understanding of resource management: requests, limits, QoS classes, and resource quotas

  • Experience with Kubernetes networking: Services (ClusterIP, NodePort, LoadBalancer), Ingress, Network Policies

  • Knowledge of Kubernetes storage: PersistentVolumes, PersistentVolumeClaims, StorageClasses, dynamic provisioning

  • Experience with Helm charts for application packaging and deployment

  • Familiarity with Kubernetes security: RBAC, Pod Security Standards, Secrets management, Workload Identity

  • Understanding of Kubernetes observability: metrics-server, kubectl top, container resource monitoring

  • Experience debugging common issues: ImagePullBackOff, CrashLoopBackOff, OOMKilled, Evicted pods, pending pods

Cloud & Infrastructure:

  • 3+ years of experience with Google Cloud Platform (GCP) services including GKE, Cloud Run, Cloud SQL, Memorystore, Pub/Sub, and Cloud Logging

  • Strong experience with Terraform for infrastructure as code (IaC)

  • Understanding of cloud networking: VPCs, subnets, firewall rules, Cloud NAT, Private Service Connect

CI/CD & GitOps:

  • Proficiency with GitLab CI/CD pipelines

  • Experience with ArgoCD or similar GitOps tools

  • Understanding of Helm charts and Kustomize for Kubernetes manifest management

Observability & Troubleshooting:

  • Experience with monitoring and APM tools (Dynatrace, Datadog, Prometheus, Grafana)

  • Ability to analyze logs, metrics, and traces to diagnose production issues

  • Familiarity with JVM troubleshooting (heap dumps, thread analysis, GC tuning, connection pool issues)

AI/ML Knowledge:

  • Basic understanding of LLM concepts, prompt engineering, and AI model deployment

  • Familiarity with AI coding assistants and their integration into development workflows

  • Interest in agentic AI systems and autonomous automation tools

  • Exposure to vector databases (Pinecone, Weaviate, pgvector) and RAG architectures is a plus

Systems & Networking:

  • Strong Linux administration skills

  • Understanding of networking concepts (DNS, load balancing, firewalls, TCP/IP)

  • Experience with service mesh (Istio) is a plus

General:

  • Excellent problem-solving and analytical skills

  • Strong written and verbal communication

  • Ability to work effectively in a collaborative, cross-functional environment

  • Experience working in an Agile/DevOps culture

  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience)

About Autozone

Since opening our first store in 1979, AutoZone has grown into a leading retailer and distributor of automotive parts and accessories across the Americas. Our customer-first mindset and commitment to Going the Extra Mile define who we are, for both our customers and AutoZoners. Working at AutoZone means being part of a team that values dedication, teamwork, and growth. Whether you're helping customers or building your career, we provide tools and support to help you succeed and drive your future.

Benefits at AutoZone

AutoZone offers thoughtful benefits programs with one-on-one benefits guidance designed to improve AutoZoners’ physical, mental and financial well-being.

All AutoZoners (Full-Time and Part-Time):

  • Competitive pay

  • Unrivaled company culture

  • Medical, dental and vision plans

  • Exclusive discounts and perks, including an AutoZone in-store discount

  • 401(k) with company match and Stock Purchase Plan

  • AutoZoners Living Well Program for free mental health support

  • Opportunities for career growth

Additional Benefits for Full-Time AutoZoners:

  • Paid time off

  • Life, and short- and long-term disability insurance options

  • Health Savings and Flexible Spending Accounts with wellness rewards

  • Tuition reimbursement

Minimum age requirements may apply. Eligibility and waiting period requirements may apply; benefits for AutoZoners in Puerto Rico, Hawaii, or the U.S. Virgin Islands may differ. Learn more about all that AutoZone has to offer at Careers.AutoZone.com.

We proudly support Veterans, Active-duty Service Members, Reservists, National Guard and Military Families. Your experience is highly valued, and we encourage you to apply to join our team.

Online Application:

An online application is required. Click the Apply button to complete your application. For step-by-step instructions on how to apply visit careers.autozone.com/candidateresources.

AutoZone, and its subsidiary, ALLDATA are equal opportunity employers. All applicants will be considered for employment without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status, or any other legally protected categories. ​

Job Identification 105932

Job Schedule Full time

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Systems Engineer - Cloud Ops in Memphis, TN vacancy
  •  ...Uses fundamental engineering concepts, practices, and procedures to analyze operational performance, improvement opportunities, and implement technology systems related to facility and material handling in the field operations. Essential Functions Analyzes performance... 
    Suggested
    Work at office
    Remote work

    FedEx

    Memphis, TN
    1 day ago
  •  ...of three main corporate divisions: Howard Power Solutions, Howard Transportation, and Howard Technology Solutions. Vision Systems Engineer (AI Specialist) We are seeking a highly skilled Vision Systems Engineer to lead the integration and optimization of... 
    Suggested

    Howard Industries

    Olive Branch, MS
    1 day ago
  •  ...Job Description AutoZone is seeking an experienced Systems Engineer to join our expanding eCommerce B2B team. These efforts specifically...  ...dashboards to support the application. Experience with Google Cloud Platform is required for deploying and managing cloud-based... 
    Suggested
    Full time
    Temporary work
    Part time
    Flexible hours

    AutoZone

    Memphis, TN
    6 days ago
  •  ...desired, sustainable business outcomes and assure the integrity and continuity of our systems. This position is responsible for establishing and maintain AutoZone’s Quality Engineering (QE) best practices, writing test scenarios and executing tests, both manually and... 
    Suggested
    Full time
    Temporary work
    Part time
    Work experience placement
    Flexible hours

    AutoZone

    Memphis, TN
    1 day ago
  •  ...Job Description AutoZone's Site Reliability Engineering (SRE) team is seeking a Systems Engineer with a focus on SRE Enablement. This position is responsible...  ..., which includes a primary emphasis on the Google Cloud Platform (GCP), as well as on-premises servers and... 
    Suggested
    Full time
    Temporary work
    Part time
    Flexible hours

    AutoZone

    Memphis, TN
    2 days ago
  • $120k - $140k

     ...Vaco is actively seeking an AWS Cloud Reliability Engineer for a direct-hire role supporting a nationally respected, technically focused, and...  ...and cutover runbooks including detailed recovery procedures, system dependencies, stakeholder communications, validation checkpoints... 
    H1b
    Work at office
    Local area
    Immediate start
    Visa sponsorship
    Work visa

    Vaco

    Memphis, TN
    10 days ago
  •  ...create the premier integrated provider of Systems & Components, and Technical Services for...  ...experienced Senior Systems & Infrastructure Engineer to join our Information Technology team...  ..., firewall replacements, and cloud integration projects. • Assist IT leadership... 
    Local area
    Remote work

    Hyperion Solutions Corporation

    Memphis, TN
    3 days ago
  •  ...Responsibilities: - Provides subject matter proficiency supporting system testing activities - Applies analytical skills to support...  ...testing - Tests web-based applications and RESTful interfaces in cloud-hosted environments - Performs additional technical and... 
    Minimum wage
    Full time
    Contract work
    Temporary work
    For contractors
    Work experience placement
    Remote work

    Maximus

    Memphis, TN
    2 days ago
  •  ...identify software defects in electrified and internal combustion engine (ICE) powertrains, ensuring the delivery of robust, high-...  ...vehicle environments and Hardware-in-the-Loop (HIL) simulation systems. The engineer will execute DVP&R test procedures, perform initial... 
    Full time
    Immediate start

    Stellantis

    Southaven, MS
    2 days ago
  •  ...business outcomes and assure the integrity and continuity of our systems. This position is responsible for the development, maintenance...  ...design and maintenance; security operations. Store engineering, coding based on design provided and roll-out implementation.... 
    Full time
    Temporary work
    Part time
    Flexible hours

    AutoZone

    Memphis, TN
    2 days ago
  •  ...Job Description Job Description Overview A leading engineering and manufacturing organization is seeking a Systems Engineer Manufacturing I to support the design and development of custom mechanical systems within an engineer-to-order environment. This role focuses... 
    Work at office

    Nextech

    Memphis, TN
    3 days ago
  • $140k - $155k

    Title: Automation Systems Director Location: Montreal, Memphis, TN, Dundalk, MD or Hattiesburg, MS work site 25-50% travel to plants...  ..., and operational efficiency. Partnering closely with Engineering, IT, and plant operations, this leader plays a key role in advancing... 
    Full time
    Contract work
    For contractors
    Work experience placement

    ConsultNet

    Memphis, TN
    2 days ago
  •  ...Overview System Manager- Infection Prevention Job Code 20986 Job Family QCS Job Summary The System Infection Prevention Manager is responsible for oversight and maintenance of all Infection Prevention-related software programs and modules... 
    Work at office

    Baptist Memorial Healthcare Corporation

    Memphis, TN
    4 days ago
  •  ...goals and measurements. Implements educational offerings and leads process improvement projects/activities. Works in partnership with system and entity leaders, physician constituents, clinical leaders, and interdisciplinary care teams, and other key stakeholders to... 
    Temporary work

    Baptist Memorial Healthcare Corporation

    Memphis, TN
    2 days ago
  • $39.07 per hour

     ...unless otherwise stated. Job Description: Maintenance Mechanic or Industrial Electrician position supports the package sorting system, including belt conveyors. In our industry, this position is also known as Plant Mechanic or Industrial Mechanic. This position is... 
    Weekly pay
    Permanent employment
    Full time
    Immediate start
    Day shift

    National Guard Employment Network

    Memphis, TN
    4 days ago
  •  ...Conectiv Supply Chain Solutions is seeking a Distribution Engineer to support warehouse and distribution operations by analyzing...  ...Support implementation and troubleshooting of warehouse management systems (WMS) and automation equipment. Conduct time studies, process... 
    Internship
    Work at office
    Local area
    Monday to Friday
    Flexible hours

    Conectiv Supply Chain Solutions, Inc.

    Memphis, TN
    15 days ago
  •  ...- Friday, 9am-5pm We are seeking an experienced Windows Server Engineer to join our Windows Application Management Services Team. In this...  ...& Experience - Minimum 8 years of experience as a Windows Systems Engineer - OR - - Bachelor's degree and a minimum of 4 years of... 
    Monday to Friday

    First Horizon Bank

    Hickory Hill, TN
    11 hours ago
  •  ...Monday-Friday, 9am-5pm Description: As a DevOps Software Platform Engineer focusing on the DevOps platform tools, you will be a crucial...  ...and Experience: Minimum 10 years of experience as a DevOps or Systems Engineer or a bachelor's degree and 6 years of experience. Kubernetes... 
    Monday to Friday

    First Horizon Bank

    Hickory Hill, TN
    11 hours ago
  •  ...Mercor is inviting applications for remote DevOps / Platform Engineer roles to join our expert network. As part of this open application...  ...Ideal candidates should have experience in CI/CD pipelines, cloud infrastructure, and containerization technologies and be able to... 
    Remote work
    Flexible hours

    Mercor Inc

    Memphis, TN
    2 days ago
  •  ...quality, and efficiency across the value chain. The Reliability Engineering Manager role is based in Memphis, TN (onsite) . Be part...  ...tools (Word, Excel, PowerPoint) and ability to learn site systems and databases. Ability to work onsite and engage directly... 
    Contract work
    Work at office

    International Flavors and Fragrances

    Memphis, TN
    1 day ago
  • $39.07 per hour

     ...unless otherwise stated. Job Description: Maintenance Mechanic or Industrial Electrician position supports the package sorting system, including belt conveyors. In our industry, this position is also known as Plant Mechanic or Industrial Mechanic. This position is... 
    Permanent employment
    Full time
    Day shift

    National Guard Employment Network

    Memphis, TN
    1 day ago
  • Requisition # 10003000_COMPANY_1.2 Job Title Information Systems Operations Manager Job Type Full-time Location Corporate - TN US Memphis, TN 38113 US (Primary) Category Information Systems Job Description PURPOSE OF... 
    Full time
    Temporary work

    Crew Training International

    Memphis, TN
    2 days ago
  • $109.85k - $184.61k

     ...Senior DevOps Engineer Job Locations US-NJ-Secaucus | US-FL-Jacksonville | US...  ...the design and implementation of hybrid clouds infrastructure, kubernetes clusters, and...  ...complex technical issues involving multiple systems and networks Conducts software rollouts... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Yusen Logistics (Americas), Inc.

    Cordova, TN
    6 days ago
  •  ...~ We're seeking an IT Engineering Platforms Architect to lead the strategy, architecture, delivery, and sustained operations of our product...  ...architecture and roadmap; define reference architectures, system boundaries, integration patterns, security/role models, and SLAs... 

    Hunter Fan Company

    Cordova, TN
    1 day ago
  • $67k - $136.8k

     ...The opportunity As an FSO DevOps Engineer Senior Analyst, you’ll be based in our Service...  ....  You will deliver high qualify systems with focus on reliability and excellent customer...  ...a similar role. Strong knowledge of cloud platforms such as Azure, GCP or AWS. Azure... 
    Summer holiday
    Flexible hours

    EY

    Memphis, TN
    4 days ago
  •  ...Weekly Schedule: Monday- Friday, 9am-5pm The Site Reliability Engineer will be an active contributor responsible for configuring Dynatrace...  ...monitoring for business applications, infrastructure, cloud environments, and Azure tenants and subscriptions using Dynatrace... 
    Monday to Friday

    First Horizon Bank

    Hickory Hill, TN
    11 hours ago
  • $40 per hour

     ...the hospitality industry around the world! As a Senior Software Engineer, you will bring your technical skills to a hospitality company...  ...and extend the built‑in capabilities of the Content Management System. Debug, troubleshoot, and resolve production issues promptly... 
    Full time
    Work experience placement
    Work at office
    Worldwide
    Night shift

    Hilton

    Memphis, TN
    5 days ago
  •  ...candidates will have a Bachelor's degree in computer/electrical engineering and a minimum of three years of related experience. The role...  ...requires strong analytical skills, familiarity with manufacturing systems, and effective time management, in a fast-paced and diverse... 

    Hyve Solutions

    Olive Branch, MS
    2 days ago
  •  ...SQL query development. Experience with healthcare information systems. Demonstrated ability to use programming languages (SAS, R, Python...  ..., analytics, statistics, public health, business analytics, engineering or related field demonstrating strong analytical foundation... 

    Mississippi Baptist Health Systems

    Memphis, TN
    18 days ago
  • $86.32k - $154.96k

     ...Position Overview St. Jude is seeking an HPC Infrastructure DevOps Engineer II to join the High-Performance Computing Support (HPCS) team....  ...workflows. Work with downstream operational teams to ensure systems are configured, validated, monitored, patched, and maintained... 
    Remote work

    St. Jude Children's Research Hospital

    Memphis, TN
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Engineer - Cloud Ops. Be the first to apply!