Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kubernetes Platform Engineer - AI Infrastructure

$152.5k - $219.2k

Cisco

The application window is expected to close on: 06/12/2026

Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received .

Kubernetes Platform Engineer – AI Infrastructure - hybrid (2013055)

***hybrid role - requires some work activity to be on-site in San Jose CA office

Meet the Team

Join our Platform Engineering team to design, build, and operate large-scale, on-prem Kubernetes infrastructure powering next-generation AI/ML platforms, including GPU-enabled environments for traditional models and LLMs. You will lead the technical direction of scalable, reliable systems, managing the Kubernetes control plane and extending platform capabilities through custom controllers and operators. You’ll architect ML platforms, implement Infrastructure as Code with Golang, and drive MLOps best practices. Partnering closely with data scientists and ML engineers, you’ll enable high-performance AI workloads while leveraging AIOps for automation and reliability. This role requires strong hands-on on-prem Kubernetes experience and offers opportunities to mentor engineers and influence platform strategy in a hybrid environment.

Your Impact / Responsibilities as a Kubernetes Platform Engineer , you will:

  • Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos), with ownership of control plane, etcd, and cluster lifecycle.

  • Architect scalable, multi-tenant platform infrastructure as the foundation for AI/ML and GenAI workloads.

  • Enable and optimize AI/ML workloads, including GPU-based environments for training, inference, and model deployment.

  • Partner with data scientists and ML engineers to onboard and scale ML pipelines and workflows.

  • Build platform capabilities using Kubernetes controllers, operators, CRDs, and Golang/Python services.

  • Implement Infrastructure as Code, automation, and AIOps-driven self-healing using platform telemetry and observability.

  • Ensure reliability through performance tuning (scheduling, resource utilization) and participate in on-call support and incident response.

Minimum Qualifications

  • 5+ years of software engineering experience, including supporting AI/ML or GPU-based workloads on Kubernetes platforms

  • 3+ years operating Kubernetes in production with control plane ownership, preferably in on-prem or self-managed environments

  • Strong experience with etcd management (backup, restore, recovery) and Kubernetes cluster upgrades

  • Proficiency in Go with experience building Kubernetes controllers/operators, CRDs, and webhooks

  • Deep understanding of Kubernetes internals (API server, scheduler, controller loops, reconciliation patterns)

  • Proven ability to debug and operate large-scale distributed systems in production environments, including participation in on-call rotations

Preferred Qualifications

  • Experience with bare-metal or on-prem infrastructure at scale

  • Experience enabling or supporting GPU-based workloads in Kubernetes environments

  • Familiarity with AI/ML platforms, pipelines, or tooling (e.g., model training, inference, or orchestration)

  • Experience building internal developer platforms or platform-as-a-service (PaaS) capabilities

  • Exposure to AIOps, including automation, anomaly detection, or self-healing systems

  • Experience applying statistical or ML techniques to operational data for reliability, performance, or capacity planning

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Message to applicants applying to work in the U.S. and/or Canada:

The starting salary range posted for this position is $152,500.00 to $219,200.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.

Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.

U.S. employees are offered benefits, subject to Cisco’s plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.

U.S. employees are eligible for paid time away as described below, subject to Cisco’s policies:

  • 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees

  • 1 paid day off for employee’s birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco

  • Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees

  • Exempt employees participate in Cisco’s flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)

  • 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next

  • Additional paid time away may be requested to deal with critical or emergency issues for family members

  • Optional 10 paid days per full calendar year to volunteer

For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco’s policies.

Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:

  • .75% of incentive target for each 1% of revenue attainment up to 50% of quota;

  • 1.5% of incentive target for each 1% of attainment between 50% and 75%;

  • 1% of incentive target for each 1% of attainment between 75% and 100%; and

  • Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.

For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.

The applicable full salary ranges for this position, by specific state, are listed below:

New York City Metro Area:

$152,500.00 - $252,000.00

Non-Metro New York state & Washington state:

$135,800.00 - $224,400.00

  • For quota-based sales roles on Cisco’s sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.

** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.

Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Kubernetes Platform Engineer - AI Infrastructure in Durham, NC vacancy
  •  ...Senior Kubernetes Platform Engineer - AI/ML Infrastructure Join our Platform Engineering team to design, build, and operate large-scale, on-prem Kubernetes infrastructure powering next-generation AI/ML platforms, including GPU-enabled environments for both traditional... 
    Suggested

    Webex Events (formerly Socio)

    Durham, NC
    2 days ago
  • $113.05k - $168.3k

     ...Software Engineer We are seeking a Software Engineer...  ...scalable cloud-based platforms and services. This...  ...performance of cloud infrastructure. The engineer will collaborate...  ...platforms such as Kubernetes (including on-prem and...  ...systems. Utilize modern AI/ML and Generative AI... 
    Suggested

    NetApp

    Durham, NC
    4 days ago
  • $230k - $250k

     ...full-stack portfolio of AI-enabled, AI-ready, and...  ..., tablets), infrastructure (server, storage, edge...  ...scale out an Agentic AI platform that streamlines the creation...  ...collaborate with AI engineers to deploy and scale AI...  ...containerization and scaling using Kubernetes. ~ Proven experience... 
    Suggested
    Full time
    Work at office
    Local area
    Work from home
    3 days per week

    Lenovo

    Morrisville, NC
    3 days ago
  •  ...Solutions is seeking a Senior Software Engineer - Platform Performance & Resilience that plays a key...  ..., and cloud services. This role uses AI-enabled automation to validate and...  .... ~ Experience deploying services in Kubernetes-based cloud environments. ~ Strong debugging... 
    Suggested
    Work at office

    Toshiba Global Commerce Solutions

    Durham, NC
    2 days ago
  • $152k - $241.5k

     ...our global services platform. At NVIDIA, you’ll keep...  ...harness the power of AI to deliver groundbreaking...  ...fabrics. Use IaC(Infrastructure‑as‑Code) and config management...  ...using Slurm, LSF or Kubernetes clusters, including...  ...Ruby. Mentored other engineers and influenced technical... 
    Suggested
    Full time

    NVIDIA

    Durham, NC
    20 hours ago
  •  ...Principal Software Engineer - Credit Card Core Platforms Brazil, Belo Horizonte; Brazil, Campinas; Brazil...  ...transformation: leveraging Generative AI to automate complex operational...  ...solutions (cloud-based agents) to automate infrastructure maintenance and data migration.... 

    Nubank

    Durham, NC
    6 days ago
  •  ...Carolina is seeking an experienced Automation Engineer to support the Army Edge Computing Capability project...  ...designing automation frameworks, deploying Infrastructure as Code (IaC) pipelines, and troubleshooting Kubernetes clusters. Ideal candidates will have senior-... 

    IBM Computing

    Durham, NC
    4 days ago
  • $100.3k - $149.6k

     ...Entry Level Software Engineer in Cloud Storage Are you passionate...  ...Linux, AWS, Azure, GCP, and Kubernetes Experience with SQL and document...  ...CI) Familiarity with infrastructure as code (IaC) tools (e.g.,...  ...and tools Experience with AI/ML frameworks like PyTorch or... 
    Full time
    Internship

    NetApp

    Durham, NC
    1 day ago
  •  ...announcement. Applicants may not receive notifications of referral status until the full 6-month eligibility period has elapsed. Positions filled with applications from this pool will be placed into positions where AI duties occupy the majority of the work performed.... 

    Social Security Administration

    Durham, NC
    1 day ago
  •  ...Building the world's leading AI-powered, cloud-native...  ...looking for software engineers, at various levels, to...  ...network data plane platform. You will play a...  ...years of relevant cloud infrastructure/cloud networking experience...  ...Experience with Kubernetes Networking Experience... 
    Work experience placement

    IBM

    Durham, NC
    10 days ago
  •  ...We are looking for a DevOps Engineer to join the Technology Development & Infrastructure team. In this role, you will be...  ...opportunity to contribute to high-impact platforms used by a large number of...  ...). Experience implementing AI-driven anomaly detection, alert... 

    HCL Global Systems

    Durham, NC
    2 days ago
  •  ...experienced Software Verification Engineer to join our Air team - the...  ...of the Air simulation platform by verifying that features are...  ...protocols. Hands‑on experience with Kubernetes or other large‑scale...  ...until April4,2026. NVIDIA uses AI tools in its recruiting processes... 

    NVIDIA Gruppe

    Durham, NC
    2 days ago
  •  ...practices across software development, platform engineering, infrastructure, and security, with a strong...  ...(e.g., AWS/Azure, CISSP, Security+, Kubernetes, or DevOps certifications). ITIL...  ...We may use artificial intelligence (AI) tools to support parts of the hiring... 
    For contractors
    Local area

    Computer World Services

    Morrisville, NC
    9 days ago
  •  ...skilled and motivated Full-Stack Engineer to design, build, and...  ...cost efficiency. Healthcare Platform Development Work...  ...Collaborate on CI/CD pipelines and infrastructure supporting cloud-based...  ...infrastructure tools (Docker, Kubernetes, Terraform). When applying... 
    Remote work

    Keebler Health

    Durham, NC
    2 days ago
  •  ...Description & Requirements Maximus is currently seeking a Cloud Platform Engineer. This is a remote position. Maximus is a trusted...  ...enterprise or federal settings. - Proven experience with Infrastructure as Code (e.g., ARM templates, Bicep, Terraform) for... 
    Minimum wage
    Full time
    Contract work
    Temporary work
    Work experience placement
    Remote work

    Maximus

    Durham, NC
    4 hours ago
  •  ...full-stack portfolio of AI-enabled, AI-ready, and...  ..., tablets), infrastructure (server, storage, edge...  ...OpenStack-based IaaS platform. This role will focus...  ...- support for hybrid Kubernetes stack (e.g., Magnum, OpenShift...  ...high-level guidance to engineering teams. ~ Collaborate... 
    Full time
    Work at office
    Local area
    Work from home
    3 days per week

    Lenovo

    Morrisville, NC
    1 day ago
  • $137k - $200.5k

     ...stable cloud service on the WebEx platform, while developing and following...  ...with Site Reliability Engineering (SRE) practices, including system...  ...re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating... 
    Permanent employment
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    Durham, NC
    3 days ago
  •  ...Position: Asset Mgmt - Senior DevOps Engineer Location: Durham, NC / Merrimack,...  ...not looking for a purely Operations, Infrastructure, SRE, or Architect background. Required...  ...and orchestrators like Docker & Kubernetes. ~ Understanding of different build... 
    Relocation
    Monday to Friday

    Anveta

    Durham, NC
    5 days ago
  •  ...reliable operation of Avalara's cloud platforms. We focus on engineering guardrails, developing event-driven...  ...solutions and guardrails using infrastructure as code and cloud-native services...  ...security automation Avalara is an AI-first Company AI is embedded in our... 

    Avalara

    Durham, NC
    4 days ago
  •  ...Cloud Security Engineer/Architect Tier One Technologies has an...  ...virtualized environments. AI/ML Security Governance (Adversarial...  ...ensuring that non-compliant infrastructure is automatically remediated...  ...: Experience securing Kubernetes (EKS/AKS/GKE) and Docker environments... 
    Permanent employment
    Contract work
    Immediate start

    A.C.Coy Company

    Morrisville, NC
    4 days ago
  •  ...industries, helping them shape their hybrid cloud and AI journeys. With support from our strategic partners,...  ...Your role and responsibilities The Azure Security Engineer will support a large team of infrastructure, security and application team during migration of on... 
    Worldwide

    IBM

    Durham, NC
    3 days ago
  •  ...DevOps Engineer / Linux and AWS / Onsite in Durham, NC A profitable...  ...their SaaS based accounting platform, used by several enterprise...  ...as AWS and Terraform for infrastructure as code, plus GitlabCI and...  ...actions for CI/CD pipelines, Kubernetes for orchestration and do some... 
    Permanent employment
    Full time

    Motion Recruitment

    Durham, NC
    2 days ago
  •  ...best practices in Resiliency Engineering, Automation, Observability and...  ...supports solutions based on cloud platforms AWS/Azure and container orchestration Kubernetes.* Onboards /Evaluates New...  ...cloud-based applications and infrastructure solutions, using DevOps or SRE... 

    Soteria Reinsurance Ltd.

    Durham, NC
    4 days ago
  •  ...Specific Essential Duties and Responsibilities: - Provide Tier‑3 engineering support for Microsoft 365 GCC, Exchange Online, hybrid Exchange Server, and SharePoint Online environments, ensuring platform availability, performance, and security. - Manage, monitor,... 
    Minimum wage
    Full time
    Contract work
    Temporary work
    Work experience placement

    Maximus

    Durham, NC
    2 days ago
  •  ...JOB SUMMARY As a Senior Cloud Engineer, you will utilize your strong Cloud experience to develop solutions in support...  ...- Experience with Python, Golang, Node JS, Docker, Kubernetes. - Experience with Infrastructure as Code (CloudFormation, Terraform). - Experience... 

    Compunnel

    Durham, NC
    1 day ago
  • $43.4 per hour

     ...Cloud Engineer (Durham, NC area) W2 Only Required Skills: ~5+ years...  ...pipeline creation ~ Strong understanding of Infrastructure as Code with Terraform, CloudFormation...  ...with Python, Golang, Node.js, Docker, Kubernetes ~ Ability to build secure, scalable cloud... 
    Work experience placement

    Yoh

    Durham, NC
    2 days ago
  •  ...As Senior Cloud Engineer, you will utilize your strong Cloud experience to develop...  ...providing cloud native composite solutions, platforms, frameworks, and security...  ...with Python, Golang, Node JS, Docker, Kubernetes, Infrastructure as Code, CloudFormation, Terraform.... 

    Fidelity TalentSource

    Durham, NC
    3 days ago
  •  ...Senior Software DevOps Engineer Equity Technology is seeking a Senior Software DevOps Engineer to support DevOps patterns and practices...  ...of our CI/CD pipelines, increasing our footprint on a cloud infrastructure. You will innovate by helping define and implement our cloud... 
    Shift work

    Fidelity Investments

    Durham, NC
    16 hours ago
  •  ...The Opportunity: The Senior Cloud Engineer is responsible for architecture, design...  ...implementation, and management of the AWS cloud infrastructure for ACA Group ("ACA")'s software and...  ..., reliability, and security across our platforms. You'll contribute to building... 
    Work experience placement
    H1b
    Work at office
    Visa sponsorship
    2 days per week

    ACA Group

    Durham, NC
    5 days ago
  • IBM Computing is seeking early-career Software Developers to join their product engineering teams in Durham, North Carolina. In this role, you will contribute to designing, building, and delivering modern cloud-ready software as part of an agile team. The ideal candidate... 

    IBM Computing

    Durham, NC
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kubernetes Platform Engineer - AI Infrastructure. Be the first to apply!