Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kubernetes Platform Engineer - AI Infrastructure

$152.5k - $219.2k

Cisco Systems

The application window is expected to close on: 06/12/2026

Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received .

Kubernetes Platform Engineer – AI Infrastructure - hybrid (2013055)

***hybrid role - requires some work activity to be on-site in San Jose CA office

Meet the Team

Join our Platform Engineering team to design, build, and operate large-scale, on-prem Kubernetes infrastructure powering next-generation AI/ML platforms, including GPU-enabled environments for traditional models and LLMs. You will lead the technical direction of scalable, reliable systems, managing the Kubernetes control plane and extending platform capabilities through custom controllers and operators. You’ll architect ML platforms, implement Infrastructure as Code with Golang, and drive MLOps best practices. Partnering closely with data scientists and ML engineers, you’ll enable high-performance AI workloads while leveraging AIOps for automation and reliability. This role requires strong hands-on on-prem Kubernetes experience and offers opportunities to mentor engineers and influence platform strategy in a hybrid environment.

Your Impact / Responsibilities as a Kubernetes Platform Engineer , you will:

  • Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos), with ownership of control plane, etcd, and cluster lifecycle.

  • Architect scalable, multi-tenant platform infrastructure as the foundation for AI/ML and GenAI workloads.

  • Enable and optimize AI/ML workloads, including GPU-based environments for training, inference, and model deployment.

  • Partner with data scientists and ML engineers to onboard and scale ML pipelines and workflows.

  • Build platform capabilities using Kubernetes controllers, operators, CRDs, and Golang/Python services.

  • Implement Infrastructure as Code, automation, and AIOps-driven self-healing using platform telemetry and observability.

  • Ensure reliability through performance tuning (scheduling, resource utilization) and participate in on-call support and incident response.

Minimum Qualifications

  • 5+ years of software engineering experience, including supporting AI/ML or GPU-based workloads on Kubernetes platforms

  • 3+ years operating Kubernetes in production with control plane ownership, preferably in on-prem or self-managed environments

  • Strong experience with etcd management (backup, restore, recovery) and Kubernetes cluster upgrades

  • Proficiency in Go with experience building Kubernetes controllers/operators, CRDs, and webhooks

  • Deep understanding of Kubernetes internals (API server, scheduler, controller loops, reconciliation patterns)

  • Proven ability to debug and operate large-scale distributed systems in production environments, including participation in on-call rotations

Preferred Qualifications

  • Experience with bare-metal or on-prem infrastructure at scale

  • Experience enabling or supporting GPU-based workloads in Kubernetes environments

  • Familiarity with AI/ML platforms, pipelines, or tooling (e.g., model training, inference, or orchestration)

  • Experience building internal developer platforms or platform-as-a-service (PaaS) capabilities

  • Exposure to AIOps, including automation, anomaly detection, or self-healing systems

  • Experience applying statistical or ML techniques to operational data for reliability, performance, or capacity planning

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Message to applicants applying to work in the U.S. and/or Canada:

The starting salary range posted for this position is $152,500.00 to $219,200.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.

Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.

U.S. employees are offered benefits, subject to Cisco’s plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.

U.S. employees are eligible for paid time away as described below, subject to Cisco’s policies:

  • 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees

  • 1 paid day off for employee’s birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco

  • Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees

  • Exempt employees participate in Cisco’s flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)

  • 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next

  • Additional paid time away may be requested to deal with critical or emergency issues for family members

  • Optional 10 paid days per full calendar year to volunteer

For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco’s policies.

Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:

  • .75% of incentive target for each 1% of revenue attainment up to 50% of quota;

  • 1.5% of incentive target for each 1% of attainment between 50% and 75%;

  • 1% of incentive target for each 1% of attainment between 75% and 100%; and

  • Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.

For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.

The applicable full salary ranges for this position, by specific state, are listed below:

New York City Metro Area:

$152,500.00 - $252,000.00

Non-Metro New York state & Washington state:

$135,800.00 - $224,400.00

  • For quota-based sales roles on Cisco’s sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.

** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.

Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Kubernetes Platform Engineer - AI Infrastructure in San Jose, CA vacancy
  •  ...Platform Engineer (AI/LLM Infrastructure) Date: May 21, 2026 Location: Santa Clara, CA, US Company: NTT DATA Services NTT DATA Services...  ...infrastructure solutions. Architect and manage production-grade Kubernetes environments (AKS/EKS), including cluster operations... 
    Suggested
    Work at office
    Remote work
    Flexible hours

    Sierra Systems, An Ntt Data Company

    Santa Clara, CA
    4 days ago
  • $100k

     ...Infrastructure And Platform Engineer, Metal Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency...  ...focuses on designing and operating Kubernetes-based platforms on on-prem data centers... 
    Suggested

    Tenstorrent

    Santa Clara, CA
    2 days ago
  •  ...Platform Engineer (AI/LLM Infrastructure) Day to Day Job Duties: Lead the design, implementation, and operation of scalable infrastructure...  ...infrastructure solutions Architect and manage production-grade Kubernetes environments (AKS/EKS), including cluster operations... 
    Suggested
    3 days per week

    United Software Group

    Santa Clara, CA
    4 days ago
  •  ...and Inclusion. We weave AI into the fabric of...  ..., Secure Cloud and AI infrastructure is the foundation of our...  ...world-class Principal Engineer (Sr Manager-equivalent...  ...Cloud Infrastructure and Platform Engineering (CIPE)...  ...orchestration technologies (e.g Kubernetes) and CI/CD. ● Deep... 
    Suggested
    Full time
    Work at office
    3 days per week

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  • $152.5k - $219.2k

    Cisco Systems, Inc. is seeking a Kubernetes Platform Engineer in San Jose, CA to design and manage large-scale on-prem Kubernetes infrastructures. This hybrid role requires strong hands-on...  ...experience with Kubernetes, supporting AI/ML workloads, and implementing Infrastructure... 
    Suggested

    Cisco Systems, Inc.

    San Jose, CA
    8 hours ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and...  ...We are looking for a systems-minded engineer who lives at the intersection of large-scale...  ...focuses on post-training and inference infrastructure, with particular emphasis on P/D disaggregation... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    18 hours ago
  •  ...generation of cloud-native ML infrastructure? We’re looking for a...  ...deep expertise in Kubernetes, Crossplane, Golang/...  ...design and scale the platforms that power Apple’s...  ...Description The AI, Search & Knowledge Platform...  ...across ML engineering, SRE, and platform teams... 

    Apple

    Cupertino, CA
    4 days ago
  • $147k - $237.5k

     ...Integrity, and Inclusion. We weave AI into the fabric of everything we do...  ...seeking a highly skilled and experienced Platform & Infrastructure Engineer to join our core infrastructure...  ...manage highly available and scalable Kubernetes platforms as a service for internal... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    8 days ago
  • $180k - $250k

     ...we build multi‑agent AI systems that automate...  ...business workflows across platforms like SAP, Salesforce,...  ...the Team The Cloud Infrastructure team builds and operates...  ...container orchestration (Kubernetes), networking, and...  ...Infrastructure / DevOps Engineer, you’ll design and own... 
    Contract work
    Work at office

    Cerebras

    San Jose, CA
    2 days ago
  •  ...This is a job that Jill, our AI Recruiter, is recruiting...  ...speak to Jack. Job Title: Kubernetes DevOps Engineer Company Description:...  ...Seed-stage MIT-founded AI infrastructure startup Job Description:...  ...the core architecture of a platform aiming to make distributed... 

    Jack and Jill AI

    San Jose, CA
    4 days ago
  • $85 - $90 per hour

     ...Senior Platform Devops Engineer- Architecture Immediate need for a talented...  ...our production environment infrastructure. Investigate and fix stability...  ...: ~ Key Skills: Kubernetes, Docker, Architecture in production...  ...agree to receive calls, AI-generated calls, text... 
    Contract work
    Local area
    Immediate start

    Pyramid Consulting

    Milpitas, CA
    2 days ago
  • $60 - $65 per hour

    Primary Skills: Kubernetes(Expert), CI/CD(Proficient), Cloud(AWS...  ...a proficient Search Platform DevOps Engineer to enhance our capacity within...  ...operations, deploying scalable infrastructures, and ensuring system...  ...end-to-end management of AI and ML services, enhancing... 
    Hourly pay
    Contract work

    Akraya

    San Jose, CA
    4 days ago
  •  ...components used in consumer electronics goods. Job Title: AI Infrastructure / Platform Engineer Duration: 6 Months Location: San Jose CA Job...  ...DevOps Engineering. Deep hands-on experience with Kubernetes and container orchestration at scale. Proven... 
    Full time
    Temporary work

    TekWissen LLC

    San Jose, CA
    20 days ago
  • $102.5k - $187.9k

    Ernst & Young Advisory Services Sdn Bhd is seeking a Kubernetes DevOps Engineer in San Jose, responsible for designing and managing containerized...  ...using Kubernetes and Azure, ensuring scalable and secure infrastructure. The ideal candidate has a bachelor's or master's in a... 

    Ernst & Young Advisory Services Sdn Bhd

    San Jose, CA
    1 day ago
  • $146.7k

     ...We are seeking a Principal Kubernetes DevOps Engineer who combines deep technical...  ...Cloud and Colocation (Colo) infrastructure that powers seamless...  ...systems to web, team chat and AI to uncover architectural or...  ...build the best collaboration platform for the enterprise, and today... 
    Casual work
    Work at office
    Remote work
    Worldwide

    Zoom

    San Jose, CA
    2 days ago
  • $102.5k - $187.9k

     ...advanced technologies in AI, automation, and data analytics...  ...key responsibilities As a Kubernetes DevOps Engineer, you are responsible for...  ...and orchestration platforms (Kubernetes) to ensure scalable...  ...scalable, secure, and automated infrastructure. Key responsibilities... 
    Work at office
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    San Jose, CA
    1 day ago
  •  ...Lead Cloud DevOps Platform Engineer Marlborough, MA, United States...  ...developers to provision compliant infrastructure (databases, clusters,...  ...Expert: Deep expertise in Kubernetes (designing and troubleshooting...  ...to refrain from using AI tools, such as generative AI... 
    Remote work

    Hologic

    Santa Clara, CA
    4 days ago
  • $165k - $242k

     ...Senior Platform Engineer II, Compute Services Livingston, NJ / New York...  ...is The Essential Cloud for AI™. Built for pioneers by...  ...CoreWeave combines superior infrastructure performance with deep technical...  ...Engineer to join our Kubernetes Infrastructure team. This role... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • $147.4k - $220.9k

     ...Platform Software Engineer, Infrastructure Services Every time someone downloads an iOS update, streams a show on Apple TV+, or use maps for direction...  ...Experience deploying and managing applications on Kubernetes using tools such as Helm, Pulumi, or Flux. Experience... 
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago
  • $158.4k - $294.1k

     ...Veeam is the Data and AI Trust Company, specializing in...  ...We are looking for a Senior Platform Engineer to join the Workload team within...  ...own critical observability infrastructure, drive incident response...  ...KQL, Query DSL ~ Azure Kubernetes Service (AKS), Azure... 
    Base plus commission
    Local area
    Worldwide

    Veeam Software

    San Jose, CA
    1 day ago
  •  ...Senior Cloud Platform Engineer San Jose, California, United States The era of pervasive AI has arrived. In this era, organizations...  ...implementing and supporting AI infrastructure in new regions, such as...  ...technologies (Docker, Kubernetes). ~ Deep understanding of... 
    Full time
    Temporary work
    Local area
    Immediate start
    Flexible hours
    Shift work

    SambaNova Systems

    San Jose, CA
    3 days ago
  •  ...artificial intelligence. Our no-code platform empowers every business team to harness the power of production-grade AI agents, without the need for specialized...  ...compliance. Job title Software Engineer - Platform Infrastructure Position overview We are looking... 
    Flexible hours

    Brevian.ai

    Sunnyvale, CA
    3 days ago
  • $170k - $215k

     ...Performance Computing (HPC) infrastructure to support large-...  ...) and downstream platforms, including managing Interface...  ...a cross-functional engineering multiplier: Leverage...  ...modern LLMs and AI based coding tools....  ...environments (Docker, Kubernetes). Archer is proud... 
    Local area
    Flexible hours

    Archer

    San Jose, CA
    4 days ago
  •  ...Overview: TENEX is an AI-native, automation-first, built...  ...upside. As a Head of Platform Engineering at TENEX, you will be a...  ...maintain our foundational infrastructure, data pipelines, and engineering...  ...containerization (Docker, Kubernetes) in a production environment... 

    Tenex.AI Inc

    San Jose, CA
    3 days ago
  •  ...Staff Platform Software Engineer It started with a simple idea: what if surgery...  ...unit and help drive AI developer enablement initiatives...  ...for services and infrastructure Implement automated testing...  ...and orchestration (Docker, Kubernetes) Understanding of authentication... 
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    18 hours ago
  • $156.8k - $229.7k

     .... As we ramp up investments in AI-driven product features, the Platform Engineering team is the engine room for this...  ...and scalable Cloud Infrastructure. You'll be architecting the entire...  ...microservices using GKE (Google Kubernetes Engine), Cloud SQL, and Cloud Build... 
    Full time

    GFiber

    Sunnyvale, CA
    7 days ago
  •  ...seeking a Senior Software Development Engineer to lead the design, implementation...  ...excellence of our cloud infrastructure. This role emphasizes platform engineering, where you will design...  ...experience in AWS, containerization, and Kubernetes, with a focus on robust platform... 

    Traveltechessentialist

    San Jose, CA
    8 hours ago
  • $199k - $278.5k

     ...looking for a Senior Software Development Engineer to lead the design, implementation, and operational excellence of cloud infrastructure. This role requires expertise in AWS and Kubernetes to build scalable, reliable platforms. With 8+ years of experience, candidates should... 

    PowerToFly

    San Jose, CA
    2 days ago
  • $199k - $278.5k

     ...is seeking a Senior Software Development Engineer in San Jose to lead the design and implementation of cloud infrastructure and platform capabilities. The role involves...  ...runtime platform using technologies like Kubernetes and CI/CD. Candidates must have strong cloud... 

    Expedia, Inc.

    San Jose, CA
    4 days ago
  • $125.2k - $181.6k

     ...About the team We’re a small Platform Engineering team responsible for the...  ...manage and optimize our cloud infrastructure, and drive operational...  ...secure, and scale cloud and Kubernetes environments. Implement and...  ...the fundamentals of Agentic AI as it applies to Platform Engineering... 
    Remote work

    QuantumScape Battery, Inc.

    San Jose, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kubernetes Platform Engineer - AI Infrastructure. Be the first to apply!