Kubernetes Platform Engineer - AI Infrastructure
$152.5k - $219.2kCisco
The application window is expected to close on: 06/12/2026
Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received .
Kubernetes Platform Engineer – AI Infrastructure - hybrid (2013055)
***hybrid role - requires some work activity to be on-site in San Jose CA office
Meet the Team
Join our Platform Engineering team to design, build, and operate large-scale, on-prem Kubernetes infrastructure powering next-generation AI/ML platforms, including GPU-enabled environments for traditional models and LLMs. You will lead the technical direction of scalable, reliable systems, managing the Kubernetes control plane and extending platform capabilities through custom controllers and operators. You’ll architect ML platforms, implement Infrastructure as Code with Golang, and drive MLOps best practices. Partnering closely with data scientists and ML engineers, you’ll enable high-performance AI workloads while leveraging AIOps for automation and reliability. This role requires strong hands-on on-prem Kubernetes experience and offers opportunities to mentor engineers and influence platform strategy in a hybrid environment.
Your Impact / Responsibilities as a Kubernetes Platform Engineer , you will:
Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos), with ownership of control plane, etcd, and cluster lifecycle.
Architect scalable, multi-tenant platform infrastructure as the foundation for AI/ML and GenAI workloads.
Enable and optimize AI/ML workloads, including GPU-based environments for training, inference, and model deployment.
Partner with data scientists and ML engineers to onboard and scale ML pipelines and workflows.
Build platform capabilities using Kubernetes controllers, operators, CRDs, and Golang/Python services.
Implement Infrastructure as Code, automation, and AIOps-driven self-healing using platform telemetry and observability.
Ensure reliability through performance tuning (scheduling, resource utilization) and participate in on-call support and incident response.
Minimum Qualifications
5+ years of software engineering experience, including supporting AI/ML or GPU-based workloads on Kubernetes platforms
3+ years operating Kubernetes in production with control plane ownership, preferably in on-prem or self-managed environments
Strong experience with etcd management (backup, restore, recovery) and Kubernetes cluster upgrades
Proficiency in Go with experience building Kubernetes controllers/operators, CRDs, and webhooks
Deep understanding of Kubernetes internals (API server, scheduler, controller loops, reconciliation patterns)
Proven ability to debug and operate large-scale distributed systems in production environments, including participation in on-call rotations
Preferred Qualifications
Experience with bare-metal or on-prem infrastructure at scale
Experience enabling or supporting GPU-based workloads in Kubernetes environments
Familiarity with AI/ML platforms, pipelines, or tooling (e.g., model training, inference, or orchestration)
Experience building internal developer platforms or platform-as-a-service (PaaS) capabilities
Exposure to AIOps, including automation, anomaly detection, or self-healing systems
Experience applying statistical or ML techniques to operational data for reliability, performance, or capacity planning
Why Cisco?
At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.
Message to applicants applying to work in the U.S. and/or Canada:
The starting salary range posted for this position is $152,500.00 to $219,200.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.
Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.
U.S. employees are offered benefits, subject to Cisco’s plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.
U.S. employees are eligible for paid time away as described below, subject to Cisco’s policies:
10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees
1 paid day off for employee’s birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco
Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
Exempt employees participate in Cisco’s flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)
80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next
Additional paid time away may be requested to deal with critical or emergency issues for family members
Optional 10 paid days per full calendar year to volunteer
For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco’s policies.
Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:
.75% of incentive target for each 1% of revenue attainment up to 50% of quota;
1.5% of incentive target for each 1% of attainment between 50% and 75%;
1% of incentive target for each 1% of attainment between 75% and 100%; and
Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.
For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.
The applicable full salary ranges for this position, by specific state, are listed below:
New York City Metro Area:
$152,500.00 - $252,000.00
Non-Metro New York state & Washington state:
$135,800.00 - $224,400.00
- For quota-based sales roles on Cisco’s sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.
** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.
Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.
Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.
- ...Senior Kubernetes Platform Engineer - AI/ML Infrastructure Join our Platform Engineering team to design, build, and operate large-scale, on-prem Kubernetes infrastructure powering next-generation AI/ML platforms, including GPU-enabled environments for both traditional...Suggested
$113.05k - $168.3k
...Software Engineer We are seeking a Software Engineer... ...scalable cloud-based platforms and services. This... ...performance of cloud infrastructure. The engineer will collaborate... ...platforms such as Kubernetes (including on-prem and... ...systems. Utilize modern AI/ML and Generative AI...Suggested$230k - $250k
...full-stack portfolio of AI-enabled, AI-ready, and... ..., tablets), infrastructure (server, storage, edge... ...scale out an Agentic AI platform that streamlines the creation... ...collaborate with AI engineers to deploy and scale AI... ...containerization and scaling using Kubernetes. ~ Proven experience...SuggestedFull timeWork at officeLocal areaWork from home3 days per week- ...Solutions is seeking a Senior Software Engineer - Platform Performance & Resilience that plays a key... ..., and cloud services. This role uses AI-enabled automation to validate and... .... ~ Experience deploying services in Kubernetes-based cloud environments. ~ Strong debugging...SuggestedWork at office
$152k - $241.5k
...our global services platform. At NVIDIA, you’ll keep... ...harness the power of AI to deliver groundbreaking... ...fabrics. Use IaC(Infrastructure‑as‑Code) and config management... ...using Slurm, LSF or Kubernetes clusters, including... ...Ruby. Mentored other engineers and influenced technical...SuggestedFull time- ...Principal Software Engineer - Credit Card Core Platforms Brazil, Belo Horizonte; Brazil, Campinas; Brazil... ...transformation: leveraging Generative AI to automate complex operational... ...solutions (cloud-based agents) to automate infrastructure maintenance and data migration....
- ...Carolina is seeking an experienced Automation Engineer to support the Army Edge Computing Capability project... ...designing automation frameworks, deploying Infrastructure as Code (IaC) pipelines, and troubleshooting Kubernetes clusters. Ideal candidates will have senior-...
$100.3k - $149.6k
...Entry Level Software Engineer in Cloud Storage Are you passionate... ...Linux, AWS, Azure, GCP, and Kubernetes Experience with SQL and document... ...CI) Familiarity with infrastructure as code (IaC) tools (e.g.,... ...and tools Experience with AI/ML frameworks like PyTorch or...Full timeInternship- ...announcement. Applicants may not receive notifications of referral status until the full 6-month eligibility period has elapsed. Positions filled with applications from this pool will be placed into positions where AI duties occupy the majority of the work performed....
- ...Building the world's leading AI-powered, cloud-native... ...looking for software engineers, at various levels, to... ...network data plane platform. You will play a... ...years of relevant cloud infrastructure/cloud networking experience... ...Experience with Kubernetes Networking Experience...Work experience placement
- ...We are looking for a DevOps Engineer to join the Technology Development & Infrastructure team. In this role, you will be... ...opportunity to contribute to high-impact platforms used by a large number of... ...). Experience implementing AI-driven anomaly detection, alert...
- ...experienced Software Verification Engineer to join our Air team - the... ...of the Air simulation platform by verifying that features are... ...protocols. Hands‑on experience with Kubernetes or other large‑scale... ...until April4,2026. NVIDIA uses AI tools in its recruiting processes...
- ...practices across software development, platform engineering, infrastructure, and security, with a strong... ...(e.g., AWS/Azure, CISSP, Security+, Kubernetes, or DevOps certifications). ITIL... ...We may use artificial intelligence (AI) tools to support parts of the hiring...For contractorsLocal area
- ...skilled and motivated Full-Stack Engineer to design, build, and... ...cost efficiency. Healthcare Platform Development Work... ...Collaborate on CI/CD pipelines and infrastructure supporting cloud-based... ...infrastructure tools (Docker, Kubernetes, Terraform). When applying...Remote work
- ...Description & Requirements Maximus is currently seeking a Cloud Platform Engineer. This is a remote position. Maximus is a trusted... ...enterprise or federal settings. - Proven experience with Infrastructure as Code (e.g., ARM templates, Bicep, Terraform) for...Minimum wageFull timeContract workTemporary workWork experience placementRemote work
- ...full-stack portfolio of AI-enabled, AI-ready, and... ..., tablets), infrastructure (server, storage, edge... ...OpenStack-based IaaS platform. This role will focus... ...- support for hybrid Kubernetes stack (e.g., Magnum, OpenShift... ...high-level guidance to engineering teams. ~ Collaborate...Full timeWork at officeLocal areaWork from home3 days per week
$137k - $200.5k
...stable cloud service on the WebEx platform, while developing and following... ...with Site Reliability Engineering (SRE) practices, including system... ...re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating...Permanent employmentFull timeTemporary workLocal areaFlexible hours- ...Position: Asset Mgmt - Senior DevOps Engineer Location: Durham, NC / Merrimack,... ...not looking for a purely Operations, Infrastructure, SRE, or Architect background. Required... ...and orchestrators like Docker & Kubernetes. ~ Understanding of different build...RelocationMonday to Friday
- ...reliable operation of Avalara's cloud platforms. We focus on engineering guardrails, developing event-driven... ...solutions and guardrails using infrastructure as code and cloud-native services... ...security automation Avalara is an AI-first Company AI is embedded in our...
- ...Cloud Security Engineer/Architect Tier One Technologies has an... ...virtualized environments. AI/ML Security Governance (Adversarial... ...ensuring that non-compliant infrastructure is automatically remediated... ...: Experience securing Kubernetes (EKS/AKS/GKE) and Docker environments...Permanent employmentContract workImmediate start
- ...industries, helping them shape their hybrid cloud and AI journeys. With support from our strategic partners,... ...Your role and responsibilities The Azure Security Engineer will support a large team of infrastructure, security and application team during migration of on...Worldwide
- ...DevOps Engineer / Linux and AWS / Onsite in Durham, NC A profitable... ...their SaaS based accounting platform, used by several enterprise... ...as AWS and Terraform for infrastructure as code, plus GitlabCI and... ...actions for CI/CD pipelines, Kubernetes for orchestration and do some...Permanent employmentFull time
- ...best practices in Resiliency Engineering, Automation, Observability and... ...supports solutions based on cloud platforms AWS/Azure and container orchestration Kubernetes.* Onboards /Evaluates New... ...cloud-based applications and infrastructure solutions, using DevOps or SRE...
- ...Specific Essential Duties and Responsibilities: - Provide Tier‑3 engineering support for Microsoft 365 GCC, Exchange Online, hybrid Exchange Server, and SharePoint Online environments, ensuring platform availability, performance, and security. - Manage, monitor,...Minimum wageFull timeContract workTemporary workWork experience placement
- ...JOB SUMMARY As a Senior Cloud Engineer, you will utilize your strong Cloud experience to develop solutions in support... ...- Experience with Python, Golang, Node JS, Docker, Kubernetes. - Experience with Infrastructure as Code (CloudFormation, Terraform). - Experience...
$43.4 per hour
...Cloud Engineer (Durham, NC area) W2 Only Required Skills: ~5+ years... ...pipeline creation ~ Strong understanding of Infrastructure as Code with Terraform, CloudFormation... ...with Python, Golang, Node.js, Docker, Kubernetes ~ Ability to build secure, scalable cloud...Work experience placement- ...As Senior Cloud Engineer, you will utilize your strong Cloud experience to develop... ...providing cloud native composite solutions, platforms, frameworks, and security... ...with Python, Golang, Node JS, Docker, Kubernetes, Infrastructure as Code, CloudFormation, Terraform....
- ...Senior Software DevOps Engineer Equity Technology is seeking a Senior Software DevOps Engineer to support DevOps patterns and practices... ...of our CI/CD pipelines, increasing our footprint on a cloud infrastructure. You will innovate by helping define and implement our cloud...Shift work
- ...The Opportunity: The Senior Cloud Engineer is responsible for architecture, design... ...implementation, and management of the AWS cloud infrastructure for ACA Group ("ACA")'s software and... ..., reliability, and security across our platforms. You'll contribute to building...Work experience placementH1bWork at officeVisa sponsorship2 days per week
- IBM Computing is seeking early-career Software Developers to join their product engineering teams in Durham, North Carolina. In this role, you will contribute to designing, building, and delivering modern cloud-ready software as part of an agile team. The ideal candidate...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Kubernetes Platform Engineer - AI Infrastructure. Be the first to apply!


