Kubernetes Platforms System Engineer
Cadre5
Location: Knoxville, TN Job Id: 525 # of Openings: 1 Founded in 1999 in the beautiful Smoky Mountains of East Tennessee, Cadre5 provides innovative technical solutions to our customers locally and nationally. Our Cadre5 Lab Partners division has partnered with The National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL) to recruit a Kubernetes Platforms System Engineer, you will work in the Infrastructure team within the HPC Infrastructure and Networking group to support all activities of our supercomputer center. The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the NCCS computing infrastructure which supports multiple highly ranked Top500 Supercomputers, including the world’s first exaflop system, Frontier. ORNL delivers scientific discoveries and technical breakthroughs needed to realize solutions in energy and national security and provides economic benefit to the nation. This premier research institution located near Knoxville in Oak Ridge, TN, addresses national needs through impactful research and world-leading research centers. Please note: The first step in the interview process requires candidates to join a Microsoft Teams meeting with the video turned on. This is a full-time, permanent position that can telecommute. Occasional travel to the Oak Ridge facility may be required. Why Cadre5? Working with highly talented team members Excellent medical insurance, including employer-paid benefits The Team: As a Kubernetes Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and RKE2, which provides a container orchestration service for running critical operation applications and user-managed persistent applications that run alongside the OLCF Supercomputer systems and other OLCF managed HPC clusters. Job Responsibilities: Work with the team to define and implement best practices and standards within the organization Keeping the Kubernetes platform reliable, available, and fast Architecting solutions to problems that improve the reliability, scalability, performance, and efficiency of our services Respond to, investigate, and fix service issues all the way from bare metal through the OS to the application code Coordinate with vendors to resolve hardware and software problems Participate in an on-call rotation providing 24-hour, 7-day support and off-hours maintenance windows Work with users to help them use Kubernetes Basic Qualifications: Bachelor’s degree in a scientific field and a minimum of 5-8 years of relevant experience. An equivalent combination of education and experience will be considered. Experience with Kubernetes as a cluster administrator for on-premises deployments Excellent interpersonal/communications skills, and the ability to work as part of a team Strong working knowledge of Linux systems fundamentals and networked computing environment concepts Experience with code reviews, code quality, CI/CD tooling, GitOps, SCM (e.g. GitLab) Ability to identify requirements and to define, plan, and implement requisite solutions for small and medium projects Ability to develop and maintain programs and scripts that aid in the operation and automation of tasks using various shell and scripting languages (primarily bash, Python, and Go) Experience with on-call rotation The ability to obtain and maintain a Department of Energy "Q" clearance is required. This requires US Citizenship. Preferred Qualifications: Bachelor’s degree in a scientific field and 8-10 years of relevant experience. Subject matter expert in Kubernetes as a cluster administrator for bare metal, on-premises deployments Excellent interpersonal/communications skills, be able to effectively communicate with other teams and organizational leadership. Convey technical details to a non-or semi-technical audience. Ability to identify requirements and to define, plan, and implement requisite solutions for large, organizationally impactful projects. Self-driven with the ability to work in a dynamic, loosely structured research & development environment. Experience with RKE2 (nice to haves: Red Hat OpenShift and Talos). Multi-cluster management tools for Kubernetes (e.g. Fleet), and container security tools (Neuvector, SCC, pod admission control) Experiencing with managing image registries such as Quay or Harbor Experience using tools such as Prometheus, Nagios, and Grafana to monitor systems, metrics and create dashboards Experience designing and implementing highly-available systems/services Experience with Infrastructure-as-Code tooling such as Terraform, Helm, and Puppet Experience implementing systems-level security technologies (e.g. SELinux, Seccomp, linux capabilities), experience with DevSecOps, and general security best practices. Experience with AIOps and MLOps tooling – e.g. KServe, Kubeflow, vLLM, NVidia Enterprise AI, AMD Silo AI, ClearML, MLFlow Experience using HPC hardware for Kubernetes – e.g. RDMA, DPUs, Infiniband, many-core CPUs Experience with declarative CI/CD tools such as ArgoCD Experience with workflow engines such as Apache Airflow or Argo Workflows Experience with infrastructure automation Cloud engineering experience with at least one cloud service provider Experience with reusable, automated workflows such as PagerDuty playbooks Cadre5 offers excellent pay and benefits, to include full medical, dental, and vision coverage coupled with 401K match, 15 days PTO, and 10 holidays. Cadre5 is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. Cadre5 is an E-Verify Employer. #J-18808-Ljbffr
- ...qualified individual for a key role in enhancing the NCCS computing infrastructure. This position involves managing Kubernetes platforms, ensuring system reliability, and collaborating with teams to address service issues. The ideal candidate must hold a Bachelor's degree...Suggested
- ...Cadre5 is seeking a Kubernetes Platforms System Engineer in Knoxville, TN. This full-time position includes responsibilities such as ensuring the reliability of the Kubernetes platform and troubleshooting issues across the stack. Candidates should have a Bachelor's degree...SuggestedFull time
- ...solutions provider in Knoxville, TN is looking for Software Engineers with Kubernetes expertise to join their innovative team working on the... ...Candidates should possess strong skills in Go, Python, and Linux system management. The position offers excellent pay, benefits,...Suggested
- ...Plus Benefits Contract: Permanent, full time Reporting to: Senior Director of Electrical Engineering Your role in the mission Type One Energy Group is seeking an experienced Systems Engineer to support an interdisciplinary, collaborative approach in the development of a...SuggestedPermanent employmentFull timeContract workWork at officeLocal area
- ...RDI Technologies, Inc. in Knoxville, Tennessee is seeking a Senior Platform Engineer for Delivery & Reliability. This role will focus on improving software delivery processes for product teams and ensuring that the workflow from code to customer is efficient, trusted,...Suggested
- ...'I want to replace all of this with something engineers actually trust and love to use'? We are seeking a Senior Platform Engineer, Delivery & Reliability to own the trusted... ...value of this role is owning the agentic system of trust around software delivery: what must be...
- ...Senior Reliability Engineer Job Locations US-TN-Knoxville, TN ID 2026-2741 Category Science & Engineering... ...magnets to develop its optimized stellarator fusion energy system. Its FusionDirect development program pursues the lowest-risk,...Permanent employmentFull timeRelocation package
- ...Utility Power Systems Engineer Basic Function: Conducting engineering studies on utility power systems. Education: Electrical Engineering degree from an accredited ABET university with emphasis on power systems. Required Skills: Six months of experience...Remote work
$134.26k
...that impact our world? CDM Smith offers employees opportunities to delve into many aspects of electrical engineering, including the design of complex power systems, observation and construction services, and power system analyses, etc. We want to match you up with the...Full timeH1bWork at officeRelocation packageFlexible hours- ...Senior Power Systems Engineer EnerNex is seeking an experienced Senior Power Systems Engineer to support and lead complex power system studies and consulting engagements for utilities, system operators, developers, and regulators. This role is intended for an experienced...Work at officeRemote work
- ...Basic Function Supporting engineering studies on wind plants and utility power systems. Education And Experience Any experience level is considered. An applicant with an advanced degree (Masters or PhD) will work on more challenging tasks and will have more responsibilities...Temporary workInternship
- ...DeRoyal, located in Powell, TN, is seeking a skilled Network Systems Engineer to support and maintain its enterprise IT infrastructure. This role involves ensuring the reliability, security, and performance of network systems and providing technical support to end-users...
- ...corporate office in Powell, TN is seeking a skilled Network Systems Engineer to support and maintain our enterprise IT infrastructure. This... ..., end-user computing environments, and telecommunications platforms. The ideal candidate is hands-on, solutions-oriented, and comfortable...Work at officeRemote work
$108.3k - $154.3k
...Enterprise AI Platform Engineer Title: Enterprise AI Platform Engineer Category: Software Development/ Engineering City: Various, United States Job Description CGI is seeking an experienced Enterprise AI Platform Engineer to support and enhance enterprise-wide Generative...Local area- ...CGI Njoyn seeks an experienced Enterprise AI Platform Engineer to enhance enterprise-wide Generative AI platforms. This position emphasizes operational support and integration of platforms like ChatGPT Enterprise and Microsoft Copilot in a highly regulated financial environment...
$66.71k - $120k
...Level of Experience: Mid This opportunity resides with Warfare Systems (WS), a business group within HII's Mission Technologies... ...capabilities in cybersecurity, network architecture, reverse engineering, software and hardware development uniquely enable us to support...Full timeWork at officeLocal areaWorldwide- ...An engineering and construction firm is seeking an Engineering Materials Systems Specialist to optimize engineering systems and data integration. This part-time telework role involves ensuring the deployment of Smart Materials and Oracle Agile. Required qualifications...Part timeRemote work
- ...seeking an experienced Database DevOps Engineer for a 6-month contract position based in... ...DevOps practices, automation, and cloud platforms. The ideal candidate will have strong expertise... ...Containerization technologies (Docker, Kubernetes) Scripting languages (Python, Bash,...Contract work
- ...identify software defects in electrified and internal combustion engine (ICE) powertrains, ensuring the delivery of robust, high-... ...vehicle environments and Hardware-in-the-Loop (HIL) simulation systems. The engineer will execute DVP&R test procedures, perform initial...Full timeImmediate start
$72.25k - $160k
...shared success across Mission Technologies. Job Description Mission Technologies, a division of HII, is seeking a Lead MBSE Systems Engineer to support operations within the Directed Energy business initative. You will provide systems engineering support to the...Full timeWork experience placementWork at officeLocal areaRemote workWorldwideRelocation- ...OP Recruiting seeks a Senior Engineer to lead the evolution of a sophisticated enterprise platform, integrating AI capabilities and robust distributed systems. Responsibilities include system ownership, architecting production-level AI workflows, and ensuring reliability...Remote work
- ...Engineering Manager - Platform WBIR-TV Knoxville About TEGNA TEGNA Inc. helps people thrive in their local communities by providing the... ...and operational health across backend microservices, MAM systems, infrastructure, and our CMS systems. Lead execution across...Full timeTemporary workPart timeLocal area
- ...System Director, Oncology Pharmacy Services - Tennessee Location: Knoxville, TN (Onsite) Relocation Assistance Available A nationally recognized healthcare system is seeking an experienced System Director of Oncology Pharmacy Services to lead pharmacy operations...Relocation package
$140k - $190k
...Overview Freenet Health is seeking a skilled Senior Software Engineer (RCM Platform) to join our engineering team. In this role, you will play a critical part in building and scaling backend systems that support healthcare billing workflows, claims processing, reimbursement...Casual workWork at officeMonday to Friday$50 per hour
...will be responsible for developing evaluation criteria, reviewing performance logs, and testing systems for vulnerabilities. Ideal candidates have 2+ years of backend engineering experience and a strong attention to detail. Competitive rates of up to $50 per hour with...Hourly payFreelanceRemote workFlexible hours- ...PeopleFind is seeking a skilled Reliability Engineer for a manufacturing role in Knoxville, TN. In this impactful position, you will lead the reliability program, work directly on the plant floor, and support teams in improving equipment performance. The ideal candidate...Relocation package
$140k - $190k
...Woundlocal is seeking a skilled Senior Software Engineer to join our team in Fair Oaks, Oklahoma. In this role, you will play a critical part in building backend systems that support healthcare billing workflows and claims processing. The ideal candidate should have proven...- ...are, join our team. KPMG is currently seeking a Lead Engineer, Network Security Platform to join our Digital Nexus technology team. Responsibilities... ...analysis; establish network environments by designing system configuration, and direct system installation by...H1bLocal area
- ...technology company located in Knoxville, TN, is seeking an Electrical Engineer to design and develop electronic hardware for fusion energy... ...should have a strong background in circuit design and embedded systems, along with at least 5 years of relevant experience. The role...
$130k - $170k
PerfectServe, located in Knoxville, TN, is looking for a Full Stack AI Engineer to join their R&D team. This remote role focuses on AI architecture and integration within healthcare communication solutions. The position requires 5+ years of experience in software engineering...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Kubernetes Platforms System Engineer. Be the first to apply!
- platform engineer Knoxville, TN
- platform developer Knoxville, TN
- healthcare systems engineer Knoxville, TN
- operating system engineer Knoxville, TN
- advanced systems engineer Knoxville, TN
- system performance engineer Knoxville, TN
- software system engineer Knoxville, TN
- operations support system engineer Knoxville, TN
- senior windows systems engineer Knoxville, TN
- systems engineer Knoxville, TN


