Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kubernetes Platforms System Engineer

Cadre5

Location: Knoxville, TN Job Id: 525 # of Openings: 1 Founded in 1999 in the beautiful Smoky Mountains of East Tennessee, Cadre5 provides innovative technical solutions to our customers locally and nationally. Our Cadre5 Lab Partners division has partnered with The National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL) to recruit a Kubernetes Platforms System Engineer, you will work in the Infrastructure team within the HPC Infrastructure and Networking group to support all activities of our supercomputer center. The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the NCCS computing infrastructure which supports multiple highly ranked Top500 Supercomputers, including the world’s first exaflop system, Frontier. ORNL delivers scientific discoveries and technical breakthroughs needed to realize solutions in energy and national security and provides economic benefit to the nation. This premier research institution located near Knoxville in Oak Ridge, TN, addresses national needs through impactful research and world-leading research centers. Please note: The first step in the interview process requires candidates to join a Microsoft Teams meeting with the video turned on. This is a full-time, permanent position that can telecommute. Occasional travel to the Oak Ridge facility may be required. Why Cadre5? Working with highly talented team members Excellent medical insurance, including employer-paid benefits The Team: As a Kubernetes Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and RKE2, which provides a container orchestration service for running critical operation applications and user-managed persistent applications that run alongside the OLCF Supercomputer systems and other OLCF managed HPC clusters. Job Responsibilities: Work with the team to define and implement best practices and standards within the organization Keeping the Kubernetes platform reliable, available, and fast Architecting solutions to problems that improve the reliability, scalability, performance, and efficiency of our services Respond to, investigate, and fix service issues all the way from bare metal through the OS to the application code Coordinate with vendors to resolve hardware and software problems Participate in an on-call rotation providing 24-hour, 7-day support and off-hours maintenance windows Work with users to help them use Kubernetes Basic Qualifications: Bachelor’s degree in a scientific field and a minimum of 5-8 years of relevant experience. An equivalent combination of education and experience will be considered. Experience with Kubernetes as a cluster administrator for on-premises deployments Excellent interpersonal/communications skills, and the ability to work as part of a team Strong working knowledge of Linux systems fundamentals and networked computing environment concepts Experience with code reviews, code quality, CI/CD tooling, GitOps, SCM (e.g. GitLab) Ability to identify requirements and to define, plan, and implement requisite solutions for small and medium projects Ability to develop and maintain programs and scripts that aid in the operation and automation of tasks using various shell and scripting languages (primarily bash, Python, and Go) Experience with on-call rotation The ability to obtain and maintain a Department of Energy "Q" clearance is required. This requires US Citizenship. Preferred Qualifications: Bachelor’s degree in a scientific field and 8-10 years of relevant experience. Subject matter expert in Kubernetes as a cluster administrator for bare metal, on-premises deployments Excellent interpersonal/communications skills, be able to effectively communicate with other teams and organizational leadership. Convey technical details to a non-or semi-technical audience. Ability to identify requirements and to define, plan, and implement requisite solutions for large, organizationally impactful projects. Self-driven with the ability to work in a dynamic, loosely structured research & development environment. Experience with RKE2 (nice to haves: Red Hat OpenShift and Talos). Multi-cluster management tools for Kubernetes (e.g. Fleet), and container security tools (Neuvector, SCC, pod admission control) Experiencing with managing image registries such as Quay or Harbor Experience using tools such as Prometheus, Nagios, and Grafana to monitor systems, metrics and create dashboards Experience designing and implementing highly-available systems/services Experience with Infrastructure-as-Code tooling such as Terraform, Helm, and Puppet Experience implementing systems-level security technologies (e.g. SELinux, Seccomp, linux capabilities), experience with DevSecOps, and general security best practices. Experience with AIOps and MLOps tooling – e.g. KServe, Kubeflow, vLLM, NVidia Enterprise AI, AMD Silo AI, ClearML, MLFlow Experience using HPC hardware for Kubernetes – e.g. RDMA, DPUs, Infiniband, many-core CPUs Experience with declarative CI/CD tools such as ArgoCD Experience with workflow engines such as Apache Airflow or Argo Workflows Experience with infrastructure automation Cloud engineering experience with at least one cloud service provider Experience with reusable, automated workflows such as PagerDuty playbooks Cadre5 offers excellent pay and benefits, to include full medical, dental, and vision coverage coupled with 401K match, 15 days PTO, and 10 holidays. Cadre5 is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. Cadre5 is an E-Verify Employer. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Kubernetes Platforms System Engineer in Knoxville, TN vacancy
  •  ...qualified individual for a key role in enhancing the NCCS computing infrastructure. This position involves managing Kubernetes platforms, ensuring system reliability, and collaborating with teams to address service issues. The ideal candidate must hold a Bachelor's degree... 
    Suggested

    Cadre5

    Knoxville, TN
    2 days ago
  •  ...Cadre5 is seeking a Kubernetes Platforms System Engineer in Knoxville, TN. This full-time position includes responsibilities such as ensuring the reliability of the Kubernetes platform and troubleshooting issues across the stack. Candidates should have a Bachelor's degree... 
    Suggested
    Full time

    Cadre5

    Knoxville, TN
    2 days ago
  •  ...solutions provider in Knoxville, TN is looking for Software Engineers with Kubernetes expertise to join their innovative team working on the...  ...Candidates should possess strong skills in Go, Python, and Linux system management. The position offers excellent pay, benefits,... 
    Suggested

    Cadre5

    Knoxville, TN
    3 days ago
  •  ...Plus Benefits Contract: Permanent, full time Reporting to: Senior Director of Electrical Engineering Your role in the mission Type One Energy Group is seeking an experienced Systems Engineer to support an interdisciplinary, collaborative approach in the development of a... 
    Suggested
    Permanent employment
    Full time
    Contract work
    Work at office
    Local area

    Type One Energy Group

    Knoxville, TN
    2 days ago
  •  ...RDI Technologies, Inc. in Knoxville, Tennessee is seeking a Senior Platform Engineer for Delivery & Reliability. This role will focus on improving software delivery processes for product teams and ensuring that the workflow from code to customer is efficient, trusted,... 
    Suggested

    RDI Technologies Inc

    Knoxville, TN
    2 days ago
  •  ...'I want to replace all of this with something engineers actually trust and love to use'? We are seeking a Senior Platform Engineer, Delivery & Reliability to own the trusted...  ...value of this role is owning the agentic system of trust around software delivery: what must be... 

    RDI Technologies Inc

    Knoxville, TN
    5 days ago
  •  ...Senior Reliability Engineer Job Locations US-TN-Knoxville, TN ID 2026-2741 Category Science & Engineering...  ...magnets to develop its optimized stellarator fusion energy system. Its FusionDirect development program pursues the lowest-risk,... 
    Permanent employment
    Full time
    Relocation package

    Oak Ridge Associated Universities

    Knoxville, TN
    3 days ago
  •  ...Utility Power Systems Engineer Basic Function: Conducting engineering studies on utility power systems. Education: Electrical Engineering degree from an accredited ABET university with emphasis on power systems. Required Skills: Six months of experience... 
    Remote work

    EnerNex

    Knoxville, TN
    6 days ago
  • $134.26k

     ...that impact our world? CDM Smith offers employees opportunities to delve into many aspects of electrical engineering, including the design of complex power systems, observation and construction services, and power system analyses, etc. We want to match you up with the... 
    Full time
    H1b
    Work at office
    Relocation package
    Flexible hours

    CDM Smith

    Knoxville, TN
    4 days ago
  •  ...Senior Power Systems Engineer EnerNex is seeking an experienced Senior Power Systems Engineer to support and lead complex power system studies and consulting engagements for utilities, system operators, developers, and regulators. This role is intended for an experienced... 
    Work at office
    Remote work

    EnerNex

    Knoxville, TN
    2 days ago
  •  ...Basic Function Supporting engineering studies on wind plants and utility power systems. Education And Experience Any experience level is considered. An applicant with an advanced degree (Masters or PhD) will work on more challenging tasks and will have more responsibilities... 
    Temporary work
    Internship

    EnerNex

    Knoxville, TN
    12 days ago
  •  ...DeRoyal, located in Powell, TN, is seeking a skilled Network Systems Engineer to support and maintain its enterprise IT infrastructure. This role involves ensuring the reliability, security, and performance of network systems and providing technical support to end-users... 

    DeRoyal

    Powell, TN
    2 days ago
  •  ...corporate office in Powell, TN is seeking a skilled Network Systems Engineer to support and maintain our enterprise IT infrastructure. This...  ..., end-user computing environments, and telecommunications platforms. The ideal candidate is hands-on, solutions-oriented, and comfortable... 
    Work at office
    Remote work

    DeRoyal

    Powell, TN
    2 days ago
  • $108.3k - $154.3k

     ...Enterprise AI Platform Engineer Title: Enterprise AI Platform Engineer Category: Software Development/ Engineering City: Various, United States Job Description CGI is seeking an experienced Enterprise AI Platform Engineer to support and enhance enterprise-wide Generative... 
    Local area

    CGI Njoyn

    Knoxville, TN
    2 days ago
  •  ...CGI Njoyn seeks an experienced Enterprise AI Platform Engineer to enhance enterprise-wide Generative AI platforms. This position emphasizes operational support and integration of platforms like ChatGPT Enterprise and Microsoft Copilot in a highly regulated financial environment... 

    CGI Njoyn

    Knoxville, TN
    2 days ago
  • $66.71k - $120k

     ...Level of Experience: Mid This opportunity resides with Warfare Systems (WS), a business group within HII's Mission Technologies...  ...capabilities in cybersecurity, network architecture, reverse engineering, software and hardware development uniquely enable us to support... 
    Full time
    Work at office
    Local area
    Worldwide

    Huntington Ingalls Industries

    Knoxville, TN
    6 days ago
  •  ...An engineering and construction firm is seeking an Engineering Materials Systems Specialist to optimize engineering systems and data integration. This part-time telework role involves ensuring the deployment of Smart Materials and Oracle Agile. Required qualifications... 
    Part time
    Remote work

    Bechtel Oil, Gas & Chemicals Incorporated

    Knoxville, TN
    3 days ago
  •  ...seeking an experienced Database DevOps Engineer for a 6-month contract position based in...  ...DevOps practices, automation, and cloud platforms. The ideal candidate will have strong expertise...  ...Containerization technologies (Docker, Kubernetes) Scripting languages (Python, Bash,... 
    Contract work

    Purple Drive

    Knoxville, TN
    5 days ago
  •  ...identify software defects in electrified and internal combustion engine (ICE) powertrains, ensuring the delivery of robust, high-...  ...vehicle environments and Hardware-in-the-Loop (HIL) simulation systems. The engineer will execute DVP&R test procedures, perform initial... 
    Full time
    Immediate start

    Stellantis

    Knoxville, TN
    4 days ago
  • $72.25k - $160k

     ...shared success across Mission Technologies. Job Description Mission Technologies, a division of HII, is seeking a Lead MBSE Systems Engineer to support operations within the Directed Energy business initative. You will provide systems engineering support to the... 
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Worldwide
    Relocation

    Huntington Ingalls Industries

    Knoxville, TN
    7 days ago
  •  ...OP Recruiting seeks a Senior Engineer to lead the evolution of a sophisticated enterprise platform, integrating AI capabilities and robust distributed systems. Responsibilities include system ownership, architecting production-level AI workflows, and ensuring reliability... 
    Remote work

    OP Recruiting

    Knoxville, TN
    2 days ago
  •  ...Engineering Manager - Platform WBIR-TV Knoxville About TEGNA TEGNA Inc. helps people thrive in their local communities by providing the...  ...and operational health across backend microservices, MAM systems, infrastructure, and our CMS systems. Lead execution across... 
    Full time
    Temporary work
    Part time
    Local area

    TEGNA

    Knoxville, TN
    9 days ago
  •  ...System Director, Oncology Pharmacy Services - Tennessee Location: Knoxville, TN (Onsite) Relocation Assistance Available A nationally recognized healthcare system is seeking an experienced System Director of Oncology Pharmacy Services to lead pharmacy operations... 
    Relocation package

    Med-Hunters

    Knoxville, TN
    4 days ago
  • $140k - $190k

     ...Overview Freenet Health is seeking a skilled Senior Software Engineer (RCM Platform) to join our engineering team. In this role, you will play a critical part in building and scaling backend systems that support healthcare billing workflows, claims processing, reimbursement... 
    Casual work
    Work at office
    Monday to Friday

    Woundlocal

    Knoxville, TN
    2 days ago
  • $50 per hour

     ...will be responsible for developing evaluation criteria, reviewing performance logs, and testing systems for vulnerabilities. Ideal candidates have 2+ years of backend engineering experience and a strong attention to detail. Competitive rates of up to $50 per hour with... 
    Hourly pay
    Freelance
    Remote work
    Flexible hours

    Outlier

    Knoxville, TN
    2 days ago
  •  ...PeopleFind is seeking a skilled Reliability Engineer for a manufacturing role in Knoxville, TN. In this impactful position, you will lead the reliability program, work directly on the plant floor, and support teams in improving equipment performance. The ideal candidate... 
    Relocation package

    PeopleFind

    Knoxville, TN
    2 days ago
  • $140k - $190k

     ...Woundlocal is seeking a skilled Senior Software Engineer to join our team in Fair Oaks, Oklahoma. In this role, you will play a critical part in building backend systems that support healthcare billing workflows and claims processing. The ideal candidate should have proven... 

    Woundlocal

    Knoxville, TN
    3 days ago
  •  ...are, join our team. KPMG is currently seeking a Lead Engineer, Network Security Platform to join our Digital Nexus technology team. Responsibilities...  ...analysis; establish network environments by designing system configuration, and direct system installation by... 
    H1b
    Local area

    KPMG

    Knoxville, TN
    5 days ago
  •  ...technology company located in Knoxville, TN, is seeking an Electrical Engineer to design and develop electronic hardware for fusion energy...  ...should have a strong background in circuit design and embedded systems, along with at least 5 years of relevant experience. The role... 

    Type One Energy Group, Inc.

    Knoxville, TN
    2 days ago
  • $130k - $170k

    PerfectServe, located in Knoxville, TN, is looking for a Full Stack AI Engineer to join their R&D team. This remote role focuses on AI architecture and integration within healthcare communication solutions. The position requires 5+ years of experience in software engineering... 
    Remote work

    PerfectServe

    Knoxville, TN
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kubernetes Platforms System Engineer. Be the first to apply!