Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

HPC Linux Storage Engineer

Oak Ridge National Laboratory

Requisition Id15790

Overview

Oak Ridge National Laboratory (ORNL), home to some of the world's most powerful supercomputers, is seeking highly skilled professionals to support large-scale storage systems, high-speed parallel file systems, and archival solutions critical to advancing scientific discovery and innovation. As part of ORNL's leadership-class computing ecosystem, you will play a vital role in designing, deploying, optimizing, and maintaining infrastructure that powers cutting-edge research across diverse scientific domains.

This evergreen posting represents multiple opportunities across ORNL's high-performance computing (HPC) environment, supporting scalable, reliable, and secure computing and storage capabilities. Applications are reviewed on an ongoing basis as new positions become available to meet the dynamic needs of our world-class computing facility.

Job Duties and Responsibilities May Include:

  • Design and Management of Infrastructure: Architect, deploy, and manage large-scale storage systems and HPC platforms to support research, scientific, and enterprise workloads. Develop and implement solutions for structured, unstructured, and archival data storage, focusing on scalability, reliability, and performance.
  • Systems Analysis and Development: Apply systems analysis techniques to consult with users/customers, determine functional requirements, and design, test, or optimize storage and computational solutions tailored to their needs. Develop, document, and modify solutions, including system prototypes and automated workflows, to enhance operational efficiency.
  • Performance, Optimization, and Troubleshooting: Ensure the performance, availability, scalability, and security of diverse infrastructure environments. Diagnose and resolve complex operational challenges quickly and effectively, applying advanced performance optimization techniques for a wide range of workloads.
  • Collaboration and Best Practices: Work closely with stakeholders from research, technical, and operational teams to understand workflows, identify opportunities for improvement, and deliver effective solutions. Define, implement, and enforce best practices, standards, and procedures across projects and teams.
  • Automation and Innovation: Automate system configuration, provisioning, monitoring, and maintenance to reduce manual efforts and downtime. Evaluate emerging technologies and tools to continuously improve system capabilities, adapt to changing needs, and plan for future advancements.
  • Support and Maintenance: Support critical infrastructure through participation in a 24/7 on-call rotation and off-hours maintenance windows. Resolve hardware and software issues in coordination with vendors, ensuring minimal impact on operations.

Basic Qualifications

  • Bachelor's degree in computer science, engineering, information technology, or a related field; and at least 5 years of professional experience managing Linux/UNIX systems in heterogeneous environments. An equivalent combination of education and experience will be considered.
  • Demonstrated experience with high-performance computing (HPC) storage systems and enterprise storage platforms (e.g., Lustre, GPFS, BeeGFS, or WEKA).
  • Proficiency in scripting languages (e.g., Python, Bash, Perl) and configuration management/automation tools (e.g., Ansible, Puppet, Git).
  • Strong communication, collaboration, and problem-solving skills with the ability to design and implement solutions independently.

Preferred Qualifications

  • Active DOE Q, DoD Top Secret, or TS/SCI clearance.
  • Hands-on experience with HPC cluster technologies, including job schedulers (e.g., SLURM) and system deployment tools (e.g., Warewulf, PXEboot, Bright Cluster Manager).
  • Expertise in high-performance parallel file systems, tape library systems, and storage networking technologies (e.g., RAID, ZFS, NVMe-oF, Infiniband).
  • Familiarity with performance monitoring tools (e.g., Grafana, Nagios), benchmarking systems, and I/O optimization techniques.
  • Experience with virtualization and containerization platforms (e.g., VMware, KVM, Podman, Apptainer).
  • Background in open source development, including submitting patches upstream, and building custom Linux packages (e.g., RPM for RHEL).
  • Demonstrated ability to troubleshoot and optimize high-performance storage, compute, and networking systems in HPC environments.
  • Experience documenting technical processes and contributing to complex technical projects in government, scientific, or highly technical settings.

Hybrid Eligibility

These positions are located in Oak Ridge, Tennessee and require onsite presence. We offer a flexible work environment that supports both the organization and the employee. A hybrid/onsite working arrangement may be available with this position, which provides flexibility to work periodically from your home, while reporting onsite to the Oak Ridge, Tennessee location on a weekly and regular basis.

Special Requirement

This position requires the ability to obtain and maintain clearance from the Department of Energy. As such, this position is a Workplace Substance Abuse (WSAP) testing designated position. WSAP positions require passing a pre-placement drug test and participation in an ongoing random drug testing program.

About ORNL

As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an impressive 80-year legacy of addressing the nation's most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals! Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive scientific breakthroughs and translate them into solutions for energy, environmental, and security challenges facing the nation.

Why Join Us

  • Work on the world's most powerful supercomputers, including Frontier , the first system to achieve exascale performance.
  • Enable breakthrough science in fields like fusion energy, climate modeling, AI, and national security.
  • Collaborate with diverse teams of scientists, engineers, and technologists from across the DOE complex and academia.
  • Grow your career in a mission-driven, innovation-focused environment with access to professional development and leadership opportunities.
  • Enjoy life in East Tennessee, with a thriving research community, scenic outdoor recreation, and a high quality of life.

This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.


If you have trouble applying for a position, please email View email address on click.appcast.io.


ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the HPC Linux Storage Engineer in Oak Ridge, TN vacancy
  •  ...Architect, deploy, and manage large-scale HPC storage systems, including parallel file systems...  ...BS degree in computer science, computer engineering, information technology, information...  ...Five (5) or more years managing UNIX/Linux systems. Demonstrated experience managing... 
    Suggested

    ITR

    Oak Ridge, TN
    9 days ago
  •  ...Responsibilities: # Advocate and promote HPC and clustered computing...  ...in computer science, computer engineering, information technology,...  ...more years with managing UNIX/Linux Systems. # Three (3) or years...  ...of systems, networking, and storage. # Experience with Grafana,... 
    Suggested

    ITR

    Oak Ridge, TN
    9 days ago
  •  ..., implementation, and management of HPC systems within a classified environment...  ...Install, integrate, and administer Linux-based HPC clusters, storage systems, and high-speed networks....  ...coordinating with vendors and internal engineering teams to implement solutions.... 
    Suggested
    Work at office

    Oak Ridge National Laboratory

    Oak Ridge, TN
    3 days ago
  •  ...solutions to some of today's most challenging problems. As an HPC Linux Systems Engineer, you will work within the HPC Scalable Systems Group...  ...Section is responsible for the division's computing, storage, networking, and infrastructure systems and services. The... 
    Suggested
    Work at office
    Relocation package
    Flexible hours

    Oak Ridge National Laboratory

    Oak Ridge, TN
    1 day ago
  • Oak Ridge National Laboratory is seeking an HPC Systems Engineer to design, operate, and maintain high-performance computing clusters. This role...  ...in a relevant field and substantial experience with UNIX/Linux systems and configuration management tools. Benefits include... 
    Suggested

    Oak Ridge National Laboratory

    Oak Ridge, TN
    4 days ago
  •  ...High-Performance Computing Systems Section within the National Center for Computational Sciences (NCCS) is seeking an IAM Linux Engineer to join the HPC Infrastructure group. The preferred candidate will possess commensurate knowledge, skills and abilities in addition to... 
    Work at office
    Relocation package
    Flexible hours

    Oak Ridge National Laboratory

    Oak Ridge, TN
    17 hours ago
  •  ...Center for Computational Sciences (NCCS) is seeking an HPC Infrastructure Platform Engineer to join the HPC Infrastructure group. The preferred candidate...  ....   Major Duties/Responsibilities:   Linux Administration: Deploy, configure and manage HPC-scale... 
    Work at office
    Relocation package
    Flexible hours

    Oak Ridge National Laboratory

    Oak Ridge, TN
    3 days ago
  •  ...Install, configure, maintain, and upgrade server operating systems, web servers, and database platforms • Partner with IT Software Engineers to resolve server, application, and desktop‑related issues • Research new tools, technologies, and standards to continually... 
    Full time
    Shift work

    Centrus Energy

    Oak Ridge, TN
    1 day ago
  •  ...Construction organization as a Construction Automation Specialist. This role involves monitoring construction data, ensuring alignment of engineering and procurement data feeds, and supporting the implementation of project tools. Key Responsibilities: Monitor... 
    Work at office

    SPECTRAFORCE

    Oak Ridge, TN
    22 hours ago
  • IT Access Administrator This position is based at our Corporate Headquarters in Oak Ridge, TN. The deadline to apply for this opportunity is June 16, 2026. Job Summary: The IT Access Administrator supports the monitoring of application access, assists with the implementation...
    Work experience placement

    ORNL Federal Credit Union

    Oak Ridge, TN
    2 days ago
  • Overview The Systems Administrator supports the US Department of Energy Office of Scientific and Technical Information (DOE OSTI) vision to ensure long‑term preservation of and access to DOE scientific and technical information through support of mission system administration...
    Contract work
    Work at office
    Remote work

    Edgewater Federal Solutions

    Oak Ridge, TN
    4 days ago
  • $75 per hour

     ...for this role. Work Status: Candidate is required to be in office full time. Responsibilities: Assure data feeds from Engineering (SmartPlant, Databroker, Setroute, Inspec, etc.) and Procurement (BPS) are aligned and functioning properly. Assess data quality... 
    Weekly pay
    Daily paid
    Full time
    Contract work
    Work at office
    Local area

    Johnson Service Group

    Oak Ridge, TN
    17 days ago
  •  ...Human Security Division at ORNL, is seeking a Geospatial Data Engineer to support research and operational workflows focused on scalable...  ...g., Airflow, Prefect, Dagster, Make/Snakemake) and working in Linux/HPC environments. Experience with large, multi‑resolution... 
    Work at office
    Remote work
    Relocation package
    Flexible hours

    Oak Ridge National Laboratory

    Oak Ridge, TN
    12 hours ago
  •  ...Kubernetes Platform Engineer XCEL Engineering, Inc. is an award-winning small business that provides trusted information technology...  ...managed Kubernetes (GKE and/or EKS). Strong understanding of Linux networking fundamentals: iptables/nftables, routing tables, DNS... 
    Work at office
    Local area

    Xcel - a Martinfed Company

    Oak Ridge, TN
    4 days ago
  •  ...and pneumatic controls. This position resides in the Ceramics and Ceramic Composites (CCC) Group, Materials Processing Science and Engineering (MPSE) Section of the Materials Science and Technology Division (MSTD), Physical Sciences Directorate (PSD). The individual will... 
    Work at office
    Relocation package
    Flexible hours

    Oak Ridge National Laboratory

    Oak Ridge, TN
    3 days ago
  • Join Mirion in our mission to harness our unrivaled knowledge of ionizing radiation for the greater good of humanity. Dosimetry Services, a Mirion Medical Company, offers products and services that better the human condition by removing the worry of radiation from the...
    Work experience placement
    Work at office
    Remote work

    Mirion

    Oak Ridge, TN
    3 days ago
  •  ...Security Analyst which would support our clients. BGS is an engineering, technology, and security firm helping to advance missions of...  ...TCP/IP, OSI model, and common protocols ( DNS, SMTP). Windows/Linux/macOS fundamentals; Active Directory/Azure AD concepts; basic cloud... 
    Full time
    Temporary work
    Remote work
    Monday to Friday
    Shift work

    Boston Government Services

    Oak Ridge, TN
    2 days ago
  • Summary:  Vital Services is looking to add an experienced and passionate HelpDesk / Desktop Support Technician to our growing IT managed service and support team in Oak Ridge, TN. As an IT Help Desk / Desktop Support Technician, you will provide technical support for our...
    Full time

    Vital Services

    Oak Ridge, TN
    4 days ago
  •  ....S. Citizenship is required) Kubernetes Platform Software Engineer As a Platform Software Engineer, you will design, implement,...  ...Operators, Service Mesh, k8s architecture Operating Systems: Linux-based OS management at the hardware level, strong Linux... 
    Remote work

    ITR

    Oak Ridge, TN
    18 days ago
  • Job Description Job Description Benefits: Flexible schedule Opportunity for advancement Profit sharing This Is Not Your Average Tech Job Are you a hands-on I.T. or Telecom professional looking for something beyond the desk? AMG Tech Support is seeking...
    Hourly pay
    For contractors
    Immediate start
    Flexible hours

    HARDY INDUSTRIES

    Oak Ridge, TN
    6 days ago
  •  ...Teledyne FLIR is searching foran experienced Embedded Hardware Engineer (PCB & FPGA) witha proventrack record ofsystem-level PCBdesign...  ...digital communication protocols UART SPI I2C SWDJTAGetc. ~ Basic Linux usageknowledgewith Bash and Python scripting skills. ~ Strong... 
    Permanent employment
    Full time
    Local area

    Teledyne Technologies

    Oak Ridge, TN
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to HPC Linux Storage Engineer. Be the first to apply!