Senior HPC Systems and Storage Engineer
$30 per hourUC San Diego
UCSD Layoff from Career Appointment : Apply by 4/13/26 for consideration with preference for rehire. All layoff applicants should contact their Employment Advisor.
Reassignment Applicants : Eligible Reassignment clients should contact their Disability Counselor for assistance.
Job posting will remain open until a suitable candidate has been identified.
DESCRIPTIONDEPARTMENT OVERVIEW:
The Mission of the San Diego Supercomputer Center is to translate innovation into practice. SDSC adopts and partners on innovations in industry and academia in the areas of software, hardware, computational and data sciences, and related areas, and translates them into cyberinfrastructure that solves practical problems across any and all scientific domains and societal endeavors. Cyberinfrastructure refers to an accessible, integrated network of high-performance computing, data, and networking resources and expertise, focused on accelerating scientific inquiry and discovery. With more than 250 employees and $30-50M of revenue a year, SDSC is a global leader in the design, development, and operations of cyberinfrastructure.
SDSC supports hundreds of multidisciplinary programs spanning a wide variety of domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. SDSC presently operates multiple large HPC systems ranging from a 120k x86 CPU core general purpose system to a system explicitly designed for Artificial Intelligence and Machine Learning, and a nationally distributed system open for all of academia to integrate with. SDSC offers research data services across the entire vertical stack from universally scalable storage to consulting services on FAIR, Big Data, and AI. SDSC offers a rich set of cloud services both on-premise, in the commercial cloud, and as hybrid services across both.
SDSC has three geographic scopes, a national scope supporting cyberinfrastructure for the entire US research and education community, a California scope with a special focus on convergence research that addresses the three dominant threats to CA: Drought, Fire, Earthquakes, and a campus scope focusing on advancing the global impact of SDSC by advancing the research objectives of the UC San Diego faculty, researchers, and students.
SDSC impacts researchers at scales from 1,000's to Millions. SDSC annually trains thousands of researchers in cyberinfrastructure tools and software, and supports thousands of individual researchers via Unix accounts on its large HPC systems. SDSC was a leader developing the Science Gateway concept, and continues to be a global leader in its evolution. SDSC operates multiple major such gateways with user communities ranging from the tens of thousands to the millions. SDSC's educational programs includes online courses that have been attended by more than a million students.
SDSC is committed to democratizing access to cyberinfrastructure across all of its geographic scopes. SDSC strives towards a culture that supports our employees to be their best, achieve their goals, and enjoy their lives, both professionally and personally.
SDSC's High-Performance Systems Group is responsible for and operates SDSC's high-performance computing clusters and related systems. The group operates large-scale compute and storage systems funded by the National Science Foundation (current ACCESS resources and previously via the XSEDE and TeraGrid programs), the UCSD campus (e.g., the Triton Shared Compute Cluster) and other entities; these systems support users from campus andnational communities across a broad range of scientific disciplines. The Group is part of SDSC's Data-Enabled Scientific Computing (DESC) Division.
The Data Enabled Scientific Computing (DESC) division within SDSC designs and jointly proposes with other SDSC researchers, supercomputing systems in response to tens of millions of dollars call-for-proposals from the National Science Foundation (NSF), various government organizations and UC entities; it responds to calls for proposals for cyberinfrastructure (CI) related research, solutions and support. DESC manages, operates and troubleshoots issues with advanced, leading edge, complex, multi-petaflop and multi-petabyte data intensive supercomputer systems, file systems (Lustre, Ceph etc.), interconnects (such as InfiniBand, NVLink, Slingshot, ethernet etc.) and CI projects housed at SDSC. Research leaders within DESC submit high performance computing (HPC), high throughput computing (HTC), AI, CI, data science, computational science, science gateways and scientific software research proposals and acquire funding from NSF, National Institutes of Health (NIH), Department of Energy (DOE), Department of Defense (DOD) and industry. DESC carries out supercomputing, CI, data science, computational science and scientific software research and development projects. This division provides consulting and user support to researchers and users from academia as well as collaborates with them and industrial users. DESC provides advanced computational science, CI and scientific software support for the national and UC user communities as a part of projects/machines such as the Expanse machine (a five-year ~$34-million project supported by the NSF and enables tens of thousands of users to use HPC, HTC and GPUs), Voyager machine (a five-year, ~$12-million project supported by the NSF and enables researchers to experiment with and use AI-focused hardware for scientific applications), PNRP project ( a five-year , ~$12-million project supported by the NSF and enables distributed computing with resources of GPUs, FPGAs and CPUs), Cosmos machine (a five-year ~$12-million project supported by the NSF and democratizes access to accelerated computing), the Triton Shared Compute Cluster (TSCC - which is a UCSD condo cluster for UCSD and external researchers and includes NIST 800 171 compliant computing), and the CloudBank project ( a five-year, ~$25-million project supported by the NSF to enable usage of commercial cloud resources). Various other funded CI research and development, and domain science (e.g. biochemistry, bioinformatics, cosmology etc.) and AI/ML projects are directed by DESC researchers. DESC staff and researchers are involved with the NSF funded Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program. DESC is involved in various HPC/HTC/AI training, workshop, outreach, workforce development and K-12 student programs and associated NSF funded projects. This division stays current with HPC, HTC, accelerators, CI, computational science and scientific software research and technology trends and engages with supercomputer vendors (e.g. Dell, Supermicro, Intel, NVIDIA, AMD, IBM, Hewlett Packard Enterprise, Data Direct Network, Aeon Computing, Arista etc.) to remain current with future technologies utilized in supercomputer designs
POSTION OVERVIEWThe Senior HPC Systems and Storage Engineer will apply advanced systems and software integration concepts, and location or institutional objectives, to resolve highly complex issues where analysis of systems and software requires an in-depth evaluation of variable factors to resolve and implement medium to large projects of broad scope and complexity. They will regularly resolve highly complex business processes, system functionality, implementation issues and system and software integration issues where analysis of situations or data requires an in-depth evaluation of variable factors and select tools, methods, techniques and evaluation criteria to obtain results. They will also give technical presentations to associated team, other technical units and management as well as evaluate new technologies including performing moderate to complex cost / benefit analyses. They may lead a team of systems / infrastructure professionals.
As part of SDSC's High Performance Systems group, the Senior HPC Systems and Storage Engineer is responsible for designing, deploying, and operating SDSC HPC compute clusters and their associated storage systems, and for maintaining their performance, reliability, and availability at the national, state, and campus level. This role requires in-depth knowledge of HPC cluster architecture, Linux systems administration, and the integration of compute, storage, and network systems.
The incumbent contributes to the design, deployment, and operation of high-performance HPC systems and storage environments, including parallel file systems operating at scale (tens to hundreds of servers supporting thousands of clients) across high-speed networks such as Ethernet and InfiniBand. In collaboration with senior technical leadership, contributes to the architecture and implementation of scalable solutions at the cluster, data center, and campus levels.
They will also plan and execute system lifecycles, including deployment, upgrades, and decommissioning of HPC systems and storage services as well as contribute to technical planning and effort estimation for new deployments, proposals, and recharge-based services.
Additionally, the incumbent will evaluate and recommend improvements to tools and workflows, and participates in the selection and integration of new technologies and work with vendors and SDSC staff to benchmark and evaluate storage systems and cluster platforms, and maintains current knowledge of emerging technologies.
The incumbent will also develop advanced processes and scripts for system analysis, testing, and automation to improve operational efficiency, scalability, and reliability across compute, storage, and network systems and lead efforts to integrate monitoring and alerting, improving incident detection, response, and user communication, and coordinates across compute and storage platforms to ensure graceful handling of service degradation.
The incumbent will additionally oversee collaboration with SDSC security teams to implement best practices for system deployment, identity management, and software updates, promoting consistent security and maintenance across the environment as well as oversee development and maintenance of related documentation.
For more information, please visit:
QUALIFICATIONS-
Bachelor's degree in related area and / or equivalent experience / training.
Proven experience administering and supporting large-scale HPC clusters or other distributed POSIX (Linux) systems, including advanced knowledge of Linux system administration, primarily Red Hat and its derivatives (e.g., Rocky Linux).
-
Proven experience designing, deploying, and operating large-scale (petabyte-class) high-performance parallel and distributed file systems (e.g., Lustre, Ceph, BeeGFS, GPFS), as well as enterprise and local file systems (e.g., NFS, ZFS, ext4, XFS) in Linux-based environments, including troubleshooting and performance tuning.
Demonstrated experience with scripting and automation using languages such as Bash and Python; use of configuration management tools (e.g., Ansible, CFEngine); and version control systems (e.g., Git) to manage and maintain system configurations and infrastructure.
-
Advanced knowledge of HPC middleware stack including cluster management tools, job schedulers and resources managers. Examples include: Slurm, PBS, HPCM, and Bright Cluster Manager.
Demonstrated knowledge of TCP/IP networking, including sockets, VLANs, and firewalls.
-
Job offer is contingent upon satisfactory clearance based on Background Check results.
Occasional evenings and weekends may be required.
On-call rotation may be required.
Pay Transparency Act
Annual Full Pay Range: Unclassified - No data available (will be prorated if the appointment percentage is less than 100%)
Hourly Equivalent: Unclassified - No data available
Factors in determining the appropriate compensation for a role include experience, skills, knowledge, abilities, education, licensure and certifications, and other business and organizational needs. The Hiring Pay Scale referenced in the job posting is the budgeted salary or hourly range that the University reasonably expects to pay for this position. The Annual Full Pay Range may be broader than what the University anticipates to pay for this position, based on internal equity, budget, and collective bargaining agreements (when applicable).
$93k - $140k
...A leading sales enablement company based in San Diego is seeking a Finance Systems Administrator. This role requires expertise in NetSuite development and administration, along with a strong background in financial processes. The ideal candidate will manage scripts, workflows...Senior- ...For our Technical and Engineering Services Support bid concerning Airborne Networking and Advance Development (AN&AD), METI is currently looking for a Senior Systems Engineer. Please note that this position is contingent upon contract award. Duties Perform...SeniorContract work
- ...Systems Engineer (Senior) Department: PEOC41ESS Employment Type: Contract / Temp Location: PEOC41ESS-NAVWAR-SEAPORT-CA Description Systems Engineer (Senior) Overview The Senior Systems Engineer provides technical leadership and expertise in systems...SeniorContract workTemporary work
- ...Senior Systems Engineer Mount Indie seeking a Senior Systems Engineer to lead mission definition, architecture clarity, and integration risk reduction across complicated tactical communication systems. This role elevates systems engineering maturity and ensures predictable...Senior
$120k - $150k
...Simms Fishing Products LLC is looking for a Senior Firmware Engineer to design and develop real-time systems for cutting-edge camera-based sports equipment. Based in San Diego and transitioning to Carlsbad, the role involves collaboration with engineers, debugging of...Senior$150k - $180k
...A defense solutions company in San Diego is looking for a Senior Software/Systems Engineer. The ideal candidate will manage the engineering and development cycle for resilience in software and hardware solutions, requiring at least 8 years of experience, active TS/SCI...Senior$150k - $180k
...A technology solutions firm in California seeks a Senior Software / Systems Engineer to manage software and systems development for mission applications. Candidates must have over 8 years of experience, including systems engineering, DOD C2 familiarity, and proficiency...Senior$150k - $180k
...Precise Systems, Inc. is looking for a Senior Software / Systems Engineer to contribute significantly to mission applications supporting the Naval Operational Architecture. The position requires strong leadership skills and a dedication to rapid prototype development within...Senior- ...A leading technology solutions provider is seeking a Software Engineer to improve processes and procedures in the software development... ...strong understanding of testing procedures, actively participate in system testing, and provide timely status reports. This position...SeniorContract work
- ...Senior Systems Engineer Sentar is proud to be an employee-owned company, fostering a culture of empowerment, collaboration, and innovation. Sentar is dedicated to developing the critical talent that the connected world demands to create solutions to address the convergence...SeniorContract workTemporary workFor contractorsFlexible hours
$140k - $180k
...Job Type Full-time Description DEVSECOPS / Model-Based Systems Engineer CompQsoft is seeking a highly qualified DevSecOps, Developer, and Model-Based Systems Engineer to support advanced Navy and DoD programs requiring secure automation, modern software...SeniorFull timeShift work$140k - $160k
...them most. We are seeking a highly skilled and hands-on Senior Systems Engineer to design, implement, and support enterprise IT infrastructure... ...infrastructure (Windows/Linux servers, virtualization, storage) Manage hybrid environments spanning on-prem and cloud (...Senior$5,000 per month
...Senior Systems Engineer Imagine One Technology & Management, Ltd. is seeking several personnel to serve in the role of Senior Systems Engineer. These positions are contingent upon award of the associated work and will be performed in San Diego, California. The Senior...Senior$130k - $175k
...Senior RFID Systems Engineer San Diego, CA About Us E-commerce got real-time data infrastructure decades ago. Physical stores still have not. RADAR is changing that. RADAR is building the data infrastructure layer for the physical world, starting with retail....SeniorFlexible hours- ...Senior Mission Engineer Why choose between doing meaningful work and having a fulfilling life? At MITRE, you can have both. That's because... ...Department Summary MITRE Technology and Engineering's Systems Engineering Division delivers innovative, multidisciplinary...SeniorWork experience placementLocal area
$77.5k - $140.9k
...world. Job Title: CyberSecurity SIEM Engineer (Senior SDC) About the job At EY, you’... ...proliferation of social media, extensive data storage demands, stringent privacy laws, and... ...understanding of complex information systems, leveraging your expertise in the...SeniorWork experience placementSummer holidayFlexible hours$110k - $175k
...NKI is experiencing rapid growth, and we're looking for a full-time Systems Engineer to join our team and support this exciting expansion. At NKI, we're committed to investing in our employees' growth and development, helping them achieve their career goals. To learn...SeniorFull time- ...customer and market requirements into robust mechatronic and robotic system concepts that balance performance, reliability, usability, and... ...Act as a technical authority and mentor for interdisciplinary engineering teams during the development of complex mechatronic and...Senior
- ...Senior Systems Engineer San Diego, California We are seeking a hands-on senior engineer to help continue the development, care, feeding... ...troubleshooting enterprise container platform issues across compute, storage, configuration, access, and networking layers....SeniorContract workWork experience placementLocal area
$150k - $180k
...core competencies in Information Assurance, Cybersecurity and Systems Engineering. With offices on both the East and West coasts, an inviting... ..., opportunity abounds for the right individual! Senior CDS Systems Engineer - 26-015 - San Diego, CA AUSGAR Technologies...SeniorFull timeFor contractorsWork at officeImmediate startRemote work$120k - $160k
...Job Description Description SAIC is seeking a highly skilled and motivated Senior Systems Engineer to join our team in support of critical Department of Defense (DoD) Command, Control, Communications, Computers, and Intelligence (C4I) programs. The ideal...SeniorFor contractorsLocal area- ...A global consulting firm is seeking a Senior AI Native Engineer to revolutionize AI applications in business. This role involves researching and implementing scalable AI systems, delivering innovative solutions, and collaborating with a dynamic team. Candidates should...Senior
$120.8k - $210.3k
...scalable wind (onshore and offshore), solar, storage (battery and pumped storage hydro),... ...cases, geographic location. The Senior Project Engineer is responsible for leading and... ...and coordinate preliminary collection system, substation and transmission design, as...SeniorFor contractorsWork at officeLocal areaFlexible hours$120k - $150k
...Senior Systems Engineer (FT) ("Ingeniero Senior de Sistemas") Salary Range $120,000.00 - $150,000.00 Salary/year Level Experienced Position... ...maintain core infrastructure platforms, including virtualization, storage, identity, networking, and security systems. Design,...SeniorHourly payFull timeCasual workShift workNight shiftRotating shiftWeekend workAfternoon shift$135k
...Systems Engineer Location: San Diego, CA at our wonderful G2 Ops office and customer site Work Setting: In person, some remote opportunity... ...clearly to government stakeholders, including senior leaders The ideal candidate will have: Demonstrated...SeniorFull timeTemporary workWork at officeLocal areaFlexible hours$130k - $150k
...The Marlin Alliance, Inc. is seeking a Senior Software & Systems Engineer in San Diego, California. This role focuses on technical development and process optimization within a government environment. Candidates need extensive experience with C#, SQL, and technical documentation...Senior$110k - $170k
...protecting service members and civilians with intelligent systems. Its products include the V-BAT and X-BAT aircraft,... ...to larger Group 5 aircraft. We are seeking a Senior Autonomy Integration & Test Engineer who thrives as a multidisciplinary builder and...SeniorFull timeTemporary workPart timeWorldwide$125k - $140k
...Title: Senior SATCOM Systems Engineer Belong. Connect. Grow. with KBR! KBR’s National Security Solutions team provides high-end engineering and advanced technology solutions to our customers in the intelligence and national security communities. In this position...SeniorTemporary workLocal areaRelocation packageFlexible hours$128.9k - $219.1k
...started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast... ...Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster...SeniorPermanent employmentWork experience placementWork at officeRemote workFlexible hours- ...Overview Planned Systems International (PSI) is an enterprise IT services company that designs, builds, secures, and operates... ...Federal Government organizations. PSI is seeking an AMOC Senior Systems Engineer to provide full-time, on-site technical leadership...SeniorFull timeContract workTemporary workLocal areaImmediate startFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior HPC Systems and Storage Engineer. Be the first to apply!
- healthcare systems engineer Nacogdoches, TX
- operating system engineer Nacogdoches, TX
- sr systems engineer Nacogdoches, TX
- senior staff systems engineer Nacogdoches, TX
- senior linux systems engineer Nacogdoches, TX
- computer system validation engineer Nacogdoches, TX
- software system engineer Nacogdoches, TX
- operations support system engineer Nacogdoches, TX
- mission system engineer Nacogdoches, TX
- senior windows systems engineer Nacogdoches, TX

