HPC Storage Systems Group Leader
$203.5k - $248.74kBerkely Lab
Storage Systems Group Lead
The National Energy Research Scientific Computing Center (NERSC) is inviting applications for the position of Storage Systems Group (SSG) Lead. NERSC's mission is to accelerate scientific discovery through high performance computing and data analysis for the Department of Energy's (DOE) Office of Science programs. NERSC is searching for a knowledgeable and inspired group leader for the Storage Systems Group who will be responsible for developing NERSC's storage strategy based on NERSC's systems roadmap, science workflows and user needs. They will provide vision and guidance to design, operate and simplify the storage environment for NERSC's 11,000+ users.
The SSG is responsible for NERSC's storage portfolio, including large scale high capacity parallel file systems and archival storage systems with an eye towards balancing performance, stability, and usability for NERSC's users who operate in a wide variety of DOE mission areas and scientific domains. The SSG Lead provides technical leadership to a group of highly skilled storage engineers who collaborate with other teams at NERSC to deliver innovative solutions to complex problems and a technical vision for the future of NERSC storage platforms.
The NERSC storage environment that SSG is responsible for today is composed of multiple tiers:
- The NERSC hierarchical storage management system (presently High Performance Storage System (HPSS)) stores more than 450 PB of data for the scientific community and puts NERSC in the top 10 largest HPSS deployments globally.
- NERSC provides a large-scale parallel community file system (presently Storage Scale) with more than 150 PB of online storage to the user community on a RDMA over Converged Ethernet (RoCE) fabric.
- Home and common storage mounted via Storage Scale on several thousand nodes across NERSC.
In addition to the current environment, SSG will be responsible for the scratch and new quality of service storage systems in NERSC's latest GPU based supercomputer, named Doudna, to be operationalized in 2027. Doudna will deliver a tenfold increase in computing power to NERSC users along with new capabilities. The new Doudna environment will support larger and higher resolution data sets coming from new sensors, detectors, sequencers and telescopes from the scientific community and these data sets will need to be managed, shared and stored.
The Storage Systems Group lead is responsible for understanding existing and new emerging requirements, and deploying storage solutions in collaboration with other NERSC teams to support NERSC's broad user base of today and tomorrow. In doing so, the SSG Lead will drive the development and implementation of a holistic storage strategy to support changing scientific workflows and new technologies as part of Doudna and future NERSC system roadmaps. To accomplish this, the SSG Lead will be responsible for investigating new storage technologies and engaging with the vendor community on future roadmaps. The SSG Lead will work with the Data Center Department Head to provide guidance and priorities for the group based on NERSC's strategic plan and its goals.
You will:
- Develop NERSC's storage strategy based on NERSC's systems roadmap, science workflows and user needs.
- Lead a team that procures, installs, manages, supports and monitors NERSC's large scale storage systems, including providing 24x7 support.
- Ensure NERSC's storage systems meet the needs of NERSC's 11,000 users by providing high performing, available, and usable systems.
- Work independently and as part of the Storage Systems Group to diagnose and fix storage problems, help analyze storage system issues, and develop and implement workarounds and/or patches for software bugs.
- Provide effective line management to a group of approximately 10 Computer Systems Engineers by hiring excellent staff and working closely with SSG staff members. Ensure staff are meeting goals, provide both positive and constructive feedback to staff and ensure all staff have career growth opportunities.
- Provide technical leadership for implementation and deployment efforts for storage system improvements that enhance task automation, reliability, stability, usability, performance, and security.
- Continuously evaluate new storage technologies and make recommendations on future storage strategy and directions for the center, including both parallel and hierarchical storage, that would create new capabilities and enhance storage and HPC system performance and usability.
- Work closely with other teams at NERSC to enable large-scale simulation, data analysis and AI applications to run on NERSC supercomputing and storage systems.
- Provide budgetary input and oversight for NERSC's storage systems.
- Lead or collaborate efforts with other Department of Energy (DOE) Labs on future storage technologies, multi-lab storage efforts and other related topics.
- Present at conferences and talks to promote NERSC to other national labs and HPC sites.
- Create and develop a vision and strategy for the group and be a key part of NERSC's management team.
We are looking for:
- Bachelor's degree in Computer Science, Engineering, Applied Mathematics, Computational Science (or related fields) and current applicable systems support and engineering experience, plus a minimum of 3 years of experience in a managerial role of complex computer systems, storage or networking unit.
- Experience with storage technologies in a Linux environment, such as InfiniBand, RoCE, SAN/NAS, NFS, pNFS, hierarchical storage management systems (such as HPSS), Lustre, Storage Scale, VAST, and object stores.
- Prior experience with HPC applications, workflows and computational and storage systems.
- Experience in managing and supporting a 24/7 IT environment.
- Ability to mentor staff to increase their knowledge and skills.
- Deep and broad knowledge of storage technologies such as parallel filesystems (i.e. Storage Scale), hierarchical storage management (i.e. HPSS), distributed storage systems (i.e. VAST), and storage networking (i.e. InfiniBand or RoCE).
- Demonstrated ability to work independently as well as collaboratively in large projects, and contribute to an active intellectual environment.
- Ability to gather requirements from the scientific user community and turn requirements into system characteristics.
- Strong technical and collaboration skills needed to create and deploy innovative ways of allowing our diverse user base to effectively utilize the unique resources that NERSC provides.
- Understand balancing technical solutions with user needs and show initiative, tact and good judgment in developing solutions to problems.
- Excellent written and verbal communication skills.
Desired skills/knowledge:
- A Master's or PhD degree in related fields.
- Knowledge of object storage and non-volatile storage technologies.
- Experience administering and deploying storage systems of tens of petabytes (or greater) scale in a HPC environment.
We're here for the same mission, to bring science solutions to the world. Join our team and YOU will play a supporting role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!
Why join Berkeley Lab?
- Exceptional health and retirement benefits, including pension or 401K-style plans
- Opportunities to grow in your career - check out our Tuition Assistance Program [Only if Applicable to the Appointment]
- A culture where you'll belong - we are invested in our teams!
- In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
- Parental bonding leave (for both mothers and fathers)
- Pet insurance
Additional information:
- Application date: Priority consideration will be given to candidates who apply by December 15, 2025. Applications will be accepted until the job posting is removed.
- Appointment type: This is a (full-time/part-time) career appointment, exempt (monthly paid) from overtime pay.
- Salary range: The expected salary for this position is $203,496 - $248,736, which fits into the full salary of $180,876 - $305,268 depending upon the candidate's skills, knowledge, and abilities. This includes education, certifications, and years of experience.
- Background check: This position is subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
- Work modality: This position requires substantial on-site presence
$150k - $225k
...Job Description Job Description Job Title: Functional Systems Lead Location: Burlingame, CA Department: ESS Engineering... ...venture to advance globally proven Sodium-Ion battery systems as the storage standard for the new era of renewable energy on a resilient...SuggestedFull timeFlexible hours- ...Software Engineering Lead - NFS, a senior technical leader with extensive experience in NFS engineering and distributed systems. In this role, you'll architect NFS across... ...this a unique opportunity in the AI and data storage landscape. #J-18808-Ljbffr Data Direct...SuggestedRemote job
$127.5k - $248.5k
...low-cost and large-scale energy storage and producing battery... ...ecosystem impacting energy storage systems. This role is responsible for... ...associations, and industry working groups relevant to energy storage... ...Serve as a technical thought leader on energy storage safety and deployment...SuggestedFull time- ...Becoming Becoming is building Developmental Intelligence: AI for predicting how organisms change over time. Most experimental systems fail when metabolic demands become too high. We are building systems that don’t — enabling sustained, controllable biological processes...Suggested
- ...static tools and manual decision-making. Ando is rebuilding this system from first principles. We start with highly accurate AI-... ...a Principal AI / ML Systems Lead to serve as Ando's senior AI leader and technical authority. This is a founding-level role . You...SuggestedHourly payContract workShift work
$120k - $145k
...GTM Systems Lead Denver, Colorado; Remote; San Francisco, California Frontera is reimagining how children with autism and other behavioral health needs get the care they deserve. We bring together world-class clinicians, technologists, and autism specialists to build...Full timeWork at officeLocal areaRemote work$80k - $125k
...Function: Clinical Sales – Hospital/Hospital Systems (Commission) Job Category:... ...partners (distributors) and key opinion leaders. Know and follow ESC policies & procedures... ...disability, business accident insurance, and group legal insurance. Employees may be...Full timeTemporary workLocal areaNight shiftWeekend work- ...Right Thing, and Make a Difference Every Day! If this is how you like to work, we’d like to invite you to join our team as a Lead Systems Technician! We offer great benefits, a competitive salary, and growth opportunities. We think you’ll find what you want here...Temporary workLocal area
- A leading AI research company based in San Francisco is seeking a Technical Program Manager to manage safety system integrations and drive risk mitigation for its models. The ideal candidate is technically skilled and has a solid background in managing complex projects...Work at officeRelocation package3 days per week
$144.5k - $180.6k
A leading aerospace company in San Francisco seeks a Senior Spacecraft Systems Engineer to lead the development of next-generation satellite systems. This full-time hybrid role involves guiding an agile team through requirements, design, and testing phases, ensuring satellites...Full time$130k - $155k
Lead, Frontier AI Systems, Centre for AI Excellence page is loaded## Lead, Frontier AI Systems... ...foremost political, business and other leaders of society to shape global, regional and... ..., including roundtables, working groups, workshops, and executive briefings.* Cultivate...Relocation packageShift work3 days per week- ...drive community engagement and technical advocacy for their AI and MLOps initiatives. The role demands extensive experience in AI systems and community leadership to enhance developer engagement and promote the Databricks platform. A competitive compensation package is...
$130k - $155k
An international organization for public-private cooperation seeks a Lead for its Frontier AI Systems & Capabilities workstream in San Francisco. This role, requiring on-site work three days a week, aims to shape the understanding and deployment of frontier AI systems....3 days per week- Kai Ming Head Start in San Francisco is looking for an AI Systems and Innovation Manager to oversee and modernize technology systems. This role will lead the development of AI-enabled operational tools and manage internal systems to support program management. Ideal candidates...
- Q CELLS USA Corp. is seeking a Sr. Manager of System Test and Validation to lead the System Test team in San Francisco, CA. You will develop and execute comprehensive product test strategies for cutting-edge renewable energy systems, ensuring functional performance and...
$175k
A robotics innovation lab in San Francisco is seeking a Motor Team Lead to spearhead motor and actuator systems for complex robotics applications. The ideal candidate will have a strong background in robotics or automotive systems, with proficiency in controller development...- A leading law firm in San Francisco is seeking a Senior Financial Systems Analyst to support and administer their accounting and finance applications. The ideal candidate will have five years of legal or corporate IT experience, particularly with Elite 3E, and will be responsible...
- Impulse Labs, located in San Francisco, is seeking a Hardware System Technical Program Manager. This role involves leading the delivery of complex hardware systems, managing the complete hardware product lifecycle, and collaborating with teams across various disciplines...
- Arup is seeking a Client Systems, Analytics & Reporting Manager in San Francisco, CA to manage a regional team and provide insights for... ...and PowerBI. This role emphasizes collaboration with business leaders to enhance sales pipeline and client performance, supporting Arup...
$180k - $250k
A tech company in San Francisco is seeking a Senior Manager of Systems Controls. This role involves owning the design and implementation of internal controls for infrastructure supporting AI. Ideal candidates will have deep expertise in GITC, a track record of leading audits...$260k - $310k
Menlo Ventures is looking for a Research Operations Specialist in San Francisco to oversee system card production for AI models. This role involves coordinating contributions from multiple teams, editing for clarity and consistency, and ensuring the integrity of complex...$120k - $135k
Innervace is looking for a Quality Systems Specialist to ensure the effectiveness and compliance of our Quality Management System (QMS). The role requires collaboration across departments and a strong background in Quality Assurance and regulatory requirements. Ideal candidates...Full time$165k - $185k
...seeking a Product Manager for internal AI in San Francisco, CA. This role focuses on making TeraWatt AI-native, developing the operating system for internal use. You'll lead cross-functional teams to improve operational efficiencies using AI tools, define product strategies,...$10 per hour
...of every freight decision. You will lead the Autonomous Freight Systems team, owning the systems that power rate visibility, quoting,... ...agents, and automated quality safeguards. Represent your team with leaders across different parts of the organization. Participate in...$120k - $140k
## Sr. Financial Systems LeadApplylocations: San Francisco: Miami: Northern Virginia: Washington, DC: Bostontime type: Full timeposted on: Posted Yesterdayjob requisition id: R003223San Francisco, California## **Job Description**The Senior Financial Systems Lead serves...Work at office$154k - $193k
Gusto is looking for a Business Systems Analyst to enhance their revenue systems through effective Salesforce management. This role involves leading system improvements, managing cross-functional projects, and integrating third-party applications. Ideal candidates will...$75 - $90 per hour
Yoh, A Day & Zimmermann Company in San Francisco is seeking a Spacecraft NPI Systems Specialist. The ideal candidate will manage the systems integration for next-generation satellite constellations, ensuring accurate release of BOM data to PLM and ERP systems. Applicants...Hourly pay$180k - $300k
...crucial for advanced robotics programs. This role involves driving technical leadership in packaging, thermal management, and charging systems. The ideal candidate will have deep expertise in battery pack design and a founder mindset to innovate efficiently. Offering a...$175k - $220k
A modern business procurement platform in San Francisco is seeking a Senior Business Systems Manager. This pivotal role will bridge the Finance and People teams, leading the implementation of AI-driven solutions and enhancing system integrations. The candidate must have...Flexible hours- ...seeking a Business Development representative to drive the go-to-market strategy. The role involves identifying and qualifying health system accounts with a focus on strategic outreach. Ideal candidates will have 1-3 years of experience in a similar role, demonstrating a...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to HPC Storage Systems Group Leader. Be the first to apply!
- member team lead San Francisco, CA
- team coordinator San Francisco, CA
- group operations director San Francisco, CA
- disability team leader San Francisco, CA
- mobile team lead San Francisco, CA
- group strategy director San Francisco, CA
- operational excellence leader San Francisco, CA
- quality control team lead San Francisco, CA
- school leader San Francisco, CA
- group product manager San Francisco, CA


