HPC AI Systems Administrator Lead
Hewlett Packard Enterprise Development LP
HPC AI Systems Administrator Lead
This role has been designed as "Onsite" with an expectation that you will primarily work from an HPE office.
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
The Data Center Administration team is seeking a Senior System Administrator to provide advanced system administration and lab operations support for hardware, network, and software environments used by HPE HPC & AI Performance Engineering teams. These environments support internal product development, performance engineering, ISV validation, and customer-facing sales and benchmarking activities. This role serves as a senior technical contributor and lab expert, providing design guidance, operational leadership, and escalation-level troubleshooting across complex HPC and AI lab environments. The position partners closely with engineering teams, infrastructure support groups, and external partners to ensure lab stability, availability, and effective use of resources. The Senior System Administrator contributes to continuous improvement of lab processes, policies, and standards, prioritizes lab requests, mentors junior staff, and supports future lab expansion and facility transitions.
Essential Job Functions and Duties
- Image, configure, and upgrade servers with Linux operating systems, including firmware updates and switch configuration to support lab environments.
- Configure and manage multiple root slots hosting varied operating system images in support of HPC cluster provisioning, validation, and testing workflows.
- Provide design guidance and operational support for virtualized lab infrastructure, including virtual server administration and the design of highly available, fault-tolerant environments.
- Provide design guidance for lab storage solutions, including installation, configuration, and performance management of high-performance storage systems (e.g., Lustre) to support sales, benchmarking, and partner activities.
- Provide guidance for hardware and software installation and configuration, including advanced hardware diagnostics and coordination with infrastructure support teams to resolve power, CPU, and GPU issues.
- Collaborate with AI benchmarking, R&D, and performance engineering teams to design and operate lab environments that meet internal, partner, and customer requirements.
- Design lab layouts, networks, and operational policies that meet functional needs while adhering to cybersecurity and asset protection standards.
- Prioritize and coordinate lab work activities to ensure timely delivery of high-impact requests and effective utilization of lab resources.
- Make recommendations on lab resource usage, capacity planning, and future expansion to support evolving business and engineering needs.
- Oversee and support lab transitions, including facility moves and infrastructure refresh activities.
- Install, configure, and support job scheduling and resource management tools to maximize lab utilization.
- Serve as a technical mentor to junior system administrators and lab staff, providing guidance on best practices, troubleshooting, and operational standards.
- Communicate lab successes, risks, failures, and issues to management in a timely and professional manner.
- Work effectively with remote administrators, vendors, and partners when specialized expertise or additional support is required.
Job-Specific Competencies
- Communication – Communicates clearly and effectively in both written and verbal forms; collaborates well with diverse technical teams.
- Creativity / Innovation – Applies creative problem-solving approaches and contributes to continuous improvement of lab processes and capabilities.
- Customer Service – Demonstrates a service-oriented mindset when supporting internal teams, partners, and stakeholders.
- Job Knowledge – Maintains deep technical knowledge of Linux systems, lab operations, and HPC/AI infrastructure.
- Problem Solving / Analysis – Breaks down complex technical issues, identifies root causes, and develops effective solutions.
- Quality – Demonstrates attention to detail, accuracy, and reliability.
- Technical Skills – Strong expertise in Linux system administration with working knowledge of networking, storage, virtualization, and hardware platforms.
Education and Experience
- Bachelor's degree in Computer Science, MIS, or a related technical field required mainly System Administration.
- Minimum of 8–10 years of Linux system administration experience required, preferably in HPC, AI, or lab-based environments.
- Candidates with strong Linux or network administration backgrounds and demonstrated interest in advanced lab system administration will also be considered.
- This role works as part of a team of system administrators and lab staff and reports to the Data Center Administration Manager.
- ...HPC AI Systems Administrator This role has been designed as "Onsite" with an expectation that you will primarily work from an HPE office. Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect...SuggestedPermanent employmentWork experience placementWork at officeLocal areaImmediate startRemote work
- ...generation. The ideal candidate has over 8 years of experience in HPC software development using C++, along with leadership skills and... ...competitive salary range and the opportunity to be part of cutting-edge AI and data center technologies. #J-18808-Ljbffr NVIDIA GruppeSuggested
- ...Infrastructure Storage Strategy team to provide groundbreaking fast storage solutions. You'll design distributed storage services for HPC and work on enhancing performance and cost-effectiveness in our cloud infrastructure. The position requires a Bachelor’s degree in...Suggested
$159.5k - $271.2k
...hands without us. KLA invents systems and solutions for the manufacturing... ...together with the world's leading technology providers to... ...workloads (real-time processing, AI/DL pipelines, high-throughput... ...deploying storage solutions in HPC or high-performance environments...SuggestedMinimum wageWork experience placementFlexible hours$129k - $161.27k
...Position Title: HPC System Administrator Position Type: Regular Hiring Range: $129,000 - $161,265 /annually; Compensation will... ...strong technical and organizational knowledge to plan and lead projects and working groups. 6. Service Delivery Work...SuggestedWork at officeLocal areaRemote workFlexible hoursShift work- NVIDIA Corporation in Santa Clara seeks a technical leader for the development of next-generation AI supercomputing systems. The role involves leading a team, defining system designs, and collaborating with various departments. Ideal candidates should have over 8 years...Remote job
$272k - $431.25k
NVIDIA is seeking a Developer Relations Manager in Santa Clara to lead strategic engagement in the U.S. Federal software ecosystem. The... ...possess over 15 years of experience, including 7 years specifically in AI/ML platforms. Responsibilities include developing technical...- Joseph J. Albanese, Inc. is looking for a highly skilled Senior Systems Administrator to manage and enhance our hybrid infrastructure in Santa... ...both office employees and distributed field teams. You will lead system upgrades, support compliance with security frameworks...Work at office
$120k
...seeking an extraordinary Senior AWS DevOp and System Administrator to join our team at eTrigue. Are you... ...your profession? Have you deployed AI? If so, we are looking for you! As a Senior... ...their channel partners. The eTrigue Lead Accelerator solution is turnkey, including...Full timeContract workPart timeH1bFlexible hours- ...System Administrator Ingrasys is a global leader in advanced technology and manufacturing, specializing in cutting-edge cloud infrastructure and AI computing solutions. As a subsidiary of Foxconn Technology Group, we're at the forefront of driving industry innovation...Work at office
- ...FEATURED OPPORTUNITY Our client is seeking a senior IAM Systems Administrator to support enterprise identity and access management... ...subject to INSPYR Solutions' Privacy Policy and INSPYR Solutions' AI and Automated Employment Decision Tool Policy: . By submitting...Local areaFlexible hours
- Advanced Micro Devices is looking for a Program Leader for AI platforms in Santa Clara, California or remote. This role involves driving... ...cross-functional initiatives to deliver next-generation AI and HPC solutions, working closely with customers, engineers, and...Remote job
- NVIDIA Gruppe, based in Santa Clara, is looking for a passionate Technical Product Manager to lead the GPUDirect Storage and cuFile products. This pivotal role involves collaboration with architects and engineers to define product roadmaps, engaging with customers to gather...
$100k - $170k
...for augmented reality (AR) and AI-powered smart glasses. Our... ...nanofabrication and epitaxy to system design - united by a vision to... ...motivated and hands-on Cloud Systems Administrator to join our growing team in... ..., and alerting solutions Lead infrastructure change management...Work at officeRemote work$120k - $190k
...the global adoption of safe, AI-driven machines. Founded in 20... ...the Vehicle OS, Self-Driving System, and toolchain to help customers... ...for an experienced GTM Systems Administrator to join our Revenue Strategy &... ...solutions that streamline lead routing, opportunity management...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$129k - $161.27k
...institution in Santa Clara seeks a skilled IT professional to enhance HPC capabilities through training, develop infrastructure solutions,... .... Ideal candidates will have experience with Linux and Windows systems, and SAN storage environments. The role offers a salary range of...$70 - $78 per hour
...Global is looking for a talented Data Center System Admin to sit onsite with one of our... ...related field. Proven experience as a System Administrator or similar role in a high‑tech... ...Automation experience with python. Use of AI in work to make processes more efficient....- ...join our CSP Engagements team, focusing on system software for datacenter products such as... ...for GB200 and next‑gen platforms. Lead hardware bring‑up activities, BSP development... ..., and applications focusing on AI/ML and HPC workloads. Perform advanced system debugging...
- ...AI Program Management Lead This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and...Work at officeShift work2 days per week
- ...Job Description Job Description An AI SaaS Client is looking for a hands-on Growth Lead to drive the launch and growth of an AI-powered SaaS product targeting SMBs. This is a highly execution-focused role where you’ll own the go-to-market strategy, user acquisition...
$171.8k - $277.93k
...Disruption, Collaboration, Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and use it to augment the... ...critical business challenges to drive and support our industry-leading growth. Your efforts will directly affect the overall strategy...Full timeWork experience placementWork at officeVisa sponsorshipWork visa- ...AI Program Management Lead This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help...Work experience placementWork at officeShift work2 days per week
- ...PMO Transformation Project Lead (IT Portfolio) Overview Seeking a hands-on PMO Transformation Project Lead to support an enterprise IT organization undergoing an AI-driven transformation. This role will assess the current PMO, define a future-state operating...Immediate start
$157k - $271.4k
...efficiency by synchronizing devices, data and AI driven insights in one place to simplify... ...We are recruiting for a Principal AI Lead within the Polyphonic® Applied AI and ML... ...with significant ownership of production systems (or equivalent experience). ~ Deep expertise...Local areaImmediate start$124.8k - $187.2k
...global scale, come make a difference at Fiserv. Job Title SDET System Integration Tester for Android We're Clover, the largest cloud... ...across diverse network environments (3G/Wi-Fi) while leveraging AI-assisted tools to accelerate defect detection and test coverage....Worldwide- A leading technology company in Santa Clara is seeking a Manager of Solutions... ...a team dedicated to NVIDIA-powered AI Factories, advising partners on AI/HPC projects, and ensuring proper infrastructure... ..., along with expertise in HPC systems and microservices architecture. #J-...Remote job
$118k - $179.04k
...Clara, California, United States of America Job Description: Johnson & Johnson RAD (Robotics and Digital) is seeking for a AI/ML Lead/ Surgical Robotics- OTTAVA, for our Santa Clara, Location. At Johnson & Johnson, we believe health is everything. Our strength...Temporary workLocal areaImmediate start$150k - $220k
...technology company in Sunnyvale, California is seeking an IT Systems Administrator/Desktop Support Engineer to manage and support the IT environment... ...00 annually, and benefits include comprehensive medical coverage and a flexible vacation policy. #J-18808-Ljbffr Lyte AI Inc.Flexible hours- ...Request ID: 21100-1 Job Title: IT - Lead II - Data Science Start/End Dates: 1/13/2026 - 7/12/2026 Location - Santa Clara,... ...Must Have Skills Skill 1 - Design, develop, and deploy Custom AI agents capable of autonomous decision-making and task execution...
$140.5k - $193k
...that literally connect our world - like AI and IoT. If you want to push the boundaries... ...while learning every day in a supportive leading global company. Visit our Careers website... ...of expertise in Costing and Pricing Systems, including CPQ platforms (SAP CPQ, Pricefx...Full timeContract workRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to HPC AI Systems Administrator Lead. Be the first to apply!
- IT infrastructure administrator San Jose, CA
- microsoft systems administrator San Jose, CA
- systems administrator San Jose, CA
- application system administrator San Jose, CA
- system admin San Jose, CA
- enterprise administrator San Jose, CA
- IT administrator San Jose, CA
- server administrator San Jose, CA
- computer systems administrator San Jose, CA
- remote systems administrator San Jose, CA


