Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Director, HPC Infrastructure Engineering

$230.2k - $316.6k

Dormont Manufacturing Company

Company Description Guardant Health is a leading precision oncology company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, Guardant is transforming patient care and accelerating new cancer therapies by providing critical insights into what drives disease through its advanced blood and tissue tests, real-world data and AI analytics. Guardant tests help improve outcomes across all stages of care, including screening to find cancer early, monitoring for recurrence in early-stage cancer, and treatment selection for patients with advanced cancer. For more information, visit guardanthealth.com and follow the company on LinkedIn , X (Twitter) and Facebook . Guardant Health’s High-Performance Computing team (HPC) builds and operates the computational technology infrastructure backbone of the company. This includes scalable data storage that holds petabytes of genomics data, high performance compute clusters running a custom bioinformatics pipeline in production and R&D environments, and the softwareinfrastructurethat hosts an ecosystem of services for internal data processing and external data integration.To facilitateGuardantHealth’s fast growth in the next few years, the HPC team is seeking a strong technical engineering leader who can help maintain and grow the HPC infrastructure during its expansion, while partnering with other engineering functions (Corporate IT, SQA and DevOps/SRE) as well as the R&D user community and Lab Operations. This is a hands-on technical leadership position that will leverage your expertise in HPC environments, as well as your experience leading and managing a team. Role: Director, HPC Infrastructure Engineering Location: Preference is given to candidates located in the San Francisco Bay Area with the ability to work onsite in Redwood City and Palo Alto; however, the role offers partial remote flexibility. Onsite presence is required during rotational coverage, scheduled maintenance windows, and cluster deployment activities . In this role, you will primarily lead an engineering team to: Oversee and manage the HPC environment — compute, storage, network, physical infrastructure, and software — serving multiple Production and Development clusters Integrate HPC systems with on-prem and cloud-based systems and data sources as required Administer multiple HPC clusters and associated cluster file systems Research, design, and implement next-generation HPC solutions Diagnose and resolve production system stack issues, leveraging software utilities down to the source code level (e.g., shell scripts, Python, etc.) Maintain and monitor infrastructure and facilities to ensure operational stability Drive continuous improvement initiatives to enhance reliability and performance as workloads and data volumes scale Ensure control, integrity, and accessibility across systems and applications serving multiple concurrent users Provide operational oversight for systems at remote and international locations Collaborate with offsite consultants to sustain and optimize infrastructure performance Partner with vendors to procure, troubleshoot, upgrade, repair, and replace systems as needed Foster a culture of continuous engineering improvement through design and architecture review, mentoring, feedback, and development and monitoring of key performance metrics Hire, coach, and mentor individuals; build a strong cross-functional organization Partner with a diverse customer base to understand requirements, priorities, and processes Propose and implement new projects or recommend system improvements Observe Quality standards appropriate for an FDA governed and CLIA/CAP compliant diagnostic laboratory Manage budgets to balance refresh of obsolete equipment and software, scaling to support company growth, utilizing fixed headcount and contractor/consulting resources Participate in a 24/7 on-call rotation Required: B.S. in Computer Science or related technical field or equivalent experience 10 years’ experience with high performance computing platforms, preferably organizations handling large volumes of sequenced genomic data, within a commercial enterprise Experience with software-defined Infrastructure and cloud computing - Google Cloud Platform, Amazon Web Service (AWS) etc GPUs and Petabyte scale Storage platforms management experience Design, deployment, support and troubleshooting experience, in a complex computing environment HPC Engineering team management experience (either directly or in a matrixed environment) 4+ years of networking experience with certification of CCNA or better 4+ years of Linux/Unix system administration, knowledge of Unix network protocols, TCP/IP, coreinfrastructuretechnologies and virtualization 2+ years of large-scale data storage and compute clusters (HPC)infrastructure 2+ years working in and with on-premise and cloud-based (AWS, Google, IBM and Azure) data-centers 2+ years of building software release and ops processes and automation toolset 2+ years providing documentation of system administration Preferred: Proficiency with Arista and compatible networking , up to and including 400 Gb/s links Hands-on administration of IBM’s General Parallel File System Operational oversight of Slurm scheduler Working knowledge of cloud bursting technologies Familiarity with wide area file systems Practical expertise in Docker and container technologies Working experience with Kubernetes Operation of infrastructure compliant with HIPAA and SOX standards Success Profile: Excels in agile, high-velocity technical environments. Demonstrates self-leadership and a commitment to advancing both individual and team expertise. Combines engineering rigor with pragmatic adaptability. Successfully manages operational SLAs while leading initiatives critical to business growth. Hybrid Work Model: This section is applicable to onsite employees who are eligible for hybrid work location as specified by management and related policies. Guardant has defined days for in-person/onsite collaboration and work-from-home days for individual-focused time. All U.S. employees who live within 50 miles of a Guardant facility will be required to be onsite on Mondays, Tuesdays, and Thursdays. We have found aligning our scheduled in-office days allows our teams to do the best work and creates the focused thinking time our innovative work requires. At Guardant, our work model has created flexibility for better work-life balance while keeping teams connected to advance our science for our patients. The annualized base salary ranges for the primary location and any additional locations are listed below. This range does not include benefits or, if applicable, bonus, commission, or equity. Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, education, job-related skills, job duties, and business need.Primary Location: Palo Alto, CAPrimary Location Base Pay Range: $230,200 - $316,600Other US Location(s) Base Pay Range: $195,700 - $269,100If the role is performed in Colorado, the pay range for this job is: $207,200 - $284,950 Employee may be required to lift routine office supplies and use office equipment.Majority of the work is performed in a desk/office environment; however, there may be exposure to high noise levels, fumes, and biohazard material in the laboratory environment.Ability to sit for extended periods of time. Guardant Health is committed to providing reasonable accommodations in our hiring processes for candidates with disabilities, long-term conditions, mental health conditions, or sincerely held religious beliefs. If you need support, please reach out to View email address on click.appcast.io A background screening including criminal history is required for this role. GH will consider qualified applicants with criminal arrest or conviction histories in a manner consistent with applicable law including but not limited to the LA County Fair Chance Policies and the Fair Chance Act (Gov. Code Section 12952). Guardant Health is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability. All your information will be kept confidential according to EEO guidelines. To learn more about the information collected when you apply for a position at Guardant Health, Inc. and how it is used, please review our Privacy Notice for Job Applicants . Please visit our career page at: #J-18808-Ljbffr

Vacancy posted 23 hours ago
Similar jobs that could be interesting for youBased on the Director, HPC Infrastructure Engineering in Palo Alto, CA vacancy
  • $230.2k - $316.6k

     ...Guardant Health is seeking a Director of HPC Infrastructure Engineering to lead their engineering team in Palo Alto, CA. The role oversees HPC environments, ensuring operational stability and performance. Candidates are preferred from the Bay Area but partial remote work... 
    Suggested
    Remote work

    Dormont Manufacturing Company

    Palo Alto, CA
    23 hours ago
  • $230.2k - $316.6k

     ...High-Performance Computing team (HPC) builds and operates the computational technology infrastructure backbone of the company.This...  ...is seeking a strong technical engineering leader who can help maintain...  ...leading and managing a team.Role: Director, HPC Infrastructure... 
    Suggested
    For contractors
    Work experience placement
    Work at office
    Remote work
    Work from home

    Guardant Health

    Palo Alto, CA
    22 hours ago
  •  ...A leading precision oncology company is looking for a Director of HPC Infrastructure Engineering to oversee the company's computing environment and storage systems. The role involves managing a technical team, driving improvements to enhance performance, and ensuring... 
    Suggested

    Guardant Health

    Palo Alto, CA
    22 hours ago
  • $149.4k - $205.4k

     ...Staff HPC Infrastructure Engineer page is loaded## Staff HPC Infrastructure Engineerlocations: Palo Alto, CAtime type: Full timeposted on: Posted Yesterdayjob requisition id: R-100428**Company Description**Guardant Health is a leading precision oncology company focused... 
    Suggested
    Work at office
    Remote work
    Work from home
    Flexible hours

    Guardant Health

    Palo Alto, CA
    22 hours ago
  • $173k - $237.95k

     ...Guardant Health is seeking a technical lead for managing HPC infrastructure in Palo Alto, CA. Ideal candidates will have strong backgrounds in TCP/IP networking and Linux administration. The role encompasses the integration of complex systems and improving computational... 
    Suggested
    Work at office

    Dormont Manufacturing Company

    Palo Alto, CA
    22 hours ago
  •  ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand...  ...that will allow us to seamlessly build-out new GPU infrastructure with little to no engineering assistance. There will be... 

    Xai

    Palo Alto, CA
    4 days ago
  •  ...Product Manager - HPC & Software (Infrastructure & Storage) Hybrid (2 days on-site) Locations: San Francisco, Bay Area, Houston, Colorado, Minnesota...  ..., and datacenter efficiency reporting. Partner with engineering to align software architecture and release sequencing... 
    Work experience placement

    Salt

    Santa Clara, CA
    21 hours ago
  • $163k - $237k

    Technical Program Manager III, Infrastructure Engineering, YouTube Location: Mountain View, CA, USA Apply Requirements Bachelor's degree in a technical field, or equivalent practical experience. 5 years of experience in program management. Experience building, automating... 
    Full time

    Google Inc.

    Mountain View, CA
    21 hours ago
  • $181k - $297k

     ...will be based in Mountain View, CA. We are seeking an HPC Network Engineer to design, deploy, and operate high-performance, low-latency...  ...benchmarking and profiling tools. Experience with infrastructure automation or configuration management tools. Demonstrated... 
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    1 day ago
  • $120k - $250k

    What MatX Is Building We're a small engineering team designing a custom chip. The work is...  ...engineers depend on every day. The infrastructure that supports all of this — CI/CD, compute...  ...batch compute or job schedulers — HPC, Slurm, Nomad, Kubernetes batch, or similar... 
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    Day shift
    3 days per week

    MatX

    Mountain View, CA
    3 days ago
  •  ...Senior Account Director – Hyperscalers (AI Infrastructure) Mountain View, California, United States About...  ...infrastructure, data center planning, capacity engineering, and compute procurement teams Act...  ...infrastructure, data centers, HPC, or GPU compute environments... 
    Work at office

    Glint Tech Solutions LLC

    Mountain View, CA
    5 days ago
  • $220.32k - $311.04k

     ...adoption metrics to inform strategic plans. - Collaborate with engineering teams to represent customer and market needs in product...  ...for diverse needs across general-purpose compute, web services, HPC, and AI-accelerated systems. Our charter encompasses defining business... 
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    1 day ago
  •  ...Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support...  ..., and reporting metrics Partner with HPC operation team, security, and...  ...request processes Strategic Planning & Engineering Oversight Provide technical and strategic... 

    ASRC Federal Holding Company

    Mountain View, CA
    22 hours ago
  •  ...AV efforts.We’re proud to serve as the infrastructure platform for teams developing autonomous...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust Compute platforms...  ...with high performance computing (HPC). Experience working with or designing interfaces... 

    General Motors

    Mountain View, CA
    22 hours ago
  •  ...A cutting-edge robotics company based in California is looking for an experienced Machine Learning Infrastructure Engineer. This role involves designing scalable ML training platforms, optimizing high-performance computing systems, and ensuring robust job scheduling and... 

    Dyna Robotics

    Redwood City, CA
    22 hours ago
  • $129k - $161.27k

    An academic institution in Santa Clara seeks a skilled IT professional to enhance HPC capabilities through training, develop infrastructure solutions, and ensure operational resilience. The position requires strong technical expertise, project management skills, and the... 

    Santa Clara University

    Santa Clara, CA
    21 hours ago
  • $214.24k - $310.03k

     ...hardware-software integration, and test automation infrastructure. Serving as the solid-line leader for engineers, the Sr Manager fosters engagement, development,...  ...bootloaders, OS/RTOS services) Experience with ECU and HPC bring-up, silicon validation, and production... 
    Permanent employment
    Temporary work
    Shift work

    CARIAD, Inc.

    Mountain View, CA
    4 days ago
  • $200k - $268k

     ...LinkedIn office on select days, as determined by the business needs of the team. We're looking for an Engineering Manager to lead two high-impact infrastructure teams shaping the future of service connectivity at LinkedIn. This role spans LinkedIn's next-generation... 
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    3 days ago
  • Google Inc. is looking for a Software Engineering Manager for Health and Home Infrastructure in Mountain View, CA. This role requires strong leadership to manage teams across multiple locations, focusing on optimizing software performance and reliability for user experiences... 

    Google Inc.

    Mountain View, CA
    2 days ago
  • $224k - $356.5k

     ...to lead our CPU marketing strategy in Santa Clara. This pivotal role will require collaboration with engineering teams to highlight NVIDIA’s leadership in AI and HPC through effective storytelling and asset creation. The ideal candidate will have over 7 years of experience... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $90k - $110k

     ...startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate...  ...are seeking a dedicated and detail-oriented Operations Engineer to join our HPC Networking Team. HPC Networking at CoreWeave is tasked... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    18 days ago
  • $160k - $230k

     ...observability platform built on the Snowflake Data Cloud and engineered for scale. We ingest and store logs, metrics, traces, and events...  ...to root cause and resolution significantly faster. The Infrastructure team at Observe by Snowflake is responsible for building, scaling... 
    Temporary work
    Flexible hours

    Snowflake Computing

    Menlo Park, CA
    2 days ago
  •  ...Overview Sycamore is building the infrastructure that makes autonomous AI agents reliable, secure, and enterprise‑ready. Responsibilities Own systems at the intersection of agent isolation, networking, storage, orchestration, and enterprise integration. Design and implement... 

    Sycamore Labs, Inc.

    Palo Alto, CA
    21 hours ago
  •  ...Company Description It all started when engineer Fred Luddy wrote code that automated a tedious task for his coworker, Phyllis...  ...product-specific search solutions to a composable, agent-native infrastructure foundation that agents and applications build on to locate,... 
    Full time
    Work at office
    Immediate start
    Remote work
    Flexible hours
    Shift work

    ServiceNow

    Mountain View, CA
    2 days ago
  • $75 - $85 per hour

     ...Our client, a leading organization in the technology and research sector, is seeking a Lab Infrastructure Engineer to join their team. As a Lab Infrastructure Engineer, you will be part of the Infrastructure Support Department supporting the Network Operations and Data... 
    Weekly pay
    Temporary work
    Flexible hours

    ManpowerGroup Global, Inc.

    Menlo Park, CA
    23 hours ago
  • $200k - $287.5k

     ...observability platform built on the Snowflake AI Data Cloud and engineered for scale. We ingest and store logs, metrics, traces, and...  ...ecosystem of one of the world's leading data platforms. The Infrastructure team at Observe by Snowflake is responsible for architecting,... 
    Immediate start
    Flexible hours

    Snowflake Computing

    Menlo Park, CA
    2 days ago
  • $120k - $300k

     ...Decisive Point is seeking a software engineer to design and build core libraries and improve developer infrastructure in Mountain View, CA. You will enhance the speed and reliability of tools used daily by engineers. The ideal candidate has a Bachelor's in Computer Science... 
    Full time

    Decisive Point

    Mountain View, CA
    22 hours ago
  •  ...Inference Infrastructure Engineer At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics stack from high-performance hardware and robot systems to the infrastructure and state-of-the-art foundation world models that... 

    Rhoda ai

    Palo Alto, CA
    5 days ago
  •  ...Job Title: Lab Infrastructure Engineer Location: Menlo Park, California (100% Onsite) Note: Only GC & USC need to apply for this opportunity as per requirement. Role Summary This role owns end-to-end execution for building, expanding, and operating high-performance... 
    Afternoon shift

    VBeyond

    Atherton, CA
    3 days ago
  • $132.1k - $279.8k

     ...performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast. Senior Infrastructure Engineer Mission At Groq, we’re building a custom cloud from the ground up — one data center at a time. Our Infrastructure Platform... 

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Director, HPC Infrastructure Engineering. Be the first to apply!