Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior HPC Systems Engineer - GPU & AI Clusters

$146k - $194k

Anduril Industries

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.

ABOUT THE ROLE

Anduril is seeking a High Performance Computing (HPC) System Engineer to directly support our most sensitive programs. You will be a part of the team building and maintaining large scale HPC infrastructure. You will have the opportunity to work with and learn from some of the world’s best engineers and cybersecurity professionals as you help to implement cutting edge systems. You will work directly to support systems deployed across the globe in support of national security missions.

WHAT YOU'LL DO

Work in a fast-paced, customer-focused environment supporting high-profile operational and research requirements. Architect and deploy advanced GPU infrastructure, leading the design, deployment, and lifecycle management of cutting-edge NVIDIA hardware including H100, H200, and B200/B300 systems. Ability to rack, stack, cable, and configure physical servers and multi-node GPU systems from end to end. Configure HPC and AI environments, including job schedulers (e.g., Slurm), multi-user login environments, and cluster management software (e.g., Warewulf, NVIDIA Base Command, RunAI). Implement and fine-tune high-speed interconnects (e.g., NVLink, NVSwitch, InfiniBand/NDR) crucial for large-scale distributed training. Configure and manage large-scale, high-performance storage platforms in the multiple petabytes range, optimized for AI/ML data access patterns. Install, configure, and maintain the application stack on HPC clusters, including traditional simulation software (StarCCM+, Ansys, Matlab) and the core AI/ML software stack (NVIDIA drivers, CUDA, PyTorch, TensorFlow). Implement and manage GPU virtualization and sharing technologies, such as Multi-Instance GPU (MIG), to maximize resource utilization across diverse workloads. Troubleshoot complex, system-wide issues related to application performance, user access, compute nodes, storage, and job queueing services. Utilize NVIDIA Data Center GPU Manager (DCGM) and additional tools to proactively monitor GPU health and performance, diagnosing and resolving training bottlenecks in collaboration with ML engineers. Ensure the security and integrity of the server and cluster infrastructure through regular audits, patching, and proactive security measures. Collaborate closely with engineering and AI/ML research stakeholders to gather requirements and architect robust, scalable solutions. Manage the hardware lifecycle, from quoting and procuring hardware from vendors to creating and executing deployment schedules. Provide technical guidance, mentoring, and architectural leadership to other team members.

REQUIRED QUALIFICATIONS

7+ years of experience in designing, developing, and implementing large scale compute enterprise systems and solutions Strong Knowledge and experience with High Performance Computing concepts to include cluster architecture file system, and high-speed infiniBand/ethernet interconnections Proven expertise in one or more of the following, Red Hat Enterprise Linux, Ubuntu, HPC, GPU, Azure or AWS cloud services Strong understanding and experience with systems automation tools (Ansible, Salt, Puppet) Experience in HPC technologies such as parallel/distribution file systems (e.g., Lustre, GPFS, Pure, VAST) Working knowledge of HPC batch schedule software (e.g., PBSPro, SLURM) AWS/Azure experience building HPC clusters Ability to lift 50 lbs Eligible to obtain an maintain a US Top Secret Clearance US Salary Range $146,000 — $194,000 USD The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and/or training, critical skills, and/or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including: Benefits At Anduril, we invest in our people. Our comprehensive, competitive benefits package (available at little to no cost to employees) ensures you’re supported in health, recovery, and whatever comes next. For more information, Explore Our Benefits . Protecting Yourself from Recruitment Scams Anduril is committed to maintaining the integrity of our Talent acquisition process and the security of our candidates. We've observed a rise in sophisticated phishing and fraudulent schemes where individuals impersonate Anduril representatives, luring job seekers with false interviews or job offers. These scammers often attempt to extract payment or sensitive personal information. To ensure your safety and help you navigate your job search with confidence, please keep the following critical points in mind: No Financial Requests: Anduril will never solicit payment or demand personal financial details (such as banking information, credit card numbers, or social security numbers) at any stage of our hiring process. Our legitimate recruitment is entirely free for candidates. Please always verify communications: Direct from Anduril: If you receive an email from one of our recruiters, it will only come from an @anduril.com address. Via Agency Partner: If contacted by a recruiting agency for an Anduril role, their email will clearly identify their agency. If you suspect any suspicious activity, please verify the agency's authenticity by reaching out to View email address on click.appcast.io . Exercise Caution with Unsolicited Outreach: If you receive any communication that appears suspicious, contains grammatical errors, or makes unusual requests, do not engage. Always confirm the sender's email domain is @anduril.com before providing any personal information or clicking on links. What to Do If You Suspect Fraud: Should you encounter any questionable or fraudulent outreach claiming to be from Anduril, please report it immediately to View email address on click.appcast.io . Your proactive caution is invaluable in protecting your personal information and upholding the security and trustworthiness of our recruitment efforts. Data Privacy To view Anduril's candidate data privacy policy, please visit . By submitting your application, you consent to Anduril Industries using a third-party service provider to conduct pre-employment risk, integrity, and due diligence screening and assessing potential risks as part of your application process. This third-party service provider provides risk-intelligence services that may include analysis of sanctions and watchlists, adverse media, public-record information, and other lawful open-source or commercial data sources. This third-party service provider does not act as a consumer reporting agency. Use of this provider helps to ensure compliance with applicable laws and protect technology, intellectual property, and organizational security. #J-18808-Ljbffr Anduril Industries

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior HPC Systems Engineer - GPU & AI Clusters in Costa Mesa, CA vacancy
  • $170k - $210k

     ...Senior Linux Systems Engineer, ECC - Active Clearance Required Costa Mesa, California...  ...by Lattice OS, an AI-powered operating system...  ...Strong working knowledge of clustered server infrastructure....  ...High Performance Compute (HPC) with CPU and GPU based clusters... 
    Senior
    Full time
    Work experience placement
    Immediate start

    anduril

    Costa Mesa, CA
    1 day ago
  • $166k - $220k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril...  ...systems is powered by Lattice OS, an AI-powered operating system that turns...  ...ABOUT THE TEAM We're seeking a senior systems security engineer to will drive security engineering... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $166k - $220k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s...  ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...as a lead provider of specialized engineering and products for Intelligence Community (... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Santa Ana, CA
    1 day ago
  • $166k - $220k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s...  ...of systems is powered by Lattice OS, an AI‑powered operating system that turns thousands...  ...of assets. We are looking for systems engineers with experience in this field to help design... 
    Senior
    Full time
    Work experience placement
    Remote work
    Relocation package

    Anduril

    Costa Mesa, CA
    13 hours ago
  • $191k - $253k

     ...Anduril is changing how military systems are designed, built and sold....  ...is powered by Lattice OS, an AI-powered operating system that...  ...HIL/SIL) testing Guide DSP engineers in the execution of RADAR-DSP...  ...fields. Experience with CUDA or GPU accelerated frameworks like... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    3 days ago
  • $191k - $253k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril's...  ...systems is powered by Lattice OS, an AI-powered operating system that turns...  ...ABOUT THE ROLE Anduril is seeking a Senior Firmware Engineer to join our team based in Costa Mesa, CA... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    4 days ago
  • $150k - $220k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril's...  ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...looking for a highly motivated Network Engineer to join the Anduril Industries Network... 
    Senior
    Full time
    Work experience placement
    For subcontractor
    Immediate start
    Remote work

    Anduril Industries

    Costa Mesa, CA
    2 days ago
  • $124k - $280k

     ...Competency: Data, Analytics & AI Industry/Sector: Health...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...for health systems. As a Senior Manager, you will serve as a strategic... 
    Senior
    Full time
    H1b

    PwC

    Irvine, CA
    1 day ago
  • $77k - $202k

     ...Competency: Data, Analytics & AI Industry/Sector: Not...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...that meet business needs. As a Senior Associate, you analyze complex... 
    Senior
    Full time
    H1b

    PwC

    Irvine, CA
    13 hours ago
  • $124k - $280k

     ...Competency: Data, Analytics & AI Industry/Sector: Health...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...system and health plans. As a Senior Manager, you will drive use... 
    Senior
    Full time
    H1b

    PwC

    Irvine, CA
    3 days ago
  • $166k - $220k

     ...Senior Mission Systems Engineer, Air Dominance & Strike, Active Clearance Costa Mesa, California, United States Anduril Industries is a defense...  ...Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams... 
    Senior
    Full time
    Work experience placement
    For subcontractor

    anduril

    Costa Mesa, CA
    2 days ago
  • $166k - $220k

     ...Senior Systems Engineer, Advanced Effects, Active Clearance Costa Mesa, California, United States Anduril Industries is a defense technology...  .... Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Work experience placement

    anduril

    Costa Mesa, CA
    2 days ago
  • Anduril Industries, located in Costa Mesa, is looking for a Staff Fluids Systems Engineer to join the Omen Team. You will drive the design and development of fluid systems for our innovative autonomous air vehicle, ensuring integration with various subsystems. The role... 
    Senior

    Anduril-1

    Costa Mesa, CA
    13 hours ago
  • $166k - $222k

    A leading defense technology company in Costa Mesa is seeking a Senior Systems Engineer specialized in space systems. This role involves overseeing technical strategy and integration for complex spacecraft systems, conducting trade studies, and collaborating across disciplines... 
    Senior

    Anduril

    Costa Mesa, CA
    3 days ago
  • $166k - $220k

     ...Senior Systems Engineer, Omen Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities...  .... Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Contract work
    Temporary work
    Work experience placement
    Remote work

    anduril

    Costa Mesa, CA
    2 days ago
  • $191k - $253k

    Anduril Industries is looking for a Staff Fluids Systems Engineer to join the Omen Team in Costa Mesa. This role involves owning the design and development of fluid systems on an advanced autonomous air vehicle. Candidates should have extensive experience in fluids systems... 
    Senior

    jobr.pro

    Costa Mesa, CA
    3 days ago
  • A leading defense technology firm in Costa Mesa, CA, is seeking a Senior Systems Engineer, Space to drive technical strategy for innovative satellite missions. You'll lead multi-disciplinary teams to deliver software and hardware solutions that enhance military space operations... 
    Senior

    Anduril Industries

    Costa Mesa, CA
    13 hours ago
  • $166k - $220k

     ...Senior Systems Engineer, Edge Compute and Communications Costa Mesa, California, United States Anduril Industries is a defense technology...  ...sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Contract work
    Work experience placement
    Immediate start

    anduril

    Costa Mesa, CA
    1 day ago
  • $132k - $198k

     ...Senior Systems Engineer Costa Mesa, California, United States Anduril Industries is a defense technology company with a mission to transform...  .... Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Work experience placement
    Immediate start

    anduril

    Costa Mesa, CA
    2 days ago
  • $166k - $220k

     ...Anduril is changing how military systems are designed, built and sold....  ...is powered by Lattice OS, an AI-powered operating system that...  ...WE'RE HERE: The systems engineering team is looking for an experienced...  ...'LL DO We are hiring a Senior / Lead Systems Engineer to... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    4 days ago
  • $166k - $222k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril's...  ...systems is powered by Lattice OS, an AI-powered operating system that turns...  ...ABOUT THE JOB We are looking for a Senior Systems Engineer, Space to join our rapidly growing team... 
    Senior
    Full time
    Contract work
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    3 days ago
  • $191k - $253k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s...  ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...in close coordination with Electrical Engineering to develop hardware and software solutions... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    13 hours ago
  • A defense technology company in Costa Mesa seeks a Systems Engineer to join the Omen team. The role involves executing Agile Systems Engineering approaches for the development of an autonomous air vehicle. Candidates should have a bachelor's degree in engineering and at... 
    Senior

    Anduril Industries

    Costa Mesa, CA
    13 hours ago
  • A leading defense technology company is seeking a Systems Engineer for its Omen team in Costa Mesa. In this role, you will directly shape the design and development of the Omen, an innovative autonomous air vehicle. Key responsibilities include executing Agile Systems... 
    Senior

    Slope

    Costa Mesa, CA
    3 days ago
  • $166k - $220k

     ...Senior Systems Engineer, Launched Effects Costa Mesa, California, United States Anduril Industries is a defense technology company with a...  ...sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Work experience placement
    Work at office
    Immediate start

    anduril

    Costa Mesa, CA
    2 days ago
  • A defense technology company located in California is seeking a Systems Safety Engineer to enhance the development of its Omen autonomous air vehicle. The position requires extensive experience in safety-critical systems and deep knowledge of relevant safety standards.... 
    Senior

    Slope

    Costa Mesa, CA
    13 hours ago
  •  ...Senior Generative AI Engineer Location: Irvine, CA Experience: 10 Duration: Long Term PLEASE MENTION THE CURRENT LOC,DL LOC,VISA STATUS Job Description: Experience in AI/ML development, with focus on OpenAI services, NLPs and LLMs. Ability to fine tune pre-trained... 
    Senior

    Argyle Infotech

    Irvine, CA
    2 days ago
  • $166k - $220k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril's...  ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...are seeking an experienced System Safety Engineer to join our rapidly growing Space team.... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    3 days ago
  • $191k - $253k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s...  ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...in close coordination with Electrical Engineering to develop hardware and software solutions... 
    Senior

    jobr.pro

    Costa Mesa, CA
    13 hours ago
  • $166k - $220k

     ...Senior System Safety Engineer, Aircraft Systems Anduril Industries is a defense technology company with a mission to transform U.S. and allied military...  .... Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data... 
    Senior
    Full time
    Work experience placement

    anduril

    Costa Mesa, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior HPC Systems Engineer - GPU & AI Clusters. Be the first to apply!