Principal HPC Architect
$162.7k - $284.7kKLA-Belgium
- # Staff HPC EngineerApplylocations: Milpitas, CAtime type: Full timeposted on: Posted Yesterdayjob requisition id: 2635960**Company Overview**KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.**Group/Division**The Information Technology (IT) group at KLA is involved in every aspect of the global business. IT’s mission is to enable business growth and productivity by connecting people, process, and technology. It focuses not only on enhancing the technology that enables our business to thrive but also on how employees use and are empowered by technology. This integrated approach to customer service, creativity and technological excellence enables employee productivity, business analytics, and process excellence.**Job Description/Preferred Qualifications**The Staff HPC Engineer designs, builds, optimizes, and supports large scale compute environments used for scientific computing, AI/ML workloads, simulation, and data intensive research. This role blends systems engineering, performance tuning, cluster architecture, and hands on troubleshooting. The engineer partners with researchers, developers, and IT teams to deliver reliable, scalable, and high performance compute infrastructure.**Key Responsibilities:*** HPC Architecture & Engineering* Design and implement HPC clusters, including compute, storage, networking, and job‐scheduling components.* Evaluate and integrate new technologies (GPUs, accelerators, interconnects, filesystems).* Develop automation for cluster provisioning, configuration, and lifecycle management.* Architect solutions for large‐scale parallel workloads, AI/ML pipelines, and data‐intensive applications.**Performance Optimization**:* Profile and tune applications for CPU, GPU, memory, and I/O performance.* Optimize MPI, OpenMP, CUDA, and other parallel programming frameworks.* Benchmark hardware and software stacks to guide procurement and architecture decisions.**Operations & Reliability:*** Maintain and monitor HPC clusters, job schedulers (Slurm, PBS, LSF), and distributed filesystems (Lustre, GPFS, BeeGFS).* Troubleshoot complex system issues across compute, storage, and network layers.* Implement security best practices, patching, and compliance controls.* Ensure high availability and efficient resource utilization.**Automation & DevOps:*** Build and maintain CI/CD pipelines for HPC‐related software and infrastructure.* Use tools such as Ansible, Terraform, Kubernetes, or custom scripts to automate workflows.* Develop monitoring and observability solutions (Prometheus, Grafana, ELK, etc.).**Collaboration & Leadership:*** Work closely with researchers, data scientists, and engineering teams to support workload optimization.* Provide technical leadership, mentorship, and guidance to junior engineers.* Document architectures, procedures, and best practices.* Participate in capacity planning and long‐term HPC strategy.**Required Qualifications:*** Extensive experience with Linux systems engineering in large‐scale compute environments.* Solid understanding of distributed systems and cloud infrastructure* Deep knowledge of HPC schedulers (Slurm preferred), MPI stacks, and parallel computing models.* Strong understanding of high‐speed interconnects (InfiniBand, RoCE) and distributed storage systems.* Proficiency in scripting languages (Python, Go, Bash) and automation frameworks.* Experience with GPUs (NVIDIA CUDA, MIG, NVLink) and accelerator‐based computing.* Familiarity with containerization (Singularity/Apptainer, Docker) in HPC contexts.* Strong troubleshooting skills across hardware, OS, and application layers.* Understanding of networking fundamentals (TCP/IP, DNS, load balancing)* Background in high-availability and distributed systems at scale**Soft Skills:*** Excellent communication and cross‐functional collaboration.* Ability to translate research needs into technical solutions.* Strong ownership mindset and ability to lead complex initiatives.**Minimum Qualifications**Doctorate (Academic) Degree and related work experience of 8 years; Master's Level Degree and related work experience of 12 years; Bachelor's Level Degree and related work experience of 15 yearsBase Pay Range: $162,700.00 - $284,700.00 AnnuallyPrimary Location: USA-CA-Milpitas-KLAKLA’s total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring process. KLA is proud to be an Equal Opportunity Employer. We will ensure that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us at View email address on click.appcast.io or at View phone number on click.appcast.io to request accommodation.Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees. KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA’s Careers website for legitimate job postings. KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers. If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to View email address on click.appcast.io to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.
- J-18808-Ljbffr KLA-Belgium
Vacancy posted 22 hours ago
Similar jobs that could be interesting for youBased on the Principal HPC Architect in Milpitas, CA vacancy
- KLA-Belgium in Milpitas seeks a Staff HPC Engineer to design and optimize large scale compute environments for scientific computing and AI workloads. The ideal candidate should have extensive experience in Linux systems engineering and a strong understanding of HPC technologies...Suggested
- ...Job Summary T he AI Interconnect Architect designs and engineers high-speed networking and communication systems for AI inference infrastructure, including servers, racks, and chips. This role focuses on delivering bandwidth, power efficiency, scalability, and optimized...Principal
- Cisco Systems, Inc. is looking for a Principal Engineer to join their Backend Services team in Milpitas, California. In this role, you will provide technical leadership to a dynamic engineering organization and work on innovative products that enhance workload security....Principal
- ...'s Data Center GPU organization is transforming the AI and HPC landscape. Our mission is to design and market exceptional... ...us. THE ROLE AMD is seeking a highly accomplished Principal Modeling Architect to join the Product Architecture and Workload Strategy team...PrincipalRemote work
- ...Principal ASIC Architect Sunnyvale, CA About our company - Tensordyne (formerly Recogni) AI is reshaping our world, performing cognitive... ...engineering and architectural planning of high performance compute (HPC) processors like GPUs, CPUs, TPUs etc., as well as HBM and...PrincipalRemote work
- ...supply chain has access to the Flash memory it needs to keep our world moving forward. Job Description An AI Interconnect Architect defines and engineers high-speed networking and communication systems for AI Inference infrastructure which include servers, racks...PrincipalTemporary workRemote workFlexible hoursShift work
$160k - $180k
Saviynt is looking for an Associate Principal SDET to drive the architecture of automation and quality engineering for our cloud-native Identity Security platform. This role will focus on scalable test frameworks and lead the adoption of AI-powered tools for software quality...Principal$184k - $356.5k
NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate...$184k - $356.5k
NVIDIA Corporation is seeking a Senior CPU Performance Architect to join their CPU performance architecture team in Santa Clara, California... ...the development of CPU technology for applications in AI, HPC, and gaming. The ideal candidate will have 7+ years of experience...- NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years...
- NVIDIA Gruppe in Santa Clara is looking for a Senior HPC Architect to support the deployment of large-scale GPU compute clusters. You will provide engineering solutions for GPU computing products, ensuring technical relationships with teams and assisting in creative solutions...
- NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience in...
$320k
NVIDIA Gruppe in Santa Clara is seeking a Distinguished Software Architect to lead the design of next-generation data center platforms. This role demands deep expertise in HPC and networking, aiming to improve GPU communication technologies. You will research and implement...$356.5k
NVIDIA Gruppe is seeking a Senior Software Architect in Santa Clara, California. This role involves co-designing next-generation data center... ...scalable communications software to enhance Deep Learning and HPC applications. Candidates should have extensive experience in C/C++...$184k
NVIDIA Gruppe is seeking a skilled software engineer to work on quantum computing and HPC algorithms. The ideal candidate has 8+ years of experience in C++ and Python programming, strong skills in GPU coding, and a PhD or MSc in a relevant field. This position is stationed...- Overview NVIDIA is a leading technology company focused on GPU innovation, AI, and high‑performance computing. We are seeking a Senior HPC Architect to support the deployment and scaling of large‑scale GPU compute clusters, enabling breakthroughs in artificial intelligence and...
- We are looking for an outstanding hands‑on architect/engineer for a Senior HPC architect role to support deployment and bringup of large‑scale GPU compute clusters. You will be a key player to enable the most exciting computing hardware and software and contribute to the...
$114k - $253k
...information, and systems to achieve their business objectives. The impact you'll make We are seeking a Senior HPC Storage Engineer to architect and manage the next generation of high-performance data platforms. You will be responsible for the full lifecycle-...Local areaRemote workFlexible hours2 days per week3 days per week1 day per week$272k - $431.25k
NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research... .... Ideal candidates should have extensive experience in HPC systems, programming, and a strong educational background in Computer...Principal$253.3k - $342.7k
...Job Overview: At Arm, the High-Speed I/O Architect defines and designs innovative on-chip interconnect architectures-coherent and non-coherent-for scalable SoC platforms. You will work across markets including mobile, automotive, datacenter, networking, and IoT, contributing...PrincipalWork at officeLocal areaShift work$150k - $200k
...Principal AI/ML Architect Job in USA 2025 (USD 150,000 to 200,000) Are you ready to elevate your career in artificial intelligence and machine learning? Mogi I/O is hiring a Principal AI/ML Architect to help lead innovative solutions in the industrial and automotive...PrincipalFull time- AMAX, located in Fremont, California, is seeking a hands-on pricing professional to manage pricing strategy for AI and HPC hardware solutions. The ideal candidate will analyze market dynamics and collaborate cross-functionally to optimize pricing based on data-driven insights...
- Proofpoint is seeking a Principal ML Architect to lead the design and development of next-generation AI systems focused on cybersecurity. In this role, you will leverage advanced machine learning techniques, including LLMs and SLMs, to create intelligent security solutions...Principal
- Northrop Grumman is seeking a Senior Principal Contract Administrator to support the Marine Systems division in Sunnyvale, CA. This role involves acting as the primary interface with customers for contract issues and ensuring compliance with contractual obligations. Ideal...PrincipalContract work
- Intel Corporation is seeking a Principal Engineer in Santa Clara, California, to architect the next generation of distributed AI systems. This role focuses on executing and optimizing large-scale AI computation graphs across diverse hardware. Ideal candidates will have...Principal
$210k - $260k
...deploy tailored architectures to meet their unique infrastructure requirements. Discover more at Role Overview As a Sr. Principal DSP Architect, you will be the technical visionary leading the definition and development of next-generation Digital Signal Processing (...PrincipalFlexible hours$216k - $345k
NVIDIA Corporation is seeking a Principal Solutions Architect in Santa Clara to drive the technical strategy for semiconductor testing. This role involves leading the architectural vision and modernizing test infrastructure. Candidates should have over 15 years of relevant...Principal$243k - $328.5k
Intuit Inc. is seeking a Principal for its AI Transformation Org to enhance workflows across various departments. This role requires a proven track record in designing and implementing production AI systems, influencing stakeholders, and driving change within organizations...Principal$126k - $229.8k
...Job Title Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter. We are looking for a data center engineer whose expertise spans security, RAS and/or virtualization. The selected candidate will help define...PrincipalWork experience placement- ...as we shape the future of AI and beyond. Together, we advance your career. THE ROLE: We are seeking a Robotics AI Architect to define and scale next-generation Physical AI systems, with a focus on complex robotic platforms (including humanoids). This role...Principal
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal HPC Architect. Be the first to apply!
Related searches
- principal Milpitas, CA
- principal cloud computing engineer Milpitas, CA
- senior principal scientist Milpitas, CA
- senior principal cloud computing engineer Milpitas, CA
- epic principal trainer
- principal network architect
- principal software architect
- principal user experience researcher
- principal financial group
- principal medical writer



