Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

HPC Engineer

$130.1k - $176k

Arm, Inc.

Hpc Operations Engineer

Engineering IT provides the high-performance compute platforms that enable Arm's engineering teams to design, verify, and deliver world-class products. The team operates a mix of on-premises and cloud-based HPC environments, EDA enablement services, job scheduling platforms, automation tooling, and custom workflows that are critical to engineering productivity across Arm.

We are looking for an HPC Operations Engineer to help run, improve, and modernize these services. This role combines production operations, site reliability engineering, automation, cloud integration, and close collaboration with engineering users and infrastructure teams.

Responsibilities
  • Operate, support, and continuously improve Arm's HPC platforms, with a solid focus on IBM Spectrum LSF and related job scheduling services.
  • Improve reliability, scalability, performance, and operational efficiency through automation, observability, standardization, and SRE practices.
  • Develop automation and self-service capabilities to reduce manual operational effort and improve the user experience.
  • Support production HPC environments, including incident response, solve, root cause analysis, service restoration, and continuous improvement.
  • Work directly with engineering users to improve job scheduling behavior, workload performance, resource utilization, and platform efficiency.
  • Develop and maintain scripts, tools, and automation frameworks using Python, Bash, and related technologies.
  • Support modernization initiatives involving containers, Kubernetes, Docker, cloud-native services, Infrastructure as Code, and alternative scheduling or orchestration technologies.
  • Contribute to cloud HPC integration across AWS, GCP, Azure, OpenStack, and hybrid environments.
  • Collaborate with platform, cloud, storage, infrastructure, networking, and security teams to deliver robust engineering services.
  • Contribute to project delivery by working with technical leads, architects, project managers, and operational team members.
  • Help define and promote standards for DevOps, SRE, platform engineering, CI/CD, monitoring, and infrastructure automation.
Required Skills and Experience
  • Experience operating HPC environments and job schedulers such as IBM Spectrum LSF, Slurm, PBS, Grid Engine, or similar.
  • Strong Linux system administration experience, preferably with RHEL or RHEL-based distributions.
  • Good scripting and automation skills using Python, Bash, Shell, or similar languages.
  • Experience supporting production infrastructure, including incident management, solve, operational recovery, and conducting RCA or comparable experience.
  • Familiarity with monitoring, alerting, and observability platforms such as Dynatrace, Prometheus, Grafana, or similar.
  • Experience building, maintaining, or supporting CI/CD pipelines and automation frameworks.
  • Experience with public, private, or hybrid cloud platforms, including AWS, GCP, Azure, OpenStack, and Kubernetes-based services.
  • Understanding of DevOps, SRE, platform engineering, infrastructure automation, and operational excellence principles.
  • Familiarity with Agile delivery practices and collaboration tools such as Jira and Confluence.
  • Ability to work with engineering users, understand workload requirements, and translate operational issues into practical improvements.
Desirable Experience
  • Experience working in EDA or semiconductor engineering environments.
  • Familiarity with EDA tools, license-aware scheduling, large-scale batch workloads, and engineering compute workflows.
  • Exposure to container platforms and orchestration technologies such as Docker, Kubernetes, and Kubernetes-native scheduling.
  • Experience with Infrastructure as Code tools such as Terraform and Ansible.
  • Exposure to alternative schedulers such as Slurm or cloud-native workload orchestration systems.
  • Experience using AI-assisted tooling, MCP, agentic services, or automation agents to improve diagnostics, operations, optimization, or self-service support.
  • Experience operating large-scale distributed systems across both on-premises and cloud infrastructure.
Salary Range:

$130,100-$176,000 per year

We value people as individuals and our dedication is to reward people competitively and equitably for the work they do and the skills and experience they bring to Arm. Salary is only one component of Arm's offering. The total reward package will be shared with candidates during the recruitment and selection process.

Accommodations at Arm

At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email View email address on click.appcast.io. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.

Hybrid Working at Arm

Arm's approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team's needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.

Equal Opportunities at Arm

Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don't discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the HPC Engineer in Austin, TX vacancy
  •  ...Job Title High Performance Computing Systems Engineer Visa: USC, GC or GC-EAD Duration: 9 months with potential extension Location...  ...people willing to relocate. Requirements -Experience with HPC Systems environments and Infrastructures technologies and... 
    Suggested
    Work experience placement
    Local area
    Relocation

    ShiftCode Analytics

    Austin, TX
    4 days ago
  • $152k - $241.5k

    Senior HPC and LSF Operations Engineer page is loaded## Senior HPC and LSF Operations Engineerlocations: US, CA, Santa Clara: US, MA, Westford: US, TX, Austin: US, NC, Durhamtime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014127As a member of the Hardware... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    5 days ago
  • $152k - $241.5k

    Senior Site Reliability Engineer - HPC page is loaded## Senior Site Reliability Engineer - HPClocations: US, CA, Santa Clara: US, TX, Austin: US, NC, Durhamtime type: Full timeposted on: Posted Todayjob requisition id: JR2013271NVIDIA has been transforming computer graphics... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    5 days ago
  • A leading technology firm in Austin is seeking a Senior HPC and LSF Operations Engineer to manage and optimize large-scale job scheduling systems. This role demands a Bachelor's degree in a relevant field and over 5 years of experience in Linux-based environments. Ideal... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    5 days ago
  • $152k - $241.5k

    Senior HPC Cluster Engineer page is loaded## Senior HPC Cluster Engineerlocations: US, CA, Santa Clara: US, TX, Austin: US, WA, Redmondtime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014289NVIDIA has been transforming computer graphics, PC gaming,... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    5 days ago
  • $150k - $250k

     ...leading AI/ML-based trading. We are seeking an experienced Storage Engineer who enjoys being challenged, appreciates an open and...  ...This is an exciting opportunity to join a small team focused on HPC storage and help set the direction for storage solutions at HRT... 
    Work at office
    Local area
    Immediate start
    Worldwide

    Hudson River Trading

    Austin, TX
    1 day ago
  • $150k - $190k

     ...leading IT services provider is seeking a High Performance Computer Engineer to support and optimize high performance computing resources for...  ...analysis. This remote position requires strong experience with HPC environments and collaboration with researchers to improve... 
    Remote work

    GovCIO

    Austin, TX
    5 days ago
  •  ...perspectives. Join us as we shape the future of AI and beyond. THE ROLE We are seeking a Cluster Thermal Engineer to help architect and deliver scalable thermal solutions for AI/HPC clusters and data center deployments. In this role, you will support the evaluation, modeling,... 
    Internship

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • Advanced Micro Devices is looking for a Cluster Thermal Engineer in Austin, Texas. In this role, you will architect and deliver scalable thermal solutions for AI and HPC clusters. The ideal candidate is an early-career mechanical engineer with solid understanding of thermal... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  • A leading technology company located in Austin, Texas, is seeking a Senior Mechanical Engineer to support their data center engineering organization. You will ensure that infrastructure solutions meet reliability and efficiency standards while collaborating with cross-... 

    Cerebras

    Austin, TX
    2 days ago
  • $152k - $241.5k

    A leading technology company in Austin is seeking a Senior HPC Cluster Engineer to design and deploy GPU Compute Clusters. The role involves providing technical leadership, collaborating with researchers, and supporting EDA workloads. Candidates should have a Bachelor'... 

    NVIDIA Corporation

    Austin, TX
    5 days ago
  • $150k - $190k

     ...currently working on a proposal with HHS and we are looking for a High Performance Computer Engineer. This is for a proposal and will be remote. The High Performance Computing (HPC) Engineer supports and optimizes HPC environments that enable advanced scientific research... 
    Full time
    Remote work
    Flexible hours

    GovCIO

    Austin, TX
    5 days ago
  • $153.85k - $199.1k

    Overview Senior GenAI & High Performance Computing (HPC) Delivery Engineer Dell Technologies has delivered HPC solutions for 25+ years, including support for Bright Cluster Manager (now NVIDIA BCM) since 2011. Today, Dell is NVIDIA’s preferred partner for GenAI Factory... 
    Remote work

    Dell Technologies

    Austin, TX
    2 days ago
  • $184k - $287.5k

    Senior HPC Storage Engineer page is loaded## Senior HPC Storage Engineerlocations: US, CA, Santa Clara: US, TX, Austintime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2014997NVIDIA has been transforming computer graphics, PC gaming, and accelerated... 

    NVIDIA Corporation

    Austin, TX
    4 days ago
  • A leading semiconductor company in Austin, Texas, is seeking a System Application Engineer to support Data Center GPU customers. This role involves interacting with OEM partners and internal teams to facilitate the deployment of AMD’s Instinct™ Accelerators. Candidates... 

    Advanced Micro Devices

    Austin, TX
    1 day ago
  •  ...Job Posting Title: Principal Mixed Signal Engineer, Texas Institute for Electronics ---- Hiring Department: Operational Services...  ..., enabling scalable, high-bandwidth integration across AI, HPC, and wireless acceleration platforms. This includes defining architecture... 
    For contractors
    Work at office
    Monday to Friday
    Flexible hours
    Shift work

    The University of Texas at Austin

    Austin, TX
    17 hours ago
  •  ...melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous...  ...with IT teams for infrastructure design in support of HPC racks and data centers. Excellent interpersonal skills and ability... 

    Graphcore

    Austin, TX
    1 day ago
  • **Job Posting Title:**Principal Mixed Signal Engineer, Texas Institute for Electronics**----****Hiring Department:**Operational Services...  ...microsystems, enabling scalable, high-bandwidth integration across AI, HPC, and wireless acceleration platforms. This includes defining... 
    Ongoing contract
    Work at office
    Monday to Friday
    Flexible hours
    Shift work

    University of Texas

    Austin, TX
    2 days ago
  • $200k - $270k

     ...Electrical Engineer We are seeking an Electrical Engineer with a comprehensive understanding of end-to-end data center electrical design...  ...Experience with high-density compute environments (AI/ML, HPC) and the unique electrical demands of GPU-heavy workloads is a plus... 
    Local area

    Fluidstack

    Austin, TX
    11 hours ago
  •  ...Senior Systems Engineer Austin, Texas, United States About Us Graphcore is one of the world's leading innovators in Artificial...  ...metrics to isolate hardware failures. Hands-on experience with HPC systems, AI compute platforms, or rack-scale infrastructure.... 

    Graphcore

    Austin, TX
    4 days ago
  • THE ROLE We are seeking a highly skilled systems engineer to design data center rack layouts for AI/HPC clusters. This role involves evaluating and selecting rack-level compute, storage, networking, power delivery, and cooling solutions to optimize performance and reliability... 
    Work experience placement

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • $255.85k - $361.2k

    **Welcome!**.Principal Engineer - Distributed AI Systems Architecture (Heterogeneous Compute) page is loaded## Principal Engineer - Distributed...  ...for diverse needs across general-purpose compute, web services, HPC, and AI-accelerated systems. Our charter encompasses defining... 
    Internship
    Local area
    Shift work

    Intel Corporation

    Austin, TX
    5 days ago
  •  ...organizations. With demand for generative AI, cloud computing, and HPC accelerating at an unprecedented pace, the Client is expanding...  ...global customer base. The Client is hiring a Field Service Engineer (FSE) to support customers across global deployment sites. This... 
    Casual work
    Work at office
    Remote work
    Worldwide
    Home office
    Flexible hours

    New Standard Partners LLC

    Austin, TX
    5 days ago
  • A leading technology company is looking for a Senior Middleware Development Engineer to work on cutting-edge communication libraries for high-performance computing systems. You will collaborate closely with scientists and engineers, optimize software for performance, and... 

    Intel Corporation

    Austin, TX
    5 days ago
  • $97.5k - $199.5k

     ...Job Description Electrical Engineer - Power Systems Engineer Role Summary Join a team of exceptional engineers as a motivated Electrical...  ..., advanced UPS architectures, and power system stability for HPC/AI data centers. This role shapes executive decision-making,... 
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    3 days ago
  •  ...cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and...  .... THE ROLE: AMD is looking for a lead systems engineer to provide thought leadership and subject matter expertise to our... 

    Advanced Micro Devices , Inc.

    Austin, TX
    1 day ago
  • $200k - $300k

     ...System Engineer, GPU Fleet As a System Engineer, GPU Fleet, you will manage, operate, and optimize hyperscale GPU compute infrastructure...  ...drivers, monitoring tools (nvidia-smi, DCGM) Experience with HPC cluster management, job schedulers (Slurm, PBS, LSF), and... 
    Local area

    Fluidstack

    Austin, TX
    11 hours ago
  • Hardware Systems Engineer - Data Center HWE At Apple, new ideas have a way of becoming products, services, and customer experiences very...  ...scale cloud providers, hyperscalers, or high-performance computing (HPC) environments. Expertise in power/performance tuning, BIOS... 

    Apple Inc.

    Austin, TX
    3 days ago
  • Job Description The Role The Staff Software Engineer ? Artificial Intelligence is responsible for supporting and reinforcing the adoption...  ...Experience with cloud technologies such as Docker, Kubernetes (K8), and HPC platforms Knowledge of Java/C#, Python, SQL, and C/C+? Strong... 
    Local area
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    1 day ago
  • $152k - $241.5k

    A leading technology company is seeking a Senior Software Engineer in Austin, Texas, to enhance their HPC infrastructure. This role involves applying modern distributed systems and improving automation across a hybrid multi-cloud environment. Candidates should possess strong... 

    NVIDIA Corporation

    Austin, TX
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to HPC Engineer. Be the first to apply!