Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Operations Engineer, HPC Networking

$90k - $110k

CoreWeave

Job Description

Job Description

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at

About the Role

At CoreWeave we are seeking a dedicated and detail-oriented Operations Engineer to join our HPC Networking Team. HPC Networking at CoreWeave is tasked with developing and operating some of the largest InfiniBand fabrics, powering industry leading AI workloads.

What You'll Do

In this role, you will support the deployment, monitoring, troubleshooting, and maintenance of large-scale InfiniBand fabrics, ensuring their stability and performance. The ideal candidate will have a strong operations mindset, effective collaboration skills, and the ability to solve complex issues in a dynamic environment.

  • Regularly monitor the performance and health of InfiniBand fabrics, including switches, host adapters, and nodes.
  • Investigate and resolve operational issues within InfiniBand fabrics, such as network connectivity problems and performance bottlenecks.
  • Assist with the installation and operational bring-up of large InfiniBand fabrics in collaboration with onsite personnel and customer teams.
  • Perform routine maintenance and upgrades on InfiniBand switches and control plane components.
  • Collaborate with HPC cluster operations teams to provide troubleshooting and operational expertise.

Investing in our people is one of our top priorities, and we value candidates who can bring their diversified experiences to our teams. Here are some qualities we've found compatible with our team. We'd love to talk about whether this aligns with your experience and Interests and what you're excited to work on next.

Who You Are

Minimum Qualifications

  • At least 1 year of experience with InfiniBand or similar networking technologies.
  • Solid understanding of networking concepts, including architectures, topologies, operational best practices, and troubleshooting.
  • Experience with Linux system administration and maintenance.
  • Proficiency in at least one scripting language (e.g., Python) and hands-on experience with Ansible.
  • Applicants must have work authorization that does not require sponsorship from the company now or in the future
  • Experience with monitoring and visualization platforms such as Grafana or Prometheus.

Preferred Qualifications

  • Hands-on experience with Nvidia UFM or similar fabric management tools.
  • Experience with operational tooling and automation frameworks like Ansible.
  • Knowledge of data center operations, including server racks, and cabling.
  • Python or Bash scripting.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $90,000-$110,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

What We Offer

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

California Applicants

California Consumer Privacy Act

Equal Opportunity & Accommodations

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on ziprecruiter.com.

Export Control Compliance

This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

Vacancy posted 13 days ago
Similar jobs that could be interesting for youBased on the Operations Engineer, HPC Networking in Sunnyvale, CA vacancy
  • $152k - $241.5k

     ...efficiency. Success in this role requires both operational precision along with developing and...  ...measurable, and aligned with long-term engineering demands. What you'll be doing:...  ...scheduling systems (LSF, Slurm, etc.) in HPC or silicon design environments Proficiency... 
    Suggested

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $124k - $195.5k

     ...NVIDIA. We seek an expert to build and operate these clusters at high reliability,...  ...and automation to improve engineers’ productivity. As an HPC Operations Engineer, you are responsible...  ...automounter, LDAP, DNS, and TCP/IP networking in Red Hat Linux distribution flavors... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

    NVIDIA Gruppe in Santa Clara seeks an HPC Operations Engineer to design and implement compute clusters for silicon development. Ideal candidates will have experience troubleshooting in large-scale environments and enhancing deployment automation. Applicants should be proficient... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $181k - $297k

     ...Description LinkedIn is the world's largest professional network, built to create economic opportunity for every...  ...be based in Mountain View, CA. We are seeking an HPC Network Engineer to design, deploy, and operate high-performance, low-latency Ethernet fabrics for... 
    Suggested
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    1 day ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking a motivated Performance Engineer to influence the roadmap of our communication libraries. The role involves conducting in-depth performance characterization on large multi-GPU and multi-node clusters and studying the interaction of our libraries... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Our enterprise-level client is seeking to add a Network Operations Engineer to the team in Mountain View, CA. Please see below for full details- Job Notes: -- 3-month contract / extensions possible with good performance. -- Onsite in Mountain View, CA 94041... 
    Hourly pay
    Contract work
    For contractors

    Merge IT LLC

    Mountain View, CA
    4 days ago
  •  ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the...  ...appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are... 

    Xai

    Palo Alto, CA
    4 days ago
  • $248k - $396.75k

     ...highly skilled Principal AI/ML Engineer to join our dynamic team to...  ...build the next generation of IT Networking space and help lead the team...  ...GPU cluster networking, and HPC environments. Cloud and...  ...architecture/standards/reuse, and operational documentation via Confluence/... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • NVIDIA Corporation is seeking a talented SDK Engineer to join the NVLink SDK group in Santa Clara,...  ...and implement SDK features for cutting-edge networking products, contributing to the development of technologies for AI, HPC, and cloud environments. Applicants should have... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • A leading technology firm in California is seeking network engineers with hands-on experience in InfiniBand and Ethernet for managing high-performance computing (HPC) and artificial intelligence (AI) environments. Candidates should have advanced knowledge of networking... 

    TechDigital Group

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Software Engineer to enhance their HPC infrastructure. The role involves applying distributed systems patterns, automation, and building scalable services in a hybrid multi-cloud environment. Candidates should have strong... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $200k - $400k

    A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate... 

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $168k - $264.5k

    NVIDIA Corporation is seeking a Senior Network Engineer for complex campus network deployments in Santa Clara, California. This role involves leading design and implementation, managing site deployments, and utilizing network automation tools (Ansible, Python). Candidates... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $128k - $201.25k

     ...impact on the world. As a Senior Technical Marketing Engineer for Datacenter Networking, you will join a dedicated team that is passionate about...  ...The Crowd: Hands‑on experience setting up and tuning HPC clusters with Slurm, Kubernetes, or other schedulers.... 
    Work at office

    NVIDIA

    Santa Clara, CA
    23 hours ago
  • $184k - $287.5k

     ...Senior Firmware Engineer We are looking for an excellent Senior Firmware Engineer...  ...team develops groundbreaking networking features for AI, cloud, HPC and storage. We drive the data growth...  ...how a big software project is operated, maintained, qualified and released... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $166k - $214k

    A leading cybersecurity firm is seeking a Principal Software Development QA Engineer to enhance product reliability and performance. The successful candidate will manage test strategies, design test plans, and develop automated scripts for product evaluations. Candidates... 

    Fortinet, Inc.

    Sunnyvale, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team. In this pivotal role, the candidate is responsible...  ...the intersection of at least two of the following areas: HPC, networking, and AI applications. ~ Hands-on experience... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • A high-performance AI infrastructure company in Santa Clara is looking for an IT Helpdesk and Operations Engineer. This role involves supporting and designing IT systems, managing security protocols, and leading significant IT projects. Candidates should have 2-3+ years... 

    Nexthop Systems Inc

    Santa Clara, CA
    3 days ago
  • $168k - $270.25k

     ...We are seeking an experienced Senior QA Automation Engineer to join our Network AI platform team. This role combines manual testing expertise...  ...architectures with AI/ML correlation engines using multiple network operating systems via WebUI, REST APIs, CLI, and shell interfaces.... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $193.3k - $261.5k

     ...We are seeking an experienced engineer and technical leader to join our team that owns the network stack for EC2 distributed AI/ML...  ...with high-speed networking or HPC/RDMA interconnects is highly valued...  ...types, software stacks, Linux operating systems, cutting-edge releases... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    1 day ago
  • NVIDIA is looking for an excellent Software Engineer to join the InfiniBand Switch and NVLink...  ...effort for the next-generation networking products. The verification team develops modern networking features for cloud, HPC and storage. We drive the data growth of... 
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

    NVIDIA is looking for an excellent SDK Engineer to join the NVLink SDK group in Santa Clara. As...  ...SDK features, delivering the next-generation networking products. The SDK team develops cutting-edge networking features for AI, HPC and cloud. We drive the data growth of the... 
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  •  ...established industry player is seeking a Senior Director of Solutions Engineering to lead innovative teams in AI and high-performance computing...  .... The ideal candidate will have extensive experience in HPC and AI systems design, with a proven track record in managing technical... 

    Skilltorch

    Santa Clara, CA
    1 day ago
  •  ...startup focused on AI solutions is seeking an experienced QA Engineer to test products across various platforms and enhance test automation...  ...and at least 5 years of hands-on testing experience in networking technologies. Strong communication and debugging skills are essential... 

    Nexthop Systems Inc

    Santa Clara, CA
    23 hours ago
  • $108k - $153k

    Google Inc. in Sunnyvale, CA is hiring an Optical Validation Engineer to develop and automate testing processes for optical products. The...  ...teams to ensure quality and performance of Google Cloud network infrastructures. The position requires a Bachelor's degree in a... 

    Google Inc.

    Sunnyvale, CA
    23 hours ago
  •  ...company located in California seeks a Principal Software Dev QA Engineer to join the FortiSwitch Team. This position involves designing...  ...years of experience, a degree in Computer Science, and strong networking technology knowledge. Benefits include competitive salary and... 

    Fortinet, Inc.

    Sunnyvale, CA
    3 days ago
  • $168k - $322k

    NVIDIA Gruppe is looking for an experienced Senior QA Automation Engineer to join our Network AI platform team in Santa Clara, California. This role involves manual and automated testing to ensure quality in AI/ML-powered network solutions. The ideal candidate will have... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...NFV Automation Engineer Main experience required in NFV (network function virtualization) and Python scripting. Duties: Understand virtual networks and be able to automate the NFV deployments with various vendors. Proficient in Python, Rest API automation and he/she... 
    Work experience placement

    InterSources

    Santa Clara, CA
    3 days ago
  •  ...Network Infrastructure Architect Architecting, deploying, and supporting highly available...  ...Participating in on-call rotation and operational readiness efforts to ensure network...  .... 4+ years of experience in network engineering, preferably in large-scale data center... 

    Tranzeal

    Mountain View, CA
    3 days ago
  • $62 - $67 per hour

     ...Network Automation Engineer Pay Range: $62hr - $67hr The Network Automation Engineer will be responsible for designing, executing, and validating network lab testing for enterprise networking technologies. The role involves translating business and architecture requirements... 

    Cynet Systems

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Operations Engineer, HPC Networking. Be the first to apply!