Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Operations Engineer, HPC Networking

$90k - $110k

CoreWeave

Job Description

Job Description

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at

About the Role

At CoreWeave we are seeking a dedicated and detail-oriented Operations Engineer to join our HPC Networking Team. HPC Networking at CoreWeave is tasked with developing and operating some of the largest InfiniBand fabrics, powering industry leading AI workloads.

What You'll Do

In this role, you will support the deployment, monitoring, troubleshooting, and maintenance of large-scale InfiniBand fabrics, ensuring their stability and performance. The ideal candidate will have a strong operations mindset, effective collaboration skills, and the ability to solve complex issues in a dynamic environment.

  • Regularly monitor the performance and health of InfiniBand fabrics, including switches, host adapters, and nodes.
  • Investigate and resolve operational issues within InfiniBand fabrics, such as network connectivity problems and performance bottlenecks.
  • Assist with the installation and operational bring-up of large InfiniBand fabrics in collaboration with onsite personnel and customer teams.
  • Perform routine maintenance and upgrades on InfiniBand switches and control plane components.
  • Collaborate with HPC cluster operations teams to provide troubleshooting and operational expertise.

Investing in our people is one of our top priorities, and we value candidates who can bring their diversified experiences to our teams. Here are some qualities we've found compatible with our team. We'd love to talk about whether this aligns with your experience and Interests and what you're excited to work on next.

Who You Are

Minimum Qualifications

  • Bachelor's degree in Computer Engineering, Electrical Engineering, Computer Science, or a related field.
  • At least 1 year of experience with InfiniBand or similar networking technologies.
  • Solid understanding of networking concepts, including architectures, topologies, operational best practices, and troubleshooting.
  • Experience with Linux system administration and maintenance.
  • Proficiency in at least one scripting language (e.g., Python) and hands-on experience with Ansible.
  • Applicants must have work authorization that does not require sponsorship from the company now or in the future
  • Experience with monitoring and visualization platforms such as Grafana or Prometheus.

Preferred Qualifications

  • Hands-on experience with Nvidia UFM or similar fabric management tools.
  • Experience with operational tooling and automation frameworks like Ansible.
  • Knowledge of data center operations, including server racks, and cabling.
  • Python or Bash scripting.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $90,000-$110,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

What We Offer

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

California Applicants

California Consumer Privacy Act

Equal Opportunity & Accommodations

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on ziprecruiter.com.

Export Control Compliance

This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

Vacancy posted 15 days ago
Similar jobs that could be interesting for youBased on the Operations Engineer, HPC Networking in Sunnyvale, CA vacancy
  • $152k - $241.5k

     ...efficiency. Success in this role requires both operational precision along with developing and...  ...measurable, and aligned with long-term engineering demands. What you'll be doing:...  ...scheduling systems (LSF, Slurm, etc.) in HPC or silicon design environments Proficiency... 
    Suggested

    NVIDIA

    Santa Clara, CA
    21 hours ago
  • $124k - $195.5k

     ...NVIDIA. We seek an expert to build and operate these clusters at high reliability,...  ...and automation to improve engineers’ productivity. As an HPC Operations Engineer, you are responsible...  ...automounter, LDAP, DNS, and TCP/IP networking in Red Hat Linux distribution flavors... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $124k - $195.5k

    NVIDIA Gruppe in Santa Clara seeks an HPC Operations Engineer to design and implement compute clusters for silicon development. Ideal candidates will have experience troubleshooting in large-scale environments and enhancing deployment automation. Applicants should be proficient... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $181k - $297k

     ...Description LinkedIn is the world's largest professional network, built to create economic opportunity for every...  ...be based in Mountain View, CA. We are seeking an HPC Network Engineer to design, deploy, and operate high-performance, low-latency Ethernet fabrics for... 
    Suggested
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking a motivated Performance Engineer to influence the roadmap of our communication libraries. The role involves conducting in-depth performance characterization on large multi-GPU and multi-node clusters and studying the interaction of our libraries... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Our enterprise-level client is seeking to add a Network Operations Engineer to the team in Mountain View, CA. Please see below for full details- Job Notes: -- 3-month contract / extensions possible with good performance. -- Onsite in Mountain View, CA 94041... 
    Hourly pay
    Contract work
    For contractors

    Merge IT LLC

    Mountain View, CA
    1 day ago
  • The Network Operations Engineer is responsible for maintaining, supporting, and enhancing network infrastructure through safe change execution, incident response, and operational monitoring. This role requires strong analytical and troubleshooting skills, the ability to... 
    Night shift

    Compunnel, Inc.

    Sunnyvale, CA
    3 days ago
  •  ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the...  ...appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are... 

    Xai

    Palo Alto, CA
    1 day ago
  • $248k - $396.75k

     ...highly skilled Principal AI/ML Engineer to join our dynamic team to...  ...build the next generation of IT Networking space and help lead the team...  ...GPU cluster networking, and HPC environments. Cloud and...  ...architecture/standards/reuse, and operational documentation via Confluence/... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • NVIDIA Corporation is seeking a talented SDK Engineer to join the NVLink SDK group in Santa Clara,...  ...and implement SDK features for cutting-edge networking products, contributing to the development of technologies for AI, HPC, and cloud environments. Applicants should have... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • A leading tech company in California is looking for a Network Operations Engineer responsible for maintaining and enhancing network infrastructure. The ideal candidate will possess strong analytical and troubleshooting skills and will engage in operational rotations and... 
    Night shift
    Weekend work

    Compunnel, Inc.

    Sunnyvale, CA
    3 days ago
  • A leading technology firm in California is seeking network engineers with hands-on experience in InfiniBand and Ethernet for managing high-performance computing (HPC) and artificial intelligence (AI) environments. Candidates should have advanced knowledge of networking... 

    TechDigital Group

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Software Engineer to enhance their HPC infrastructure. The role involves applying distributed systems patterns, automation, and building scalable services in a hybrid multi-cloud environment. Candidates should have strong... 

    NVIDIA Gruppe

    Santa Clara, CA
    21 hours ago
  • $200k - $400k

    A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate... 

    Institute of Foundation Models

    Sunnyvale, CA
    21 hours ago
  • $168k - $264.5k

    NVIDIA Corporation is seeking a Senior Network Engineer for complex campus network deployments in Santa Clara, California. This role involves leading design and implementation, managing site deployments, and utilizing network automation tools (Ansible, Python). Candidates... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $128k - $201.25k

     ...impact on the world. As a Senior Technical Marketing Engineer for Datacenter Networking, you will join a dedicated team that is passionate about...  ...The Crowd: Hands‑on experience setting up and tuning HPC clusters with Slurm, Kubernetes, or other schedulers.... 
    Work at office

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...Senior Firmware Engineer We are looking for an excellent Senior Firmware Engineer...  ...team develops groundbreaking networking features for AI, cloud, HPC and storage. We drive the data growth...  ...how a big software project is operated, maintained, qualified and released... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $166k - $214k

    A leading cybersecurity firm is seeking a Principal Software Development QA Engineer to enhance product reliability and performance. The successful candidate will manage test strategies, design test plans, and develop automated scripts for product evaluations. Candidates... 

    Fortinet, Inc.

    Sunnyvale, CA
    21 hours ago
  • $80k

    A leading technology company based in Sunnyvale, California, is seeking an Engineer for Cloud Operations & Support. The successful candidate will deploy and maintain cloud services while developing automation tools to enhance operational efficiency. A Bachelor’s degree... 

    eGain Corporation

    Sunnyvale, CA
    3 days ago
  • A high-performance AI infrastructure company in Santa Clara is looking for an IT Helpdesk and Operations Engineer. This role involves supporting and designing IT systems, managing security protocols, and leading significant IT projects. Candidates should have 2-3+ years... 

    Nexthop Systems Inc

    Santa Clara, CA
    21 hours ago
  • $152k - $241.5k

     ...NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team. In this pivotal role, the candidate is responsible...  ...the intersection of at least two of the following areas: HPC, networking, and AI applications. ~ Hands-on experience... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $168k - $270.25k

     ...We are seeking an experienced Senior QA Automation Engineer to join our Network AI platform team. This role combines manual testing expertise...  ...architectures with AI/ML correlation engines using multiple network operating systems via WebUI, REST APIs, CLI, and shell interfaces.... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $165k - $220k

     ...Senior Specialist Field Engineer - Networking CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a...  ...on networking technologies within high-performance compute (HPC) environments Collaborate closely with customers to understand... 
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $193.3k - $261.5k

     ...We are seeking an experienced engineer and technical leader to join our team that owns the network stack for EC2 distributed AI/ML...  ...with high-speed networking or HPC/RDMA interconnects is highly valued...  ...types, software stacks, Linux operating systems, cutting-edge releases... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  • $172.1k - $258.6k

    Cupertino, California, United States Operations and Supply Chain Imagine what you could do here. At Apple, new ideas have a way of becoming...  ...most exciting new technologies at Apple. The Operations Test Engineering team is seeking an individual with strong technical and... 
    Contract work
    Work experience placement
    Worldwide
    Relocation package

    Apple Inc.

    Cupertino, CA
    3 days ago
  • General Motors is hiring a Staff Security Software Engineer for their Cybersecurity Team in Mountain View, California. This role requires expertise in software engineering and security integrations to define technical strategy and architecture for enterprise-scale projects... 

    General Motors

    Mountain View, CA
    3 days ago
  • $148k - $235.75k

    NVIDIA is looking for an excellent Software Engineer to join the InfiniBand Switch and NVLink...  ...effort for the next-generation networking products. The verification team develops modern networking features for cloud, HPC and storage. We drive the data growth of... 
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $124k - $195.5k

    NVIDIA is looking for an excellent SDK Engineer to join the NVLink SDK group in Santa Clara. As...  ...SDK features, delivering the next-generation networking products. The SDK team develops cutting-edge networking features for AI, HPC and cloud. We drive the data growth of the... 
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $159k - $231k

    Senior Data Center Operations Engineer, Google Cloud Sunnyvale, CA, USA Qualifications Bachelor’s degree in Electrical Engineering, Computer...  .... 4 years of experience working in a data center or networking operation center technical environment. Preferred Qualifications... 
    Full time
    Work at office
    Worldwide

    Google Inc.

    Sunnyvale, CA
    3 days ago
  •  ...established industry player is seeking a Senior Director of Solutions Engineering to lead innovative teams in AI and high-performance computing...  .... The ideal candidate will have extensive experience in HPC and AI systems design, with a proven track record in managing technical... 

    Skilltorch

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Operations Engineer, HPC Networking. Be the first to apply!