Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

RDMA Ops Engineer for Ultra-Low Latency Clusters

$104.4k - $171k

Alibaba Cloud

A leading cloud service provider is seeking an RDMA Ops Engineer to optimize high-performance networking infrastructure for computing clusters. Responsibilities include deploying RDMA-based network architectures and optimizing performance. The ideal candidate has strong scripting skills, experience with RDMA technologies, and a solid understanding of Linux internals. This position offers a competitive salary range between $104,400 and $171,000/year based on market location and experience. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the RDMA Ops Engineer for Ultra-Low Latency Clusters in Sunnyvale, CA vacancy
  • $104.4k - $171k

     ...Overview We're seeking a skilled RDMA Ops Engineer to optimize and maintain high-performance networking infrastructure for our computing clusters. This role focuses on building and operating ultra-low latency, high-throughput networks using RDMA technologies to power... 
    Suggested

    Alibaba Cloud

    Sunnyvale, CA
    4 days ago
  • $147.4k - $272.1k

    Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves...  ...and integrating next-generation Siri experiences, focusing on low-latency interactions and system performance optimization. Ideal... 
    Suggested

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $200k - $400k

     ...dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge...  ...candidate has strong experience with NVIDIA RDMA technologies, networking protocols,... 
    Suggested

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $150k - $230k

     ...Clockwork Systems, Inc. in Palo Alto, CA, is looking for a Senior Engineer to lead the development of their Core ClockSync systems. The role involves building and scaling low-latency time synchronization infrastructure. Candidates should have over 6 years of experience... 
    Suggested

    Clockwork Systems

    Palo Alto, CA
    4 days ago
  • Garuda Ventures in Palo Alto is seeking Robotics Software Engineers to build high-performance middleware and runtime systems for robotic platforms. You will design low-latency execution frameworks and optimize inter-process communication. Successful candidates will develop... 
    Suggested

    Garuda Ventures

    Palo Alto, CA
    2 days ago
  • $200k - $400k

     ...Institute Of Foundation Models Engineer The Institute of...  ...) designs and operates ultra-scale GPU...  ...awareness · Reduce tail latency and improve determinism...  ...execution under real-world cluster failures Core Technical...  ...Deep debugging of NCCL, RDMA, and custom... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • A leading computer technology company in California seeks an HPC Operations Engineer to provide leadership in designing and implementing compute clusters. You will troubleshoot issues, enhance automation, and collaborate across diverse teams to improve systems. The role... 

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $224k - $356.5k

     ...Gruppe is seeking skilled professionals to advance their GeForceNOW cloud-gaming service. Candidates will collaborate to develop low-latency streaming technologies, improve user experience, and innovate networking solutions. A PhD or Master's degree in a related field and... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $135k - $155k

    Software Engineer, Low Latency Computing (Starlink) Sunnyvale, CA SpaceX was founded under the belief that a future where humanity is exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make... 
    Permanent employment
    Temporary work
    Internship
    Worldwide
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Sunnyvale, CA
    4 days ago
  • A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive... 

    Inworld

    Mountain View, CA
    3 days ago
  • $147.4k - $272.1k

    Silicon Validation Software Engineer: Embedded and Low-level Programming Cupertino, California, United States Hardware At Apple, new ideas have...  ...Develop boot and driver code for Apple SOC, including AP clusters, IO Co-Processor system, fabric, power management, memory... 
    Relocation

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $172.5k - $210k

     ...a Virtualization Validation Engineer , you will be responsible for...  ...large-scale, multi-node GPU clusters. You will focus on high-performance...  ...environments to ensure low-latency, high-bandwidth communication...  ...Validation : Validate SR-IOV and RDMA configurations to ensure that... 
    Temporary work

    Crusoe

    Sunnyvale, CA
    1 day ago
  •  ...workflow automation with Moveworks’ Reasoning Engine and natural language capabilities, we...  ..., model evaluation frameworks, and LLM latency optimization. You will guide the team's...  ...synthesis and distributed training to ultra-low-latency inference and serving—for hundreds... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    6 days ago
  •  ...Technician position in a semiconductor equipment company or a technical position in the engineering field) PVD or semiconductor vacuum equipment experience Multi chamber cluster systems and robotics experience is highly preferred Ability to understand electrical... 
    Full time
    Work at office
    Immediate start
    Flexible hours
    Shift work

    Talent Search PRO

    San Jose, CA
    28 days ago
  • NVIDIA is searching for a highly skilled HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for Electronic Design Automation...  ...‑speed networking pertaining to HPC, including InfiniBand, RDMA and RoCE. Understanding of fast, distributed storage systems... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $300 per month

     ...runs on. We are looking for a Principal Engineer on our Production Engineering team. Someone...  ...: BGP, OSPF, ECMP, load balancing, and low-latency network design in production — you can...  ...Experience with HPC infrastructure: GPU cluster operations, job schedulers (Slurm, Kubernetes... 
    Full time
    Temporary work
    Immediate start

    Crusoe

    Sunnyvale, CA
    21 days ago
  •  ...development of a novel surgical robot system. The engineer will collaborate with a skilled team to...  ...Strong knowledge of real-time systems, low-latency pipelines, and performance tuning....  ...media transport (e.g., SMPTE ST 2110, RDMA, NVENC/NVDEC). Experience with working... 
    Local area

    Intuitive

    Sunnyvale, CA
    2 days ago
  • $165k - $242k

     ...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale,...  ...ideal for someone who thrives in low-level systems engineering, and...  ...Stability – Tune kernel subsystems for latency, throughput, and scalability in distributed HPC/AI clusters. About the Role:... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • $160k - $250k

    Staff Hardware Engineer - NFC Digital Key Hardware Digital Key enables customers to...  ...Field Communication (NFC), Bluetooth Low Energy (BLE) and Ultra‑Wideband (UWB) technologies that must...  ...NFC, including accuracy, precision, latency, robustness, and reader field strength... 
    Local area
    Worldwide
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  • $124k - $195.5k

    NVIDIA Gruppe in Santa Clara seeks an HPC Operations Engineer to design and implement compute clusters for silicon development. Ideal candidates will have experience troubleshooting in large-scale environments and enhancing deployment automation. Applicants should be proficient... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $320k

     ...of our data centers is the ability to engineer integrated system designs in close...  ...transition from "best-effort" transport to ultra-reliable, low-latency communication (URLLC) protocols....  ...interconnections for high-density GPU clusters with the unique backhaul requirements... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $131k - $175k

     ...Senior Hardware Systems Engineer – AI Rack & Cluster Infrastructure Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation.... 
    Remote work
    Flexible hours

    Arista Networks, Inc.

    Santa Clara, CA
    4 days ago
  • Decisive Point is looking for a Software Engineer to work on the NextGen OS team focused on building an innovative operating system stack...  ...and system services, requiring a strong background in C/C++ and low-level systems. The ideal candidate will have over 6 years of software... 

    Decisive Point

    Mountain View, CA
    1 day ago
  • $150k - $230k

     ...A technology firm in Palo Alto is seeking a Senior Software Engineer to lead the development of their Core ClockSync systems. This position focuses on building low-latency, highly accurate time synchronization infrastructure across distributed environments. The ideal candidate... 

    Clockwork Inc

    Palo Alto, CA
    4 days ago
  •  ...ServiceNow's leading workflow automation with Moveworks' Reasoning Engine and natural language capabilities, we deliver the AI platform...  ...Establish automation guidelines, standards, and patterns for future CS Ops engineering Partner cross-functionally Collaborate with... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    5 days ago
  •  ...Senior Switchgear Electrical Engineer - Medium & Low Voltage (MV/LV) Role Summary The Senior Switchgear Engineer (MV/LV) is responsible for the design, engineering, validation, and technical leadership of medium- and low-voltage switchgear systems. This role provides... 

    Vital Chemicals

    Cupertino, CA
    3 days ago
  •  ...the intersection of inference engines, distributed systems, and GPU...  ...thinks in terms of throughput, latency, memory movement, and scheduling...  ...pipelines using RCCL, RDMA, and collective‑based execution...  ...across single‑GPU and multi‑GPU clusters. Optimize continuous batching... 

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • $181.1k - $318.4k

     ...and more. We're looking for an engineer who doesn't just build...  ...production platform with strict latency SLOs, complex failure domains,...  ...systems at the scale of billion ops. Solid understanding of async...  ...workloads across heterogeneous clusters (EKS and/or bare-metal) Familiarity... 
    Relocation

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • $126.8k - $220.9k

    iPhone Touch Sensing Electrical Design Engineer Cupertino, California, United States Hardware...  ...develops, and delivers high-precision, low-latency MultiTouch solutions that are the gold...  ...from proof-of-concept through ultra-high-volume production! Our team features... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $181.1k - $318.4k

     ...leading technology company is seeking a Senior Systems Software Engineer specializing in video technologies to enhance core application layers...  ...workflows. The role requires extensive experience in low-level application development and a strong understanding of macOS... 

    Apple Inc.

    Cupertino, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to RDMA Ops Engineer for Ultra-Low Latency Clusters. Be the first to apply!