Get new jobs by email
$109.2k - $223.4k
A leading cloud infrastructure company is seeking a Principal Network Engineer to support the design and operations of its global cloud computing environment. This role requires 6 to 10 years of experience and involves leading complex projects, managing network solutions...Suggested- ...an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal candidate will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and low latency. A deep understanding of NVIDIA architectures...Suggested
- A leading technology corporation is seeking a software engineer to design and develop Libfabric provider features for their networking product line. The ideal candidate will have at least 5 years of experience in networking software and a strong understanding of High-Performance...Suggested
- A leading data analytics company in the United States is looking for a Staff Software Engineer to lead the development of networking software for their massive parallel processing platform. The ideal candidate will have extensive experience in Linux kernel development and...Suggested
$184k - $287.5k
A leading technology company is looking for a Senior Linux Kernel Software Engineer to join their Linux networking drivers R&D team in Santa Clara. This role involves developing device drivers for network interface cards, integrating existing solutions, and leading engineering...Suggested- A leading technology company in Cupertino seeks a networking software developer to enhance high-performance networking solutions. Responsibilities include designing core functions, collaborating with various teams, and ensuring software release quality. Candidates should...Suggested
$150k - $230k
...in systems, HPC, or performance-critical software development ~ Strong proficiency in low-level C/C++ ~ Solid understanding of RDMA networking , including InfiniBand, RoCE, and IBVerbs ~ Experience working with multi-node, multi-GPU workloads ~ Familiarity...Suggested- ...seeking a Principal Network Engineer to support the design and operations of large-scale global cloud environments. This role focuses on RDMA/RoCE network fabrics which are vital for AI and HPC services. Ideal candidates will have deep knowledge of network systems and...SuggestedWorldwide
- ...OpenFlow -High-speed networks: ~1/10/40/100 Gigabit Ethernet ~ SDR, DDR, QDR, FDR IB ~ PCIe Gen 3 and Gen 4 ~ NVME and NVMEoF ~ RDMA over Ethernet (RoCE and NFS over RDMA) -Debugging of Embedded Hardware and Software -Experience Developing Portable, Embedded,...Suggested
$34 - $37 per hour
...backend, and management networks in on‑prem data centers, including new buildouts and upgrades. Optimizing network performance by tuning RDMA and minimizing latency, packet loss, and jitter across mission‑critical platforms. Troubleshooting complex issues end‑to‑end,...SuggestedHourly payContract workTemporary workLocal area- ...Qualifications Experience with different types of fabrics, such as PCIe, Infiniband, and RoCE Experience with fast networking stacks, such as RDMA Good communication skills and enthusiasm to help colleagues Submission Guidelines Please note that in order to be considered an...SuggestedFull timeTemporary workLocal areaFlexible hours
$165k - $241.4k
...including L2, L3, BGP, EVPN, VXLAN. Our Preferred Qualifications for this role Exposure to open source NoS SONiC. Familiarity with RDMA, RoCE and HPC networks. Knowledge of QoS and congestion control mechanism in data center network. GIT, Jira, Jenkins and CI/CD...SuggestedFull timeTemporary workWork at officeLocal areaFlexible hours$84k - $134k
...application benchmarks based on enterprise computing requirements and generic server architecture, key examples include HammerDB, MySQL, RDMA, flashed based storage (NVDIMM, and NVMe) • Execute benchmark testing and build proof of concepts based on latest data center...SuggestedWorldwide- ...the RMF process For HPC systems this includes: Linux based supercomputers MPI (Message passing interface) and OpenMP based clusters RDMA memory access Cray EX HPE slingshot Infiniband CXL memory Slurm and PBS job schedulers Cray SLES Suse Linux Troubleshooting LAPACK,...Suggested
$82k - $120k
...application benchmarks based on enterprise storage requirements and generic server architecture, key examples include Vdbench, FIO, RDMA, flash based storage (NVDIMM, and NVMe) Execute benchmark testing and build proof of concepts based on latest data center technologies...SuggestedWorldwide$2,000 per month
...scale compute systems, including 5+ years leading engineering teams Strong understanding of hardware/software interfaces such as PCIe, RDMA, memory hierarchies, interrupts, and device drivers Experience building or operating cluster-scale systems (HPC, AI infrastructure,...Work at officeRelocation package$175k - $250k
...and dynamic security policy frameworks Lead end‑to‑end design of advanced GPU interconnect fabrics (e.g., Infiniband, Spectrum‑X, RDMA/RoCEv2, DPUs/Smart NICs) leveraging CLOS/ECMP architectures Build and maintain carrier‑grade edge routing systems with BGP‑based peering...Remote work$125k - $175k
...'ll Do Provide front-line operational support for 24/7 Linux HPC compute, storage, and interconnects. Technologies involved include RDMA fabrics, parallel filesystems, HPC batch schedulers, FUSE filesystems, internal Jump software, multi-vendor hardware, cybersecurity...Work at officeAfternoon shift$291k
...Virtual Private Cloud (VPC), and Load Balancing, while advancing high-performance networking for AI and HPC workloads using InfiniBand, RDMA, and low-latency Ethernet. As the networking PM, you will shape the services that connect customers securely and efficiently to...Work at officeLocal areaWork from homeFlexible hours- ...operations including power, cooling, and colo/vendor engagements Strong Linux systems administration experience, including kernel drivers, RDMA stack tuning, and performance analysis Experience with schedulers and orchestration systems such as Slurm and Kubernetes Exposure...Long term contractContract workFixed term contractWork at officeLocal areaVisa sponsorshipShift work3 days per week
- ...expertise in: Distributed and parallel storage integration with Kubernetes for HPC workloads. High-performance networking (InfiniBand, RDMA, RoCE) in containerized environments. Proven ability to design scalable, secure, and resilient Kubernetes-based architectures for...Shift work
- ...NVIDIA Quantum InfiniBand switches, cable types (NDR/HDR), and troubleshooting commands. AI Ethernet (RoCEv2): Solid understanding of RDMA over Converged Ethernet (RoCEv2), including the configuration of PFC and ECN on switches (Spectrum/Arista/Cisco/Juniper). Fabric...Full timeImmediate startShift workEarly shift
$90k - $193.75k
...Expertise In The Following Domains Network programmability: proficiency in developing programmable networks, application offloading, and RDMA. SmartNICs: experience with Smart Network Interface Cards, leveraging their capabilities for improved network performance and...Work experience placementInternshipWork at office$100k
...Kubernetes (specifically Device Plugins for GPUs) and Terraform/Ansible for “Infrastructure as Code.” Networking Deep understanding of RDMA (Remote Direct Memory Access) and non‑blocking Clos topologies. AI Workloads Familiarity with distributed training techniques (...Minimum wageLocal areaRemote work- ...background in UCIe, CXL, NVLink, or UAL microarchitecture and protocols is a plus Familiarity with High-speed networking: InfiniBand, RDMA, NVLink is a plus Expert knowledge of transformer architectures, attention mechanisms, and model parallelism techniques Multi-...Temporary workRemote workFlexible hoursShift workNight shift
$164.43k - $274.04k
...interoperability testing across L2/L3, EVPN/VXLAN, QoS, ACL, and multi‑ASIC platforms. Validate AI data center networking features including RDMA, RoCEv2, ECN, PFC, and congestion management. Develop Python‑based automated test cases, integrate into CI/CD pipelines, GitHub...Flexible hours$109.2k - $223.4k
...following languages English Job Description We are the AI Infrastructure - Network Operations team at OCI. We support and operate the RDMA/RoCE network fabrics for OCI's largest AI and HPC customers. These fabrics are the foundation underneath OCI's AI, GPU and HPC...Temporary workImmediate startFlexible hours- ...management systems (e.g., PBS, LSF) Proven experience in InfiniBand networking design and operations, including subnet management, QoS, RDMA, and performance tuning Experience with high-speed Ethernet networks and associated protocols (e.g., VLAN, LACP, BGP, OSPF, EVPN,...Remote workFlexible hours
- .../spine network connectivity. Run cluster‑wide burn‑in and stress testing. Validate GPU‑to‑GPU and node‑to‑node performance (NCCL, RDMA, GPUDirect). Troubleshoot hardware, firmware, and fabric‑level issues. Automation & Process Contribute to automation for provisioning...Work at office
- ...platforms such as Slurm or PBS cluster provisioning frameworks (e.g., xCAT, Warewulf) high-performance networking technologies including RDMA / InfiniBand distributed parallel compute workloads utilizing MPI or OpenMP GPU-enabled compute resources supporting CUDA-based...