Get new jobs by email
- Job Description Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc. Responsibilities As a member of the software engineering division, you will take an active role in the definition and evolution of...SuggestedFlexible hours
$180k
...strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. About the Role RDMA Engineers on xAI’s Supercomputing team design and optimize low-latency, high-bandwidth networking solutions using NVIDIA’s RDMA-...SuggestedRemote jobLocal area$103.7k - $153.7k
...and optimize transport-level solutions that adaptively select paths and adjust rates in large-scale AI fabrics. ML Collective & RDMA Systems: Develop collective communication transports for ML workloads that maintain high performance through congestion,...SuggestedTemporary workFlexible hours$96.8k - $251.6k
Job Description Senior Principal Software Engineer (IC5) - Oracle Cloud Infrastructure (OCI) Build the Future of Cloud at Oracle Cloud Infrastructure (OCI) At Oracle Cloud Infrastructure (OCI), we enable mission-critical applications for the world's top enterprises...SuggestedTemporary workFlexible hours- ...multicloud environments centered on OCI . Build scalable compute environments using OCI bare metal , VMs , GPU instances , and RDMA cluster networking . Ensure architecture and deployments align with IL5‑approved patterns and federal security requirements....SuggestedRemote jobFull time
- ...familiarity with signal processing or mathematical modeling Have experience with GPU software development Have experience with RDMA Demonstrate familiarity with radar concepts Demonstrate familiarity with integrated systems combining software and hardware to...SuggestedLocal areaNight shift
$140k - $175k
...(HPC) systems design or deployment. ~ Familiary with NVIDIA CUDA-Q ~ Direct experience with NVIDIA Holoscan Sensor Bridge, RDMA over Converged Ethernet, or similar high‑speed coherent interconnects. ~ Experience supporting or developing error‑correction decoding...SuggestedFull timeTemporary workWork at officeFlexible hours- ...inform our future supercomputer network designs. You might thrive in this role if you: Have written distributed algorithms using RDMA in the past. Are comfortable writing low level performance sensitive CPU and/or GPU code. Are familiar with network simulation...SuggestedFull timeWork at officeRelocation package
- ...-on troubleshooting experience, and familiarity with device drivers and QoS technologies such as policy maps, class maps, WRED, and RDMA. Proficiency in collaborative tools and excellent communication are essential. #Skills: #QoSDevelopment #EmbeddedSystems #RouterSwitchSoftware...SuggestedContract work
- ...Experience with RH Satellite Experience with Lustre, NFS, and other file systems Experience with Ethernet networking Experience with RDMA communications (InfiniBand and OmniPath) Experience with InfiniBand or Omni-Path high speed fabrics, including subnet management,...SuggestedFull timeContract workWork at officeRemote workRelocationMonday to FridayFlexible hoursShift work
$175k - $220k
...Determine and implement in PyTorch an optimal sharding scheme for a novel attention variant Optimize communication patterns in RDMA networks (Infiniband, RoCE) Debug numerical instabilities for a given model for a small portion of requests when deployed at scale...SuggestedFull time$19.61 - $53 per hour
...working on Oracle Engineered Systems which provide utmost performance and availability using technologies such as Persistent Memory and RDMA. Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as...SuggestedHourly payTemporary workInternshipWorldwideFlexible hours$147k - $240k
...oriented programming languages such as Golang, Python, Java, C++, Rust. ~ Experience with network protocols such as TCP/IP, BGP, MPLS, RDMA, network overlay technologies, and network-based Quality of Service (QoS). Preferred Qualifications BS and 8+ years of...SuggestedFull timeFor contractorsWork experience placementWork at officeFlexible hours$100k
...thinking to solve challenging problems in large-scale computing. Experience or familiarity with high-performance networking, MPI, RDMA, or cluster computing frameworks is advantageous but not required. Compensation for all engineers at Tenstorrent ranges from $...SuggestedPermanent employmentFull time- ...signal processing or mathematical modeling. Experience with GPU programming and high-performance computing. Familiarity with RDMA and advanced networking concepts. Expertise in modern C++ standards (C++17 and beyond). Experience in systems that...SuggestedContract workTemporary work
- ...Compute, nvprof, PyTorch Profiler KV cache optimization, Flash Attention, Mixture of Experts High-speed networking: InfiniBand, RDMA, NVLink Expertise in CUDA programming, GPU memory hierarchies, and hardware-specific optimizations Proven track record...Full timeTemporary workRemote workFlexible hoursShift workNight shift
- ...distributed training environments Contributions to ML infrastructure open-source projects Familiarity with storage, networking, or RDMA/GPU Direct technologies Understanding of observability in ML pipelines (metrics, logs, dashboards) Enjoy Challenging...Full time
- ...interpret Linux logs for diagnostics Nice-to-Have Skills: Familiarity with the Linux command line (CLI) Exposure to RoCE (RDMA over Converged Ethernet) networking Benefits Our comprehensive benefits package for full-time salaried employees is...Full timeTemporary workImmediate startMonday to FridayShift workRotating shiftDay shift
$325k
...experience with one or more ML hardware accelerators (GPUs, TPUs, Trainium). Understand ML-specific networking optimizations like RDMA and InfiniBand. Have expertise in AI-specific observability tools and frameworks. Have experience with chaos engineering and...Full timeWork at officeVisa sponsorshipFlexible hours$152k - $248k
...oriented programming languages such as Golang, Python, Java, C++, Rust. ~ Experience with network protocols such as TCP/IP, BGP, MPLS, RDMA, network overlay technologies, and network-based Quality of Service (QoS). Preferred Qualifications: BS and 8+ years of...Full timeFor contractorsWork experience placementWork at officeFlexible hours- ...~ Scripting and automation (Bash, Python) ~ Understanding of CPU/GPU architectures and NUMA ~ Networking fundamentals (TCP/IP, RDMA, firewalls) ~ Experience with xCAT, Bright Cluster Manager, or similar ~ Knowledge of containerization and orchestration technologies...Flexible hours
$87k - $178.1k
...Job Description The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute (HPC), Artificial Intelligence and Machine Learning advancements. We envision a future where artificial intelligence...Temporary workFlexible hours- ...background in UCIe, CXL, NVLink, or UAL microarchitecture and protocols is a plus Familiarity with High-speed networking: InfiniBand, RDMA, NVLink is a plus Expert knowledge of transformer architectures, attention mechanisms, and model parallelism techniques Multi-...Full timeTemporary workRemote workFlexible hoursShift workNight shift
$200k - $400k
...Linux kernel, GPU/accelerator kernels, and interconnects. Develop and tune communication libraries such as NCCL, MPI, UCX, RCCL, and RDMA-based systems. Partner with ML researchers and engineers to support frameworks like PyTorch, MegatronLM, and DeepSpeed in large-...Full timeVisa sponsorship$342k
...support high-performance systems. Design and implement kernel drivers, including for functionality related to DMA, PCIe, NICs, and RDMA. Drive end-to-end development of system-scale networking, including required kernel and other low-level software. Collaborate...$186.55k - $279.77k
...threaded and/or asynchronous communications environments. 3+ years' experience with network stack, and/or network protocols such as RDMA. How to Stand out (Preferred Qualifications): Experience with Multi-NIC environments and features (e.g. load balancing and...Full timeLocal areaRemote workWork from home- ...Oracle Cloud). Experience with FinOps practices for AI workloads. Understanding of data locality, high-throughput networking (RDMA, InfiniBand), and parallel file systems. Ability to translate AI workload requirements into infrastructure reference...
- ...scale infrastructure design. ~ Deep knowledge of HPC cluster design, parallel computing and high-speed networking (e.g. InfiniBand, RDMA, RoCE). ~ Experience with NVIDIA GPU computing, CUDA, AI/ML accelerators and high-density compute architectures. ~ Strong...Full timeWork at officeRelocation package
$179.2k - $268.8k
...safety standards Familiarity with io_uring, DirectStorage or similar high performance storage APIs Developing software targeting RDMA systems such as InfiniBand Familiarity with a real time operating systems Experience with developing massively scalable...Permanent employmentFull timeWork at officeImmediate startVisa sponsorship- ...design, and system-level modeling ~ Strong working knowledge of various interconnect technologies including PCIe, CXL, NoCs, ethernet, RDMA ~ Excellent communication skills, demonstrated ability to influence key stakeholders, and mentorship capabilities....Full timeTemporary workRemote workFlexible hoursShift work