Get new jobs by email
$96.8k - $251.6k
...delivering networking mechanisms for large-scale AI systems. The ideal candidate has over 10 years of experience, strong expertise in RDMA, and proficiency in debugging complex systems. Competitive compensation between $96,800 and $251,600 annually, plus comprehensive...Suggested- ...functionality across networking hardware. Ideal candidates will have over 10 years of experience in systems engineering and a strong background in RDMA and high-performance protocols. A comprehensive benefits package is offered with competitive compensation. #J-18808-Ljbffr OracleSuggested
$96.8k - $251.6k
...Build and optimize transport-level solutions that adaptively select paths and adjust rates in large-scale AI fabrics. ML Collective & RDMA Systems: Develop collective communication transports for ML workloads that maintain high performance through congestion, failures,...SuggestedTemporary workFlexible hours$79.1k - $158.2k
A global technology leader in Nashville is seeking a Network Engineer to support the design and operations of a large-scale Oracle Cloud Infrastructure. Ideal candidates will have a bachelor's degree in Computer Science and experience in network engineering. The role includes...Suggested- A technology consulting firm is seeking a Consulting Member of Technical Staff (CMTS) in Seattle, Washington. This role is a hands-on technical leader responsible for architecting, prototyping, and delivering advanced networking mechanisms critical to large-scale AI systems...Suggested
- The Consulting Member of Technical Staff (CMTS) is a hands-on technical leader responsible for architecting, prototyping, and delivering advanced networking mechanisms that underpin large-scale AI systems. This role focuses on deep technical execution—designing and implementing...Suggested
$96.8k - $251.6k
Job Description Senior Principal Software Engineer (IC5) - Oracle Cloud Infrastructure (OCI) Build the Future of Cloud at Oracle Cloud Infrastructure (OCI) At Oracle Cloud Infrastructure (OCI), we enable mission-critical applications for the world's top enterprises...SuggestedTemporary workFlexible hours- ...interpret Linux logs for diagnostics Nice-to-Have Skills: Familiarity with the Linux command line (CLI) Exposure to RoCE (RDMA over Converged Ethernet) networking Benefits Our comprehensive benefits package for full-time salaried employees is effective...SuggestedFull timeTemporary workImmediate startRelocation packageFlexible hoursShift workRotating shiftDay shift
- ...~ Scripting and automation (Bash, Python) ~ Understanding of CPU/GPU architectures and NUMA ~ Networking fundamentals (TCP/IP, RDMA, firewalls) ~ Experience with xCAT, Bright Cluster Manager, or similar ~ Knowledge of containerization and orchestration technologies...SuggestedFlexible hours
- ...multicloud environments centered on OCI . Build scalable compute environments using OCI bare metal , VMs , GPU instances , and RDMA cluster networking . Ensure architecture and deployments align with IL5‑approved patterns and federal security requirements....SuggestedFull timeRemote work
- ...Compute, nvprof, PyTorch Profiler KV cache optimization, Flash Attention, Mixture of Experts High-speed networking: InfiniBand, RDMA, NVLink Expertise in CUDA programming, GPU memory hierarchies, and hardware-specific optimizations Proven track record...SuggestedFull timeTemporary workRemote workFlexible hoursShift workNight shift
- ...advancements in database technologies and cloud offerings Implement and manage Exadata X9 hardware including persistent memory PMem and RDMA algorithms to optimize performance and reduce latency Deploy and manage Exadata X1110M98 hardware leveraging its scaleout...SuggestedFor contractorsOverseas
$175k - $250k
...and dynamic security policy frameworks Lead end‑to‑end design of advanced GPU interconnect fabrics (e.g., Infiniband, Spectrum‑X, RDMA/RoCEv2, DPUs/Smart NICs) leveraging CLOS/ECMP architectures Build and maintain carrier‑grade edge routing systems with BGP‑based peering...SuggestedRemote work- ...scale infrastructure design. ~ Deep knowledge of HPC cluster design, parallel computing and high-speed networking (e.g. InfiniBand, RDMA, RoCE). ~ Experience with NVIDIA GPU computing, CUDA, AI/ML accelerators and high-density compute architectures. ~ Strong...SuggestedFull timeWork at officeRelocation package
$19.62 - $53 per hour
...working on Oracle Engineered Systems which provide utmost performance and availability using technologies such as Persistent Memory and RDMA. Disclaimer Certain US customer or client-facing roles may be required to comply with applicable requirements, such as...SuggestedHourly payTemporary workSummer workInternshipH1bWorldwideFlexible hours$38.03 - $76.06 per hour
...required to read, write, and speak the following languages English Job Description The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute (HPC), Artificial Intelligence and Machine...Hourly payTemporary workFlexible hours$113.1k - $185.1k
...languages - Python, shell/bash, etc, Deployment automation using Terraform and Ansible Knowledge of TCP/IP fundamentals, RDMA, InfiniBand, cluster networking and running MPI jobs Working knowledge of distributed/parallel file systems and storage...Temporary workFlexible hours- ...CLI: Command-line experience in Linux for basic system navigation and troubleshooting. RoCE Networking: Familiarity with RoCE (RDMA over Converged Ethernet) network protocols. Additional Information: Onsite Work: This role requires 100% onsite presence in Memphis...Full timeTemporary workImmediate startRemote workShift workAfternoon shift
- ...with yearend 1099 vendor processing Review vendor statements and reconcile discrepancies using Bills.com Upload documentation into the RDMA portal in a timely manner Maintain electronic budget records in partnership with the Fractional CFO Review monthly budget-to-actuals...
- ...kernel drivers. PREFERRED EXPERIENCE Enabling clusters for end users, finding and resolving problems. Experience working with RoCE (RDMA over Converge Ethernet), Libfabric and InfiniBand among others. Experience working with Linux kernel, device drivers and network...
$144k - $230k
...protocols (InfiniBand, RoCE), and the crucial performance metrics relevant to tightly coupled AI workloads (Jitter, Tail Latency, GPUDirect RDMA).* GTM Track Record: Successful track record of defining, launching, and driving significant market activation for complex technical...Worldwide$132k - $207k
...Silicon One SDK).* Proficiency in datacenter networking protocols (VLAN, STP, ARP, OSPF, BGP, VXLAN, EVPN) and functionalities like RDMA, RoCE, ECN, and PFC or equivalent experience.* Strong Linux networking background (Netlink, iproute2, tc, DPDK) and experience with...Remote work- ...Linux Kernel, especially drivers and network stack Working knowledge of transport stack particularly Remote Direct Memory Access (RDMA) and/or RDMA over Converged Ethernet version 2 (RoCEv2) Qemu, FPGA Emulation environment is a plus Parallel computing...Local areaRemote work
$139.5k - $258.1k
...and optimize high‑performance virtual networking solutions for custom hardware, including Open vSwitch, DPDK, GPU Direct, and RoCE RDMA technologies Work with KVM, QEMU, and Linux kernel to efficiently enable functionality within virtual machines, including GPU passthrough...Relocation$150k - $250k
...schedules that adapt to operational needs. Nice to Haves AI/HPC Fabric Operations: Experience operating AI/ML or HPC fabrics with RDMA (RoCEv2), lossless Ethernet (PFC, ECN), or high‑performance networking. Regional/Campus Operations Leadership: Proven experience as...Local areaRemote workFlexible hoursNight shift$170.77k - $281.77k
...Rust, C, or C++ Working knowledge of high-performance networking protocols and technologies including UCX, RoCE, InfiniBand, and RDMA is a plus. Deep experience with the Kubernetes ecosystem, including core concepts, custom APIs, operators, and the Gateway API inference...Permanent employmentFull timeContract workWork experience placementWork at officeRemote workFlexible hours$64k - $104k
...Requirements: Strong background in object-oriented programming, preferably in C/C++ Familiarity with network protocols such as RDMA and TCP/IP Experience in data path is advantageous Ability to produce high-quality code with meticulous attention to detail...Full time$150k - $250k
...being wherever the infrastructure is being built. Nice to Haves AI Fabric Experience: Exposure to AI/ML networking environments with RDMA (RoCEv2), lossless Ethernet (PFC, ECN), or high‑performance compute fabrics. You understand the precision and validation required...Local area$200k - $300k
...protocols (IPMI, Redfish, BMC) and firmware management for server hardware Experience with high-performance networking (InfiniBand, RoCE, RDMA) and network troubleshooting in GPU cluster environments Familiarity with datacenter operations including rack installations,...Local area- ...partners PREFERRED EXPERIENCE Good experience with complex compute systems used in AI, HPC deployments, backend network designs in RDMA clusters Experience in validating complex AI infrastructure - GPUs, networking, ROCEv2, UEC, running benchmark tests like IBPerf...

