Get new jobs by email
  • Job Description Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc. Responsibilities As a member of the software engineering division, you will take an active role in the definition and evolution of...
    Suggested
    Flexible hours

    Oracle

    Augusta, ME
    14 days ago
  • $180k

     ...strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. About the Role RDMA Engineers on xAI’s Supercomputing team design and optimize low-latency, high-bandwidth networking solutions using NVIDIA’s RDMA-... 
    Suggested
    Remote job
    Local area

    xAI

    Palo Alto, CA
    more than 2 months ago
  • $103.7k - $153.7k

     ...and optimize transport-level solutions that adaptively select paths and adjust rates in large-scale AI fabrics. ML Collective & RDMA Systems: Develop collective communication transports for ML workloads that maintain high performance through congestion,... 
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Augusta, ME
    2 days ago
  • $96.8k - $251.6k

    Job Description Senior Principal Software Engineer (IC5) - Oracle Cloud Infrastructure (OCI) Build the Future of Cloud at Oracle Cloud Infrastructure (OCI) At Oracle Cloud Infrastructure (OCI), we enable mission-critical applications for the world's top enterprises...
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Jefferson City, MO
    13 days ago
  •  ...multicloud environments centered on OCI . Build scalable compute environments using OCI bare metal , VMs , GPU instances , and RDMA cluster networking . Ensure architecture and deployments align with IL5‑approved patterns and federal security requirements.... 
    Suggested
    Remote job
    Full time

    Mc3 Partners

    Remote
    13 hours ago
  •  ...familiarity with signal processing or mathematical modeling Have experience with GPU software development Have experience with RDMA Demonstrate familiarity with radar concepts Demonstrate familiarity with integrated systems combining software and hardware to... 
    Suggested
    Local area
    Night shift

    Str

    Woburn, MA
    9 hours agonew
  • $140k - $175k

     ...(HPC) systems design or deployment.  ~ Familiary with NVIDIA CUDA-Q  ~ Direct experience with NVIDIA Holoscan Sensor Bridge, RDMA over Converged Ethernet, or similar high‑speed coherent interconnects.  ~ Experience supporting or developing error‑correction decoding... 
    Suggested
    Full time
    Temporary work
    Work at office
    Flexible hours

    Infleqtion

    Louisville, CO
    13 hours ago
  •  ...inform our future supercomputer network designs.  You might thrive in this role if you: Have written distributed algorithms using RDMA in the past. Are comfortable writing low level performance sensitive CPU and/or GPU code. Are familiar with network simulation... 
    Suggested
    Full time
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    13 hours ago
  •  ...-on troubleshooting experience, and familiarity with device drivers and QoS technologies such as policy maps, class maps, WRED, and RDMA. Proficiency in collaborative tools and excellent communication are essential. #Skills: #QoSDevelopment #EmbeddedSystems #RouterSwitchSoftware... 
    Suggested
    Contract work

    Pddn

    Milpitas, CA
    13 hours ago
  •  ...Experience with RH Satellite Experience with Lustre, NFS, and other file systems Experience with Ethernet networking Experience with RDMA communications (InfiniBand and OmniPath) Experience with InfiniBand or Omni-Path high speed fabrics, including subnet management,... 
    Suggested
    Full time
    Contract work
    Work at office
    Remote work
    Relocation
    Monday to Friday
    Flexible hours
    Shift work

    Lockheed Martin

    Herndon, VA
    3 days ago
  • $175k - $220k

     ...Determine and implement in PyTorch an optimal sharding scheme for a novel attention variant Optimize communication patterns in RDMA networks (Infiniband, RoCE) Debug numerical instabilities for a given model for a small portion of requests when deployed at scale... 
    Suggested
    Full time

    Fireworks Ai

    Redwood City, CA
    13 hours ago
  • $19.61 - $53 per hour

     ...working on Oracle Engineered Systems which provide utmost performance and availability using technologies such as Persistent Memory and RDMA. Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as... 
    Suggested
    Hourly pay
    Temporary work
    Internship
    Worldwide
    Flexible hours

    Arizona Staffing

    Redwood City, CA
    4 days ago
  • $147k - $240k

     ...oriented programming languages such as Golang, Python, Java, C++, Rust. ~ Experience with network protocols such as TCP/IP, BGP, MPLS, RDMA, network overlay technologies, and network-based Quality of Service (QoS). Preferred Qualifications BS and 8+ years of... 
    Suggested
    Full time
    For contractors
    Work experience placement
    Work at office
    Flexible hours

    Linkedin

    Mountain View, CA
    13 hours ago
  • $100k

     ...thinking to solve challenging problems in large-scale computing. Experience or familiarity with high-performance networking, MPI, RDMA, or cluster computing frameworks is advantageous but not required.   Compensation for all engineers at Tenstorrent ranges from $... 
    Suggested
    Permanent employment
    Full time

    Tenstorrent

    Santa Clara, TX
    13 hours ago
  •  ...signal processing or mathematical modeling. Experience with GPU programming and high-performance computing. Familiarity with RDMA and advanced networking concepts. Expertise in modern C++ standards (C++17 and beyond). Experience in systems that... 
    Suggested
    Contract work
    Temporary work

    Actalent

    Woburn, MA
    3 days ago
  •  ...Compute, nvprof, PyTorch Profiler KV cache optimization, Flash Attention, Mixture of Experts High-speed networking: InfiniBand, RDMA, NVLink Expertise in CUDA programming, GPU memory hierarchies, and hardware-specific optimizations Proven track record... 
    Full time
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    17 days ago
  •  ...distributed training environments Contributions to ML infrastructure open-source projects Familiarity with storage, networking, or RDMA/GPU Direct technologies Understanding of observability in ML pipelines (metrics, logs, dashboards) Enjoy Challenging... 
    Full time

    Clockwork.io

    Palo Alto, CA
    13 hours ago
  •  ...interpret Linux logs for diagnostics Nice-to-Have Skills: Familiarity with the Linux command line (CLI) Exposure to RoCE (RDMA over Converged Ethernet) networking Benefits Our comprehensive benefits package for full-time salaried employees is... 
    Full time
    Temporary work
    Immediate start
    Monday to Friday
    Shift work
    Rotating shift
    Day shift

    PGTEK

    Muskogee, OK
    4 days ago
  • $325k

     ...experience with one or more ML hardware accelerators (GPUs, TPUs, Trainium). Understand ML-specific networking optimizations like RDMA and InfiniBand. Have expertise in AI-specific observability tools and frameworks. Have experience with chaos engineering and... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    13 hours ago
  • $152k - $248k

     ...oriented programming languages such as Golang, Python, Java, C++, Rust. ~ Experience with network protocols such as TCP/IP, BGP, MPLS, RDMA, network overlay technologies, and network-based Quality of Service (QoS). Preferred Qualifications: BS and 8+ years of... 
    Full time
    For contractors
    Work experience placement
    Work at office
    Flexible hours

    Linkedin

    Mountain View, CA
    13 hours ago
  •  ...~ Scripting and automation (Bash, Python) ~ Understanding of CPU/GPU architectures and NUMA ~ Networking fundamentals (TCP/IP, RDMA, firewalls) ~ Experience with xCAT, Bright Cluster Manager, or similar ~ Knowledge of containerization and orchestration technologies... 
    Flexible hours

    Keysight Technologies

    Colorado Springs, CO
    25 days ago
  • $87k - $178.1k

     ...Job Description The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute (HPC), Artificial Intelligence and Machine Learning advancements. We envision a future where artificial intelligence... 
    Temporary work
    Flexible hours

    Oracle

    Bismarck, ND
    1 day ago
  •  ...background in UCIe, CXL, NVLink, or UAL microarchitecture and protocols is a plus Familiarity with High-speed networking: InfiniBand, RDMA, NVLink is a plus Expert knowledge of transformer architectures, attention mechanisms, and model parallelism techniques Multi-... 
    Full time
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    3 days ago
  • $200k - $400k

     ...Linux kernel, GPU/accelerator kernels, and interconnects. Develop and tune communication libraries such as NCCL, MPI, UCX, RCCL, and RDMA-based systems. Partner with ML researchers and engineers to support frameworks like PyTorch, MegatronLM, and DeepSpeed in large-... 
    Full time
    Visa sponsorship

    Institute Of Foundation Models

    Sunnyvale, CA
    13 hours ago
  • $342k

     ...support high-performance systems. Design and implement kernel drivers, including for functionality related to DMA, PCIe, NICs, and RDMA. Drive end-to-end development of system-scale networking, including required kernel and other low-level software. Collaborate... 

    OpenAI

    San Francisco, CA
    13 hours ago
  • $186.55k - $279.77k

     ...threaded and/or asynchronous communications environments. 3+ years' experience with network stack, and/or network protocols such as RDMA. How to Stand out (Preferred Qualifications): Experience with Multi-NIC environments and features (e.g. load balancing and... 
    Full time
    Local area
    Remote work
    Work from home

    INTEL

    Indiana
    1 day ago
  •  ...Oracle Cloud). Experience with FinOps practices for AI workloads. Understanding of data locality, high-throughput networking (RDMA, InfiniBand), and parallel file systems. Ability to translate AI workload requirements into infrastructure reference... 

    Connvertex Technologies Inc.

    San Mateo, CA
    2 days ago
  •  ...scale infrastructure design. ~ Deep knowledge of HPC cluster design, parallel computing and high-speed networking (e.g. InfiniBand, RDMA, RoCE). ~ Experience with NVIDIA GPU computing, CUDA, AI/ML accelerators and high-density compute architectures. ~ Strong... 
    Full time
    Work at office
    Relocation package

    Atto Trading Technologies

    New York, NY
    a month ago
  • $179.2k - $268.8k

     ...safety standards Familiarity with io_uring, DirectStorage or similar high performance storage APIs Developing software targeting RDMA systems such as InfiniBand Familiarity with a real time operating systems Experience with developing massively scalable... 
    Permanent employment
    Full time
    Work at office
    Immediate start
    Visa sponsorship

    Latitude Ai

    Palo Alto, CA
    13 hours ago
  •  ...design, and system-level modeling ~ Strong working knowledge of various interconnect technologies including PCIe, CXL, NoCs, ethernet, RDMA ~ Excellent communication skills, demonstrated ability to influence key stakeholders, and mentorship capabilities.... 
    Full time
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    11 days ago