Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior HPC Developer - RDMA Networking

$150k - $230k

Clockwork.io

Job Description

Job Description

About Clockwork Systems

Clockwork.io – Software Driven Fabrics to increase GPU cluster utilization

Clockwork Systems was founded by Stanford researchers and veteran systems engineers who share a vision for redefining the foundations of distributed computing. As AI workloads grow increasingly complex, traditional infrastructure struggles to meet the demands of performance, reliability, and precise coordination. Clockwork is pioneering a software-driven approach to AI fabrics by delivering cross-stack observability to catch and quickly resolve problems, workload fault tolerance to keep jobs running through failures, and performance acceleration that dynamically routes and paces traffic to avoid congestion.

To learn more, visit

About the Role

We're looking for a Senior HPC Developer who wants to grow in a startup environment while working on cutting-edge GPU and high-performance networking problems. You'll work across multi-node, multi-GPU systems and have the opportunity to learn deeply across the full stack—from kernel and drivers to GPUs and networks.

This role is ideal for someone who is hands-on, curious, and excited to learn by building and debugging real systems at scale.

What You'll Do
  • Build and optimize high-performance GPU and networking subsystems
  • Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads
  • Debug performance issues across kernel, driver, GPU, and network layers
  • Develop and improve GPU-aware networking solutions
  • Profile, analyze, and tune system performance using low-level tooling
  • Collaborate closely with a small engineering team and take ownership of core systems
What You Bring
  • 5+ years of experience in systems, HPC, or performance-critical software development
  • Strong proficiency in low-level C/C++
  • Solid understanding of RDMA networking , including InfiniBand, RoCE, and IBVerbs
  • Experience working with multi-node, multi-GPU workloads
  • Familiarity with collective communication libraries and communication algorithms
  • Ability and willingness to debug complex issues across hardware and software boundaries
  • Curiosity and eagerness to learn in a fast-moving startup environment
Bonus Points
  • Experience with congestion control mechanisms such as DCQCN
  • Exposure to GPU-aware networking or advanced communication optimizations
  • Experience with performance profiling, tracing, or observability tooling
  • Background in AI infrastructure, HPC clusters, or distributed systems

Enjoy

  • Challenging projects.
  • A friendly and inclusive workplace culture.
  • Competitive compensation.
  • A great benefits package.
  • Catered lunch.

Compensation for this position will vary based on the skills and experience you bring, as well as internal equity considerations. For candidates hired at the posted level, the expected base salary range is $150,000 - $230,000. The offered compensation package may also include stock options or other equity awards, subject to Clockwork's equity program and applicable approvals.

Clockwork Systems is an equal opportunity employer. We are committed to building world-class teams by welcoming bright, passionate individuals from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender identity or expression, national origin, disability, or protected veteran status. We believe diversity drives innovation, and we grow stronger together.

Vacancy posted 8 days ago
Similar jobs that could be interesting for youBased on the Senior HPC Developer - RDMA Networking in Palo Alto, CA vacancy
  • $200k - $400k

    A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing...  .... The ideal candidate has strong experience with NVIDIA RDMA technologies, networking protocols, and Kubernetes. This role... 
    Senior

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $181k - $297k

     ...LinkedIn is the world's largest professional network, built to create economic opportunity...  ...View, CA. We are seeking an HPC Network Engineer to design, deploy, and...  ...lossless Ethernet networks optimized for RDMA traffic. As a Senior Staff Software Engineer, you will... 
    Senior
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    4 days ago
  • $200k - $400k

     ...maintain low-latency, high-bandwidth networking solutions that power some of...  ...technologies such as NVIDIA's RDMA-capable solutions, InfiniBand,...  ...Design & Optimization: Develop and tune RDMA-based communication...  ...SHARP, GPUDirect RDMA AI & HPC Communication Frameworks:... 
    Senior
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • black.ai is looking for a skilled platform engineer in Palo Alto to enhance our AWS infrastructure and support quantum simulations. This role requires strong experience in platform engineering, DevOps practices, and GPU workloads. As a platform engineer, you will improve...
    Senior

    black.ai

    Palo Alto, CA
    2 days ago
  •  ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the...  ...We need an engineer with deep experience in RoCEv2 that can develop at hyper scale while optimizing performance and... 
    Suggested

    Xai

    Palo Alto, CA
    1 day ago
  •  ...Title : Senior Network Engineer Location : Mountain View, California (Onsite) Key Responsibilities Deployment...  ...for incidents affecting end users. Automation: Develop and implement network automation tools and scripts (e.g... 
    Senior
    Permanent employment

    E-Solutions

    Mountain View, CA
    4 days ago
  •  ...Senior Network Engineer Location: Pal Alto, CA 3days a week(Hybrid) Contract Experience: 10+ Job Description: Rubrik is looking for a hands-on Senior Network Engineer. The primary objective is to execute network automation tasks across our multi-cloud footprint... 
    Senior
    Contract work

    Argyle Infotech

    Palo Alto, CA
    1 day ago
  •  ...on experience with InfiniBand/Ethernet networking system configuration and operation. Deep...  ...to get maximum profiles. We don't need developers. These are more of IT kind of skills to...  ...triaging NW issues in InfiniBand and RoCE (RDMA over Converged Ethernet). Engineers who... 
    Senior
    H1b
    Local area

    ShiftCode Analytics

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team....  ...libraries. The candidate will develop tools and methodologies...  ...of the following areas: HPC, networking, and AI applications...  ...(such as RoCE and RDMA). ~ Strong programming... 
    Senior

    NVIDIA

    Santa Clara, CA
    9 hours ago
  •  ...Job Description We are seeking a highly skilled Sr Network Engineer to manage and oversee our critical network infrastructure in our offices in Palo Alto, CA This role, which can also be performed remotely, requires a strong understanding of network security, routing... 
    Senior
    Extra income
    Full time
    H1b
    Local area
    Remote work
    Visa sponsorship
    Work visa

    ATR International

    Palo Alto, CA
    1 day ago
  • $184k - $287.5k

    A leading technology company is looking for a Senior Linux Kernel Software Engineer to join their Linux networking drivers R&D team in Santa Clara. This role involves developing device drivers for network interface cards, integrating existing solutions, and leading engineering... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  •  ...Job Description Senior Network Engineer Role Description Technically innovative MSP is seeking a Senior Network Operations Engineer...  ...regulations. Strategic Planning: Experience in developing cybersecurity strategies and roadmaps for organizations.... 
    Senior
    Full time
    Local area
    Remote work

    Capstone Search Advisors

    Mountain View, CA
    9 hours ago
  •  ...Job Title 12+ years in platform engineering, SRE, or DevOps. Experience with HPC clusters (Slurm, PBS, Grid Engine). Cloud infrastructure expertise (GCP/AWS preferred). Proficiency with Terraform, Ansible, Prometheus, Grafana, ELK. Strong Linux administration... 
    Senior

    Saxon Global

    Mountain View, CA
    9 hours ago
  • $173k - $237.95k

     ...operations • Help manage the HPC interconnects • Help...  ...abstraction work • Work with the networking infrastructure team to manage...  ...file systems. • Help research, develop and implement the next generation...  ...experience • 2+ years of RDMA networking experience • 4+ years... 
    Work at office
    Remote work
    Work from home
    Flexible hours

    Guardant Health

    Palo Alto, CA
    1 day ago
  •  ...About the role: We are seeking an experienced and hands-on Senior Cloud Network Engineer to design, implement, and operate resilient,...  ...across diverse environments. Partner with security, SRE, and developer teams to onboard services securely and efficiently to the... 
    Senior
    Local area

    Rubrik

    Palo Alto, CA
    5 days ago
  • $193.93k - $291.15k

    Nuro in Mountain View, California, is seeking a skilled Networking Engineer to tackle complex connectivity challenges in self-driving technology. You will develop a network bonding framework, optimize performance, and work cross-functionally to enhance our vehicle's connectivity... 
    Senior

    Icehouseventures

    Mountain View, CA
    5 days ago
  •  ...technology firm in Palo Alto is seeking a skilled Network Engineer with extensive experience in...  ...optimization, specifically within the AI/HPC sectors. You will work on enhancing both backend and front-end networks while developing performance metrics. The role involves... 
    Senior

    xAI

    Palo Alto, CA
    9 hours ago
  • NVIDIA Corporation in Santa Clara is looking for a Senior Software Engineer to design and build cloud platforms. Candidates should have...  ...strong skills in the K8s ecosystem. Responsibilities include developing scalable cloud solutions and collaborating with teams on innovative... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • A leading automotive firm in California is seeking a Senior ML Infrastructure Engineer to build and scale robust Compute platforms for Simulation workflows. This high-impact position will focus on scalability and driving efficiency in AI infrastructure. The ideal candidate... 
    Senior

    General Motors

    Sunnyvale, CA
    4 days ago
  • $125k - $151k

    Crusoe is seeking a Senior Cloud Support Engineer to enhance customer support for their sustainable GPU compute solutions. Ideal candidates will have a Bachelor's degree in IT or related fields and over 5 years of experience in customer support within cloud environments... 
    Senior

    Crusoe

    Sunnyvale, CA
    2 days ago
  •  ...Team builds tools for quantum algorithm developers: cloud development environments, circuit...  ...response and post-mortems when necessary. GPU/HPC Bridge Work (30%) Make GPU clusters and...  ...on ECS/EKS, managed multi-account networking (VPCs, security groups), and dealt with... 
    Senior

    black.ai

    Palo Alto, CA
    2 days ago
  • $152k - $241.5k

     ...highly skilled and experienced HPC Cluster Engineer to design,...  .... What you'll be doing: Develop and enhance our ecosystem around...  ...including the deployment of compute, networking, and storage. Foster strong...  ...to HPC including InfiniBand, RDMA and RoCE. Understanding of... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • A consulting firm specializing in defense is seeking a Senior Network Engineer to lead the planning, implementation, and maintenance of secure network architectures for the Department of Defense. The candidate will ensure compliance with DoD cybersecurity standards and... 
    Senior

    M2synergy Consulting LLC

    Palo Alto, CA
    3 days ago
  • $152k - $241.5k

     ...leading the way in groundbreaking developments in Artificial Intelligence,...  ...Communications Libraries and Networking team at NVIDIA. We deliver...  ...NVSHMEM, UCX for Deep Learning and HPC. We are looking for a...  ...Ethernet networks in areas like RDMA, topologies, congestion... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    9 hours ago
  • $184k - $356.5k

    NVIDIA Corporation in Santa Clara is seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC. Responsibilities include architecting system topologies, collaborating to optimize transport layers, and contributing to hardware... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $140k - $160k

     ...ASRC Federal is looking for a Senior HPC Engineer, as ASRC Federal InuTeq provides High Performance...  ...and use cases to release Designs and develops scripts for system administration,...  ...problems (hardware, software, and network). Understands research use cases, researches... 
    Senior
    Contract work
    Weekend work

    ASRC Federal Holding Company

    Mountain View, CA
    1 day ago
  • Position Overview At M2SC, we are looking for a senior-level network engineer to provide support for secure, mission-critical environments within the Department of Defense (DoD). As a Senior Network Engineer, you will serve as the technical lead responsible for the planning... 
    Senior
    Contract work

    M2synergy Consulting LLC

    Palo Alto, CA
    2 days ago
  • Ein führendes IT-Dienstleistungsunternehmen in Kalifornien sucht einen Senior System Engineer Networking. Sie sind verantwortlich für die Planung und den Betrieb moderner Netzwerkarchitekturen. Der ideale Kandidat bringt mehrjährige Erfahrung im Netzwerkbereich mit und... 
    Senior
    Home office
    Flexible hours

    unique projects GmbH & Co. KG

    Palo Alto, CA
    9 hours ago
  • ATX Venture Partners seeks a Principal Engineer to drive technology initiatives and create scalable solutions. You'll develop systems in a highly collaborative environment, utilizing both front-end and back-end technologies, particularly in AI domains. The ideal candidate... 
    Senior

    ATX Venture Partners

    Mountain View, CA
    3 days ago
  • $140k - $224.25k

     ...In this position, you will take part in developing cutting-edge features and technologies in...  ...kernel and userspace for groundbreaking network technologies. What you'll be doing:...  ...continue to grow rapidly. If you are a senior data engineer passionate about building... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    25 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior HPC Developer - RDMA Networking. Be the first to apply!