Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Distinguished Software Architect - Deep Learning and HPC Communications

$320k

NVIDIA Gruppe

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. GPU Communications Libraries and Networking team Distinguished Software Architect We are looking for a Distinguished Software Architect to help co‑design our next generation data center platforms. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high‑speed interconnects (e.g., NVLink, PCIe) within a node and with high‑speed networking (e.g., Infiniband, Ethernet) across nodes. Communication performance between the GPUs directly impacts end‑to‑end application performance; the stakes are even higher at huge scales. This is an outstanding opportunity to push the limits of state‑of‑the‑art technologies and deliver platforms the world has never seen before. What you will be doing Research new communication technologies (e.g., expand the GPUDirect technology portfolio) and design new features for our communication libraries. Propose innovative solutions in hardware and software for our next‑gen platforms. You will co‑design these solutions with the GPU, Networking, and SW architects and ensure seamless integration with the software stacks. Inspire changes based on quantitative data from proof‑of‑concepts or detailed technical analysis/modeling. Drive the adoption of new communication technologies across application verticals. Keep up with the latest DL research and collaborate with diverse teams (internal and external), including DL researchers and customers. What we need to see PHD in Computer Science, Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or industry. Expertise in HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA. Deep understanding of high‑performance networking aspects: network technologies (Infiniband, Ethernet), network design, topologies, debugging and performance analysis. Strong knowledge in at least a few of these areas: ML/DL fundamentals and their relation to communications, parallel algorithms, fault tolerance and resiliency, competitive assessments, performance analysis and optimizations for large clusters, developing applications using DL frameworks (PyTorch, TensorFlow). Programming fluency with C or C++ for systems software development. Flexibility to work and communicate effectively across different HW/SW teams and time zones. Ways to stand out from the crowd Industry‑recognized leader in HPC/DL communications with a history of patents, publications, conference talks and keynotes relevant to the role. Influential role in industry standards (e.g., MPI, OpenSHMEM) and open‑source software (e.g., PyTorch, UCX, Open MPI). Compensation and Benefits Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 320,000 USD – 488,750 USD. You will also be eligible for equity and benefits. Application Deadline Applications for this job will be accepted until May 26, 2026. Equal Opportunity Statement NVIDIA is committed to fostering a diverse work environment and is a proud equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Distinguished Software Architect - Deep Learning and HPC Communications in Santa Clara, CA vacancy
  •  ...intelligence to autonomous cars. We are the GPU Communications Libraries and Networking team at NVIDIA. We...  ...NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC. We're seeking a Senior Software Architect to help co‑design next‑gen data center platforms... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

    NVIDIA Corporation is seeking a highly motivated Senior Software Engineer for its communication libraries and network software team in Santa Clara,...  ...designing and maintaining communication runtimes for Deep Learning frameworks and participating in parallel programming specifications... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...will be doing: Integrate new communication libraries features in AI...  ...to production Perform deep analysis of AI workloads and...  ...equivalent experience) with 5+ software engineering and HPC/AI experience Development...  ...experience with Deep Learning Frameworks such as PyTorch... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...We are the GPU Communications Libraries and Networking team...  ...libraries. The DL and HPC applications of today have...  ..., CPU, networking) and software components in the stack...  ...Adaptability and passion to learn new areas and tools;...  ...GPUs. Experience with deep learning frameworks... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...As a member of the GPU AI/HPC Infrastructure team, you will provide leadership in the design and implementation of ground-breaking GPU compute clusters that run demanding deep learning, high-performance computing, and computationally intensive workloads. We seek a technical... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $287.5k

     ...NVIDIA Gruppe is seeking a highly motivated Senior Software Engineer to join our communication libraries and network software team in Santa Clara, California...  ...systems used in High Performance Computing and Deep Learning. The ideal candidate should have a M.S./Ph.D. degree... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...inventions from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. We are looking for a motivated Performance engineer to influence the... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...inventions from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...Performance Architect – Deep Learning Software We are seeking a Performance Architect to develop processor and system architectures that accelerate...  ...high‑performance power‑efficient designs, energy efficient HPC, performance analysis, and profiling. Experience with GPU... 
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end‑to‑end software and firmware stack for these...  ...interface. Understanding of HPC or Deep learning workloads...  ...written and verbal communication skills. BS or MS degree in Computer... 
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $320k

     ...offerings. We seek a technical architect to define end‑to‑end...  ...these products at the system software level, covering firmware, kernel...  ...shift strategies. Qualifications Deep expertise in scalable and performant...  ...DMTF. Familiarity with NVIDIA HPC programming models and... 
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $262k - $365k

    Google Inc. is seeking a Senior Staff Software Architect to lead the development of innovative software technologies for AI and HPC infrastructure. The ideal candidate will possess a Bachelor's degree in Computer Science or Electrical Engineering and have 8 years of experience... 

    Google Inc.

    Sunnyvale, CA
    4 days ago
  •  ...Principal AI/ML System Software Engineer At d-Matrix, we are focused...  ...and believe in direct communication. Our team is inclusive, and our...  ...computer architecture, and machine learning fundamentals Proficient in...  ..., etc.) Experience with deep learning frameworks (such as... 
    Work experience placement
    3 days per week

    d-Matrix

    Santa Clara, CA
    1 day ago
  •  ...Salesforce Technical Architect Location: Los Angeles, CA (100% Onsite...  ...implementations Strong communication skills Proven capability...  ...independently and designing software on the Force.com platform, with...  ...of data modeling/ security Deep understanding of RDBMS... 
    Contract work
    Temporary work
    Work at office

    Omega Solutions Inc

    Santa Clara, CA
    15 days ago
  • $152k - $241.5k

     ...Doing Helping our customers deploy cloud‑native software stacks, such as NVIDIA AI, to embed guidelines in machine learning operations. Collaborating closely with...  ...applying profiling and analysis tools to improve deep learning model performance. Working with internal... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking an experienced Solutions Architect in Santa Clara to support accelerated computing networking solutions for AI/ML and HPC. You will develop and demonstrate solutions with major tech companies while addressing customer needs and performance issues... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $195.2k - $361.2k

     ...explored architectures, algorithms, and software inspired by the brain's extraordinary...  ...robotic systems. In this role, you will architect and lead the development of the firmware...  ...CD. ~5+ years of experience with AI, deep learning, or optimization workloads using... 
    Work at office
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    3 days ago
  •  ...NVIDIA’s Networking Systems & Software Architecture group is...  ...hardware‑software co‑design, and communication frameworks that plug directly...  ...interconnects. The Senior Architect role is to own modules and projects...  ...Background of Reinforcement Learning systems. With competitive... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $272k - $431.25k

    ## Principal CPU Software ArchitectApplylocations: US, CA, Santa...  ...for a CPU System Software Architect!Do you want to be a part of...  ...class systems architects and deep learning professionals to define the...  ...with C* Excellent communication skills to describe hard problems... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...technology. We are at the forefront of software and hardware innovation, pushing...  ...humility and believe in direct communication. Our team is inclusive , and our differing...  ...large language models and machine learning fundamentals Experience with Deep Learning, Tensorflow, Transformers... 
    3 days per week

    d-Matrix

    Santa Clara, CA
    1 day ago
  • $124k - $195.5k

    Deep Learning Kernel Software Performance Architect - New College Grad 2026 page is loaded## Deep Learning Kernel Software Performance Architect - New College Grad 2026locations: US, CA, Santa Claratime type: Full timeposted on: Posted Todayjob requisition id: JR2011814... 
    Work experience placement

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...leading technology company is seeking a Senior Software Architect to innovate server systems for deep learning applications. This role involves leading software...  ...experience in system architecture, with strong communication skills. Attractive compensation is offered, alongside... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $150k

     ...global hub for high‑performance computing in deep learning, driving impactful discoveries that...  ...optimizing performance for the machine learning software stacks, especially at training and...  ...the institution’s cutting‑edge HPC and deep learning capabilities and establishing... 
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $150k

     ...high-performance computing in deep learning, driving impactful...  ...institution’s cutting-edge HPC and deep learning capabilities...  ...~3 years of experience in software engineering, including experience...  ...technical challenges. ~ Effective communication and collaboration skills to... 
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  •  ...-driving cars to learning machines. We lead...  ...silicon chips and software content. Join us...  ...accomplished and visionary architect with a passion...  .... Leveraging a deep understanding of...  ...are an effective communicator across technical...  ...a leader among a distinguished Solutions... 
    Worldwide

    Synopsys

    Sunnyvale, CA
    4 days ago
  •  ...Clara is looking for an experienced AI Software Engineer to lead the integration of new communication libraries into AI frameworks. This role requires deep analysis and collaboration with teams...  ...5 years of software engineering and HPC/AI experience, along with proficiency... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...NVIDIA’s deep learning and HPC platforms have made a huge impact in various fields and are broadly...  ..., numeric libraries, modular software design. Deep knowledge of state‑of‑the...  ...environment. Solid written and oral communications skills. Good teamwork and interpersonal... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...high-performance computing in deep learning, driving impactful...  ...’s responsible for the full software development life cycle, from...  ...institution’s cutting-edge HPC and deep learning capabilities...  ...technical challenges. Effective communication and collaboration skills to... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    a month ago
  • $269.1k - $307.2k

     ...Distinguished Software Engineer - IFX As a Distinguished Engineer at Capital...  ...of developers, machine learning experts, product managers...  ...creating clear and concise communications, code samples, blog posts...  ...success. What you'll do: Architect and build control and data... 
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    13 days ago
  • $136.3k - $231.7k

     ...KLA-Belgium is seeking an HPC System Software Engineer in Milpitas, California. The role involves building software for large-scale deep learning and image processing on clusters. The ideal candidate has significant experience in HPC software development and strong engineering... 

    KLA-Belgium

    Milpitas, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Distinguished Software Architect - Deep Learning and HPC Communications. Be the first to apply!