Get new jobs by email
  • $145k - $220k

     ...require the ability to access U.S.-only data systems Preferred Qualifications Experience developing near-real-time embedded software Experience with CUDA Experience with additional languages such as C++, Rust, NodeJS, Java, Go, etc. Development... 
    Suggested
    Full time
    Work experience placement
    Casual work
    Work at office
    Relocation package

    Chaos Inc

    Hawthorne, CA
    2 days ago
  • A leading technology company is seeking a Senior Software Engineer specializing in HPC libraries in Santa Clara. Applicants should have a strong background in C++ and parallel programming, with a PhD preferred. The role involves designing user-friendly APIs and collaborating...
    Suggested

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $160k - $250k

     ...-tenant serverless workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal...  ...is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand, NVLink, MPI)... 
    Suggested
    Full time
    Local area

    Together AI

    San Francisco, CA
    2 days ago
  •  ...programming skills in JavaScript, Python, and modern web development frameworks, and familiarity with GPU-accelerated tools (e.g., CUDA, cuDNN, Triton).Proficiency in React (or similar frameworks like Vue/Angular) ~ Experience with front-end testing frameworks, animation... 
    Suggested
    Remote job
    Contract work
    Work at office

    GenBio AI

    Palo Alto, CA
    12 hours ago
  •  ...Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform...  ...inference systems. Develop high-performance C++ and CUDA/HIP modules, including custom GPU kernels and memory-optimized... 
    Suggested
    Full time
    Work at office
    Flexible hours

    Sciforium

    San Francisco, CA
    4 hours agonew
  • $180k - $286k

     ...creating audio and video content quickly and easily. This involves developing a revolutionary way to record, transcribe, edit, and mix audio...  ...with audio and video processing Knowledge of Python, C/C++, CUDA, and experience profiling GPU performance and distributed... 
    Suggested
    Full time
    Work at office
    Remote work
    Flexible hours

    The Rundown AI, Inc.

    San Francisco, CA
    1 day ago
  • $139k - $229k

     ...containerized pipeline orchestration infrastructure, including developing and distributing stable base container images, providing advanced...  ..., enable GPU based inference for a large variety of use cases, cuda level optimizations for high performance, enable on-device and... 
    Suggested
    Work experience placement

    LinkedIn

    Mountain View, CA
    12 hours ago
  • $150k - $200k

     ...00.00/yr About Lumafield: Founded in 2019, Lumafield has developed the world's first accessible XRay CT scanner for engineers. Our...  ...hardwareaccelerated image processing edge solutions using NVIDIA CUDA and/or Jetson platform ~ Familiarity with Python for... 
    Suggested
    Full time
    Work at office
    Work visa
    Flexible hours

    Lumafield

    San Francisco, CA
    4 days ago
  • $160k - $230k

     ...Experience with DPUs/SmartNICs a plus ~ GPU programming, NCCL, CUDA knowledge a plus Responsibilities Perform...  ...open-source Together AI platform Create services, tools, and developer documentation Create testing frameworks for robustness and fault... 
    Suggested
    Full time
    Remote work

    Together AI

    San Francisco, CA
    2 days ago
  • $148k - $235.75k

     ...technologies, industry vertical business development, building a strong developer ecosystem and scaling out the business using partner channels...  ...in Remote Direct Memory Access (RDMA), AI workloads, NCCL, CUDA, Congestion Control, and performance tuning* Demonstrated drive... 
    Suggested
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...Platform is our flagship open-source solution for sensor AI enabling developers to build, optimize, and deploy GPU-accelerated solutions that...  ...standards, and mentoring teammates.* Hands-on expertise with CUDA C/C++ and deep knowledge of parallel programming paradigms.**... 
    Suggested
    Flexible hours

    NVIDIA

    Santa Clara, CA
    12 hours ago
  • $217k - $307k

     ...instrumentation for performance monitoring (CPU, GPU, latency, memory) and develop offline benchmarking frameworks, tools, and scripts to evaluate...  ...field and 7+ years of experience. Strong knowledge of CUDA as applied to recent GPU microarchitectures (e.g., Ampere,... 
    Suggested
    Temporary work

    Zoox

    San Diego, CA
    1 day ago
  •  ...GPU And CPU Accelerated Software Libraries Developer Join our multidisciplinary team and help build and improve GPU and CPU accelerated...  ...on modern computing platforms, with GPU acceleration using CUDA. ~ Excellent C/C++ programming and software design skills.... 
    Suggested
    Worldwide

    NVIDIA

    Los Angeles, CA
    12 hours ago
  • $272k - $425.5k

     ...open source libraries like RAPIDS to accelerate reading, writing and batch data operations in Spark. What you'll be doing: Develop CUDA/C++ libraries to accelerate DataFrames and I/O operations on common file formats such as Parquet, ORC and JSON Collaborate... 
    Suggested
    Full time
    Work experience placement

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $148k - $235.75k

     ...machine instructions for Graphics (OpenGL, Vulkan, DX) and Compute (CUDA, PTX, OpenCL, Fortran, C++). This team is comprised of worldwide...  ...based backend JIT compiler targeting NVIDIA GPUs Design and develop new compiler passes and optimizations required to meet the high... 
    Suggested
    Worldwide

    NVIDIA

    Santa Clara, CA
    12 hours ago
  • $184k - $287.5k

     ...Developer Relations Manager We are looking for a Developer Relations Manager - AI Platform SW, passionate about developing modern Artificial...  ...NVIDIA products and SDKs (Megatron, TensorRT LLM, CUTLASS, CUDA Toolkit, Python ecosystem) Your base salary will be... 
    Work experience placement

    NVIDIA

    Santa Clara, CA
    12 hours ago
  • $81.71k - $130.74k

     ...the Capital System has sustained superior results over time. You may solve problems through rigorous analysis You may design, develop, and test creative technical solutions for business needs You will be working as part of an inclusive and collaborative team in... 
    Full time
    Temporary work
    Local area

    Capital Group

    Irvine, CA
    1 day ago
  • $165k - $242k

     ...Triton, TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies....  ...environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    2 days ago
  •  ...perception (sensing, preprocessing, detection, tracking) ~ Python, C++ and ROS2 proficiency ~ The ability to work autonomously Preferred Qualifications PhD in Computer Science, Robotics, or a related discipline Experience developing CUDA kernels... 

    AeroVect

    Los Angeles, CA
    4 days ago
  • $184k - $356.5k

     ...through deployment. What You'll Be Doing: Design and develop software solutions for data center servers including Linux kernel...  ...Out From The Crowd: Experience with GPU computing (CUDA), deep learning workloads Expertise in Out of Band and In-band... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $148k - $235.75k

     ...-19 research to autonomous machines. Did you know that our team develops the GPU-accelerated mathematical libraries that make all of this...  ...meta-programming, RAII). ~ Parallel programming experience with CUDA or OpenCL. ~ Strong collaboration, communication, and... 
    Flexible hours
    Shift work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $156k - $180k

     ...based architectures, or tensor-network simulation. On-device ML for adaptive calibration (Gaussian processes, RL bandits). GPU / CUDA-accelerated numerical kernels (cuBLAS/cuFFT/cuQuantum) and/or GPU-Direct RDMA pipelines. Contributions to open-source control or... 
    Full time

    PSI Quantum

    Palo Alto, CA
    3 days ago
  • $224k - $356.5k

     ...Software Architect For Developer Tools Always-On Profiling Team Are you ready to innovate GPU performance analysis for Machine Learning...  ...ecosystems and their limitations is a plus. ~ GPU & CUDA Proficiency: In-depth knowledge of CUDA APIs, runtime, streams,... 
    Flexible hours

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...Collaborate with infrastructure engineers and researchers to develop scalable APIs, services, and end-to-end inference workflows....  ...infrastructure. Strong background in GPU inference performance tuning, CUDA-based systems, and operating across cloud-native and hybrid... 
    Remote work

    NVIDIA

    San Jose, CA
    4 days ago
  •  ...Overview Title: C# Developer Location: Irvine, CA Duration: 5+ months plus Qualifications ~5+ years of C# .NET coding experience ~ Proven experience with C/C++, VB ~ Working experience in ActiveX ~ Working experience in COM object and IPC ~ Experience... 
    Work experience placement

    Inent Inc

    Irvine, CA
    2 days ago
  • Front is the leading AI-powered customer service platform built for collaboration. Front brings core support channels into a modern, intuitive workspace where teams can collaborate on requests, automate manual processes, and delight customers across their entire lifecycle...
    Full time
    Work at office
    Remote work
    Work from home
    Worldwide
    Flexible hours
    3 days per week

    Front

    San Francisco, CA
    12 hours ago
  •  ...robots must live and learn alongside us. Thats why were focused on developing friendly home robots designed to integrate seamlessly into...  ...to access low?level system resources (e.g. V4L, POSIX, OpenGL, CUDA) ~ Deep understanding of OS primitives and experience applying... 
    Local area

    1X Technologies AS

    Palo Alto, CA
    2 days ago
  •  ...performance tuning of large-scale HPC clusters Experience with MPI, RPC frameworks, or distributed runtimes Experience with NCCL, CUDA and GPU Kernels Knowledge of RDMA APIs (e.g., libibverbs) and transport semantics Experience with NIC drivers or NIC... 

    Clockwork Inc

    Palo Alto, CA
    3 days ago
  • $184k - $287.5k

     .../expert/pipeline-parallelism, prefill-decode disaggregation. Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated...  ...SGLang). Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $241.8k - $409.2k

     ...'re completely overhauling our software stack and embracing the CUDA ecosystem. Our goal is to achieve over 90% compatibility with cuBLAS...  .... Job Responsibilities Software Technical Strategy Develop and refine a comprehensive 3-year roadmap for a software stack... 
    Full time

    XPENG Deutschland

    San Diego, CA
    1 day ago