Get new jobs by email
$145k - $220k
...require the ability to access U.S.-only data systems Preferred Qualifications Experience developing near-real-time embedded software Experience with CUDA Experience with additional languages such as C++, Rust, NodeJS, Java, Go, etc. Development...SuggestedFull timeWork experience placementCasual workWork at officeRelocation package- A leading technology company is seeking a Senior Software Engineer specializing in HPC libraries in Santa Clara. Applicants should have a strong background in C++ and parallel programming, with a PhD preferred. The role involves designing user-friendly APIs and collaborating...Suggested
$160k - $250k
...-tenant serverless workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal... ...is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand, NVLink, MPI)...SuggestedFull timeLocal area- ...programming skills in JavaScript, Python, and modern web development frameworks, and familiarity with GPU-accelerated tools (e.g., CUDA, cuDNN, Triton).Proficiency in React (or similar frameworks like Vue/Angular) ~ Experience with front-end testing frameworks, animation...SuggestedRemote jobContract workWork at office
- ...Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform... ...inference systems. Develop high-performance C++ and CUDA/HIP modules, including custom GPU kernels and memory-optimized...SuggestedFull timeWork at officeFlexible hours
$180k - $286k
...creating audio and video content quickly and easily. This involves developing a revolutionary way to record, transcribe, edit, and mix audio... ...with audio and video processing Knowledge of Python, C/C++, CUDA, and experience profiling GPU performance and distributed...SuggestedFull timeWork at officeRemote workFlexible hours$139k - $229k
...containerized pipeline orchestration infrastructure, including developing and distributing stable base container images, providing advanced... ..., enable GPU based inference for a large variety of use cases, cuda level optimizations for high performance, enable on-device and...SuggestedWork experience placement$150k - $200k
...00.00/yr About Lumafield: Founded in 2019, Lumafield has developed the world's first accessible XRay CT scanner for engineers. Our... ...hardwareaccelerated image processing edge solutions using NVIDIA CUDA and/or Jetson platform ~ Familiarity with Python for...SuggestedFull timeWork at officeWork visaFlexible hours$160k - $230k
...Experience with DPUs/SmartNICs a plus ~ GPU programming, NCCL, CUDA knowledge a plus Responsibilities Perform... ...open-source Together AI platform Create services, tools, and developer documentation Create testing frameworks for robustness and fault...SuggestedFull timeRemote work$148k - $235.75k
...technologies, industry vertical business development, building a strong developer ecosystem and scaling out the business using partner channels... ...in Remote Direct Memory Access (RDMA), AI workloads, NCCL, CUDA, Congestion Control, and performance tuning* Demonstrated drive...SuggestedRemote work$184k - $287.5k
...Platform is our flagship open-source solution for sensor AI enabling developers to build, optimize, and deploy GPU-accelerated solutions that... ...standards, and mentoring teammates.* Hands-on expertise with CUDA C/C++ and deep knowledge of parallel programming paradigms.**...SuggestedFlexible hours$217k - $307k
...instrumentation for performance monitoring (CPU, GPU, latency, memory) and develop offline benchmarking frameworks, tools, and scripts to evaluate... ...field and 7+ years of experience. Strong knowledge of CUDA as applied to recent GPU microarchitectures (e.g., Ampere,...SuggestedTemporary work- ...GPU And CPU Accelerated Software Libraries Developer Join our multidisciplinary team and help build and improve GPU and CPU accelerated... ...on modern computing platforms, with GPU acceleration using CUDA. ~ Excellent C/C++ programming and software design skills....SuggestedWorldwide
$272k - $425.5k
...open source libraries like RAPIDS to accelerate reading, writing and batch data operations in Spark. What you'll be doing: Develop CUDA/C++ libraries to accelerate DataFrames and I/O operations on common file formats such as Parquet, ORC and JSON Collaborate...SuggestedFull timeWork experience placement$148k - $235.75k
...machine instructions for Graphics (OpenGL, Vulkan, DX) and Compute (CUDA, PTX, OpenCL, Fortran, C++). This team is comprised of worldwide... ...based backend JIT compiler targeting NVIDIA GPUs Design and develop new compiler passes and optimizations required to meet the high...SuggestedWorldwide$184k - $287.5k
...Developer Relations Manager We are looking for a Developer Relations Manager - AI Platform SW, passionate about developing modern Artificial... ...NVIDIA products and SDKs (Megatron, TensorRT LLM, CUTLASS, CUDA Toolkit, Python ecosystem) Your base salary will be...Work experience placement$81.71k - $130.74k
...the Capital System has sustained superior results over time. You may solve problems through rigorous analysis You may design, develop, and test creative technical solutions for business needs You will be working as part of an inclusive and collaborative team in...Full timeTemporary workLocal area$165k - $242k
...Triton, TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.... ...environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work- ...perception (sensing, preprocessing, detection, tracking) ~ Python, C++ and ROS2 proficiency ~ The ability to work autonomously Preferred Qualifications PhD in Computer Science, Robotics, or a related discipline Experience developing CUDA kernels...
$184k - $356.5k
...through deployment. What You'll Be Doing: Design and develop software solutions for data center servers including Linux kernel... ...Out From The Crowd: Experience with GPU computing (CUDA), deep learning workloads Expertise in Out of Band and In-band...$148k - $235.75k
...-19 research to autonomous machines. Did you know that our team develops the GPU-accelerated mathematical libraries that make all of this... ...meta-programming, RAII). ~ Parallel programming experience with CUDA or OpenCL. ~ Strong collaboration, communication, and...Flexible hoursShift work$156k - $180k
...based architectures, or tensor-network simulation. On-device ML for adaptive calibration (Gaussian processes, RL bandits). GPU / CUDA-accelerated numerical kernels (cuBLAS/cuFFT/cuQuantum) and/or GPU-Direct RDMA pipelines. Contributions to open-source control or...Full time$224k - $356.5k
...Software Architect For Developer Tools Always-On Profiling Team Are you ready to innovate GPU performance analysis for Machine Learning... ...ecosystems and their limitations is a plus. ~ GPU & CUDA Proficiency: In-depth knowledge of CUDA APIs, runtime, streams,...Flexible hours$184k - $287.5k
...Collaborate with infrastructure engineers and researchers to develop scalable APIs, services, and end-to-end inference workflows.... ...infrastructure. Strong background in GPU inference performance tuning, CUDA-based systems, and operating across cloud-native and hybrid...Remote work- ...Overview Title: C# Developer Location: Irvine, CA Duration: 5+ months plus Qualifications ~5+ years of C# .NET coding experience ~ Proven experience with C/C++, VB ~ Working experience in ActiveX ~ Working experience in COM object and IPC ~ Experience...Work experience placement
- Front is the leading AI-powered customer service platform built for collaboration. Front brings core support channels into a modern, intuitive workspace where teams can collaborate on requests, automate manual processes, and delight customers across their entire lifecycle...Full timeWork at officeRemote workWork from homeWorldwideFlexible hours3 days per week
- ...robots must live and learn alongside us. Thats why were focused on developing friendly home robots designed to integrate seamlessly into... ...to access low?level system resources (e.g. V4L, POSIX, OpenGL, CUDA) ~ Deep understanding of OS primitives and experience applying...Local area
- ...performance tuning of large-scale HPC clusters Experience with MPI, RPC frameworks, or distributed runtimes Experience with NCCL, CUDA and GPU Kernels Knowledge of RDMA APIs (e.g., libibverbs) and transport semantics Experience with NIC drivers or NIC...
$184k - $287.5k
.../expert/pipeline-parallelism, prefill-decode disaggregation. Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated... ...SGLang). Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/...$241.8k - $409.2k
...'re completely overhauling our software stack and embracing the CUDA ecosystem. Our goal is to achieve over 90% compatibility with cuBLAS... .... Job Responsibilities Software Technical Strategy Develop and refine a comprehensive 3-year roadmap for a software stack...Full time






