GPU Systems Engineer - HPC / Parallel Computing
$160k - $320kVast.ai
About Us Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. We are a small, growing, and highly motivated team dedicated to an ambitious technical plan. We operate with a flat mobile organizational structure where everyone contributes directly to the company's mission. Leadership is earned by those who show initiative and deliver excellence. We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of excellent research, coding, and communication skills. Location On-site at our office in San Francisco or Westwood, Los Angeles. About the Role We're looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You'll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. Employment Details Full-time On-site at either our SF or LA offices Tech Stack CUDA/C++, GPGPU, Python, Linux. Key Responsibilities Design and optimize GPU kernels and tensor libraries. Translate HPC techniques into scalable AI inference solutions. Evaluate emerging architectures and resource management approaches. Collaborate with technical leadership to improve GPU infrastructure efficiency. Ideal Experience Advanced C++ (C++17/20 preferred). Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar). Strong background in systems optimization and HPC performance tooling. Familiarity with distributed training/inference frameworks (bonus). Annual Salary Range $160,000 – $320,000 + equity + benefits. Benefits Comprehensive health, dental, vision, and life insurance. 401(k) with company match. Meaningful early-stage equity. Onsite meals, snacks, and close collaboration with founders/tech leaders. Ambitious, fast-paced startup culture where initiative is rewarded. #J-18808-Ljbffr Vast.ai
$160k - $320k
A leading AI computing firm is seeking a Systems Engineer in San Francisco or Los Angeles to scale AI inference... ...have strong C++ skills, HPC experience, and knowledge of parallel programming techniques.... ...Responsibilities include designing GPU kernels, optimizing...Suggested- ...Francisco is seeking an experienced engineer to manage and operate GPU clusters. The role requires over 5 years... ..., a deep understanding of hardware systems, and a passion for automating fleet... ...a collaborative work culture. #J-18808-Ljbffr The San Francisco Compute CompanySuggested
$300 per month
...site Department Cloud Engineering Crusoe's mission is to... ...Engineer specializing in Systems Applications. This... ...and development of our compute platform, specifically... ...etc.) in cutting-edge AI/HPC environments. Kernel &... ...ML workloads, including GPU virtualization. Previous...SuggestedFull timeTemporary work- ...help build the platform engineers turn to to ship AI... ...the global operating system for distributed, heterogeneous... ..., the network is the computer. We are looking for... ...engineers to lead our GPU Networking efforts,... ...Serving, Wide Expert Parallelism (WideEP), and lightening...SuggestedFlexible hours
$285k - $315k
...re looking for a Founding GPU Kernel Engineer who lives right at the boundary... ...profiling tools: Nsight Compute, Nsight Systems, rocprof, or equivalents... ...patterns Nice to Have HPC background: experience with... ...checkpointing, pipeline/tensor parallelism, ZeRO-style optimizations...SuggestedFull timeWork at officeRelocation package- ...develop low-latency inference pipelines for on-device deployment in robotics. The role involves designing and optimizing distributed systems on GPU clusters, implementing efficient low-level code such as CUDA and Triton, and managing workloads to ensure high throughput and...
- ...the Facility for Advanced Computing (FAC), storage and systems, including support for large... ...The Storage Systems Engineer will: Work with the... ...environments, including VAST, parallel filesystems, and enterprise... ...integration between storage and HPC compute systems,...Work experience placementWorldwide
- ...research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and... ...uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates have...
$90k - $210k
...Overview: The Cambridge HPC AI Technologist is a... ...end-to-end research computing system solutions. You will leverage... ...Science, Computer Engineering, or science related... ...display solid knowledge of GPU-focused hardware/... ...storage technologies and parallel filesystems (Lustre,...Full timeLocal areaRemote work$230k
...Fleet team at OpenAI supports the computing environment that powers our... ...development. We oversee large-scale systems that span data centers, GPUs, networking... ...the role As a software engineer on the Fleet High Performance Computing (HPC) team, you will be responsible for...$137k - $161k
ProducePay, located in San Francisco, CA, is seeking a passionate Senior/Staff Software Engineer specializing in Systems Applications. This role focuses on design and development for their AI compute platform, emphasizing the construction of virtualization applications. The...- Summary As a systems engineer, you’ll work on pioneering machine learning... ...large numbers of experiments in parallel across local and cloud GPUs,... ...cloud and physical GPU resources. Implementing a caching... ...the dream of the *personal* computer, where computers become...Remote jobLocal areaFlexible hours
$148.5k - $161k
...site Department Cloud Engineering Crusoe is on a mission... ...time. The demand for AI compute is boundless, and power... .... As a Senior Storage Systems Engineer , you will be... ...-class AI training and HPC workloads. You will lead... ...that our sustainable GPU clusters have the reliable...Full timeTemporary work$150k - $240k
..., based in San Francisco, is looking for a founding engineer to contribute to a high-performance computing platform for AI-powered drug discovery. You'll join... ...responsibilities from day one. Ideal candidates excel in low-level system architecture and possess strong problem-solving...- ...company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and... ...of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-functional teams...
$335k
...that powers large-scale AI systems. We design and deliver next-generation compute environments that... ...We are seeking a System Engineer (Network / Storage / Systems... ...faults across firmware, NIC, GPU, server, and platform... ..., AI clusters, HPC environments, or data center...Work at officeRelocation package- ...an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python, CUDA, and cuDNN. You will work on profiling...
- B Capital is seeking a Systems Engineer to join its Compute Platform team in San Francisco. This role involves maintaining a K8s-based platform and solving complex systems challenges, focusing on GPU infrastructures and multi-cloud environments. The ideal candidate has...
$50 per hour
...mission to align the future of computing with the future of the... ...About the Role: The Hardware Systems Engineering team at Crusoe is at the forefront... .... ~ Experience with CPU/GPU hardware and server system architecture... ...or coursework in AI/HPC systems hardware development...Full timeTemporary work$172k - $209k
...time. The demand for AI compute is boundless, and power is... ...Production / Sustaining Engineer to strengthen Crusoe’s Hardware Systems Engineering team and close... ...reliability across Crusoe Cloud’s GPU- and CPU-based... ...how to leverage them in AI/HPC environments. Expertise...Temporary work$285k - $315k
...AI and high-performance computing depends on rethinking... ...foundations of AI and HPC. SF Tensor is proudly... ...partnering with researchers, engineers, and organizations who... ...'re hiring a Founding GPU Compiler Engineer to... ...performance regression systems Work closely with ML researchers...Full timeWork at officeRelocation package- Thunder Compute is seeking a talented C++ developer in San Francisco to focus on core systems development with responsibilities including performance optimization, systems debugging, and research. The role requires top-tier C++ skills, a strong background in low-level systems...Full time
$120k - $180k
...DevOps and Systems Team Our unique machine learning needs led us to open our own data centers, with an emphasis on GPU resources. Even with these data centers, we maintain a hybrid... ...teams and management levels Degree in computer science, or similar, is an added plus...- About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-level compute kernels that power our large-scale, GPU-... ...years of experience in GPU programming, HPC, or performance-critical systems Bachelor’s or Master’s degrees in...Flexible hours
$320k - $405k
...Machine Learning Systems Engineer, Research Tools San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic... ...enables scientific progress Distributed systems and parallel computing for ML workflows Large language models or other transformer...Work at officeVisa sponsorshipFlexible hours- ...looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training... ...and optimize model‑parallel training strategies (data... ...communication overhead. Optimize GPU utilization, memory efficiency, and compute performance across...Remote workVisa sponsorship
$200k - $280k
A leading AI company in San Francisco is looking for a Staff Machine Learning Engineer to enhance inference systems at production scale. You will design algorithms, optimize performance, and collaborate on RL and post-training pipelines. Ideal candidates have 3+ years of...Full time$248.8k - $311k
...across Robotics, Autonomous Vehicles, and Computer Vision. This position will be a key... ...Physical AI. The Role As an ML Systems Engineer on the Physical AI team, you will design... ...for cloud environments, including GPU-level algorithm optimizations (e.g., CUDA...Full time- ...Tech Lead, AI Compute Infrastructure Los Angeles,... ...You will be the core engineer responsible for building... ...video. Optimize GPU Utilization: Design and... ...scaling of our distributed systems. Minimum Requirements... ...AI infrastructure, or HPC systems. ~...Full time
- ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal... ...power next-generation large-scale AI systems. You will work across the hardware–... ...engineers, and model-serving teams to optimize compute performance across the stack. Work...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to GPU Systems Engineer - HPC / Parallel Computing. Be the first to apply!
- operations support system engineer San Francisco, CA
- microsoft systems engineer San Francisco, CA
- system safety engineer San Francisco, CA
- ground systems engineer San Francisco, CA
- mission system engineer San Francisco, CA
- unix linux systems engineer San Francisco, CA
- wireless systems engineer San Francisco, CA
- space systems engineer San Francisco, CA
- director systems engineering San Francisco, CA
- digital communications systems engineer San Francisco, CA



