GPU Systems Engineer - HPC / Parallel Computing

$160k - $320k

Vast.ai

About Us Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. We are a small, growing, and highly motivated team dedicated to an ambitious technical plan. We operate with a flat mobile organizational structure where everyone contributes directly to the company's mission. Leadership is earned by those who show initiative and deliver excellence. We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of excellent research, coding, and communication skills. Location On-site at our office in San Francisco or Westwood, Los Angeles. About the Role We're looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You'll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. Employment Details Full-time On-site at either our SF or LA offices Tech Stack CUDA/C++, GPGPU, Python, Linux. Key Responsibilities Design and optimize GPU kernels and tensor libraries. Translate HPC techniques into scalable AI inference solutions. Evaluate emerging architectures and resource management approaches. Collaborate with technical leadership to improve GPU infrastructure efficiency. Ideal Experience Advanced C++ (C++17/20 preferred). Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar). Strong background in systems optimization and HPC performance tooling. Familiarity with distributed training/inference frameworks (bonus). Annual Salary Range $160,000 – $320,000 + equity + benefits. Benefits Comprehensive health, dental, vision, and life insurance. 401(k) with company match. Meaningful early-stage equity. Onsite meals, snacks, and close collaboration with founders/tech leaders. Ambitious, fast-paced startup culture where initiative is rewarded. #J-18808-Ljbffr Vast.ai

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the GPU Systems Engineer - HPC / Parallel Computing in San Francisco, CA vacancy

GPU HPC Systems Engineer for AI Inference & Equity
$160k - $320k
A leading AI computing firm is seeking a Systems Engineer in San Francisco or Los Angeles to scale AI inference... ...have strong C++ skills, HPC experience, and knowledge of parallel programming techniques.... ...Responsibilities include designing GPU kernels, optimizing...
Suggested
Vast.ai
San Francisco, CA
4 days ago
Senior HPC GPU Compute Engineer (Hybrid SF)
...Francisco is seeking an experienced engineer to manage and operate GPU clusters. The role requires over 5 years... ..., a deep understanding of hardware systems, and a passion for automating fleet... ...a collaborative work culture. #J-18808-Ljbffr The San Francisco Compute Company
Suggested
The San Francisco Compute Company
San Francisco, CA
3 days ago
SYSTEMS ENGINEERING AND ANALYSIS
$300 per month
...site Department Cloud Engineering Crusoe's mission is to... ...Engineer specializing in Systems Applications. This... ...and development of our compute platform, specifically... ...etc.) in cutting-edge AI/HPC environments. Kernel &... ...ML workloads, including GPU virtualization. Previous...
Suggested
Full time
Temporary work
ProducePay
San Francisco, CA
2 days ago
Software Engineer — GPU Networking & Distributed Systems
...help build the platform engineers turn to to ship AI... ...the global operating system for distributed, heterogeneous... ..., the network is the computer. We are looking for... ...engineers to lead our GPU Networking efforts,... ...Serving, Wide Expert Parallelism (WideEP), and lightening...
Suggested
Flexible hours
Baseten
San Francisco, CA
4 days ago
Founding GPU Kernel Engineer
$285k - $315k
...re looking for a Founding GPU Kernel Engineer who lives right at the boundary... ...profiling tools: Nsight Compute, Nsight Systems, rocprof, or equivalents... ...patterns Nice to Have HPC background: experience with... ...checkpointing, pipeline/tensor parallelism, ZeRO-style optimizations...
Suggested
Full time
Work at office
Relocation package
SF Tensor
San Francisco, CA
4 days ago
Low-Latency Inference Systems Engineer - On-Device & GPU
...develop low-latency inference pipelines for on-device deployment in robotics. The role involves designing and optimizing distributed systems on GPU clusters, implementing efficient low-level code such as CUDA and Triton, and managing workloads to ensure high throughput and...
Genesis AI
San Francisco, CA
4 days ago
Storage Systems Engineer
...the Facility for Advanced Computing (FAC), storage and systems, including support for large... ...The Storage Systems Engineer will: Work with the... ...environments, including VAST, parallel filesystems, and enterprise... ...integration between storage and HPC compute systems,...
Work experience placement
Worldwide
University of California , San Francisco
San Francisco, CA
7 hours ago
Senior GPU HPC Platform Reliability Engineer
...research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and... ...uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates have...
OpenAI
San Francisco, CA
3 days ago
HPC AI Technologist
$90k - $210k
...Overview: The Cambridge HPC AI Technologist is a... ...end-to-end research computing system solutions. You will leverage... ...Science, Computer Engineering, or science related... ...display solid knowledge of GPU-focused hardware/... ...storage technologies and parallel filesystems (Lustre,...
Full time
Local area
Remote work
Cambridge Computer Services, Inc
San Francisco, CA
1 day ago
Software Engineer, GPU Infrastructure - HPC
$230k
...Fleet team at OpenAI supports the computing environment that powers our... ...development. We oversee large-scale systems that span data centers, GPUs, networking... ...the role As a software engineer on the Fleet High Performance Computing (HPC) team, you will be responsible for...
OpenAI
San Francisco, CA
1 day ago
Senior Systems Engineer, Compute & AI Virtualization
$137k - $161k
ProducePay, located in San Francisco, CA, is seeking a passionate Senior/Staff Software Engineer specializing in Systems Applications. This role focuses on design and development for their AI compute platform, emphasizing the construction of virtualization applications. The...
ProducePay
San Francisco, CA
2 days ago
Remote- Systems Engineering -
Summary As a systems engineer, you’ll work on pioneering machine learning... ...large numbers of experiments in parallel across local and cloud GPUs,... ...cloud and physical GPU resources. Implementing a caching... ...the dream of the *personal* computer, where computers become...
Remote job
Local area
Flexible hours
Stars Arena
San Francisco, CA
3 days ago
Senior Storage Systems Engineer
$148.5k - $161k
...site Department Cloud Engineering Crusoe is on a mission... ...time. The demand for AI compute is boundless, and power... .... As a Senior Storage Systems Engineer , you will be... ...-class AI training and HPC workloads. You will lead... ...that our sustainable GPU clusters have the reliable...
Full time
Temporary work
Epoch Biodesign
San Francisco, CA
3 days ago
Founding Systems Engineer — AI‑Native HPC for Drugs
$150k - $240k
..., based in San Francisco, is looking for a founding engineer to contribute to a high-performance computing platform for AI-powered drug discovery. You'll join... ...responsibilities from day one. Ideal candidates excel in low-level system architecture and possess strong problem-solving...
10x Science (YC W26)
San Francisco, CA
3 days ago
Senior ML Training Systems Engineer - Distributed GPU Infra
...company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and... ...of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-functional teams...
Baseten
San Francisco, CA
4 days ago
System & Network Engineering
$335k
...that powers large-scale AI systems. We design and deliver next-generation compute environments that... ...We are seeking a System Engineer (Network / Storage / Systems... ...faults across firmware, NIC, GPU, server, and platform... ..., AI clusters, HPC environments, or data center...
Work at office
Relocation package
OpenAI
San Francisco, CA
4 days ago
Staff ML Systems Engineer - GPU & Distributed Training
...an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python, CUDA, and cuDNN. You will work on profiling...
Genesis AI
San Francisco, CA
4 days ago
Compute Platform Engineer - GPU & Multi-Cloud Infra
B Capital is seeking a Systems Engineer to join its Compute Platform team in San Francisco. This role involves maintaining a K8s-based platform and solving complex systems challenges, focusing on GPU infrastructures and multi-cloud environments. The ideal candidate has...
B Capital
San Francisco, CA
5 days ago
Staff Hardware Systems Engineer
$50 per hour
...mission to align the future of computing with the future of the... ...About the Role: The Hardware Systems Engineering team at Crusoe is at the forefront... .... ~ Experience with CPU/GPU hardware and server system architecture... ...or coursework in AI/HPC systems hardware development...
Full time
Temporary work
Crusoe
San Francisco, CA
7 hours ago
Senior Hardware Systems Engineer
$172k - $209k
...time. The demand for AI compute is boundless, and power is... ...Production / Sustaining Engineer to strengthen Crusoe’s Hardware Systems Engineering team and close... ...reliability across Crusoe Cloud’s GPU- and CPU-based... ...how to leverage them in AI/HPC environments. Expertise...
Temporary work
Crusoe
San Francisco, CA
2 days ago
Founding GPU Compiler Engineer
$285k - $315k
...AI and high-performance computing depends on rethinking... ...foundations of AI and HPC. SF Tensor is proudly... ...partnering with researchers, engineers, and organizations who... ...'re hiring a Founding GPU Compiler Engineer to... ...performance regression systems Work closely with ML researchers...
Full time
Work at office
Relocation package
SF Tensor
San Francisco, CA
3 days ago
Senior C++ Systems Engineer — GPU Cloud, Low-Latency
Thunder Compute is seeking a talented C++ developer in San Francisco to focus on core systems development with responsibilities including performance optimization, systems debugging, and research. The role requires top-tier C++ skills, a strong background in low-level systems...
Full time
Thunder Compute
San Francisco, CA
2 days ago
Systems Engineer / DevOps
$120k - $180k
...DevOps and Systems Team Our unique machine learning needs led us to open our own data centers, with an emphasis on GPU resources. Even with these data centers, we maintain a hybrid... ...teams and management levels Degree in computer science, or similar, is an added plus...
Hive
San Francisco, CA
1 day ago
Software Engineer - GPU Kernel
About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-level compute kernels that power our large-scale, GPU-... ...years of experience in GPU programming, HPC, or performance-critical systems Bachelor’s or Master’s degrees in...
Flexible hours
FriendliAI
San Francisco, CA
4 days ago
Machine Learning Systems Engineer, Research Tools
$320k - $405k
...Machine Learning Systems Engineer, Research Tools San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic... ...enables scientific progress Distributed systems and parallel computing for ML workflows Large language models or other transformer...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
7 hours ago
Machine Learning Engineer - Distributed ML Systems
...looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training... ...and optimize model‑parallel training strategies (data... ...communication overhead. Optimize GPU utilization, memory efficiency, and compute performance across...
Remote work
Visa sponsorship
Pluralis Research
San Francisco, CA
4 days ago
GPU Systems Support Engineer for High-Performance Inference
$200k - $280k
A leading AI company in San Francisco is looking for a Staff Machine Learning Engineer to enhance inference systems at production scale. You will design algorithms, optimize performance, and collaborate on RL and post-training pipelines. Ideal candidates have 3+ years of...
Full time
AI Chopping Block, Inc.
San Francisco, CA
1 day ago
ML Systems Engineer, Robotics
$248.8k - $311k
...across Robotics, Autonomous Vehicles, and Computer Vision. This position will be a key... ...Physical AI. The Role As an ML Systems Engineer on the Physical AI team, you will design... ...for cloud environments, including GPU-level algorithm optimizations (e.g., CUDA...
Full time
Scale AI
San Francisco, CA
2 days ago
Tech Lead, AI Compute Infrastructure
...Tech Lead, AI Compute Infrastructure Los Angeles,... ...You will be the core engineer responsible for building... ...video. Optimize GPU Utilization: Design and... ...scaling of our distributed systems. Minimum Requirements... ...AI infrastructure, or HPC systems. ~...
Full time
HeyGen
San Francisco, CA
1 day ago
GPU Kernel Engineer
...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal... ...power next-generation large-scale AI systems. You will work across the hardware–... ...engineers, and model-serving teams to optimize compute performance across the stack. Work...
Flexible hours
Sciforium
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Systems Engineer - HPC / Parallel Computing. Be the first to apply!