Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, High Performance Computing

Eventual

About Eventual Every breakthrough Physical AI system — humanoid robots, autonomous vehicles, video generation models — is trained on petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks, Snowflake) were built for spreadsheet‑like analytics, not the multimodal corpora that power AI. Robotics and video‑AI teams now lose 20‑40% of their training time to dataloading alone. GPU bandwidth has grown 2‑3× per generation. Storage and pipelines haven't. The gap widens every year. Eventual was founded in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose‑built for multimodal AI — already running 2 PB/day at Amazon, 60‑100 PB at another FAANG company, and in production at Mobileye, TogetherAI, and CloudKitchens. We are building a video‑native index on top of our engine for Physical AI that streams curated datasets to GPUs at line rate. Saturates B200s today. Aimed at NVL72 and Vera Rubin tomorrow. We're building this in partnership with the top PhysicalAI labs and public AI infrastructure companies today. We have raised $30M from Felicis, CRV, Microsoft M12, Citi, Essence, Y Combinator, Caffeinated Capital, Array.vc, and angels from the co‑founders of Databricks and Perplexity. We've assembled a world‑class team from AWS, Render, Pinecone and Tesla. We have spent our careers powering the last generation of PhysicalAI in self‑driving, and are excited to now do this for the next. Join our small (but powerful!) team working together 4 days/week in our SF Mission district office. Your Role As a Systems Engineer on the Dataloading team, you'll build the layer that turns multi‑petabyte video corpora into dict[str, Tensor] already on the GPU at line rate. We work with the top labs training Physical AI on the newest generation hardware — H100, B200, GB200, NVL72, with Vera Rubin on the horizon — on billions of dollars worth of compute, in collaboration with partners that are the largest public AI companies on Earth. Our job is to keep those GPUs fed: rank‑aware sampling, NVMe caching, video and sensor co‑loading, random access into clips, decode pipelining. Streaming alone can already saturate a B200; the hard part is enabling the complex sampling patterns researchers actually need without giving up a single percentage point of MFU. This is a systems engineering role for someone who feels physical pain when a system is slow. You won't need GPU experience on day one — we'll uplevel you on NVL72, CUDA, and SLURM. We will need you to bring real expertise on what happens between NVMe, network, memory, and CPU, and a deep instinct for where bytes go. Key Responsibilities Design and build the video‑native dataloader: rank‑aware, NVMe‑cached, random‑access into clips, returns tensors directly to the GPU. Profile and optimize the full data path from object store → NVMe → page cache → host RAM → device RAM. Eliminate every avoidable copy and stall. Saturate the latest hardware (B200, GB200, NVL72) on real customer training jobs. Push toward Vera Rubin bandwidth requirements. Own performance benchmarks against customer baselines (custom DataLoaders, DALI, decord, LeRobot) and against our own historical numbers — regressions get caught at PR time. Partner with researchers at our partner labs to land the loader in their training stack and measure MFU end‑to‑end. Work cross‑team with Storage Infrastructure on the index/format boundary and with Visual Understanding on the model‑output ingestion path. What we look for Obsession with systems‑level performance. You can recite Jeff Dean's "numbers every programmer should know" in your sleep. You eat flamegraphs for breakfast. Strong opinions on io_uring — love it or hate it, you've earned the opinion. Live and breathe Rust, C++, or C. You reach for them when it matters and you know why. Strong familiarity with operating systems — page cache, scheduling, syscalls, NUMA, memory hierarchies. A sense for where bytes actually go: NVMe vs. memory vs. network vs. PCIe vs. NVLink, and the throughput and latency budgets of each. Nice to have Experience working with GPUs is a plus, but you don't need it on day one. Experience working with SLURM, Kubernetes for GPU workloads, or other HPC schedulers. Hands‑on CUDA experience. Deep expertise on memory and caching subsystems — page cache tuning, hugepages, NUMA pinning, GPU‑Direct Storage. Worked on video decode pipelines (PyAV, decord, NVDEC) or PyTorch DataLoader internals. Contributed to open‑source systems projects in Rust/C++. Perks & Benefits In‑person, tight‑knit team — 4 days/week in our SF Mission office. Competitive comp and meaningful startup equity. Catered lunches and dinners for SF employees. Commuter benefit. Team‑building events and poker nights. Health, vision, and dental coverage. Flexible PTO. Latest Apple equipment. 401(k) plan with match. If slow systems evoke emotional pain for you and you want to spend the next few years making the most expensive GPU clusters on the planet earn their keep, we'd love to talk. #J-18808-Ljbffr Eventual

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, High Performance Computing in San Francisco, CA vacancy
  •  ...Employment Type Full time Department Engineering About Eventual Every breakthrough AI application...  ...district office. Your Role As a Software Engineer, you will be responsible for building...  ...Key Responsibilities Design and build highly reliable and resilient products and... 
    Performance
    Full time
    Work at office
    Immediate start
    Flexible hours
    Night shift

    Alumni Ventures

    San Francisco, CA
    4 days ago
  • $230k - $347k

     ...debugging issues across hardware and software, building the tools needed to...  ...into job failures and performance, or building workflows that...  ...infrastructure teams to identify high-leverage operational problems...  ...scheduling, storage, and compute systems. Raise the bar for... 
    Performance

    OpenAI

    San Francisco, CA
    1 day ago
  • $150k - $170k

     ...Software Engineer Schmidt Sciences is a nonprofit organization founded...  ...impact including AI and advanced computing, astrophysics, biosciences,...  ...Improve AI reliability and performance in areas of limited...  ...AI. This includes selected high-impact grantmaking, such as... 
    Performance
    Local area

    Schmidt Entities

    San Francisco, CA
    1 day ago
  • $140k - $200k

     ...for physical AI—giving engineers a strong foundation...  ...models combine vision's high angular resolution with...  ...machine learning, and software engineering, with...  ...implement and optimize a high-performance pipeline capable of...  ...embedded computing platforms. This position... 
    Performance
    Full time
    Work at office

    Zendar

    Berkeley, CA
    6 days ago
  • $230k - $405k

    About the Team Compute Infrastructure builds the platform that turns...  ...of compute into a reliable engine for frontier AI. We design,...  ...data centers, orchestration software, agent infrastructure,...  ..., deep system optimization, high‑performance networking, storage, fleet health... 
    Performance

    Centaur Labs

    San Francisco, CA
    2 days ago
  • About the Team The Computer-Using Agent team is responsible for developing and deploying...  .... Combining rigorous research with high-quality engineering across evaluation, data, training, RL...  ...with researchers to build high-performance systems at massive scale for specialized... 
    Performance
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    18 hours ago
  • $164.2k - $205.2k

    Position Overview At Databricks, the Compute Infrastructure organization...  ...efficiency. Job Description As a Senior Software Engineer on the Compute Infra team, you...  ...build world‑class products with high velocity and best‑in‑class performance. Design the workload... 
    Performance
    Local area

    I did my part and supported the Regular Toilet

    San Francisco, CA
    2 days ago
  • $164.2k - $205.2k

    Senior Software Engineer, Compute Infrastructure RDQ427R175 Overview At Databricks, we are passionate about helping data teams...  ...engineers to build world‑class products with high velocity and best‑in‑class performance Design the workload orchestration and scheduling... 
    Performance
    Local area

    Databricks Inc.

    San Francisco, CA
    2 days ago
  • $196k - $339.9k

     ...work gets done. The Compute Platform team is...  ...services at scale. We enable engineers across the company to...  ..., governance, and performance guardrails. We are...  ...architecture proposals and guide high-impact infrastructure...  ...: ~8+ years of software engineering experience... 
    Performance
    Remote job
    Full time
    For contractors

    Airtable

    San Francisco, CA
    18 hours ago
  • $230k - $385k

     ...is a hands-on infrastructure role for engineers who want to work on deeply technical systems...  ...behind a simple interface Improve performance, reliability, and operational...  ...conditional offer of employment: protect computer hardware entrusted to you from theft, loss... 
    Performance

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...proprietary visual intelligence engine with full spatial reasoning,...  ...focusing on deploying and optimizing computer vision systems on NVIDIA...  ...systems, computer vision, and software development, with a passion for delivering high-performance solutions in real-world... 
    Performance
    Flexible hours

    EchoTwin AI

    San Francisco, CA
    1 day ago
  •  ...the team The Stream Compute team at Stripe builds and...  ...distributed systems with high reliability and performance to meet Stripe's scaling,...  ...Partnering with infrastructure engineers, adjacent platform teams,...  ...infrastructure as a product Strong software engineering skills and a... 
    Performance
    Remote work

    Stripe

    San Francisco, CA
    4 days ago
  •  ...Introduction At IBM Software, we transform client...  ...the Team The Secure Compute team builds and operates...  ...in even the most highly regulated industries....  ...As a Staff Software Engineer on the Secure Compute...  ...services) with diverse performance and isolation needs.... 
    Performance

    IBM

    San Francisco, CA
    5 days ago
  • $148k - $260k

     ...world in a positive way. To learn more visit: As a Software Engineer in High-Performance Onboard Algorithms, you will be a key contributor to the...  ...signal processing software, leveraging parallel computing architectures (e.g., CPU, GPU, specialized accelerators... 
    Performance
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    2 days ago
  • Overview We\'re looking for a Staff Software Engineer - Computer Vision Deployment to build and scale the infrastructure that powers our AI-driven...  ...benchmarking, and evaluation frameworks to ensure model performance and reliability in production environments. Required... 
    Performance
    Work at office
    3 days per week

    Claryo

    San Francisco, CA
    18 hours ago
  • $180k - $280k

     ...building a platform that powers all of compute at Vercel. That means we provide all...  ...untrusted code from our customers: even a 1% performance improvement has massive repercussions...  ...About You: You have 5+ years of software engineering experience, Golang preferred. You have... 
    Performance
    Remote job
    Work at office
    Work from home
    Monday to Friday
    Flexible hours

    vercel.com

    San Francisco, CA
    1 day ago
  •  ...Developer to design, build, and maintain high-quality mobile applications. The ideal...  ...Android SDK, and a Bachelor's degree in Computer Science or related field. Join us to work...  ...cutting-edge mobile technologies and enhance application performance. #J-18808-Ljbffr Simera
    Performance

    Simera

    San Francisco, CA
    2 days ago
  • $192.2k - $260k

     ...The Center for Quantum Computing (CQC) is a multi-...  ...disciplinary team of scientists, engineers, and technicians, on a...  ...to join our growing software team. You will work...  ..., and improving the performance of our quantum devices...  ...be able to translate high-level science requirements... 
    Performance
    Permanent employment
    Local area
    Flexible hours

    Amazon

    San Francisco, CA
    1 day ago
  •  ...cloud provider in San Francisco is seeking a skilled Network Software Engineer to design and build an advanced networking foundation for...  ...will work with technologies like Rust and Go to develop high-performance systems and SDN solutions. The ideal candidate has strong... 
    Performance

    Blaxel

    San Francisco, CA
    4 days ago
  • A leading AI-native cloud startup is seeking a Network Software Engineer to architect and build high-performance networking solutions. The ideal candidate will have over 3 years of experience and a strong proficiency in Rust and Go. This role involves developing cutting... 
    Performance

    Jack & Jill/External ATS

    San Francisco, CA
    4 days ago
  • $180k - $250k

    Unto Labs is on the lookout for a Systems Engineer to work on pioneering blockchain technology. The role involves designing and optimizing high-performance systems, especially for distributed computing focusing on performance from commodity hardware. If you have expertise... 
    Performance

    Unto Labs

    San Francisco, CA
    2 days ago
  •  ...technology firm in San Francisco is seeking a Security Software Engineer to join their Infrastructure Security team. This role requires...  ...and partner with engineers to enhance security in high-performance compute clusters. This position offers a chance to work on critical... 
    Performance

    algojobs

    San Francisco, CA
    18 hours ago
  •  ...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San...  ...training data pipelines to high-throughput, low-latency video...  ...Responsibilities You will be the core engineer responsible for building the...  ...will directly impact model performance, developer productivity, and... 
    Performance
    Full time

    HeyGen

    San Francisco, CA
    1 day ago
  •  ...the physical world. Our high-fidelity mapping...  ...Lead for the Applied Computer Vision Algorithms Team...  ...general ML code for high-performance execution on CPU and GPU...  ...Mentorship: Work with engineering leadership to define...  ...for production-level software development. Specialized... 
    Performance
    Work at office
    3 days per week

    Niantic Spatial, Inc

    San Francisco, CA
    4 days ago
  •  ...AI/ML Engineer (Computer Vision) Location: On site, Bay Area, CA A...  ...industrial market. This is a high impact opportunity for someone...  ...research into reliable software, working close to product decisions...  ...findings into measurable performance gains. Contribute to data... 
    Performance

    Blue Signal LLC

    San Francisco, CA
    18 hours ago
  •  ...experience with GPU programming and performance work (CUDA, Triton, CUTLASS,...  ..., 3+ years of professional software engineering experience with meaningful work on ML inference or high-performance systems ,...  ...and debugging tools: Nsight Compute/Systems, CUDA-GDB, PTX/SASS... 
    Performance

    Perplexity AI

    San Francisco, CA
    2 days ago
  • $175k - $250k

     ...Senior AI/ML Engineer: Python & Scientific Computing SF, NYC, Remote About Swayable Swayable is a...  ...a Senior Engineer blending Python software development expertise with scientific...  ...techniques, and architecture for high-performance computing. You will work with a talented... 
    Performance
    Remote work

    Swayable

    San Francisco, CA
    18 hours ago
  •  ...aspire to build a team of smart, high-caliber players that inspire each...  ...options. We're looking for a Backend Engineer with experience building high-performance micro-services and APIs. This is...  ...look for: ~ BS (or higher) in Computer Science, related technical field or... 
    Performance
    Remote work
    Flexible hours

    Fieldmaterials

    San Francisco, CA
    10 days ago
  • $105k - $125k

     ...intelligence collection enable engineering, safety, and security teams...  ...Design and implement highly reliable, distributed, backend...  ...architectures; Monitor and improve performance, reliability, and resource...  ...Qualifications Degree in Computer Science, Engineering, or related... 
    Performance
    Remote work

    10a Labs

    San Francisco, CA
    7 days ago
  • $230k - $385k

     ...goal is to make AI feel like a real software engineering teammate inside real workflows: editing...  ...- then turn them into reliable, high-performance product. You'll help define what great...  ...conditional offer of employment: protect computer hardware entrusted to you from theft,... 
    Performance

    OpenAI

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, High Performance Computing. Be the first to apply!