Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Hybrid SF RL Infrastructure Engineer — GPU-Scale Orchestration

Vmax

Vmax is seeking a Member of Technical Staff for RL Infrastructure in San Francisco. This role involves building infrastructure for distributed RL training and inference across thousands of GPUs. You will improve the reliability and throughput of RL experiments while owning projects from architecture to deployment. Candidates should have strong software engineering experience, particularly in building infrastructure for LLM inference or RL training, and be able to work closely with ML researchers. #J-18808-Ljbffr Vmax

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Hybrid SF RL Infrastructure Engineer — GPU-Scale Orchestration in San Francisco, CA vacancy
  • $250k - $380k

     ...Employment Type Full time Department Scaling Compensation $250K - $380K •...  ...s LLM training and inference infrastructure that powers frontier models...  ..., and execution across vast GPU/accelerator fleets. By...  ...the Role We are looking for an engineer to design and implement the dataset... 
    Suggested
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    1 day ago
  • $170k - $230k

     ...Senior Core Infrastructure Engineer San Francisco About Highnote...  ..., container orchestration frameworks, and infrastructure...  ...in building large-scale, distributed systems...  ...in San Francisco are hybrid and include core working...  ...from our downtown SF office. Highnote believes... 
    Suggested
    Work at office
    Local area
    Home office
    Flexible hours

    HighNote

    San Francisco, CA
    4 days ago
  • A cutting-edge AI video platform is seeking a Senior Software Engineer (Infrastructure) to manage its GPU deployments and maintain a reliable AWS backbone. You will collaborate with specialized providers to ensure high availability and architect scalable systems, impacting... 
    Suggested

    Jack & Jill/External ATS

    San Francisco, CA
    3 days ago
  •  ...Role We're hiring a Senior Infrastructure Engineer to own and scale the core systems behind a...  ...environments, Kubernetes orchestration, CI/CD, observability,...  ...work on‑site 5 days/week in SF or Vancouver Nice‑to‑...  ...experience Experience with hybrid or on‑prem deployments Background... 
    Suggested
    Work at office

    Emeraldadvantageconcepts

    San Francisco, CA
    1 day ago
  • A leading AI infrastructure company is seeking a Staff Infrastructure Engineer in San Francisco. In this role, you will own the systems that power the company at scale, focusing on reliability, scalability, and developer velocity. You will be responsible for designing... 
    Suggested
    Work at office

    Salient

    San Francisco, CA
    2 days ago
  •  ...Jill is looking for a Principal Software Engineer to join their team in San Francisco. In this...  ...with a seasoned team to shape a high-scale platform and innovate on a modern card-issuing...  ...-centric engineering are essential. This hybrid role offers unique opportunities to... 

    Jack & Jill

    San Francisco, CA
    3 days ago
  • A well-funded data platform startup is seeking a software engineer to contribute to their open-sourced data engine. The hybrid role involves working in the SF office three days a week, focusing on both frontend and backend development. Candidates should have 3+ years of... 
    Work at office
    3 days per week

    DRH Search

    San Francisco, CA
    4 days ago
  •  ...Senior HPC & GPU Infrastructure Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary...  ...AMD with hands-on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI... 
    Flexible hours

    Sciforium

    San Francisco, CA
    3 days ago
  • $180k - $200k

     ...Infrastructure Engineer (Storage) New York, New York, United States...  ...cost-efficient, large-scale compute. Teams get the...  .... This role can work hybrid out of one of our US-...  ...hubs (Seattle, NYC, or SF) or fully remote...  ...technologies (e.g., RDMA, GPU Direct Storage) Experience... 
    Remote work
    Work from home
    Flexible hours

    Lightning AI

    San Francisco, CA
    4 days ago
  •  ...the globe, we offer an innovative GPU marketplace and AI inference service...  ...Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU Cloud Marketplace...  ...suppliers into a programmable, orchestrated pool that serves thousands of AI developers... 
    Remote work

    Hyperbolic Labs

    San Francisco, CA
    1 day ago
  • Hayden AI Technologies, Inc. is looking for a Senior Software Engineer to join their Platform - Cloud Events team in San Francisco. The...  ...services, and expertise in Golang or Python. The position follows a hybrid schedule, requiring in-office presence at least 3 days a week.... 
    Work at office
    3 days per week

    Hayden AI Technologies, Inc.

    San Francisco, CA
    12 hours ago
  • The role As an infrastructure engineer, you'll build the foundational...  ...environments, orchestration infrastructure,...  ...on‑prem, VPC, or hybrid cloud). Strong...  ...concerns at scale. Strong candidates...  ...infrastructure: GPU scheduling, model...  ...founders. We've built RL infrastructure at... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package

    Applied Compute

    San Francisco, CA
    4 days ago
  •  ...building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them....  ...Labs is seeking a Network Engineer to design, build, and scale the...  ...Experience with AI, HPC, GPU, or large-scale distributed infrastructure... 

    Gimlet Labs

    San Francisco, CA
    4 days ago
  •  ...technology firm in San Francisco is looking for an experienced backend engineer to own and evolve systems that support LLM-powered products....  ...ability to lead complex initiatives. This position offers a hybrid work model, competitive benefits, and the opportunity to... 

    EvenUp

    San Francisco, CA
    2 hours ago
  •  ...pioneering tech firm in San Francisco is looking for an experienced professional to manage GPU cluster deployments. This role involves overseeing vendor relations, supporting engineering teams, and facilitating smooth operations in data centers. Ideal candidates will have... 

    The San Francisco Compute Company

    San Francisco, CA
    3 days ago
  •  ...Forward Deployed Engineer (FDE) to assist enterprises...  ...in deploying, scaling, and operating...  ...on FriendliAI infrastructure. You will work...  ...challenges like scaling, orchestration, and monitoring....  ...Experience with GPU-based computing...  ...on-prem or hybrid-cloud deployments... 
    Temporary work
    Flexible hours

    FriendliAI

    San Francisco, CA
    4 days ago
  •  ...ML Infrastructure Engineer Spectral Labs is a spatial intelligence...  ...Role: In person in SF Comp: 350-600k+ TC...  ...pipelines to fine tune and run RL on our CAD foundation...  ...training & RL across our GPU cluster of hundreds of...  ...multi-node training at scale Deep understanding of... 

    Spectral Labs

    San Francisco, CA
    5 days ago
  • A technology company in San Francisco is seeking a Backend Engineer to design scalable systems enabling AI-influenced products. This hybrid role requires expertise in backend systems and collaboration with interdisciplinary teams to ensure high-quality outcomes. The ideal... 

    EvenUp

    San Francisco, CA
    1 day ago
  • Alumni Ventures is seeking a Senior Server Engineer to join their Identity Engineering Team in San Francisco. This role focuses on building...  ...strong knowledge in security systems and distributed databases. A hybrid working model is offered. #J-18808-Ljbffr Alumni Ventures

    Alumni Ventures

    San Francisco, CA
    1 day ago
  • $168.8k - $241.2k

     ...shoulder with Product Management, Engineering, and Sales to bring the...  ...and enablement content to scale product knowledge across Sales...  ...in San Jose / Bay Area; hybrid role requiring 3-4 days per...  ...revolutionizing how data and infrastructure connect and protect organizations... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Local area
    Flexible hours
    3 days per week

    Cisco

    San Francisco, CA
    2 days ago
  •  ...into actionable input for Product Management and Engineering. Develop training materials and enablement content to scale product knowledge across Sales, Systems...  ...Systems Engineer. Based in San Jose / Bay Area; hybrid role requiring 3-4 days per week in office.... 
    Work experience placement
    Work at office
    3 days per week

    Webex Events (formerly Socio)

    San Francisco, CA
    3 days ago
  • $160k - $200k

     ...Infrastructure Operations Engineer Lightning AI is the company behind PyTorch...  ...-efficient, large-scale compute. Teams get the...  ...scale for GPU infrastructure. This...  .... This role can work hybrid out of one of our US-...  ...hubs (Seattle, NYC, or SF) or fully remote within... 
    Remote work
    Work from home
    Flexible hours

    Lightning AI

    San Francisco, CA
    1 day ago
  •  ...first AI software engineer, and Windsurf, an...  ...of AI, including Scale AI, Palantir, Cursor...  ...the speed of the infrastructure underneath it. Every...  ..., experiment orchestration, data pipelines, and...  ...jobs reliably across GPU clusters. This...  ...up at our largest RL training scales.... 

    Cognition

    San Francisco, CA
    4 days ago
  • $350k

    Research Engineer, RL Infrastructure and Reliability (Knowledge Work) Anthropic’...  ...use every day. As that work scales, the systems behind it...  ...(schedulers, multi‑agent orchestration, sandboxed execution). Prior...  ...position. Location-based hybrid policy: Currently, we expect... 
    Visa sponsorship
    Shift work

    aijoblist

    San Francisco, CA
    3 days ago
  • $200k - $280k

    Engineering San Francisco Full-time $200,000...  ...Role Join our ML Infrastructure team to build...  ...our AI models at scale. You'll work at...  ...systems Optimize GPU utilization and...  ...Kubernetes and container orchestration Understanding of...  ...minimum Hybrid work with SF office access $5... 
    Full time
    Work at office

    Lattice, Inc.

    San Francisco, CA
    2 days ago
  •  ...take on a hands-on role focused on scaling and optimizing ML training...  ...responsibilities include owning the training infrastructure, improving performance, and managing GPU/TPU compute resources. Ideal...  ...will have strong software engineering foundations, hands-on experience... 

    Physical Intelligence

    San Francisco, CA
    1 day ago
  • $180k - $300k

     ...Technical Staff - Infrastructure Engineer Freiburg (...  ...Francisco, we're scaling fast while staying...  ...training possible, to orchestrate resources at...  ...Kubernetes Nvidia GPU drivers, and...  ...us in Freiburg or SF at least 2 days a...  ...office. We operate a hybrid model and cover... 
    Work at office
    Remote work
    Worldwide
    Relocation
    2 days per week

    Black Forest Labs

    San Francisco, CA
    4 days ago
  • $335k

    OpenAI in San Francisco seeks a System Engineer to architect and operationalize essential infrastructure for AI systems. The role demands 7+...  ...debugging and a solid grasp of clustering and scaling in production environments. Offers a hybrid work model with relocation assistance... 
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • $350k

    Menlo Ventures is seeking a Research Engineer to enhance the reliability and infrastructure of AI systems focused on...  ...operating machine learning systems at scale. Responsibilities include ensuring...  ..., based in San Francisco, with a hybrid work policy requiring office... 
    Work at office

    Menlo Ventures

    San Francisco, CA
    1 day ago
  • $120k - $200k

     ...Senior Infrastructure Engineer At Bland.com, our goal is to empower enterprises to make AI-phone agents at scale. Based out of San Francisco, we're a quickly...  ..., model serving, or GPU computing. Experience...  ...office in Jackson Square, SF with rooftop views If... 
    Work at office
    Night shift

    Bland AI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Hybrid SF RL Infrastructure Engineer — GPU-Scale Orchestration. Be the first to apply!