Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Training Systems Engineer: Distributed & RL

B Capital

B Capital in San Francisco is looking for an engineering professional to architect and optimize core training infrastructure for their AI models. You will work on distributed systems and large-scale data pipelines, focusing on performance and numerical stability. Successful candidates will have strong software engineering skills and experience in either distributed training or data infrastructure. The role offers top-tier compensation and comprehensive health and wellness benefits. #J-18808-Ljbffr B Capital

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the AI Training Systems Engineer: Distributed & RL in San Francisco, CA vacancy
  • A leading AI technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating... 
    Training

    Baseten

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal... 
    Training
    Remote job

    Pluralis Research

    San Francisco, CA
    1 day ago
  • An AI and Robotics firm in San Francisco seeks a Staff/Principal ML Systems Engineer to enhance training performance for multimodal robotic data. You will lead efforts to improve end...  ...will have significant experience in distributed training, a strong background in PyTorch... 
    Training

    Maxwell Bond

    San Francisco, CA
    4 days ago
  •  ...nation states. Our team of AI researchers and company...  ...power our research, training, and production environments. These systems form the foundational...  ...multi-tenant isolation. Distributed Systems Architecture:...  ..., service reliability engineering. About You Strong... 
    Training
    Relocation package

    Reflection AI

    San Francisco, CA
    4 days ago
  •  ...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data centers. We deploy...  ..., from thermal systems to power distribution, where milliseconds and megawatts matter...  ...environments to accelerate training, testing, and Sim2Real deployment.... 
    Training
    Weekend work

    Fluix AI

    San Francisco, CA
    4 days ago
  • $117.2k - $313.7k

     ...Category Software Engineering Job Details About...  ...Salesforce is the #1 AI CRM, where humans with...  ...components/frameworks in distributed filesystems in an ever...  ...that improve system scalability, robustness...  ...promotion, benefits, training, assessment of job performance... 
    Training
    Immediate start
    Remote work

    Salesforce

    San Francisco, CA
    5 days ago
  •  ...About Us Most AI is frozen in place - it...  ...time. Our vision is AI systems that are flexible, personalized...  .... Researchers and ML engineers will hand you...  ...: Design and operate distributed inference systems for...  ...curate the datasets behind training and evaluation. The... 
    Training
    Flexible hours

    Adaption

    San Francisco, CA
    22 days ago
  •  ...Research Systems Engineer As a research systems engineer, you'll train frontier-scale models and develop the...  ..., explore cutting-edge RL techniques, and build the...  ..., with experience in distributed training Strong experimental...  ...learning. Our applied AI team sits side-by-side... 
    Training
    Visa sponsorship
    Relocation package

    Applied Compute

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...that enables anyone to create, train, and deploy them. We...  ...plane and pair it with the full rl post-training stack: environments...  ...contexts. As a Research Engineer working on Distributed Training, you'll play a...  ...focusing on our decentralizing AI training stack. If you love... 
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $255k - $405k

    Slope is seeking a Software Engineer for its team in San Francisco, CA. The...  ...for large-scale multimodal training. Responsibilities include managing distributed data pipelines and collaborating...  ...strong experience in distributed systems and possess excellent organizational... 
    Training

    Slope

    San Francisco, CA
    2 days ago
  • $146.5k

     ...team: The ML Data Engineering team powers metadata...  ...users worldwide. Our systems operate at massive...  ...data engineering, and distributed systems, collaborating...  ...cutting-edge generative AI and metadata enrichment...  ...relevant education or training; and other business and... 
    Training
    For contractors
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    2 days ago
  •  ...AI Systems Engineer Transluce is a fast-moving research lab building the public tech stack for understanding and debugging AI systems...  ...to 400B+ parameter models Behavior elicitation: Distributed RL training and roll-outs allowing thousands of concurrent rollouts... 
    Training
    Flexible hours

    Transluce

    San Francisco, CA
    4 days ago
  • $295k

     ...to seamlessly blend high-level AI capabilities with the constraints of physical systems to improve peoples' lives....  ...About the Role As a Research Engineer, Distributed Data Systems, you will design...  ...powers large-scale multimodal training and evaluation at OpenAI. You'... 
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • $146.5k - $228k

     ...the team: The ML Data Engineering team powers metadata extraction...  ...users worldwide. Our systems operate at massive...  ...data engineering, and distributed systems, collaborating...  ...-edge generative AI and metadata enrichment...  ...relevant education or training; and other business... 
    Training
    Temporary work
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    3 days ago
  • $142.6k - $261.5k

     ...organizations. Using our product-driven, AI-centric approach, we empower...  ..., designers, and software engineers enable our clients to solve...  .... Knowledgeable in system development lifecycle and technology...  ...and interest in cloud and distributed systems architectures... 
    Summer holiday
    Flexible hours

    EY

    San Francisco, CA
    4 days ago
  • $255k - $405k

     ...multimodal functionalities into our AI products, ensuring they are reliable,...  ...benefit. About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the...  ...infrastructure that powers large‑scale multimodal training and evaluation at OpenAI. You’ll... 
    Training
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    2 days ago
  • $335k

     ...infrastructure that powers large-scale AI systems. We design and deliver next-...  ...that support frontier model training and inference across an...  ...We are seeking a System Engineer (Network / Storage / Systems...  ...firmware, Linux systems, or distributed infrastructure. ~... 
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  •  ...research on Protocol Learning : multi-participant training of foundation models where no single participant has...  ...economics. We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You’ll be implementing... 
    Training
    Remote work
    Visa sponsorship

    Pluralis Research

    San Francisco, CA
    1 day ago
  •  ...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements... 
    Training

    OpenAI

    San Francisco, CA
    2 days ago
  • $264.8k - $331k

     ...AI is becoming vitally important in every function...  ...state of the art post-training algorithms to reach the...  ...As an ML Sys Research Engineer, you'll work on building...  ...for our next-gen Agent RL training platform, support...  ...to optimize our ML system. Your customer will be... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    5 days ago
  • A leading tech company based in San Francisco is seeking a Software Engineer to enhance its data and AI platform. The role involves developing high-performance distributed data systems and delivering on ambitious projects such as Delta Lake and performance engineering.... 

    Databricks Inc.

    San Francisco, CA
    1 day ago
  •  ...AI Systems Engineer - Codex Core Agents About The Team The Codex Core Agents team builds...  ...and increasingly part of how models are trained and evaluated, making this one of the...  ...or operated production systems in distributed systems, infrastructure, developer tooling... 
    Training

    OpenAI

    San Francisco, CA
    1 day ago
  • AI Systems Engineer - Codex Core Agents Location San Francisco Employment Type Full time Department...  ...or operated production systems in distributed systems, infrastructure, developer...  ...using LLM systems, model evals, or post‑training feedback loops. Background in compilers... 
    Training
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    5 days ago
  • Acceler8 Talent is looking for a Senior Distributed Systems Engineer with over 7 years of experience in software engineering. This hybrid position in San Francisco focuses on building systems for AI-powered clinical environments, impacting patient care directly. The role... 

    Acceler8 Talent

    San Francisco, CA
    5 days ago
  • $120k - $170k

     ...America's manufacturing base. Our AI-powered robots automate food prep and...  ...About the Role Chef is seeking Systems Support Engineers to serve as a vital link between our customers...  ...experience, and relevant education or training. In addition to salary and early-... 
    Training
    Work at office
    Flexible hours
    Night shift

    Chef Robotics

    San Francisco, CA
    1 day ago
  • $220k - $320k

     ...Help us build the systems that train specialized AI models for the fastest-growing companies...  ...funded ten-person team of engineers who work in-person in...  ...latest techniques in SFT, RL, and model optimization to...  ...models Experience with distributed training at scale... 
    Training
    Work at office

    Inference

    San Francisco, CA
    4 days ago
  • $200k - $280k

     ...inference (algorithms, architectures, engines) and post-training / RL systems. We build and operate the systems...  ..., or similar), GPU performance, distributed serving. RL-first profile: RL /...  ...engineering. About Together AI Together AI is a research-driven... 
    Training
    Full time

    Together AI

    San Francisco, CA
    5 days ago
  •  ...enables anyone to create, train, and deploy them. We...  ...pair it with the full RL post-training stack: environments...  .... As a Research Engineer in our Reasoning team,...  ...utilization of AI inference workloads by...  ...synthetic data generation and distributed RL frameworks. Publish... 
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    4 days ago
  • B Capital is seeking a skilled software engineer in San Francisco to develop foundational AI systems. You will work on shared services and improve operational...  ...development, experience with APIs, and familiarity with distributed systems. This role offers top-tier compensation,... 

    B Capital

    San Francisco, CA
    2 days ago
  • MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines for research purposes, ensuring accurate and...  ...and owning the data pipelines for training and evaluation. If you have 6+ years... 
    Training

    MakerMaker.AI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Training Systems Engineer: Distributed & RL. Be the first to apply!