Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Systems Engineer - Distributed Training for Robotics

Maxwell Bond

An AI and Robotics firm in San Francisco seeks a Staff/Principal ML Systems Engineer to enhance training performance for multimodal robotic data. You will lead efforts to improve end-to-end training efficiency and collaborate with a team dedicated to cutting-edge robotics research. Ideal candidates will have significant experience in distributed training, a strong background in PyTorch, and the ability to work in a startup environment with high ownership. The role offers a unique opportunity to directly impact research cycles and robotic capabilities. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Systems Engineer - Distributed Training for Robotics in San Francisco, CA vacancy
  • A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal... 
    Training
    Remote work

    Pluralis Research

    San Francisco, CA
    1 day ago
  •  ...looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization...  ...over 5 years of experience in ML infrastructure and a strong... 
    Training

    BaseTen

    San Francisco, CA
    8 hours ago
  •  ...ML Systems Engineer – Robotics & AI We are building the full-stack foundation for the next generation...  ...environments and handling scenarios unseen in training. We work at the intersection of...  ...counts. Drive measurable gains in distributed efficiency, compute efficiency, and... 
    Training

    Maxwell Bond

    San Francisco, CA
    1 day ago
  • Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python... 
    Training

    Genesis AI

    San Francisco, CA
    1 day ago
  • $295k

     ...About the Team The OpenAI Robotics team is focused on...  ...the constraints of physical systems to improve peoples' lives....  ...the Role As a Research Engineer, Distributed Data Systems, you will design...  ...powers large-scale multimodal training and evaluation at OpenAI.... 
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • $295k - $380k

     ...OpenAI is searching for a Senior Software Engineer to join their Robotics team in San Francisco. The role focuses on maintaining and improving the training framework while actively reviewing and debugging code within ML systems. The ideal candidate should thrive in hands... 
    Training

    OpenAI

    San Francisco, CA
    1 day ago
  • $218.4k - $273k

     ...solving the data bottleneck across Robotics, Autonomous Vehicles, and Computer...  ...in Physical AI and developing ML pipelines for processing, training, and fine-tuning on data collected...  ...for Physical AI. The Role As an ML Systems Engineer on the Physical AI team, you will... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    3 days ago
  •  ...research on Protocol Learning : multi-participant training of foundation models where no single participant...  ...economics. We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You’ll be implementing a novel... 
    Training
    Remote work
    Visa sponsorship

    Pluralis Research

    San Francisco, CA
    11 days ago
  •  ...An innovative company is seeking a Distributed Systems/ML Engineer to enhance the training throughput of its internal framework. This role involves collaborating with researchers to develop efficient video models and applying cutting-edge techniques to optimize training... 
    Training

    OpenAI

    San Francisco, CA
    8 hours ago
  •  ...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements... 
    Training

    OpenAI

    San Francisco, CA
    1 day ago
  • $255k - $405k

     ...with our mission of broad societal benefit. About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large‑scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines,... 
    Training
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    8 hours ago
  •  ...infrastructure for robots operating in the...  ...need to be improved, engineers rely on data to...  ...robotics and autonomous systems teams to ingest,...  ...for an applied ML engineer with deep...  ...throughput to standing up training and eval workflows...  ...fundamentals: distributed systems, cloud... 
    Training
    Remote work

    Foxglove

    San Francisco, CA
    11 days ago
  • About Humble Robotics Working at Humble Robotics...  ...believe culture can be engineered - but when it falls...  ...’re looking for an ML engineer to design, train, and ship the vision...  ...training infrastructure (distributed training, data...  ...into production-grade systems Collaborate... 
    Training
    Local area

    Humble Robotics

    San Francisco, CA
    4 days ago
  • About Humble Robotics Working at Humble Robotics...  ...culture can be engineered - but when it falls...  ...re looking for an ML infrastructure...  ...the foundational systems we need to realize...  ...every stage of the ML training flywheel and be an...  ...Design and scale distributed ML training on our... 
    Training
    Local area

    Humble Robotics

    San Francisco, CA
    1 day ago
  •  ...ultimately become the perception engine for a company’s physical...  ...world of physical AI and robotics. We are a small, fast...  ...models to our world-class distributed perception system Building and scaling a production...  ..., labelling, and model re-training platform Driving the... 
    Training

    Specter

    San Francisco, CA
    4 days ago
  •  ...MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines for research purposes, ensuring accurate...  ...and owning the data pipelines for training and evaluation. If you have 6+ years of experience... 
    Training

    MakerMaker.AI

    San Francisco, CA
    7 hours ago
  • $300k - $405k

     ...A leading AI research company in New York seeks a Machine Learning Systems Engineer to build cutting-edge systems for training AI models. This role involves developing critical algorithms, improving system performance, and collaborating with a dynamic research team. Ideal... 
    Training
    Work at office

    Menlo Ventures

    San Francisco, CA
    8 hours ago
  •  ...seeking an experienced Software Engineer to develop machine learning...  ...for monetization and ads systems. The role involves building data pipelines, creating training platforms, and...  ...engineering, particularly in distributed systems and ML workflows. Join us in shaping... 
    Training

    AI Chopping Block, Inc.

    San Francisco, CA
    8 hours ago
  • $140k - $185k

     ...WHO WE ARE Built Robotics’ mission is to build the robots...  ...develop edge machine learning systems that to improve the...  ...construction robots Build scalable ML infrastructure for model training, validation, deployment,...  ...Experience building distributed ML training systems and... 
    Training
    Local area
    Flexible hours

    Built Robotics

    San Francisco, CA
    7 hours ago
  • $255k - $405k

    Slope is seeking a Software Engineer for its team in San Francisco, CA. The...  ...for large-scale multimodal training. Responsibilities include managing distributed data pipelines and collaborating...  ...strong experience in distributed systems and possess excellent organizational... 
    Training

    Slope

    San Francisco, CA
    1 day ago
  •  ...interpretable, and steerable AI systems. We want AI to be safe...  ...committed researchers, engineers, policy experts, and...  ...-edge systems that train AI models like Claude....  ...and steerable AI. As an ML Systems Engineer on our...  ..., large scale distributed systems Large scale... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...A leading AI research organization in San Francisco seeks an Infrastructure Engineer to design and maintain large distributed ML training and inference clusters. The ideal candidate will have a strong grasp of optimizing training workloads and experience with distributed... 
    Training

    Causal Labs

    San Francisco, CA
    3 days ago
  •  ...streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating...  ...have over 8 years of experience in building distributed systems, strong skills in AWS, and knowledge... 

    Tubi TV

    San Francisco, CA
    8 hours ago
  • $200k - $350k

     ...A technology-focused company in San Francisco seeks candidates for a role specializing in robotic control systems. You will train whole-body policies, build simulation environments, and run GPU training experiments. Ideal candidates should have strong coding skills in... 
    Training

    Pantera Capital

    San Francisco, CA
    8 hours ago
  •  ...A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should... 
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    7 hours ago
  •  ...infrastructure for robots operating in the...  ...need to be improved, engineers rely on data to...  ...robotics and autonomous systems teams to ingest,...  ...'re looking for a ML Platform Engineer...  ...orchestration to training infrastructure and...  ...foundation in distributed systems and cloud... 
    Training
    Remote work

    Foxglove Technologies, Inc

    San Francisco, CA
    4 days ago
  • $189.6k - $237k

     ...Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering...  ...to optimize our ML system Ideally you'd have:...  ...systems Strong software engineering skills, proficient in frameworks... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    3 days ago
  •  ...Technical Staff to focus on cutting-edge AI research and development. The role involves building and scaling training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have a passion for addressing ambitious challenges... 
    Training

    Mirendil

    San Francisco, CA
    1 day ago
  • Modal Labs is seeking strong engineers to train production machine learning models and contribute to open-source projects. Candidates should have experience with high-performance code and ML training optimization, working in our NYC or San Francisco offices. Ideal applicants... 
    Training

    Modal Labs

    San Francisco, CA
    4 days ago
  •  ...safer and more secure. The AI Engineering Team is chartered with...  ...Language Models (LLMs) and agentic systems. Our mission is to build robust...  ...Role As a Senior or Staff ML Systems Engineer - LLM , you’...  ...reusable CI/CD workflows for model training, evaluation, and deployment —... 
    Training
    Remote work
    Worldwide

    TRM Labs

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer - Distributed Training for Robotics. Be the first to apply!