Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, Distributed Data Systems - Robotics

OpenAI

About the Team

The OpenAI Robotics team is focused on unlocking general-purpose robotics and pushing towards AGI-level intelligence in dynamic, real-world settings. Working across the entire model stack, we integrate cutting-edge hardware and software to explore a broad range of robotic form factors. We strive to seamlessly blend high-level AI capabilities with the constraints of physical systems to improve peoples' lives.

About the Role

As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You'll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for OpenAI's rapid iteration cycles.

We're looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:
  • Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
  • Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient.
  • Partner with researchers to deeply understand requirements and translate them into production-ready systems.
  • Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation.
You might thrive in this role if you:
  • Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data.
  • Are detail-oriented and bring rigor to building and maintaining reliable systems.
  • Demonstrate excellent software engineering fundamentals and organizational skills.
  • Are comfortable with ambiguity and rapid change.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.


We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.


For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Distributed Data Systems - Robotics in United States vacancy
  •  ...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements... 
    Suggested

    OpenAI

    San Francisco, CA
    5 days ago
  • A leading robotics company in Palo Alto seeks a Staff/Principal ML Systems Engineer to enhance training performance for their innovative humanoid robots. You will optimize distributed training systems and engage closely with researchers to transform model changes into scalable... 
    Suggested

    Rhoda AI

    Palo Alto, CA
    3 days ago
  • An AI and Robotics firm in San Francisco seeks a Staff/Principal ML Systems Engineer to enhance training performance for multimodal robotic data. You will lead efforts to improve end-to-end training...  ...significant experience in distributed training, a strong background... 
    Suggested

    Maxwell Bond

    San Francisco, CA
    2 days ago
  • $255k - $405k

     ...life and disability coverage Annual learning and development stipend to fuel your...  .... About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure...  ...storage, streaming infrastructure, machine learning infrastructure while... 
    Suggested
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    5 days ago
  • $175k - $250k

     ...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an early-stage robotics and AI company building autonomous systems that operate in real-world...  ...end-to-end ML infrastructure, including distributed training, experiment tracking, and compute... 
    Suggested

    Right Hand Talent

    Brooklyn, NY
    2 days ago
  •  ...Overview We’re looking for a Machine Learning Systems Engineer to strengthen the performance and scalability of our distributed training infrastructure. In this role, you'll work closely with researchers to streamline the development and execution of large-scale training... 
    Remote work

    Susquehanna International Group

    United States
    2 days ago
  •  ...foundational research on Protocol Learning: multi-participant training of...  ...We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You...  ...model‑parallel training strategies (data, tensor, pipeline parallelism)... 
    Remote work
    Visa sponsorship

    Pluralis Research

    California, MO
    5 days ago
  •  ...generation of humanoid robots — from high-...  ...intersection of large-scale learning, robotics, and systems, with a research...  ...Principal ML Systems Engineer to own training performance...  ...multimodal robotic data (vision,...  ...measurable gains in: Distributed efficiency (overlap,... 

    Rhoda AI

    Palo Alto, CA
    4 days ago
  •  ...embodied AI meets real robots, real sensors,...  ..., field-ready AI systems that solve the...  ...go beyond typical data-driven approaches...  ...rigorous engineering with learning systems proven in...  ...architect and build the distributed infrastructure that...  ...large-scale machine learning workflows... 
    Local area

    FieldAI

    Irvine, CA
    5 days ago
  • $181.1k - $318.4k

     ...Machine Learning Engineer Computer Vision & Data Systems At Apple, we are dedicated to creating technologies that enrich people's lives. Our teams develop...  ...building or optimizing large-scale data pipelines (e.g., distributed ETL, dataset generation, annotation workflows,... 
    Relocation

    Apple

    Sunnyvale, CA
    2 days ago
  • ML Systems Engineer - Robotics & AI We are building the full-stack foundation for...  ...intersection of large-scale learning, robotics, and systems, with...  ...on multimodal robotic data including vision, proprioception...  ...Drive measurable gains in distributed efficiency, compute efficiency... 

    Maxwell Bond

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal... 
    Remote work

    Pluralis Research

    San Francisco, CA
    5 days ago
  • $192k - $260k

    A leading cloud computing company is seeking a Staff Software Engineer - Distributed Data Systems in Mountain View, California. The role focuses on developing cutting-edge data storage and processing systems, with the potential for significant customer impact. Candidates... 

    Databricks Inc.

    Mountain View, CA
    2 days ago
  • A mission-driven technology company in California is seeking experienced Senior/Staff Engineers proficient in building distributed ML systems. Applicants should possess strong experience in optimizing large-scale training under low-bandwidth conditions, with expertise in... 
    Remote work

    Pluralis Research

    California, MO
    5 days ago
  •  ...been doing cutting edge engineering work for Silicon...  ...dynamic and continuous learning environment, with highlycooperative...  ...and test software systems used in...  ...focusing on large-scale distributed cloud-based systems Hands...  ...foundation (algorithms, data structures, database,... 
    Remote work

    Veganetworks

    New York, NY
    5 days ago
  • $255k - $405k

    Slope is seeking a Software Engineer for its team in San Francisco, CA. The role focuses...  .... Responsibilities include managing distributed data pipelines and collaborating closely with...  ...exhibit strong experience in distributed systems and possess excellent organizational... 

    Slope

    San Francisco, CA
    5 days ago
  • DATA & CONTROL SYSTEMS ENGINEER (STARSHIP LAUNCH PAD) SpaceX was founded under the belief that a future...  ...experience Experience with electrical power distribution, working knowledge of NFPA 70 and...  ...from the U.S. Department of State. Learn more about the ITAR here. SpaceX is... 
    Permanent employment
    Internship
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Florida, NY
    5 days ago
  • $157.7k - $213.8k

    A leading data and AI company is seeking a Software Engineer to join their Runtime team in Bellevue, United States. This role involves building next-generation distributed data storage and processing systems, requiring 5+ years of experience in Java, Scala, or C++. Candidates... 

    Menlo Ventures

    Bellevue, WA
    3 days ago
  • A leading tech company based in San Francisco is seeking a Software Engineer to enhance its data and AI platform. The role involves developing high-performance distributed data systems and delivering on ambitious projects such as Delta Lake and performance engineering.... 

    Databricks Inc.

    San Francisco, CA
    4 days ago
  • $182.4k - $247k

    A leading data and AI company in Bellevue is seeking a Software Engineer to join their Runtime team. You will build the next generation distributed data storage and processing systems that outperform traditional SQL query engines. Ideal candidates will have 8+ years of... 

    Menlo Ventures

    Bellevue, WA
    2 days ago
  • A leading data and AI company is seeking a Senior Software Engineer to join their team in Bellevue, Washington. You will work on building next-generation distributed data storage and processing systems that exceed traditional SQL performance. The ideal candidate will have... 

    Databricks Inc.

    Bellevue, WA
    3 days ago
  • Voiceflow is seeking a Software Engineer (Distributed Systems) in San Francisco. As a founding engineer, you will focus on building a real-time database...  ...processing, and prefers working in-person. Join us in shaping the future of data replication! #J-18808-Ljbffr Voiceflow

    Voiceflow

    San Francisco, CA
    4 days ago
  • A leading technology company in San Diego is looking for a Senior Software Engineer specializing in distributed systems. You will develop and maintain a device telemetry platform, working closely with a collaborative engineering team. Applicants should have significant... 

    Apple Inc.

    San Diego, CA
    5 days ago
  • $325k - $405k

     ...leading AI research company in San Francisco seeks a Software Engineer for their Data Acquisition team. You'll lead projects in data collection, collaborate with various teams, and develop scalable distributed systems. Candidates should hold a BS/MS/PhD in computer science... 

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...Description SAIC is seeking a SIGINT Ground Data Distribution Systems Engineer to support the LANDMARK AOS Prime SETA program in Chantilly, VA. LANDMARK AOS supports the NRO's Ground Enterprise Directorate (GED) and delivers high-impact engineering expertise across... 
    Shift work

    SAIC

    Chantilly, Loudoun County, VA
    1 day ago
  •  ...technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-... 

    Baseten

    San Francisco, CA
    5 days ago
  • A robotics innovation company focused on home robotics seeks a Software Engineer to develop machine learning infrastructure. You will own training systems, optimize data pipelines, and work on real-time robot control...  ..., with experience in distributed systems, machine... 

    Sunday

    Mountain View, CA
    2 days ago
  •  ...technology company in the United States is seeking a Software Engineer II to join their innovative team. This fully remote position requires hands-on experience with large-scale distributed database systems and proficiency in languages like C++, Java, or C#. You will work... 
    Remote work

    Buoyant Inc

    United States
    5 days ago
  • $124.9k - $228.9k

     ...background, to work on large-scale distributed systems coordinating thousands of...  ...in cloud and physical data centers around the world,...  ...petabyte-scale data challenges, machine learning, advanced visualizations,...  ...you’ll do: As a Senior Engineer on this team, you will lead... 
    Full time
    Temporary work
    Local area

    The Trade Desk

    Bellevue, WA
    5 days ago
  • Databricks Inc. is seeking an experienced Software Engineer to join their Runtime team in Bellevue, Washington. In this role, you will develop next-generation distributed data storage and processing systems that enhance performance beyond traditional SQL engines. Ideal... 

    Databricks Inc.

    Bellevue, WA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Distributed Data Systems - Robotics. Be the first to apply!