Machine Learning Engineer, Distributed Data Systems - Robotics
OpenAI
About the Team The OpenAI Robotics team is focused on unlocking general-purpose robotics and pushing towards AGI-level intelligence in dynamic, real-world settings. Working across the entire model stack, we integrate cutting-edge hardware and software to explore a broad range of robotic form factors. We strive to seamlessly blend high-level AI capabilities with the constraints of physical systems to improve peoples' lives. About the Role As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You'll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for OpenAI's rapid iteration cycles. We're looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will:
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
- Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
- Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient.
- Partner with researchers to deeply understand requirements and translate them into production-ready systems.
- Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation.
- Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data.
- Are detail-oriented and bring rigor to building and maintaining reliable systems.
- Demonstrate excellent software engineering fundamentals and organizational skills.
- Are comfortable with ambiguity and rapid change.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Distributed Data Systems - Robotics in United States vacancy
- ...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements...Suggested
- A leading robotics company in Palo Alto seeks a Staff/Principal ML Systems Engineer to enhance training performance for their innovative humanoid robots. You will optimize distributed training systems and engage closely with researchers to transform model changes into scalable...Suggested
- An AI and Robotics firm in San Francisco seeks a Staff/Principal ML Systems Engineer to enhance training performance for multimodal robotic data. You will lead efforts to improve end-to-end training... ...significant experience in distributed training, a strong background...Suggested
$255k - $405k
...life and disability coverage Annual learning and development stipend to fuel your... .... About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure... ...storage, streaming infrastructure, machine learning infrastructure while...SuggestedFull timeWork at officeLocal areaRelocation packageFlexible hours$175k - $250k
...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an early-stage robotics and AI company building autonomous systems that operate in real-world... ...end-to-end ML infrastructure, including distributed training, experiment tracking, and compute...Suggested- ...Overview We’re looking for a Machine Learning Systems Engineer to strengthen the performance and scalability of our distributed training infrastructure. In this role, you'll work closely with researchers to streamline the development and execution of large-scale training...Remote work
- ...foundational research on Protocol Learning: multi-participant training of... ...We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You... ...model‑parallel training strategies (data, tensor, pipeline parallelism)...Remote workVisa sponsorship
- ...generation of humanoid robots — from high-... ...intersection of large-scale learning, robotics, and systems, with a research... ...Principal ML Systems Engineer to own training performance... ...multimodal robotic data (vision,... ...measurable gains in: Distributed efficiency (overlap,...
- ...embodied AI meets real robots, real sensors,... ..., field-ready AI systems that solve the... ...go beyond typical data-driven approaches... ...rigorous engineering with learning systems proven in... ...architect and build the distributed infrastructure that... ...large-scale machine learning workflows...Local area
$181.1k - $318.4k
...Machine Learning Engineer Computer Vision & Data Systems At Apple, we are dedicated to creating technologies that enrich people's lives. Our teams develop... ...building or optimizing large-scale data pipelines (e.g., distributed ETL, dataset generation, annotation workflows,...Relocation- ML Systems Engineer - Robotics & AI We are building the full-stack foundation for... ...intersection of large-scale learning, robotics, and systems, with... ...on multimodal robotic data including vision, proprioception... ...Drive measurable gains in distributed efficiency, compute efficiency...
- A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal...Remote work
$192k - $260k
A leading cloud computing company is seeking a Staff Software Engineer - Distributed Data Systems in Mountain View, California. The role focuses on developing cutting-edge data storage and processing systems, with the potential for significant customer impact. Candidates...- A mission-driven technology company in California is seeking experienced Senior/Staff Engineers proficient in building distributed ML systems. Applicants should possess strong experience in optimizing large-scale training under low-bandwidth conditions, with expertise in...Remote work
- ...been doing cutting edge engineering work for Silicon... ...dynamic and continuous learning environment, with highlycooperative... ...and test software systems used in... ...focusing on large-scale distributed cloud-based systems Hands... ...foundation (algorithms, data structures, database,...Remote work
$255k - $405k
Slope is seeking a Software Engineer for its team in San Francisco, CA. The role focuses... .... Responsibilities include managing distributed data pipelines and collaborating closely with... ...exhibit strong experience in distributed systems and possess excellent organizational...- DATA & CONTROL SYSTEMS ENGINEER (STARSHIP LAUNCH PAD) SpaceX was founded under the belief that a future... ...experience Experience with electrical power distribution, working knowledge of NFPA 70 and... ...from the U.S. Department of State. Learn more about the ITAR here. SpaceX is...Permanent employmentInternshipWeekend work
$157.7k - $213.8k
A leading data and AI company is seeking a Software Engineer to join their Runtime team in Bellevue, United States. This role involves building next-generation distributed data storage and processing systems, requiring 5+ years of experience in Java, Scala, or C++. Candidates...- A leading tech company based in San Francisco is seeking a Software Engineer to enhance its data and AI platform. The role involves developing high-performance distributed data systems and delivering on ambitious projects such as Delta Lake and performance engineering....
$182.4k - $247k
A leading data and AI company in Bellevue is seeking a Software Engineer to join their Runtime team. You will build the next generation distributed data storage and processing systems that outperform traditional SQL query engines. Ideal candidates will have 8+ years of...- A leading data and AI company is seeking a Senior Software Engineer to join their team in Bellevue, Washington. You will work on building next-generation distributed data storage and processing systems that exceed traditional SQL performance. The ideal candidate will have...
- Voiceflow is seeking a Software Engineer (Distributed Systems) in San Francisco. As a founding engineer, you will focus on building a real-time database... ...processing, and prefers working in-person. Join us in shaping the future of data replication! #J-18808-Ljbffr Voiceflow
- A leading technology company in San Diego is looking for a Senior Software Engineer specializing in distributed systems. You will develop and maintain a device telemetry platform, working closely with a collaborative engineering team. Applicants should have significant...
$325k - $405k
...leading AI research company in San Francisco seeks a Software Engineer for their Data Acquisition team. You'll lead projects in data collection, collaborate with various teams, and develop scalable distributed systems. Candidates should hold a BS/MS/PhD in computer science...- ...Description SAIC is seeking a SIGINT Ground Data Distribution Systems Engineer to support the LANDMARK AOS Prime SETA program in Chantilly, VA. LANDMARK AOS supports the NRO's Ground Enterprise Directorate (GED) and delivers high-impact engineering expertise across...Shift work
- ...technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-...
- A robotics innovation company focused on home robotics seeks a Software Engineer to develop machine learning infrastructure. You will own training systems, optimize data pipelines, and work on real-time robot control... ..., with experience in distributed systems, machine...
- ...technology company in the United States is seeking a Software Engineer II to join their innovative team. This fully remote position requires hands-on experience with large-scale distributed database systems and proficiency in languages like C++, Java, or C#. You will work...Remote work
$124.9k - $228.9k
...background, to work on large-scale distributed systems coordinating thousands of... ...in cloud and physical data centers around the world,... ...petabyte-scale data challenges, machine learning, advanced visualizations,... ...you’ll do: As a Senior Engineer on this team, you will lead...Full timeTemporary workLocal area- Databricks Inc. is seeking an experienced Software Engineer to join their Runtime team in Bellevue, Washington. In this role, you will develop next-generation distributed data storage and processing systems that enhance performance beyond traditional SQL engines. Ideal...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Distributed Data Systems - Robotics. Be the first to apply!
Related searches
- entry level machine learning engineer United States
- senior ml engineer United States
- data scientist machine learning engineer United States
- machine learning ai engineer United States
- lead machine learning engineer United States
- junior machine learning engineer United States
- staff machine learning engineer United States
- junior machine learning research engineer United States
- computer vision machine learning engineer United States
- graduate machine learning engineer United States

