Machine Learning Engineer, Distributed Data Systems
OpenAI
About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societal benefit. About the Role As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for Sora’s rapid iteration cycles. We’re looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will: Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security. Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient. Partner with researchers to deeply understand requirements and translate them into production-ready systems. Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation. You might thrive in this role if you: Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data. Are detail-oriented and bring rigor to building and maintaining reliable systems. Demonstrate excellent software engineering fundamentals and organizational skills. Are comfortable with ambiguity and rapid change. About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr OpenAI
- ...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements...Suggested
$295k
...capabilities with the constraints of physical systems to improve peoples' lives. About the Role As a Research Engineer, Distributed Data Systems, you will design and scale the... ...storage, streaming infrastructure, machine learning infrastructure while ensuring...SuggestedWork at officeRelocation package- ...foundational research on Protocol Learning : multi-participant training of... ...We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You... ...model‑parallel training strategies (data, tensor, pipeline parallelism)...SuggestedRemote workVisa sponsorship
- A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal...SuggestedRemote work
- A leading tech company based in San Francisco is seeking a Software Engineer to enhance its data and AI platform. The role involves developing high-performance distributed data systems and delivering on ambitious projects such as Delta Lake and performance engineering....Suggested
$255k - $405k
Slope is seeking a Software Engineer for its team in San Francisco, CA. The role focuses... .... Responsibilities include managing distributed data pipelines and collaborating closely with... ...exhibit strong experience in distributed systems and possess excellent organizational...$325k - $405k
...leading AI research company in San Francisco seeks a Software Engineer for their Data Acquisition team. You'll lead projects in data collection, collaborate with various teams, and develop scalable distributed systems. Candidates should hold a BS/MS/PhD in computer science...- Voiceflow is seeking a Software Engineer (Distributed Systems) in San Francisco. As a founding engineer, you will focus on building a real-time database... ...processing, and prefers working in-person. Join us in shaping the future of data replication! #J-18808-Ljbffr Voiceflow
- ...technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-...
- ...About the Role Join a startup building an agentic data lakehouse platform. As a Senior Software Engineer, Distributed Data Systems, you'll work on a greenfield project to build scalable data infrastructure that transforms enterprise data into actionable insights...
- Genesis AI in San Francisco is looking for an experienced professional to optimize and build distributed training systems using PyTorch. The ideal candidate has over 8 years of experience in distributed systems, high-performance computing, and extensive expertise in Python...
- ...in San Francisco seeks a Staff/Principal ML Systems Engineer to enhance training performance for multimodal robotic data. You will lead efforts to improve end-to-end... ...candidates will have significant experience in distributed training, a strong background in PyTorch, and...
- ...is seeking a Staff Software Engineer to enhance ML infrastructure... ...involves designing scalable systems, mentoring engineers, and collaborating... ...of experience in building distributed systems, strong skills in... ..., familiarity with machine learning infrastructure is a plus. This...
$166k - $225k
...passionate about enabling data teams to solve the... .... Founded by engineers — and customer... ...millions of virtual machines. And we're only... ...methods such as machine learning that go well... ...the next generation distributed data storage and processing systems that can outperform...Local areaWorldwide$255k - $405k
...life and disability coverage Annual learning and development stipend to fuel your... .... About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure... ...storage, streaming infrastructure, machine learning infrastructure while...Full timeWork at officeLocal areaRelocation packageFlexible hours- ...real-time. Our vision is AI systems that are flexible, personalized... ...serve LLMs at scale and the data pipelines that feed them. One... ...about both. Researchers and ML engineers will hand you workloads that... ...Scale: Design and operate distributed inference systems for LLMs, optimizing...Flexible hours
- ...Francisco, CA, is seeking a Senior Software Engineer (Infrastructure) to lead the design of scalable data and API systems. The role involves architecting real-time data... ...experience in software engineering with a focus on distributed systems. The position offers competitive...
- A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should...Work at officeRelocation package
$200k - $300k
...tech startup in San Francisco seeks a Lead Software Engineer to build and optimize foundational backend systems for a massive AI video dataset. You will lead... ...years in backend engineering, strong experience with distributed systems, and is proficient in Go, Python, or Node...- ...interpretable, and steerable AI systems. We want AI to be safe and... ...of committed researchers, engineers, policy experts, and... ...to work at the frontier of machine learning, implementing and improving... ...High performance, large scale distributed systems Large scale LLM...Work at officeVisa sponsorshipFlexible hours
$90k
...Distributed Systems Software Engineer, Python / Go Join to apply for the Distributed Systems Software Engineer... ...bring deep engineering insights and a data driven approach to test automation,... ...remotely since 2004! Personal learning and development budget of USD 2,000...Full timeFreelanceInternshipLocal areaRemote workWorldwide$146.5k
...About the team: The ML Data Engineering team powers metadata extraction,... ...millions of users worldwide. Our systems operate at massive scale, supporting... ...We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely...For contractorsLocal areaWorldwideHome officeFlexible hours- ...have a lasting impact. Learn more at . At EvenUp,... ...to the legal system. Tackling the most complex... ...requires expertise in data quality, robust model... ...an experienced Staff Machine Learning Engineer eager to join EvenUp'... ...engineering skills (Python, distributed computing, APIs)....Full timeTemporary workLocal areaHome officeFlexible hours
$184.5k - $230.7k
...Join the team as Twilio's next L5 Machine Learning & Data Engineer to lead the design, build, and operation... ...of daily deployments. Own system design reviews, threat modeling, and... ...Go, or C++). ~ Hands-on mastery of distributed data frameworks (Spark/Flink), SQL/NoSQL...Local areaRemote workWorldwide- ...ML researchers, and engineers to work together to move... ...that can be learned, predicted, and designed... ...massive compute, massive data, and massive ambition... ...architecture to deployment on distributed infrastructure. We... ...the intersection of machine learning systems architecture and...Work at office
$229.9k - $262.4k
...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building... ...who are passionate about marrying data with emerging technologies. As a... ...transformation within Capital One. The Machine Learning Experience Team (MLX Tech) is...Full timePart timeInternshipLocal area$250k - $334.53k
...Perception team builds the system which learns the spatial-temporal... ...of miles of driving data from a diverse set of sensors, enabling engineers like you to (1)... ...recipes for human and machine labeling of data sets... ...Experience in designing distributed systems processing...Full timeRemote work$146.5k - $228k
...attitude. About the team: The ML Data Engineering team powers metadata extraction,... ...of users worldwide. Our systems operate at massive scale, supporting... ...We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with...Temporary workLocal areaWorldwideHome officeFlexible hours- The **Machine Learning Experience Team (MLX Tech)** is committed to pioneering... ...developers with deep experience in distributed microservices, and full stack systems to create solutions that help... ...mentoring other members of the engineering community, and from time to...Full timePart timeInternship
$229.9k - $262.4k
Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building... ...who are passionate about marrying data with emerging technologies. As a Capital... ...within Capital One. The Machine Learning Experience Team (MLX Tech) is committed...Full timePart timeInternshipLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Distributed Data Systems. Be the first to apply!
- entry level machine learning engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning engineer San Francisco, CA

