Machine Learning Engineer, Distributed Data Systems
OpenAI
About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societal benefit. About the Role As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for Sora’s rapid iteration cycles. We’re looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will: Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security. Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient. Partner with researchers to deeply understand requirements and translate them into production-ready systems. Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation. You might thrive in this role if you: Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data. Are detail-oriented and bring rigor to building and maintaining reliable systems. Demonstrate excellent software engineering fundamentals and organizational skills. Are comfortable with ambiguity and rapid change. About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr OpenAI
- ...foundational research on Protocol Learning : multi-participant training of... ...We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You... ...model‑parallel training strategies (data, tensor, pipeline parallelism)...SuggestedRemote workVisa sponsorship
$325k - $405k
...leading AI research company in San Francisco seeks a Software Engineer for their Data Acquisition team. You'll lead projects in data collection, collaborate with various teams, and develop scalable distributed systems. Candidates should hold a BS/MS/PhD in computer science...Suggested$255k - $405k
Slope is seeking a Software Engineer for its team in San Francisco, CA. The role focuses... .... Responsibilities include managing distributed data pipelines and collaborating closely with... ...exhibit strong experience in distributed systems and possess excellent organizational...Suggested- A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal...SuggestedRemote job
$245k - $385k
Dormont Manufacturing Co is seeking a Distributed Systems/ML engineer in San Francisco, CA. You'll improve training throughput for our internal framework and enable researchers to innovate. Strong Python skills are essential. The position offers a hybrid work model, robust...Suggested- ...technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-...
$245k - $385k
...our cutting-edge models. We work on distributed model execution as well as the interfaces... .... About the Role As a Distributed Systems/ML engineer, you will work on improving the... ...the-art AI models), writing bug‑free machine learning code (surprisingly difficult!), and...Work at officeRelocation package- Dormont Manufacturing Co is looking for a Software Engineer for their Pre-training Systems team in San Francisco. Your primary role will be to design and maintain the distributed infrastructure that trains long-context models at scale, tackling challenges related to memory...
- ...is seeking a Staff Software Engineer to enhance ML infrastructure... ...involves designing scalable systems, mentoring engineers, and collaborating... ...of experience in building distributed systems, strong skills in... ..., familiarity with machine learning infrastructure is a plus. This...
$166k - $225k
...passionate about enabling data teams to solve the... .... Founded by engineers — and customer... ...millions of virtual machines. And we're only... ...methods such as machine learning that go well... ...the next generation distributed data storage and processing systems that can outperform...Local areaWorldwide$255k - $405k
...life and disability coverage Annual learning and development stipend to fuel your... .... About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure... ...storage, streaming infrastructure, machine learning infrastructure while...Full timeWork at officeLocal areaRelocation packageFlexible hours- A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should...Work at officeRelocation package
- ...Manufacturing Co is seeking a Staff Machine-Learning Infrastructure Engineer to drive the development of our... ...workplace safety. You will manage data pipelines and build large-scale training... ...experience with a strong focus on distributed systems and machine learning technologies....
$200.8k - $251k
...company in San Francisco seeks a team member to build and optimize a machine learning framework for large language models. Candidates should have system optimization experience and solid software engineering skills, particularly in tools like CUDA and Pytorch. This full-...Full time$279.2k - $390.9k
...infrastructure that powers machine learning driven recommendations. We design and maintain systems for ML data ingestion, low-latency retrieval... ...ML Indexing & Retrieval engine, integrating capabilities... ...excellence in large-scale distributed systems. Mentor and guide engineers...For contractorsWork experience placementFlexible hours- ...have a lasting impact. Learn more at . At EvenUp,... ...to the legal system. Tackling the most complex... ...requires expertise in data quality, robust model... ...an experienced Staff Machine Learning Engineer eager to join EvenUp'... ...engineering skills (Python, distributed computing, APIs)....Full timeTemporary workLocal areaHome officeFlexible hours
$90k
Distributed Systems Software Engineer, Python / Go Join to apply for the Distributed Systems Software Engineer... ...bring deep engineering insights and a data driven approach to test automation,... ...remotely since 2004! Personal learning and development budget of USD 2,000...Full timeFreelanceInternshipLocal areaRemote workWorldwide- ...ML researchers, and engineers to work together to move... ...that can be learned, predicted, and designed... ...massive compute, massive data, and massive ambition... ...architecture to deployment on distributed infrastructure. We... ...the intersection of machine learning systems architecture and...Work at office
$146.5k - $228k
...attitude. About the team: The ML Data Engineering team powers metadata extraction,... ...of users worldwide. Our systems operate at massive scale, supporting... ...We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with...Temporary workLocal areaWorldwideHome officeFlexible hours$250.8k - $286.2k
Capital One is seeking a Senior Lead Software Engineer specializing in distributed systems. This role involves leading technology projects and developing solutions to enhance financial empowerment for millions of Americans. The candidate should have extensive experience...$229.9k - $262.4k
Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building... ...who are passionate about marrying data with emerging technologies. As a Capital... ...within Capital One. The Machine Learning Experience Team (MLX Tech) is committed...Full timePart timeInternshipLocal area$175k - $225k
A cutting-edge technology firm is looking for a Senior Backend Engineer to design distributed systems for running AI agents. This role involves managing core data infrastructure and ensuring scalable solutions. The ideal candidate has 4+ years of backend engineering experience...- An innovative company is seeking a Distributed Systems/ML Engineer to enhance the training throughput of its internal framework. This role involves collaborating with researchers to develop efficient video models and applying cutting-edge techniques to optimize training...
- Gravity Engineering Services Pvt Ltd. in San Francisco seeks a Machine Learning Engineer to enhance our data processing and generation systems. The ideal candidate has strong Python skills and solid machine learning fundamentals, ideally with 3+ years of relevant experience...
- ...States Digital Space LLC is looking for an experienced engineer to build and operate large-scale data systems in San Francisco. This role focuses on creating... ...environment. The ideal candidate has expertise in distributed systems and strong programming skills. Competitive...
- ...is seeking an experienced Software Engineer to develop machine learning infrastructure for monetization and ads systems. The role involves building data pipelines, creating training platforms... ...engineering, particularly in distributed systems and ML workflows. Join us in...
- United States Digital Space LLC is seeking a Machine Learning Expert to enhance customer service through cutting-edge AI technologies... .... The role involves developing innovative ML systems, collaborating with engineering and product teams, and significantly contributing to...
- ...problems in AI, enhancing robot functionalities through extensive experience in building distributed applications and data pipelines. The ideal candidate will design scalable systems, write essential business logic, and utilize modern ML techniques. We seek individuals...
- Acceler8 Talent is looking for a Senior Distributed Systems Engineer with over 7 years of experience in software engineering. This hybrid position in San Francisco focuses on building systems for AI-powered clinical environments, impacting patient care directly. The role...
- B Capital in San Francisco is looking for an engineering professional to architect and optimize core training infrastructure for their AI models. You will work on distributed systems and large-scale data pipelines, focusing on performance and numerical stability. Successful...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Distributed Data Systems. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- senior cloud data engineer San Francisco, CA
