Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, Distributed Data Systems

OpenAI

About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societal benefit. About the Role As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for Sora’s rapid iteration cycles. We’re looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will: Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security. Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient. Partner with researchers to deeply understand requirements and translate them into production-ready systems. Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation. You might thrive in this role if you: Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data. Are detail-oriented and bring rigor to building and maintaining reliable systems. Demonstrate excellent software engineering fundamentals and organizational skills. Are comfortable with ambiguity and rapid change. About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr OpenAI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Distributed Data Systems in San Francisco, CA vacancy
  •  ...foundational research on Protocol Learning : multi-participant training of...  ...We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You...  ...model‑parallel training strategies (data, tensor, pipeline parallelism)... 
    Suggested
    Remote work
    Visa sponsorship

    Pluralis Research

    San Francisco, CA
    1 day ago
  • $325k - $405k

     ...leading AI research company in San Francisco seeks a Software Engineer for their Data Acquisition team. You'll lead projects in data collection, collaborate with various teams, and develop scalable distributed systems. Candidates should hold a BS/MS/PhD in computer science... 
    Suggested

    OpenAI

    San Francisco, CA
    11 hours ago
  • $255k - $405k

    Slope is seeking a Software Engineer for its team in San Francisco, CA. The role focuses...  .... Responsibilities include managing distributed data pipelines and collaborating closely with...  ...exhibit strong experience in distributed systems and possess excellent organizational... 
    Suggested

    Slope

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal... 
    Suggested
    Remote job

    Pluralis Research

    San Francisco, CA
    1 day ago
  • $245k - $385k

    Dormont Manufacturing Co is seeking a Distributed Systems/ML engineer in San Francisco, CA. You'll improve training throughput for our internal framework and enable researchers to innovate. Strong Python skills are essential. The position offers a hybrid work model, robust... 
    Suggested

    Dormont Manufacturing Co

    San Francisco, CA
    2 days ago
  •  ...technology company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You will design distributed training systems and optimize GPU utilization while collaborating with cross-... 

    Baseten

    San Francisco, CA
    2 days ago
  • $245k - $385k

     ...our cutting-edge models. We work on distributed model execution as well as the interfaces...  .... About the Role As a Distributed Systems/ML engineer, you will work on improving the...  ...the-art AI models), writing bug‑free machine learning code (surprisingly difficult!), and... 
    Work at office
    Relocation package

    Dormont Manufacturing Co

    San Francisco, CA
    1 day ago
  • Dormont Manufacturing Co is looking for a Software Engineer for their Pre-training Systems team in San Francisco. Your primary role will be to design and maintain the distributed infrastructure that trains long-context models at scale, tackling challenges related to memory... 

    Dormont Manufacturing Co

    San Francisco, CA
    2 days ago
  •  ...is seeking a Staff Software Engineer to enhance ML infrastructure...  ...involves designing scalable systems, mentoring engineers, and collaborating...  ...of experience in building distributed systems, strong skills in...  ..., familiarity with machine learning infrastructure is a plus. This... 

    Tubi Tv

    San Francisco, CA
    11 hours ago
  • $166k - $225k

     ...passionate about enabling data teams to solve the...  .... Founded by engineers — and customer...  ...millions of virtual machines. And we're only...  ...methods such as machine learning that go well...  ...the next generation distributed data storage and processing systems that can outperform... 
    Local area
    Worldwide

    Databricks Inc.

    San Francisco, CA
    11 hours ago
  • $255k - $405k

     ...life and disability coverage Annual learning and development stipend to fuel your...  .... About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure...  ...storage, streaming infrastructure, machine learning infrastructure while... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    2 days ago
  • A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...Manufacturing Co is seeking a Staff Machine-Learning Infrastructure Engineer to drive the development of our...  ...workplace safety. You will manage data pipelines and build large-scale training...  ...experience with a strong focus on distributed systems and machine learning technologies.... 

    Dormont Manufacturing Company

    San Francisco, CA
    11 hours ago
  • $200.8k - $251k

     ...company in San Francisco seeks a team member to build and optimize a machine learning framework for large language models. Candidates should have system optimization experience and solid software engineering skills, particularly in tools like CUDA and Pytorch. This full-... 
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  • $279.2k - $390.9k

     ...infrastructure that powers machine learning driven recommendations. We design and maintain systems for ML data ingestion, low-latency retrieval...  ...ML Indexing & Retrieval engine, integrating capabilities...  ...excellence in large-scale distributed systems. Mentor and guide engineers... 
    For contractors
    Work experience placement
    Flexible hours

    Tensec

    San Francisco, CA
    2 days ago
  •  ...have a lasting impact. Learn more at . At EvenUp,...  ...to the legal system. Tackling the most complex...  ...requires expertise in data quality, robust model...  ...an experienced Staff Machine Learning Engineer eager to join EvenUp'...  ...engineering skills (Python, distributed computing, APIs).... 
    Full time
    Temporary work
    Local area
    Home office
    Flexible hours

    B Capital

    San Francisco, CA
    2 days ago
  • $90k

    Distributed Systems Software Engineer, Python / Go Join to apply for the Distributed Systems Software Engineer...  ...bring deep engineering insights and a data driven approach to test automation,...  ...remotely since 2004! Personal learning and development budget of USD 2,000... 
    Full time
    Freelance
    Internship
    Local area
    Remote work
    Worldwide

    Canonical

    San Francisco, CA
    25 days ago
  •  ...ML researchers, and engineers to work together to move...  ...that can be learned, predicted, and designed...  ...massive compute, massive data, and massive ambition...  ...architecture to deployment on distributed infrastructure. We...  ...the intersection of machine learning systems architecture and... 
    Work at office

    Achira

    San Francisco, CA
    1 day ago
  • $146.5k - $228k

     ...attitude. About the team: The ML Data Engineering team powers metadata extraction,...  ...of users worldwide. Our systems operate at massive scale, supporting...  ...We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with... 
    Temporary work
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd, Inc.

    San Francisco, CA
    3 days ago
  • $250.8k - $286.2k

    Capital One is seeking a Senior Lead Software Engineer specializing in distributed systems. This role involves leading technology projects and developing solutions to enhance financial empowerment for millions of Americans. The candidate should have extensive experience... 

    Information Technology Senior Management Forum

    San Francisco, CA
    4 days ago
  • $229.9k - $262.4k

    Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building...  ...who are passionate about marrying data with emerging technologies. As a Capital...  ...within Capital One. The Machine Learning Experience Team (MLX Tech) is committed... 
    Full time
    Part time
    Internship
    Local area

    Capital One National Association

    San Francisco, CA
    1 day ago
  • $175k - $225k

    A cutting-edge technology firm is looking for a Senior Backend Engineer to design distributed systems for running AI agents. This role involves managing core data infrastructure and ensuring scalable solutions. The ideal candidate has 4+ years of backend engineering experience... 

    LangChain

    San Francisco, CA
    12 days ago
  • An innovative company is seeking a Distributed Systems/ML Engineer to enhance the training throughput of its internal framework. This role involves collaborating with researchers to develop efficient video models and applying cutting-edge techniques to optimize training... 

    OpenAI

    San Francisco, CA
    2 days ago
  • Gravity Engineering Services Pvt Ltd. in San Francisco seeks a Machine Learning Engineer to enhance our data processing and generation systems. The ideal candidate has strong Python skills and solid machine learning fundamentals, ideally with 3+ years of relevant experience... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  •  ...States Digital Space LLC is looking for an experienced engineer to build and operate large-scale data systems in San Francisco. This role focuses on creating...  ...environment. The ideal candidate has expertise in distributed systems and strong programming skills. Competitive... 

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  •  ...is seeking an experienced Software Engineer to develop machine learning infrastructure for monetization and ads systems. The role involves building data pipelines, creating training platforms...  ...engineering, particularly in distributed systems and ML workflows. Join us in... 

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • United States Digital Space LLC is seeking a Machine Learning Expert to enhance customer service through cutting-edge AI technologies...  .... The role involves developing innovative ML systems, collaborating with engineering and product teams, and significantly contributing to... 

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  •  ...problems in AI, enhancing robot functionalities through extensive experience in building distributed applications and data pipelines. The ideal candidate will design scalable systems, write essential business logic, and utilize modern ML techniques. We seek individuals... 

    Generalist

    San Francisco, CA
    3 days ago
  • Acceler8 Talent is looking for a Senior Distributed Systems Engineer with over 7 years of experience in software engineering. This hybrid position in San Francisco focuses on building systems for AI-powered clinical environments, impacting patient care directly. The role... 

    Acceler8 Talent

    San Francisco, CA
    11 hours ago
  • B Capital in San Francisco is looking for an engineering professional to architect and optimize core training infrastructure for their AI models. You will work on distributed systems and large-scale data pipelines, focusing on performance and numerical stability. Successful... 

    B Capital

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, Distributed Data Systems. Be the first to apply!