Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist, Infrastructure

$230k - $385k

OpenAI

About the Team

Our infrastructure team helps deliver OpenAI's most capable models and products to the world by scaling infrastructure and turning demand into useful FLOPS. We collaborate across research, engineering, design, and business to turn cutting-edge AI advancements into impactful, real-world applications. Our team ensures the right compute is available-at the right time and place-to support some of the world's most demanding workloads. We empower all of OpenAI's products and research by scaling the infrastructure behind them. Our work makes it possible to launch new models and products reliably and at scale.

About the Role

As a Data Scientist on the Infra team, you will play a key role in shaping how we scale the infrastructure that powers OpenAI's products and research. This is critical as we operate one of the largest and most advanced compute fleets in the world, supporting millions of users and businesses globally. We focus on aligning infrastructure measurement, planning, scaling, allocation, and efficiency to drive measurable impact across the company.

You should expect to guide the definition of foundational datasets for infrastructure resources, develop metrics that inform key decisions, build forecasting and optimization models, and establish source of truth dashboards and analyses that enable teams to understand and improve infra usage. Most importantly, you should expect to be a core partner to engineering, research, and product teams in shaping the infrastructure that powers everything OpenAI builds.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Build and maintain foundational datasets and metrics that reflect infrastructure usage, efficiency, and scaling.

  • Develop forecasting and optimization models to support infra planning and resource allocation.

  • Partner with engineering, research, and product teams to shape infrastructure strategy through data.

  • Drive clarity with source-of-truth dashboards and analyses that guide infra decisions across OpenAI.

You might thrive in this role if you have:

  • 5+ years of experience in a quantitative role navigating ambiguous environments, ideally in infrastructure, systems, or platform domains at a high-growth company or research org

  • Experience defining and operationalizing metrics that reflect system performance, resource usage, or efficiency from the ground up

  • A strong foundation in SQL and Python, and a track record of building models and analyses that drive technical and strategic decisions

  • Excellent communication skills and the ability to partner effectively with engineers, researchers, and product stakeholders

  • A strategic mindset that goes beyond statistical testing to surface actionable insights and long-term tradeoffs

You could be an especially great fit if you have:

  • Proven track record of operating as a data partner in large scale backend systems

  • Comfortable navigating fast-paced execution while also anchoring decisions in long-term impact

  • Strong programming background, with ability to run simulations and prototype variants

  • Experience in NLP, large language models, or generative AI

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation Range: $230K - $385K

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Scientist, Infrastructure in San Francisco, CA vacancy
  •  ...Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and...  ..., and others. The Role: We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data... 
    Suggested

    Judgment Labs

    San Francisco, CA
    2 days ago
  •  ...Data Infrastructure Engineer Los Angeles, Palo Alto, San Francisco, Toronto About HeyGen At HeyGen, our mission is to make visual...  ...productionization infrastructure Collaborate with data scientists and machine learning engineers to understand their... 
    Suggested

    HeyGen

    San Francisco, CA
    4 days ago
  • Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with... 
    Suggested
    Work at office

    Cartesia

    San Francisco, CA
    1 day ago
  • A progressive technology company in San Francisco is looking for a Data Infrastructure Engineer to design and operate data and ML infrastructure on AWS. The ideal candidate will have strong software engineering fundamentals and experience building production systems, particularly... 
    Suggested

    Matter Intelligence

    San Francisco, CA
    2 days ago
  • About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing data into production datasets, models, and customer-facing insights. This role spans ingestion, processing, storage, compute... 
    Suggested
    Permanent employment
    Full time

    Matter Intelligence

    San Francisco, CA
    2 days ago
  •  ...Intelligence Platform team in San Francisco. You will design, build, and operate ML infrastructure that enables real-time intelligence in customer interactions. Responsibilities include architecting data pipelines, building ML workflows, and ensuring data quality. Ideal... 
    Remote job

    Nerdleveltech

    San Francisco, CA
    2 days ago
  • $257k - $327k

     ...About the Team OpenAI is building the infrastructure foundation for the next generation of AI. The Data Center Engineering team defines the strategy, reference architectures, technical requirements, and delivery standards for the large-scale data centers that support... 
    For contractors
    Work at office

    OpenAI

    San Francisco, CA
    4 days ago
  • Granica, based in San Francisco, is seeking an expert in distributed systems to enhance their data infrastructure. This role involves architecting a global metadata substrate, developing intelligent data layouts, and implementing algorithms for efficient data representation... 
    Flexible hours

    Granica

    San Francisco, CA
    2 days ago
  • $120k - $160k

     ...Founding Engineer For Airweave's Data And Infrastructure We're looking for a founding engineer to own Airweave's data and infrastructure layer, the systems that make our distributed search and data pipelines scalable, reliable and observable. At Airweave, you'll... 

    Airweave (yc X25)

    San Francisco, CA
    4 days ago
  • $350k

     ...AI work for their unique needs and goals. We are scientists, engineers, and builders who've created some of the most widely...  ...We're looking for an engineer to join us and contribute to data infrastructure. You'll join a small, high-impact team responsible for... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package

    Thinking Machines Lab

    San Francisco, CA
    3 days ago
  • $160k - $225k

     ...platform synthesizes complex employee data, pinpoints risky behaviors, and deploys...  ...Build and scale the foundational data infrastructure powering a category-defining product...  ...and experimentation Partner with data scientists and engineers to productionize ML and... 
    Work experience placement
    Relocation package
    Flexible hours

    Fable

    San Francisco, CA
    4 days ago
  • $153k - $376k

     ...design and collaboration, join us!The Data Platform team at Figma builds and operates...  ..., machine learning engineers, data scientists, product engineers, and business teams...  ...ML Datalake, orchestration and pipeline infrastructure, and large-scale data ingestion and processing... 
    Full time
    Remote work
    Work from home

    Figma

    San Francisco, CA
    7 hours ago
  • $250k - $380k

     ...for designing and running OpenAI's LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 

    OpenAI

    San Francisco, CA
    3 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block's Cash App and Square, Chime, Oura... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    2 hours ago
  • A digital identity platform company in San Francisco is looking for a Data Infrastructure Engineer to design, build, and maintain their data platform. The role requires 3+ years of software engineering experience, proficiency in Python, and knowledge of technologies like... 

    Persona

    San Francisco, CA
    1 day ago
  • $250k - $380k

     ...for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    5 days ago
  • A leading medical AI platform in San Francisco is seeking a Data Infrastructure Software Engineer. You will build end-to-end systems vital for improving clinical decision-making. This role demands a commitment to performance, scalability, and precision in a fast-paced environment... 

    OpenEvidence

    San Francisco, CA
    2 days ago
  • A leading AI research organization located in San Francisco is seeking an experienced data infrastructure engineer to design and operate data infrastructure supporting extensive compute fleets. You will manage the lifecycle ownership and ensure high performance, scalability... 
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The... 

    Gerra Group

    San Francisco, CA
    1 day ago
  • $160k - $225k

    A technology-driven security company based in California is looking for a Data Infrastructure Engineer. This role focuses on designing and maintaining scalable data pipelines and infrastructure, ensuring data quality and reliability. Ideal candidates should have 3-7+ years... 
    Flexible hours

    Fable Security

    San Francisco, CA
    5 days ago
  •  ...translate complex, real-world environments into clear, actionable data. While most AI companies focus on digital industries, we...  ...consequences are real. The Role As a Founding Data Engineer (AI Infrastructure) , you will build the circulatory system of Skiffra’s AI-... 

    Rethink recruit

    San Francisco, CA
    1 day ago
  •  ...general intelligence. In this role, you will design and build systems in-house while utilizing cloud technology to create reliable data infrastructure. The ideal candidate has 5+ years of software engineering experience and expertise in managing large datasets for machine... 

    I did my part and supported the Regular Toilet

    San Francisco, CA
    5 days ago
  • A leading AI company in San Francisco is seeking an experienced Data Engineer to work on innovative storage infrastructure and product launches. The role demands a strong command of Python and SQL, with 5+ years experience in production-grade data processing systems. Candidates... 
    Remote job
    Flexible hours

    Cohere

    San Francisco, CA
    5 days ago
  • A leading AI infrastructure company is seeking a Founding Data Engineer to design and build the foundational data architecture for its innovative AI solutions. This role involves collaborating with the founding team to create robust data pipelines, ensuring data quality... 

    Rethink recruit

    San Francisco, CA
    1 day ago
  • $197.3k - $313.7k

     ...Staff Software Engineer Slack is looking for a Staff Software Engineer to join the Data Infrastructure team within the broader Data Engineering organization. The mission of our team is to build secure, reliable, performant, scalable, and cost-efficient infrastructure... 
    Permanent employment

    Slack

    San Francisco, CA
    1 day ago
  • Verne Robotics in San Francisco is looking for a Robotics Data Infrastructure Engineer to develop data systems for their robots. This hands-on role allows for ownership and responsibility in building data pipelines on AWS and managing multi-modal datasets. Candidates should... 

    Verne Robotics

    San Francisco, CA
    2 days ago
  •  ...Microsoft, with deep expertise in robotics, embodied AI, and large-scale machine learning. The Role We’re looking for a Robotics Data Infrastructure Engineer to own and build the data systems that power Verne’s robots in the real world. This is a hands-on founding engineer... 
    Full time
    Work experience placement
    Immediate start

    Verne Robotics

    San Francisco, CA
    2 days ago
  • $171k - $248k

    Data Center Mechanical Engineer, Technical Infrastructure Google San Francisco, CA, USA ; Atlanta, GA, USA ; +5 more Remote eligible X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance... 
    Full time
    Contract work
    Work at office
    Remote work

    Google Inc.

    San Francisco, CA
    1 day ago
  • $280k - $430k

    AI Chopping Block, Inc. is looking for a hands-on Engineering Manager for their AI & Data Infrastructure team in San Francisco. This technical player/coach role involves leading the team responsible for data and inference systems critical for AI interactions. Candidates... 

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  • $200k - $275k

     ...accuracy. Our AI-enabled platform turns siloed and disconnected data into operational intelligence - instantly surfacing mission-...  ...executing our vision. Role We are looking for a Staff Data Infrastructure Engineer to join our growing team, where you will have deep... 
    Work at office
    Local area

    Peregrine Technologies

    San Francisco, CA
    7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist, Infrastructure. Be the first to apply!