Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist, Infrastructure

$230k - $385k

OpenAI

About the Team

Our infrastructure team helps deliver OpenAI's most capable models and products to the world by scaling infrastructure and turning demand into useful FLOPS. We collaborate across research, engineering, design, and business to turn cutting-edge AI advancements into impactful, real-world applications. Our team ensures the right compute is available-at the right time and place-to support some of the world's most demanding workloads. We empower all of OpenAI's products and research by scaling the infrastructure behind them. Our work makes it possible to launch new models and products reliably and at scale.

About the Role

As a Data Scientist on the Infra team, you will play a key role in shaping how we scale the infrastructure that powers OpenAI's products and research. This is critical as we operate one of the largest and most advanced compute fleets in the world, supporting millions of users and businesses globally. We focus on aligning infrastructure measurement, planning, scaling, allocation, and efficiency to drive measurable impact across the company.

You should expect to guide the definition of foundational datasets for infrastructure resources, develop metrics that inform key decisions, build forecasting and optimization models, and establish source of truth dashboards and analyses that enable teams to understand and improve infra usage. Most importantly, you should expect to be a core partner to engineering, research, and product teams in shaping the infrastructure that powers everything OpenAI builds.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Build and maintain foundational datasets and metrics that reflect infrastructure usage, efficiency, and scaling.

  • Develop forecasting and optimization models to support infra planning and resource allocation.

  • Partner with engineering, research, and product teams to shape infrastructure strategy through data.

  • Drive clarity with source-of-truth dashboards and analyses that guide infra decisions across OpenAI.

You might thrive in this role if you have:

  • 5+ years of experience in a quantitative role navigating ambiguous environments, ideally in infrastructure, systems, or platform domains at a high-growth company or research org

  • Experience defining and operationalizing metrics that reflect system performance, resource usage, or efficiency from the ground up

  • A strong foundation in SQL and Python, and a track record of building models and analyses that drive technical and strategic decisions

  • Excellent communication skills and the ability to partner effectively with engineers, researchers, and product stakeholders

  • A strategic mindset that goes beyond statistical testing to surface actionable insights and long-term tradeoffs

You could be an especially great fit if you have:

  • Proven track record of operating as a data partner in large scale backend systems

  • Comfortable navigating fast-paced execution while also anchoring decisions in long-term impact

  • Strong programming background, with ability to run simulations and prototype variants

  • Experience in NLP, large language models, or generative AI

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation Range: $230K - $385K

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Scientist, Infrastructure in San Francisco, CA vacancy
  •  ...About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing data into production datasets, models, and customer-facing insights. This role spans ingestion, processing, storage,... 
    Suggested
    Permanent employment
    Full time

    Matter Intelligence

    San Francisco, CA
    3 days ago
  • $165k - $205k

    Candid Health is looking for a passionate Data Engineer to join our growing data team in San Francisco. You will design, build, and support data infrastructure that meets the needs of our customers. With your expertise in data pipelines and modern architecture, you will... 
    Suggested

    Candid Health

    San Francisco, CA
    11 hours ago
  • A progressive technology company in San Francisco is looking for a Data Infrastructure Engineer to design and operate data and ML infrastructure on AWS. The ideal candidate will have strong software engineering fundamentals and experience building production systems, particularly... 
    Suggested

    Matter Intelligence

    San Francisco, CA
    2 days ago
  • Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with... 
    Suggested
    Work at office

    Cartesia

    San Francisco, CA
    1 day ago
  • Granica, based in San Francisco, is seeking an expert in distributed systems to enhance their data infrastructure. This role involves architecting a global metadata substrate, developing intelligent data layouts, and implementing algorithms for efficient data representation... 
    Suggested
    Flexible hours

    Granica

    San Francisco, CA
    2 days ago
  •  ...Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and...  ..., and others. The Role: We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data... 

    Judgment Labs

    San Francisco, CA
    2 days ago
  • $140k - $180k

     ...Data Infrastructure Engineer Alljoined is creating a future where humans are fully understood and augmented by technology. Our work solves the communication bottleneck between humans and computers by decoding thoughts from the brain, entirely non-invasively. We apply... 
    Local area
    Visa sponsorship

    Alljoined

    San Francisco, CA
    1 day ago
  •  ...Data Infrastructure Engineer Los Angeles, Palo Alto, San Francisco, Toronto About HeyGen At HeyGen, our mission is to make visual...  ...productionization infrastructure Collaborate with data scientists and machine learning engineers to understand their... 

    HeyGen

    San Francisco, CA
    4 days ago
  • $257k - $327k

     ...About the Team OpenAI is building the infrastructure foundation for the next generation of AI. The Data Center Engineering team defines the strategy, reference architectures, technical requirements, and delivery standards for the large-scale data centers that support... 
    For contractors
    Work at office

    OpenAI

    San Francisco, CA
    4 days ago
  • A leading AI company in San Francisco is seeking an experienced Data Engineer to work on innovative storage infrastructure and product launches. The role demands a strong command of Python and SQL, with 5+ years experience in production-grade data processing systems. Candidates... 
    Remote job
    Flexible hours

    Cohere

    San Francisco, CA
    11 hours ago
  • Epoch Biodesign in San Francisco is seeking a Senior Data Engineer to architect and build foundational data platform infrastructure for AI operations. This full-time position requires proficiency in Python and systems-level languages, experience with data platforms, and... 
    Full time

    Epoch Biodesign

    San Francisco, CA
    4 days ago
  • Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The... 

    Gerra Group

    San Francisco, CA
    1 day ago
  • A leading AI research organization located in San Francisco is seeking an experienced data infrastructure engineer to design and operate data infrastructure supporting extensive compute fleets. You will manage the lifecycle ownership and ensure high performance, scalability... 
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • A leading medical AI platform in San Francisco is seeking a Data Infrastructure Software Engineer. You will build end-to-end systems vital for improving clinical decision-making. This role demands a commitment to performance, scalability, and precision in a fast-paced environment... 

    OpenEvidence

    San Francisco, CA
    2 days ago
  • A digital identity platform company in San Francisco is looking for a Data Infrastructure Engineer to design, build, and maintain their data platform. The role requires 3+ years of software engineering experience, proficiency in Python, and knowledge of technologies like... 

    Persona

    San Francisco, CA
    1 day ago
  • $250k - $380k

     ...for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    11 hours ago
  •  ...general intelligence. In this role, you will design and build systems in-house while utilizing cloud technology to create reliable data infrastructure. The ideal candidate has 5+ years of software engineering experience and expertise in managing large datasets for machine... 

    I did my part and supported the Regular Toilet

    San Francisco, CA
    11 hours ago
  • Decagon AI, Inc. is looking for a Senior Data Infrastructure Engineer to design and operate the data systems that power its AI products. The successful candidate will own critical data pipelines and storage layers, improving reliability and creating clear data pathways... 

    Decagon AI, Inc.

    San Francisco, CA
    11 hours ago
  • $350k

     ...Software Engineer, Data Infrastructure Thinking Machines Lab's mission is to empower humanity through advancing collaborative general...  ...to make AI work for their unique needs and goals. We are scientists, engineers, and builders who've created some of the most widely... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package

    Thinking Machines Lab

    San Francisco, CA
    3 days ago
  • $160k - $225k

     ...platform synthesizes complex employee data, pinpoints risky behaviors, and deploys...  ...Build and scale the foundational data infrastructure powering a category-defining product...  ...and experimentation Partner with data scientists and engineers to productionize ML and... 
    Work experience placement
    Relocation package
    Flexible hours

    Fable

    San Francisco, CA
    4 days ago
  • $120k - $160k

     ...Founding Engineer For Airweave's Data And Infrastructure We're looking for a founding engineer to own Airweave's data and infrastructure layer, the systems that make our distributed search and data pipelines scalable, reliable and observable. At Airweave, you'll... 

    Airweave (yc X25)

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block's Cash App and Square, Chime, Oura... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    10 days ago
  • $250k - $380k

     ...for designing and running OpenAI's LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...Data Center Infrastructure Electrical Engineer OpenAI is building the infrastructure foundation for the next generation of AI. The Data Center Engineering team defines the strategy, reference architectures, technical requirements, and delivery standards for the large... 
    For contractors
    Work at office

    OpenAI

    San Francisco, CA
    4 days ago
  • $197.3k - $313.7k

     ...Agentforce is the future of AI, and you are the future of Salesforce. Slack is looking for a Staff Software Engineer to join the Data Infrastructure team within the broader Data Engineering organization. The mission of our team is to build secure, reliable, performant,... 

    Salesforce.Com Inc

    San Francisco, CA
    5 days ago
  • 100 Salesforce, Inc. is looking for a Staff Software Engineer to join the Data Infrastructure team. This role involves designing and operating reliable, scalable data infrastructure that supports analytics and machine learning workflows. The ideal candidate will have 10... 

    100 Salesforce, Inc.

    San Francisco, CA
    2 days ago
  • Slack Enterprise seeks a Staff Software Engineer to join its Data Infrastructure team. This role includes designing and building high-performance data systems that support analytics and machine learning needs. Candidates should have over 10 years of experience in software... 

    Slack Enterprise

    San Francisco, CA
    4 days ago
  •  ...Software Engineer in San Francisco to develop scalable software for data-driven operations. The role requires expertise in programming...  ..., along with familiarity in distributed systems and cloud infrastructure. The position offers significant autonomy in a collaborative... 
    Relocation package

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    3 days ago
  • $175k

     ...intelligence. Building these large-scale models requires performant data infrastructure to create and store the datasets used in all of our...  ...for company value Partner with engineers and research scientists to facilitate progress for both research and product development... 
    Work at office
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    11 hours ago
  • $160k - $230k

    A technology company in San Francisco is hiring for a foundational role to design and implement a large-scale data infrastructure. You'll develop the Models API and manage data pipelines using Kafka, Postgres, and Clickhouse. Ideal candidates will have experience in schema... 
    Flexible hours

    Meter

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist, Infrastructure. Be the first to apply!