Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist, Infrastructure

OpenAI

About the Team Our infrastructure team helps deliver OpenAI’s most capable models and products to the world by scaling infrastructure and turning demand into useful FLOPS. We collaborate across research, engineering, design, and business to turn cutting-edge AI advancements into impactful, real-world applications. Our team ensures the right compute is available—at the right time and place—to support some of the world’s most demanding workloads. We empower all of OpenAI’s products and research by scaling the infrastructure behind them. Our work makes it possible to launch new models and products reliably and at scale. About the Role As a Data Scientist on the Infra team, you will play a key role in shaping how we scale the infrastructure that powers OpenAI’s products and research. This is critical as we operate one of the largest and most advanced compute fleets in the world, supporting millions of users and businesses globally. We focus on aligning infrastructure measurement, planning, scaling, allocation, and efficiency to drive measurable impact across the company. You should expect to guide the definition of foundational datasets for infrastructure resources, develop metrics that inform key decisions, build forecasting and optimization models, and establish source of truth dashboards and analyses that enable teams to understand and improve infra usage. Most importantly, you should expect to be a core partner to engineering, research, and product teams in shaping the infrastructure that powers everything OpenAI builds. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will: Build and maintain foundational datasets and metrics that reflect infrastructure usage, efficiency, and scaling. Develop forecasting and optimization models to support infra planning and resource allocation. Partner with engineering, research, and product teams to shape infrastructure strategy through data. Drive clarity with source-of-truth dashboards and analyses that guide infra decisions across OpenAI. You might thrive in this role if you have: 5+ years of experience in a quantitative role navigating ambiguous environments, ideally in infrastructure, systems, or platform domains at a high-growth company or research org Experience defining and operationalizing metrics that reflect system performance, resource usage, or efficiency from the ground up A strong foundation in SQL and Python, and a track record of building models and analyses that drive technical and strategic decisions Excellent communication skills and the ability to partner effectively with engineers, researchers, and product stakeholders A strategic mindset that goes beyond statistical testing to surface actionable insights and long-term tradeoffs You could be an especially great fit if you have: Proven track record of operating as a data partner in large scale backend systems Comfortable navigating fast-paced execution while also anchoring decisions in long-term impact Strong programming background, with ability to run simulations and prototype variants Experience in NLP, large language models, or generative AI About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement. Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. We comply with applicable safety and privacy regulations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy. At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr OpenAI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Data Scientist, Infrastructure in San Francisco, CA vacancy
  • $140k - $180k

     ...live our lives. We are actively growing our founding engineering team to build the underlying infrastructure that makes this ambitious future a reality. About the Role As a Data Infrastructure Engineer , you will build the backend and hardware architecture that allows us... 
    Suggested
    Local area
    Visa sponsorship

    Alljoined

    San Francisco, CA
    3 days ago
  •  ...robots generate high-rate video, telemetry, and demonstration data every time they move — and that data is what trains our models...  ...team builds the systems that connect robots to learning. Data infrastructure sits at the center of that loop: what gets captured, how it moves... 
    Suggested
    Immediate start

    Droyd

    San Francisco, CA
    2 days ago
  • $165k - $205k

    Candid Health is looking for a passionate Data Engineer to join our growing data team in San Francisco. You will design, build, and support data infrastructure that meets the needs of our customers. With your expertise in data pipelines and modern architecture, you will... 
    Suggested

    Candid Health

    San Francisco, CA
    2 days ago
  • About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing data into production datasets, models, and customer-facing insights. This role spans ingestion, processing, storage, compute... 
    Suggested
    Permanent employment
    Full time

    Matter Intelligence

    San Francisco, CA
    4 days ago
  • A progressive technology company in San Francisco is looking for a Data Infrastructure Engineer to design and operate data and ML infrastructure on AWS. The ideal candidate will have strong software engineering fundamentals and experience building production systems, particularly... 
    Suggested

    Matter Intelligence

    San Francisco, CA
    4 days ago
  • Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with... 
    Work at office

    Cartesia

    San Francisco, CA
    3 days ago
  • $191k - $225k

     ...for guests to connect with communities in a more authentic way. The Community You Will Join: Data represents the voice of Airbnb’s users at scale. The Data Warehouse Infrastructure team is responsible for the foundational big data infrastructure, which is used by hundreds... 
    Work experience placement

    Nerdleveltech

    San Francisco, CA
    5 days ago
  •  ...for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale validation... 

    Slope

    San Francisco, CA
    4 days ago
  • Scribd is looking for a Senior AI Data Engineer to lead AI engineering efforts on the Data Platform team. The role includes building infrastructure for AI applications, aiding platform development, and mentoring engineers. Ideal candidates have 5+ years in data engineering... 

    Scribd

    San Francisco, CA
    3 days ago
  •  ...Francisco is seeking an experienced Rack Product Engineer to manage the technical development and lifecycle performance of rack infrastructure for datacenters. The ideal candidate will have10+ years of experience in hardware product engineering and a strong background... 

    OpenAI

    San Francisco, CA
    4 days ago
  • Epoch Biodesign in San Francisco is seeking a Senior Data Engineer to architect and build foundational data platform infrastructure for AI operations. This full-time position requires proficiency in Python and systems-level languages, experience with data platforms, and... 
    Full time

    Epoch Biodesign

    San Francisco, CA
    1 day ago
  • $200k - $400k

     ..., and The Polymath Principle — shape how we work and grow as a team. About the Team The Infrastructure team builds and operates the foundations that power Decagon: networking, data, ML serving, developer platform, and real‑time voice. We partner closely with product, data... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    3 days ago
  •  ...offices in New York, WashingtonD.C., London and Amsterdam. Making data driven decisions is key to Plaid's culture. To support that,...  ...Plaid serve our customers more effectively. Engineers on Data Infrastructure are domain experts in Data Warehouse, Data Lakehouse, Spark,... 
    Work experience placement
    Local area

    Plaid

    San Francisco, CA
    3 days ago
  • Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The... 

    Gerra Group

    San Francisco, CA
    3 days ago
  • Droyd in San Francisco is seeking a Staff Software Engineer focused on data infrastructure. You will own data pipelines that convert robot telemetry into valuable training signals. Collaborate directly with a small, senior team across robotics and machine learning to improve... 

    Droyd

    San Francisco, CA
    4 days ago
  • A leading AI research organization located in San Francisco is seeking an experienced data infrastructure engineer to design and operate data infrastructure supporting extensive compute fleets. You will manage the lifecycle ownership and ensure high performance, scalability... 
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  • A digital identity platform company in San Francisco is looking for a Data Infrastructure Engineer to design, build, and maintain their data platform. The role requires 3+ years of software engineering experience, proficiency in Python, and knowledge of technologies like... 

    Persona

    San Francisco, CA
    3 days ago
  • $250k - $380k

     ...for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    2 days ago
  • A technology startup in San Francisco is seeking a Data Infrastructure Engineer to build backend systems and manage the data lifecycle. You will be responsible for creating pipelines that process large datasets and ensuring the infrastructure supports world-class research... 
    Visa sponsorship

    Alljoined

    San Francisco, CA
    4 days ago
  • $160k - $230k

    REACH INDUSTRIES is seeking a Founding Data Infrastructure Engineer in San Francisco. You will build the backend for data collection efforts, designing scalable architectures for multimodal data. Ideal candidates have a strong background in Python, Rust, or Go, and experience... 
    Relocation package

    REACH INDUSTRIES

    San Francisco, CA
    5 days ago
  • $160k - $225k

    A technology-driven security company based in California is looking for a Data Infrastructure Engineer. This role focuses on designing and maintaining scalable data pipelines and infrastructure, ensuring data quality and reliability. Ideal candidates should have 3-7+ years... 
    Flexible hours

    Fable Security LLP

    San Francisco, CA
    2 days ago
  • Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and...  ...Kevin Hartz, and others. The Role: We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data... 

    Judgment Labs Inc.

    San Francisco, CA
    3 days ago
  • Decagon AI, Inc. is looking for a Senior Data Infrastructure Engineer to design and operate the data systems that power its AI products. The successful candidate will own critical data pipelines and storage layers, improving reliability and creating clear data pathways... 

    Decagon AI, Inc.

    San Francisco, CA
    2 days ago
  • 11x in San Francisco is looking for a Data Engineer who operates like a founder to build systems for AI applications. This role involves owning critical infrastructure for AI workers, designing scalable data systems, and moving quickly through ambiguity. Candidates should... 

    11x

    San Francisco, CA
    1 day ago
  • $160k - $230k

    Open role Founding Data Infrastructure Engineer San Francisco (On-site) About Us Constellation is creating the AI-human translation layer that ensures humanity evolves alongside our technology. Our mission is to leverage AI towards addressing deep and meaningful problems... 
    Work at office
    Local area
    Relocation package

    REACH INDUSTRIES

    San Francisco, CA
    5 days ago
  •  ...general intelligence. In this role, you will design and build systems in-house while utilizing cloud technology to create reliable data infrastructure. The ideal candidate has 5+ years of software engineering experience and expertise in managing large datasets for machine... 

    I did my part and supported the Regular Toilet

    San Francisco, CA
    2 days ago
  • OpenAI is looking for a Data Engineer to build and scale data products that support its infrastructure operations. This role involves designing data pipelines, developing datasets for critical operational metrics, and collaborating with various teams including Hardware... 

    OpenAI

    San Francisco, CA
    5 days ago
  • A leading AI company in San Francisco is seeking an experienced Data Engineer to work on innovative storage infrastructure and product launches. The role demands a strong command of Python and SQL, with 5+ years experience in production-grade data processing systems. Candidates... 
    Remote job
    Flexible hours

    Cohere

    San Francisco, CA
    2 days ago
  • $300k - $400k

    AI Talent Now in San Francisco is looking for an Engineering Manager to lead a team of engineers building core infrastructure for AI labs. In this role, you will manage engineering operations, mentor teams, and work closely with the CTO. Ideal candidates have 6-10 years... 

    AI Talent Now

    San Francisco, CA
    2 days ago
  • About the Team OpenAI is building the infrastructure foundation for the next generation of AI. The Data Center Engineering team defines the strategy, reference architectures, technical requirements, and delivery standards for the large-scale data centers that support OpenAI... 
    For contractors
    Work at office

    OpenAI

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist, Infrastructure. Be the first to apply!