Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Platform Systems

$310k

OpenAI

About the Team

The Platform Systems team at OpenAI operates at the intersection of cutting-edge AI and large-scale distributed systems. We build the engineering and research infrastructure required to train OpenAI's flagship models on some of the world's largest, custom-built supercomputers.

Our team develops core model training software and works deep in the stack - spanning collective communication, compute efficiency, parallelism strategies, fault tolerance, failure detection, and observability. The systems we build are foundational to OpenAI's research velocity, enabling reliable, efficient training at frontier scale.

We collaborate closely with researchers across the organization, continuously incorporating learnings from across OpenAI into the evolution of our training platform.

About the Role

As a Software Engineer, Platform Systems, you will design and build distributed systems that provide visibility into large-scale training workloads and help operate them reliably at scale.

You'll work on failure detection, tracing, and observability systems that identify slow or faulty nodes, surface performance bottlenecks, and help engineers understand and optimize massive distributed training jobs. This infrastructure is critical to operating OpenAI's training stack and is actively evolving to support new use cases and increasingly complex workloads.

This role sits at the core of our training infrastructure, blending systems engineering, performance analysis, and large-scale debugging.

In This Role, You Will
  • Design and build distributed failure detection, tracing, and profiling systems for large-scale AI training jobs

  • Develop tooling to identify slow, faulty, or misbehaving nodes and provide actionable visibility into system behavior

  • Improve observability, reliability, and performance across OpenAI's training platform

  • Debug and resolve issues in complex, high-throughput distributed systems

  • Collaborate with systems, infrastructure, and research teams to evolve platform capabilities

  • Extend and adapt failure detection systems or tracing systems to support new training paradigms and workloads

You Might Thrive in This Role If You
  • Care deeply about performance, stability, and observability in distributed systems

  • Enjoy finding and fixing issues in large-scale systems and automating operational workflows

  • Have experience writing low-level software where system details matter

  • Understand hardware, operating systems, networking, concurrency, and distributed systems

  • Have a background in high-performance computing or low-level systems engineering

  • Are excited to work on critical infrastructure that powers frontier AI research

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation Range: $310K - $460K

Vacancy posted 21 hours ago
Similar jobs that could be interesting for youBased on the Software Engineer, Platform Systems in San Francisco, CA vacancy
  •  ...innovative energy technology firm located in San Francisco is seeking a Staff Software Engineer to design, build, and scale customer-facing managed services. The ideal candidate will utilize systems programming expertise, ensuring technical oversight on edge systems while... 
    Software

    Crusoe Energy Systems LLC

    San Francisco, CA
    3 days ago
  • Perplexity AI is seeking a skilled software engineer to join their Enterprise Platform team in San Francisco, California. The role involves building user-facing products that facilitate enterprise adoption of Perplexity products while ensuring a seamless onboarding process... 
    Software

    Perplexity AI

    San Francisco, CA
    4 days ago
  • B Capital in San Francisco is looking for highly motivated college graduates for a Graduate Software Engineer role. This position offers a chance to work with world-class engineers and deliver scalable cloud computing products. Responsibilities include architecting and... 
    Software

    B Capital

    San Francisco, CA
    1 day ago
  • $293.6k - $335.1k

    COMFORT SYSTEMS is seeking a Distinguished Software Engineer to join our innovative team in San Francisco, CA. You will lead technical contributions and mentor colleagues in a collaborative environment. The ideal candidate will have extensive experience in software engineering... 
    Software

    COMFORT SYSTEMS

    San Francisco, CA
    1 day ago
  • B Capital is seeking a skilled software engineer in San Francisco to develop foundational AI systems. You will work on shared services and improve operational reliability, ensuring performance under load and addressing complex challenges. Ideal candidates will have a strong... 
    Software

    B Capital

    San Francisco, CA
    1 day ago
  • $180k - $280k

     ...infrastructure and reliability engineer, you will join the team...  ...and maintaining TypeSafe’s API platform for inference. These APIs will...  ...Experience designing resilient systems and improving on-call...  ...Have 5+ years of professional software engineering experience (3+ years... 
    Software
    Visa sponsorship

    TypeSafe AI

    San Francisco, CA
    2 days ago
  •  ...computing and make it accessible to software developers of all skill...  ...needing to be a distributed systems expert. Proud to be backed by...  ...a Senior Site Reliability Engineer to join the Infrastructure team...  ...that powers Anyscale’s cloud platform. You will have the opportunity... 
    Software

    Anyscale

    San Francisco, CA
    3 days ago
  • $157.36k - $281k

    A leading IoT company is looking for a Staff Engineer to drive the technical direction of its team and build foundational systems for scaling its software products. The ideal candidate will have extensive experience in software development and architecture, aiming to create... 
    Software
    Remote job

    Samsara

    San Francisco, CA
    4 days ago
  • Golunar, based in San Francisco, is seeking a Staff Software Engineer to tackle complex technical challenges in healthcare. You will design and build modern, AI-powered software systems that improve hospital operations and patient care. The ideal candidate will have over... 
    Software

    Golunar

    San Francisco, CA
    4 days ago
  • $217k - $312.2k

     ...Senior Engineering Manager – Workspace Platform – San Francisco, California At Databricks, we are passionate...  ...opportunity to guide a team of ~20 software engineers in creating platform features...  ...for high‑volume distributed systems. Cross‑Functional Collaboration... 
    Software
    Local area
    Worldwide

    Databricks Inc.

    San Francisco, CA
    4 days ago
  • SupportFinity™ is seeking a Senior AI Engineer to join the AI Platform team in San Francisco. In this role,...  ...design and implement LLM-powered AI systems to optimize insights from data. The...  ...has over 5 years of experience in software engineering and machine learning. You... 
    Software

    SupportFinity™

    San Francisco, CA
    5 days ago
  •  ...We achieve this by building platforms that enable the rapid and responsible...  ..., and full stack systems to create solutions that help...  ...mentoring other members of the engineering community, and from time to...  ...6 years of experience in software engineering (Internship experience... 
    Software
    Full time
    Part time
    Internship

    Capital One

    San Francisco, CA
    21 hours ago
  • $144k - $240k

    Lila Sciences is seeking a Sr Principal / Principal Software Engineer to join their innovative team in San Francisco, CA. You will design and build AI-driven applications, focusing on performance, reliability, and cross-functional collaboration with scientists. Ideal candidates... 
    Software
    Flexible hours

    Jobr

    San Francisco, CA
    4 days ago
  • Homebase in San Francisco is looking for a Senior Software Engineer, AI Systems, to enhance AI capabilities across engineering. This role includes building workflow automation and shared developer platforms, partnering with cross-functional teams, and evaluating emerging... 
    Software

    Homebase

    San Francisco, CA
    12 hours ago
  • $200k - $300k

    A tech startup in San Francisco seeks a Lead Software Engineer to build and optimize foundational backend systems for a massive AI video dataset. You will lead architecture, ensuring reliability and scalability while collaborating with cross-functional teams. The ideal... 
    Software

    Troveo AI

    San Francisco, CA
    3 days ago
  • $285k - $330k

    Parafin in San Francisco is seeking an experienced platform-focused software engineer to join our Merchant Platform team. The role involves designing scalable systems, enhancing the merchant experience, and collaborating with cross-functional teams to deliver product integrations... 
    Software
    Work from home

    Parafin Inc

    San Francisco, CA
    21 hours ago
  • $140k - $265k

     ...Software Engineer, Platform Mountain View, CA About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began...  ...re excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow,... 
    Software
    Work at office
    Home office
    Flexible hours

    Glean.info

    San Francisco, CA
    21 hours ago
  • Avive Solutions is seeking a Technical Support Engineer in San Francisco, California. This role is focused on providing technical support for our connected hardware and software platform, diagnosing issues in real-time while communicating clearly with customers. The ideal... 
    Software

    Avive Solutions

    San Francisco, CA
    1 day ago
  •  ...Koah Labs Adtech Engineer Koah Labs is building the ad network to power the next generation...  ...infrastructure that make up our adtech platform. You might be a fit if: You have maintained or operated serious systems in production at scale You are detail-oriented... 
    Software

    Koah Labs

    San Francisco, CA
    2 days ago
  • $229.9k - $262.4k

     ...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building and pioneering in the technology space? Do you...  ...AI/ML across Capital One . We achieve this by building platforms that enable the rapid and responsible development and... 
    Software
    Full time
    Part time
    Internship
    Local area

    Information Technology Senior Management Forum

    San Francisco, CA
    21 hours ago
  • $229.9k - $262.4k

     ...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building and pioneering in the technology space? Do you...  ...AI/ML across Capital One. We achieve this by building platforms that enable the rapid and responsible development and deployment... 
    Software
    Full time
    Part time
    Internship
    Local area

    Capital One National Association

    San Francisco, CA
    21 hours ago
  • $140k - $200k

     ...– Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and...  ...own companies. Overview The responsibilities of our Platform team include building and maintaining all backend services, including... 
    Software
    Work at office

    Speechify

    San Francisco, CA
    2 days ago
  •  ...A leading open-source software provider is looking for an Engineering Manager in San Francisco. In this role, you will lead a team working with major cloud partners like Amazon and Google, focusing on optimizing Ubuntu infrastructure. You will need strong technical leadership... 
    Software
    Remote work

    Canonical

    San Francisco, CA
    1 day ago
  •  ...the only unified payments and financial platform for global businesses. Powered by our unique...  ...of proprietary infrastructure and software, we empower over 200,000 businesses worldwide...  ...Are? As a high level architect (staff engineer), you will oversee the strategy,... 
    Software
    Work at office
    Worldwide

    Airwallex-

    San Francisco, CA
    2 days ago
  •  ...AI-native financial operating system for health systems, founded...  ...technical foundation of the platform . You'll work at the intersection...  ...You'll partner closely with engineers and leadership to understand...  ...looking for a systems-minded software engineer who cares deeply... 
    Software
    Contract work

    MidStream PA

    San Francisco, CA
    21 hours ago
  • $133.65k - $222k

     ...verification plans for complex embedded systems based on requirements. -...  ...-in-the-Loop (HIL) and Software-in-the-Loop (SIL) tooling....  ...for verification of embedded platform components (including embedded...  ...Qualifications: - Bachelors in an engineering discipline (MS/PhD preferred)... 
    Software
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    14 hours ago
  •  ...company in San Francisco is seeking a Lead Software Engineer to design and develop distributed filesystems for their innovative platform. You will research and oversee software...  ...required alongside significant experience in systems design. Benefits include comprehensive... 
    Software
    Remote job

    Salesforce

    San Francisco, CA
    4 days ago
  • $300k - $320k

     ...interpretable, and steerable AI systems. We want AI to be safe and...  ...group of committed researchers, engineers, policy experts, and business...  ..., select, and implement GRC platforms and tools, configuring and...  ...engineering, data engineering, software development, or related... 
    Software
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • Zendesk in San Francisco is seeking an experienced Engineering Manager to lead the Authentication team within Core Services Engineering....  ...across various teams. The ideal candidate will have 8+ years of software engineering experience and 2+ years in management, excellent... 
    Software

    Ultimate.ai

    San Francisco, CA
    4 days ago
  • $200k - $250k

     ...We're building Skyway, a platform to help companies find, procure,...  ...are the strategic finance and engineering leaders at AI labs, inference...  ...providers, neoclouds, and AI-native software companies who are making...  ...data that flows through our systems. Under the hood, we're... 
    Software
    Contract work

    Duckbill

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Platform Systems. Be the first to apply!