Software Engineer, Platform Systems
$310kOpenAI
The Platform Systems team at OpenAI operates at the intersection of cutting-edge AI and large-scale distributed systems. We build the engineering and research infrastructure required to train OpenAI's flagship models on some of the world's largest, custom-built supercomputers.
Our team develops core model training software and works deep in the stack - spanning collective communication, compute efficiency, parallelism strategies, fault tolerance, failure detection, and observability. The systems we build are foundational to OpenAI's research velocity, enabling reliable, efficient training at frontier scale.
We collaborate closely with researchers across the organization, continuously incorporating learnings from across OpenAI into the evolution of our training platform.
About the RoleAs a Software Engineer, Platform Systems, you will design and build distributed systems that provide visibility into large-scale training workloads and help operate them reliably at scale.
You'll work on failure detection, tracing, and observability systems that identify slow or faulty nodes, surface performance bottlenecks, and help engineers understand and optimize massive distributed training jobs. This infrastructure is critical to operating OpenAI's training stack and is actively evolving to support new use cases and increasingly complex workloads.
This role sits at the core of our training infrastructure, blending systems engineering, performance analysis, and large-scale debugging.
In This Role, You Will-
Design and build distributed failure detection, tracing, and profiling systems for large-scale AI training jobs
Develop tooling to identify slow, faulty, or misbehaving nodes and provide actionable visibility into system behavior
Improve observability, reliability, and performance across OpenAI's training platform
Debug and resolve issues in complex, high-throughput distributed systems
Collaborate with systems, infrastructure, and research teams to evolve platform capabilities
Extend and adapt failure detection systems or tracing systems to support new training paradigms and workloads
-
Care deeply about performance, stability, and observability in distributed systems
Enjoy finding and fixing issues in large-scale systems and automating operational workflows
Have experience writing low-level software where system details matter
Understand hardware, operating systems, networking, concurrency, and distributed systems
Have a background in high-performance computing or low-level systems engineering
Are excited to work on critical infrastructure that powers frontier AI research
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Compensation Range: $310K - $460K
- ...innovative energy technology firm located in San Francisco is seeking a Staff Software Engineer to design, build, and scale customer-facing managed services. The ideal candidate will utilize systems programming expertise, ensuring technical oversight on edge systems while...Software
- Perplexity AI is seeking a skilled software engineer to join their Enterprise Platform team in San Francisco, California. The role involves building user-facing products that facilitate enterprise adoption of Perplexity products while ensuring a seamless onboarding process...Software
- B Capital in San Francisco is looking for highly motivated college graduates for a Graduate Software Engineer role. This position offers a chance to work with world-class engineers and deliver scalable cloud computing products. Responsibilities include architecting and...Software
$293.6k - $335.1k
COMFORT SYSTEMS is seeking a Distinguished Software Engineer to join our innovative team in San Francisco, CA. You will lead technical contributions and mentor colleagues in a collaborative environment. The ideal candidate will have extensive experience in software engineering...Software- B Capital is seeking a skilled software engineer in San Francisco to develop foundational AI systems. You will work on shared services and improve operational reliability, ensuring performance under load and addressing complex challenges. Ideal candidates will have a strong...Software
$180k - $280k
...infrastructure and reliability engineer, you will join the team... ...and maintaining TypeSafe’s API platform for inference. These APIs will... ...Experience designing resilient systems and improving on-call... ...Have 5+ years of professional software engineering experience (3+ years...SoftwareVisa sponsorship- ...computing and make it accessible to software developers of all skill... ...needing to be a distributed systems expert. Proud to be backed by... ...a Senior Site Reliability Engineer to join the Infrastructure team... ...that powers Anyscale’s cloud platform. You will have the opportunity...Software
$157.36k - $281k
A leading IoT company is looking for a Staff Engineer to drive the technical direction of its team and build foundational systems for scaling its software products. The ideal candidate will have extensive experience in software development and architecture, aiming to create...SoftwareRemote job- Golunar, based in San Francisco, is seeking a Staff Software Engineer to tackle complex technical challenges in healthcare. You will design and build modern, AI-powered software systems that improve hospital operations and patient care. The ideal candidate will have over...Software
$217k - $312.2k
...Senior Engineering Manager – Workspace Platform – San Francisco, California At Databricks, we are passionate... ...opportunity to guide a team of ~20 software engineers in creating platform features... ...for high‑volume distributed systems. Cross‑Functional Collaboration...SoftwareLocal areaWorldwide- SupportFinity™ is seeking a Senior AI Engineer to join the AI Platform team in San Francisco. In this role,... ...design and implement LLM-powered AI systems to optimize insights from data. The... ...has over 5 years of experience in software engineering and machine learning. You...Software
- ...We achieve this by building platforms that enable the rapid and responsible... ..., and full stack systems to create solutions that help... ...mentoring other members of the engineering community, and from time to... ...6 years of experience in software engineering (Internship experience...SoftwareFull timePart timeInternship
$144k - $240k
Lila Sciences is seeking a Sr Principal / Principal Software Engineer to join their innovative team in San Francisco, CA. You will design and build AI-driven applications, focusing on performance, reliability, and cross-functional collaboration with scientists. Ideal candidates...SoftwareFlexible hours- Homebase in San Francisco is looking for a Senior Software Engineer, AI Systems, to enhance AI capabilities across engineering. This role includes building workflow automation and shared developer platforms, partnering with cross-functional teams, and evaluating emerging...Software
$200k - $300k
A tech startup in San Francisco seeks a Lead Software Engineer to build and optimize foundational backend systems for a massive AI video dataset. You will lead architecture, ensuring reliability and scalability while collaborating with cross-functional teams. The ideal...Software$285k - $330k
Parafin in San Francisco is seeking an experienced platform-focused software engineer to join our Merchant Platform team. The role involves designing scalable systems, enhancing the merchant experience, and collaborating with cross-functional teams to deliver product integrations...SoftwareWork from home$140k - $265k
...Software Engineer, Platform Mountain View, CA About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began... ...re excited to shape how the world works, you'll help build systems used daily across Microsoft Teams, Zoom, ServiceNow,...SoftwareWork at officeHome officeFlexible hours- Avive Solutions is seeking a Technical Support Engineer in San Francisco, California. This role is focused on providing technical support for our connected hardware and software platform, diagnosing issues in real-time while communicating clearly with customers. The ideal...Software
- ...Koah Labs Adtech Engineer Koah Labs is building the ad network to power the next generation... ...infrastructure that make up our adtech platform. You might be a fit if: You have maintained or operated serious systems in production at scale You are detail-oriented...Software
$229.9k - $262.4k
...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building and pioneering in the technology space? Do you... ...AI/ML across Capital One . We achieve this by building platforms that enable the rapid and responsible development and...SoftwareFull timePart timeInternshipLocal area$229.9k - $262.4k
...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building and pioneering in the technology space? Do you... ...AI/ML across Capital One. We achieve this by building platforms that enable the rapid and responsible development and deployment...SoftwareFull timePart timeInternshipLocal area$140k - $200k
...– Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and... ...own companies. Overview The responsibilities of our Platform team include building and maintaining all backend services, including...SoftwareWork at office- ...A leading open-source software provider is looking for an Engineering Manager in San Francisco. In this role, you will lead a team working with major cloud partners like Amazon and Google, focusing on optimizing Ubuntu infrastructure. You will need strong technical leadership...SoftwareRemote work
- ...the only unified payments and financial platform for global businesses. Powered by our unique... ...of proprietary infrastructure and software, we empower over 200,000 businesses worldwide... ...Are? As a high level architect (staff engineer), you will oversee the strategy,...SoftwareWork at officeWorldwide
- ...AI-native financial operating system for health systems, founded... ...technical foundation of the platform . You'll work at the intersection... ...You'll partner closely with engineers and leadership to understand... ...looking for a systems-minded software engineer who cares deeply...SoftwareContract work
$133.65k - $222k
...verification plans for complex embedded systems based on requirements. -... ...-in-the-Loop (HIL) and Software-in-the-Loop (SIL) tooling.... ...for verification of embedded platform components (including embedded... ...Qualifications: - Bachelors in an engineering discipline (MS/PhD preferred)...SoftwareFull timeWork at officeWork from homeFlexible hours- ...company in San Francisco is seeking a Lead Software Engineer to design and develop distributed filesystems for their innovative platform. You will research and oversee software... ...required alongside significant experience in systems design. Benefits include comprehensive...SoftwareRemote job
$300k - $320k
...interpretable, and steerable AI systems. We want AI to be safe and... ...group of committed researchers, engineers, policy experts, and business... ..., select, and implement GRC platforms and tools, configuring and... ...engineering, data engineering, software development, or related...SoftwareFull timeWork at officeVisa sponsorshipFlexible hours- Zendesk in San Francisco is seeking an experienced Engineering Manager to lead the Authentication team within Core Services Engineering.... ...across various teams. The ideal candidate will have 8+ years of software engineering experience and 2+ years in management, excellent...Software
$200k - $250k
...We're building Skyway, a platform to help companies find, procure,... ...are the strategic finance and engineering leaders at AI labs, inference... ...providers, neoclouds, and AI-native software companies who are making... ...data that flows through our systems. Under the hood, we're...SoftwareContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Platform Systems. Be the first to apply!
- software sales engineer San Francisco, CA
- software engineer amazon San Francisco, CA
- software engineer student San Francisco, CA
- agile software developer San Francisco, CA
- rust software engineer San Francisco, CA
- software developer positions San Francisco, CA
- senior software design engineer San Francisco, CA
- software developer San Francisco, CA
- ngo software engineer San Francisco, CA
- startup software engineer San Francisco, CA


