Software Engineer, Compute Infrastructure
OpenAI
Compute Infrastructure Engineer
Compute Infrastructure builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI. We design, provision, schedule, operate, and optimize the systems that connect accelerators, CPUs, networks, storage, data centers, orchestration software, agent infrastructure, developer tools, and observability into one coherent experience for researchers and product teams.
Our work spans the entire stack: capacity planning and cluster lifecycle, bare-metal automation, distributed systems, Kubernetes and scheduling, deep system optimization, high-performance networking, storage, fleet health, reliability, workload profiling, benchmarking, and the developer experience that lets teams use enormous compute systems with confidence. At this scale, small improvements to communication, scheduling, hardware efficiency, or debugging workflows can compound into meaningful research velocity. We are hiring across Compute Infrastructure rather than for a single narrow team, and we use this opening to match strong engineers to the problems where they can have the most leverage.
We are looking for engineers who want to build the compute platform behind OpenAI's research and products. You may be strongest in low-level systems, high-performance computing, distributed infrastructure, reliability, CaaS, agent infrastructure, developer platforms, tooling, or the user experience around infrastructure. What matters is that you can reason carefully about complex systems, write durable software, and raise the quality and velocity of the people around you.
Depending on your background and interests, you might work close to hardware, close to users, on CaaS and agent infrastructure, or on the control planes and data planes in between. You could help bring new supercomputing capacity online, optimize training workloads from profiler traces and benchmarks, improve NCCL and collective communication behavior, reason about GPUs, NICs, topology, firmware, thermals, and failure modes, or design abstractions that make heterogeneous clusters feel like one coherent platform.
We do not expect every candidate to have worked at every layer. Some engineers will go deep on systems performance, kernel or runtime behavior, large-scale networking protocols, RDMA, NCCL, GPU hardware behavior, benchmarking, scheduling, or hardware reliability; others will make the platform more usable through APIs, tools, workflows, and developer experience. The common thread is strong engineering judgment and excitement about making enormous compute systems faster, more reliable, and easier to use.
This is a general opening for Compute Infrastructure. We will consider candidates for teams across Compute Infrastructure and match you based on your strengths, the problems that motivate you, and where the infrastructure needs are highest.
In this role, you will:
- Build and deeply optimize reliable system software for large-scale compute systems that run some of the world's most demanding AI workloads
- Design and operate infrastructure across accelerators, CPUs, NICs, switches, networking protocols, storage, data centers, cluster orchestration, scheduling, and fleet health
- Profile, benchmark, and optimize training workloads across compute, memory, storage, networking, NCCL and collective communication, and cluster scheduling bottlenecks
- Create hardware-aware automation that makes provisioning, firmware and driver upgrades, incident response, and day-to-day operations faster and less error-prone
- Build CaaS, agent infrastructure, profiling, observability, benchmarking, and platform tools that help researchers, product engineers, and operators launch, debug, and optimize workloads with less friction
- Turn operational lessons into better systems, stronger abstractions, and clearer ownership boundaries across teams
- Collaborate across research, engineering, security, networking, hardware, and data center teams to make compute capacity more capable and easier to use
You might thrive in this role if you:
- Have built or operated distributed systems, infrastructure platforms, high-performance computing environments, large-scale networking systems, Kubernetes clusters, developer tools, or production systems with demanding reliability requirements
- Enjoy working across layers of the stack and are comfortable moving between software, hardware, networking, systems performance, reliability, and user needs
- Care about making complex infrastructure understandable, observable, and usable for the people depending on it
- Can diagnose hard problems under real operational pressure while still investing in long-term engineering quality
- Like building leverage for others, whether through APIs, automation, debugging tools, CaaS and agent infrastructure primitives, workflow improvements, or better platform abstractions
- Are motivated by scale, efficiency, reliability, and disciplined measurement through benchmarks, profiles, and production evidence
- Communicate clearly, take ownership, and work well with teams whose constraints and goals differ from your own
Qualifications:
- Strong software engineering skills and experience building, operating, or improving production infrastructure systems
- Experience in one or more relevant areas such as distributed systems, operating systems, networking protocols, RDMA, NCCL or collective communication, storage, Kubernetes, scheduling, observability, reliability engineering, high-performance computing, GPU infrastructure, CaaS, agent infrastructure, hardware-aware performance optimization, benchmarking, developer experience, or infrastructure tooling
- Ability to debug complex system behavior across software, hardware, networking, and workload layers, then turn findings into robust improvements
- Comfort with ambiguity, strong ownership, and a bias toward practical, durable solutions
- Interest in working on infrastructure that directly enables frontier AI research and product impact
About OpenAI:
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
$164.2k - $205.2k
...the world's best data and AI infrastructure platform so our customers... ...their business. Founded by engineers - and customer obsessed - we... ...started. At Databricks, the Compute Infrastructure organization... ...efficiency. As a Senior Software Engineer on the Compute Infra...SuggestedLocal areaWorldwide$156k - $387.6k
...Responsibilitie About the Team The Compute Infrastructure - Orchestration & Scheduling team uses Kubernetes and Serverless technologies... ...growing compute infrastructure. We're seeking talented software engineers excited to optimize our infrastructure for AI & LLM models...SuggestedTemporary workLocal areaOverseas$148.2k - $300.96k
...Software Engineer - AI Compute Infrastructure Location: Seattle Team: Infrastructure Employment Type: Regular Job Code: A111013C Responsibilities About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes...SuggestedTemporary workLocal area- ...Software Engineer, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore About HeyGen At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of...SuggestedFull time
- ...Team We build and scale the Compute foundation that powers frontier... ...world, rapidly bringing new infrastructure online across a wide range... ...the Role We’re looking for engineers to help build and operate the... ...bring‑up, and build the software layers that make heterogeneous...Suggested
$164.2k - $205.2k
Position Overview At Databricks, the Compute Infrastructure organization builds and operates the foundation that runs all Data, AI, and stateful... ..., and cost efficiency. Job Description As a Senior Software Engineer on the Compute Infra team, you will design and build the...Local area$174k - $252k
Senior Software Engineer, Infrastructure, Google Cloud Compute Infrastructure corporate_fare Google place Kirkland, WA, USA ; Seattle, WA, USA Apply In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to...Full timeTemporary workWorldwide- ...Fortanix we are pioneers in confidential computing and Confidential AI for hybrid and... ...and data across clouds, on-premises infrastructure, and devices. Our platform enables... ...and security. The Role Staff Software Engineer (Rust) - Confidential Computing Infrastructure...H1b
$248k - $391k
...NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing... ...are seeking a highly skilled Principal Software Engineer to join our dynamic team. Our company... ...and optimizing the performance of our infrastructure both on-prem and in the cloud. You will...Remote work$198k - $326k
...and from a LinkedIn office on select days, as determined by the business needs of the team. As a Sr. Staff Software Engineer of the Compute Infrastructure team at LinkedIn, you will play a crucial role in our ongoing efforts to re-architect our compute infrastructure...For contractorsWork at officeFlexible hours- ...Cloud Computing Sr Software Engineer Resolve incidents associated with EUC equipment and/or EUC software, failure or degradation of EUC services, and provide break/fix support, advice, and assistance to end users across all company locations or working from home. Work...Remote workWork from home
$100k - $150k
...evolve a unified cloud-native compute and network platform that... ...implement compute and network infrastructure capabilities on AWS,... ...Collaborate closely with application engineering, architecture, and platform... ...above in Computer Science, Software Engineering, or a related field...Local areaWorldwide$135k - $216k
...Cloud Computing Engineer - RHEL Infrastructure Job Locations US-VA-Chantilly Requisition ID 2026-167068 Position Category... ...include building systems up from bare metal, performing software package installation and update, operating system configuration...Contract workWork at officeRemote workShift work$160k - $240k
...Senior Software Engineer - Public Cloud Engineering Managed Compute Location New York Business Area Engineering and CTO Ref # 10050591 Description... ...machines and containers, they're using the infrastructure and patterns our team built. We own the full...Temporary workFor contractorsWork experience placement$124.84k - $154.08k
...Software Engineer II, Computational Platform Remote; Watertown, Massachusetts, United States The Role Software Engineer II A collaborative... ...AI products. You will drive development of the cloud infrastructure that makes them reliable, scalable, and secure. You...Remote workFlexible hoursShift work$165k - $225k
...Senior Software Engineer, Compute Platform Chicago, IL or Remote Moonlite delivers high-performance AI infrastructure for organizations running intensive computational research, large-scale model training, and demanding data processing workloads. We provide infrastructure...Immediate startRemote workFlexible hours- ...Replit is the agentic software creation platform that enables anyone... ...distributed systems engineers who are passionate about building... ...the capabilities of Replit Infrastructure, optimize performance across... ...application deployment, serverless computing, or container orchestration....Full timeTemporary workWork at officeWorldwideMonday to FridayFlexible hours
- ...Senior Software Backend Engineer, Platform Computing We are seeking a Senior Software Backend Engineer, Platform Computing to integrate and operate... ...focus will be on integrating and operating compute infrastructure and orchestration systems that enable scientific workflows...Flexible hours
$125k - $160k
...embedded systems, radar sensing, cloud computing, and AI to unlock powerful real-world... ...intelligence. We're looking for a software engineer to help build and scale our edge and... ...services, distributed systems, and infrastructure that enable real-time data processing...$196.75k - $243.29k
...experiences for everyone. As a senior software engineer on the Cell Platform team at Roblox,... ...K8s controllers, and UX, simplifying infrastructure for our internal customers. You will also... ...engineer ~ Bachelor's degree in Computer Science or an equivalent field You...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$166k - $244k
Senior Software Engineer, Machine Learning, Google Cloud Compute Apply Benefits for this role include: Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of...Full timeTemporary work$214k - $295k
...Staff Software Engineer, Data Infrastructure, AI Compute Platform Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose...Work at officeWorldwideRelocation packageFlexible hours3 days per week$174k - $252k
A leading technology company is seeking a Senior Software Engineer to work on Infrastructure for Google Cloud. This role requires a Bachelor's degree and significant software development experience in languages like C++, C, or Python. Responsibilities include code development...$174k - $252k
Google is looking for a Software Engineer in Kirkland, WA to contribute to innovative technologies that connect users globally. This role focuses on scientific computing and high performance computing on the Google Cloud Platform. Candidates must have a Bachelor's degree...- ...We are seeking a Senior Software Engineer to join a high-performance engineering team responsible... ...for building and evolving the core compute platform that underpins large-scale data... ...and developing robust, scalable infrastructure to support complex workloads, including...
$96k - $132k
Software Engineer, Computational Microscopy Platform (Biohub SF) Job Description The Chan Zuckerberg Biohub San Francisco (CZ Biohub SF) is an independent nonprofit research institute that brings together three powerhouse universities - Stanford, UC Berkeley, and UC San...InternshipFlexible hours$160k - $240k
Bloomberg L.P. is seeking a Senior Software Engineer specializing in Compute Management in New York. This role involves designing and developing applications to maintain a healthy production environment and improve the reliability of platforms. The ideal candidate should...$171.6k - $302.2k
Senior Software Release Engineer, Private Cloud Computing Seattle, Washington, United States Software and Services Apple Service Engineering is seeking... ...validated, and scaled at one of the most ambitious infrastructure efforts in the industry. You will be a technical leader...Relocation$190k - $235k
Databricks is looking for an Engineering Manager to lead a team responsible for critical components of their compute platform. This role will significantly impact product... ...in engineering management. Strong cloud infrastructure knowledge is required. A competitive pay...$97.1k - $164k
...Research Computing Software Engineer We are seeking a Research Computing Software Engineer to join the Visualization and Decision Support... ...Integrate software solutions with existing research computing infrastructure, including cloud platforms Collaborate with other...Full timeWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Compute Infrastructure. Be the first to apply!
- graduate software developer United States
- rust software engineer United States
- senior software design engineer United States
- software engineer student United States
- software engineer amazon United States
- software developer positions United States
- software engineer full time United States
- software qa engineer United States
- new graduate software engineer United States
- junior software developer United States

