Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Compute Infrastructure

OpenAI

Compute Infrastructure Engineer

Compute Infrastructure builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI. We design, provision, schedule, operate, and optimize the systems that connect accelerators, CPUs, networks, storage, data centers, orchestration software, agent infrastructure, developer tools, and observability into one coherent experience for researchers and product teams.

Our work spans the entire stack: capacity planning and cluster lifecycle, bare-metal automation, distributed systems, Kubernetes and scheduling, deep system optimization, high-performance networking, storage, fleet health, reliability, workload profiling, benchmarking, and the developer experience that lets teams use enormous compute systems with confidence. At this scale, small improvements to communication, scheduling, hardware efficiency, or debugging workflows can compound into meaningful research velocity. We are hiring across Compute Infrastructure rather than for a single narrow team, and we use this opening to match strong engineers to the problems where they can have the most leverage.

We are looking for engineers who want to build the compute platform behind OpenAI's research and products. You may be strongest in low-level systems, high-performance computing, distributed infrastructure, reliability, CaaS, agent infrastructure, developer platforms, tooling, or the user experience around infrastructure. What matters is that you can reason carefully about complex systems, write durable software, and raise the quality and velocity of the people around you.

Depending on your background and interests, you might work close to hardware, close to users, on CaaS and agent infrastructure, or on the control planes and data planes in between. You could help bring new supercomputing capacity online, optimize training workloads from profiler traces and benchmarks, improve NCCL and collective communication behavior, reason about GPUs, NICs, topology, firmware, thermals, and failure modes, or design abstractions that make heterogeneous clusters feel like one coherent platform.

We do not expect every candidate to have worked at every layer. Some engineers will go deep on systems performance, kernel or runtime behavior, large-scale networking protocols, RDMA, NCCL, GPU hardware behavior, benchmarking, scheduling, or hardware reliability; others will make the platform more usable through APIs, tools, workflows, and developer experience. The common thread is strong engineering judgment and excitement about making enormous compute systems faster, more reliable, and easier to use.

This is a general opening for Compute Infrastructure. We will consider candidates for teams across Compute Infrastructure and match you based on your strengths, the problems that motivate you, and where the infrastructure needs are highest.

In this role, you will:

  • Build and deeply optimize reliable system software for large-scale compute systems that run some of the world's most demanding AI workloads
  • Design and operate infrastructure across accelerators, CPUs, NICs, switches, networking protocols, storage, data centers, cluster orchestration, scheduling, and fleet health
  • Profile, benchmark, and optimize training workloads across compute, memory, storage, networking, NCCL and collective communication, and cluster scheduling bottlenecks
  • Create hardware-aware automation that makes provisioning, firmware and driver upgrades, incident response, and day-to-day operations faster and less error-prone
  • Build CaaS, agent infrastructure, profiling, observability, benchmarking, and platform tools that help researchers, product engineers, and operators launch, debug, and optimize workloads with less friction
  • Turn operational lessons into better systems, stronger abstractions, and clearer ownership boundaries across teams
  • Collaborate across research, engineering, security, networking, hardware, and data center teams to make compute capacity more capable and easier to use

You might thrive in this role if you:

  • Have built or operated distributed systems, infrastructure platforms, high-performance computing environments, large-scale networking systems, Kubernetes clusters, developer tools, or production systems with demanding reliability requirements
  • Enjoy working across layers of the stack and are comfortable moving between software, hardware, networking, systems performance, reliability, and user needs
  • Care about making complex infrastructure understandable, observable, and usable for the people depending on it
  • Can diagnose hard problems under real operational pressure while still investing in long-term engineering quality
  • Like building leverage for others, whether through APIs, automation, debugging tools, CaaS and agent infrastructure primitives, workflow improvements, or better platform abstractions
  • Are motivated by scale, efficiency, reliability, and disciplined measurement through benchmarks, profiles, and production evidence
  • Communicate clearly, take ownership, and work well with teams whose constraints and goals differ from your own

Qualifications:

  • Strong software engineering skills and experience building, operating, or improving production infrastructure systems
  • Experience in one or more relevant areas such as distributed systems, operating systems, networking protocols, RDMA, NCCL or collective communication, storage, Kubernetes, scheduling, observability, reliability engineering, high-performance computing, GPU infrastructure, CaaS, agent infrastructure, hardware-aware performance optimization, benchmarking, developer experience, or infrastructure tooling
  • Ability to debug complex system behavior across software, hardware, networking, and workload layers, then turn findings into robust improvements
  • Comfort with ambiguity, strong ownership, and a bias toward practical, durable solutions
  • Interest in working on infrastructure that directly enables frontier AI research and product impact

About OpenAI:

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the Software Engineer, Compute Infrastructure in United States vacancy
  • $164.2k - $205.2k

     ...the world's best data and AI infrastructure platform so our customers...  ...their business. Founded by engineers - and customer obsessed - we...  ...started. At Databricks, the Compute Infrastructure organization...  ...efficiency. As a Senior Software Engineer on the Compute Infra... 
    Suggested
    Local area
    Worldwide

    Databricks

    Mountain View, CA
    15 hours ago
  • $156k - $387.6k

     ...Responsibilitie About the Team The Compute Infrastructure - Orchestration & Scheduling team uses Kubernetes and Serverless technologies...  ...growing compute infrastructure. We're seeking talented software engineers excited to optimize our infrastructure for AI & LLM models... 
    Suggested
    Temporary work
    Local area
    Overseas

    ByteDance

    San Jose, CA
    3 days ago
  • $148.2k - $300.96k

     ...Software Engineer - AI Compute Infrastructure Location: Seattle Team: Infrastructure Employment Type: Regular Job Code: A111013C Responsibilities About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes... 
    Suggested
    Temporary work
    Local area

    ByteDance

    Seattle, WA
    1 day ago
  •  ...Software Engineer, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore About HeyGen At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of... 
    Suggested
    Full time

    HeyGen

    Palo Alto, CA
    15 hours ago
  •  ...Team We build and scale the Compute foundation that powers frontier...  ...world, rapidly bringing new infrastructure online across a wide range...  ...the Role We’re looking for engineers to help build and operate the...  ...bring‑up, and build the software layers that make heterogeneous... 
    Suggested

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  • $164.2k - $205.2k

    Position Overview At Databricks, the Compute Infrastructure organization builds and operates the foundation that runs all Data, AI, and stateful...  ..., and cost efficiency. Job Description As a Senior Software Engineer on the Compute Infra team, you will design and build the... 
    Local area

    I did my part and supported the Regular Toilet

    San Francisco, CA
    1 day ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, Google Cloud Compute Infrastructure corporate_fare Google place Kirkland, WA, USA ; Seattle, WA, USA Apply In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to... 
    Full time
    Temporary work
    Worldwide

    Google Inc.

    Seattle, WA
    2 days ago
  •  ...Fortanix we are pioneers in confidential computing and Confidential AI for hybrid and...  ...and data across clouds, on-premises infrastructure, and devices. Our platform enables...  ...and security. The Role Staff Software Engineer (Rust) - Confidential Computing Infrastructure... 
    H1b

    Fortanix

    Santa Clara, CA
    5 days ago
  • $248k - $391k

     ...NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing...  ...are seeking a highly skilled Principal Software Engineer to join our dynamic team. Our company...  ...and optimizing the performance of our infrastructure both on-prem and in the cloud. You will... 
    Remote work

    NVIDIA

    United States
    15 hours ago
  • $198k - $326k

     ...and from a LinkedIn office on select days, as determined by the business needs of the team. As a Sr. Staff Software Engineer of the Compute Infrastructure team at LinkedIn, you will play a crucial role in our ongoing efforts to re-architect our compute infrastructure... 
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    3 days ago
  •  ...Cloud Computing Sr Software Engineer Resolve incidents associated with EUC equipment and/or EUC software, failure or degradation of EUC services, and provide break/fix support, advice, and assistance to end users across all company locations or working from home. Work... 
    Remote work
    Work from home

    Keylent Inc

    United States
    10 hours ago
  • $100k - $150k

     ...evolve a unified cloud-native compute and network platform that...  ...implement compute and network infrastructure capabilities on AWS,...  ...Collaborate closely with application engineering, architecture, and platform...  ...above in Computer Science, Software Engineering, or a related field... 
    Local area
    Worldwide

    TP-Link North America, Inc.

    Irvine, CA
    1 day ago
  • $135k - $216k

     ...Cloud Computing Engineer - RHEL Infrastructure Job Locations US-VA-Chantilly Requisition ID 2026-167068 Position Category...  ...include building systems up from bare metal, performing software package installation and update, operating system configuration... 
    Contract work
    Work at office
    Remote work
    Shift work

    Peraton

    Chantilly, Loudoun County, VA
    15 hours ago
  • $160k - $240k

     ...Senior Software Engineer - Public Cloud Engineering Managed Compute Location New York Business Area Engineering and CTO Ref # 10050591 Description...  ...machines and containers, they're using the infrastructure and patterns our team built. We own the full... 
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    2 days ago
  • $124.84k - $154.08k

     ...Software Engineer II, Computational Platform Remote; Watertown, Massachusetts, United States The Role Software Engineer II A collaborative...  ...AI products. You will drive development of the cloud infrastructure that makes them reliable, scalable, and secure. You... 
    Remote work
    Flexible hours
    Shift work

    Dyno Therapeutics

    United States
    4 days ago
  • $165k - $225k

     ...Senior Software Engineer, Compute Platform Chicago, IL or Remote Moonlite delivers high-performance AI infrastructure for organizations running intensive computational research, large-scale model training, and demanding data processing workloads. We provide infrastructure... 
    Immediate start
    Remote work
    Flexible hours

    Moonlite AI

    United States
    10 hours ago
  •  ...Replit is the agentic software creation platform that enables anyone...  ...distributed systems engineers who are passionate about building...  ...the capabilities of Replit Infrastructure, optimize performance across...  ...application deployment, serverless computing, or container orchestration.... 
    Full time
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Flexible hours

    Replit

    Foster, CA
    4 days ago
  •  ...Senior Software Backend Engineer, Platform Computing We are seeking a Senior Software Backend Engineer, Platform Computing to integrate and operate...  ...focus will be on integrating and operating compute infrastructure and orchestration systems that enable scientific workflows... 
    Flexible hours

    Iambic

    San Diego, CA
    2 days ago
  • $125k - $160k

     ...embedded systems, radar sensing, cloud computing, and AI to unlock powerful real-world...  ...intelligence. We're looking for a software engineer to help build and scale our edge and...  ...services, distributed systems, and infrastructure that enable real-time data processing... 

    Matrixspace

    Burlington, MA
    15 hours ago
  • $196.75k - $243.29k

     ...experiences for everyone. As a senior software engineer on the Cell Platform team at Roblox,...  ...K8s controllers, and UX, simplifying infrastructure for our internal customers. You will also...  ...engineer ~ Bachelor's degree in Computer Science or an equivalent field You... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    15 hours ago
  • $166k - $244k

    Senior Software Engineer, Machine Learning, Google Cloud Compute Apply Benefits for this role include: Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of... 
    Full time
    Temporary work

    Google Inc.

    New York, NY
    5 hours ago
  • $214k - $295k

     ...Staff Software Engineer, Data Infrastructure, AI Compute Platform Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose... 
    Work at office
    Worldwide
    Relocation package
    Flexible hours
    3 days per week

    Biohub

    Redwood City, CA
    15 hours ago
  • $174k - $252k

    A leading technology company is seeking a Senior Software Engineer to work on Infrastructure for Google Cloud. This role requires a Bachelor's degree and significant software development experience in languages like C++, C, or Python. Responsibilities include code development... 

    Google Inc.

    Seattle, WA
    2 days ago
  • $174k - $252k

    Google is looking for a Software Engineer in Kirkland, WA to contribute to innovative technologies that connect users globally. This role focuses on scientific computing and high performance computing on the Google Cloud Platform. Candidates must have a Bachelor's degree... 

    Google

    Kirkland, WA
    1 day ago
  •  ...We are seeking a Senior Software Engineer to join a high-performance engineering team responsible...  ...for building and evolving the core compute platform that underpins large-scale data...  ...and developing robust, scalable infrastructure to support complex workloads, including... 

    Huxley

    Boston, MA
    2 days ago
  • $96k - $132k

    Software Engineer, Computational Microscopy Platform (Biohub SF) Job Description The Chan Zuckerberg Biohub San Francisco (CZ Biohub SF) is an independent nonprofit research institute that brings together three powerhouse universities - Stanford, UC Berkeley, and UC San... 
    Internship
    Flexible hours

    Second Renaissance

    Stanford, CA
    1 day ago
  • $160k - $240k

    Bloomberg L.P. is seeking a Senior Software Engineer specializing in Compute Management in New York. This role involves designing and developing applications to maintain a healthy production environment and improve the reliability of platforms. The ideal candidate should... 

    Bloomberg L.P.

    New York, NY
    1 day ago
  • $171.6k - $302.2k

    Senior Software Release Engineer, Private Cloud Computing Seattle, Washington, United States Software and Services Apple Service Engineering is seeking...  ...validated, and scaled at one of the most ambitious infrastructure efforts in the industry. You will be a technical leader... 
    Relocation

    Apple Inc.

    Seattle, WA
    1 day ago
  • $190k - $235k

    Databricks is looking for an Engineering Manager to lead a team responsible for critical components of their compute platform. This role will significantly impact product...  ...in engineering management. Strong cloud infrastructure knowledge is required. A competitive pay... 

    I did my part and supported the Regular Toilet

    Bellevue, WA
    1 day ago
  • $97.1k - $164k

     ...Research Computing Software Engineer We are seeking a Research Computing Software Engineer to join the Visualization and Decision Support...  ...Integrate software solutions with existing research computing infrastructure, including cloud platforms Collaborate with other... 
    Full time
    Work experience placement

    Penn State University

    Reston, VA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Compute Infrastructure. Be the first to apply!