Software Engineer, RL Training Infra

Full-time

OpenAI

About the Team

The Post-Training Frontiers team creates the frontier agents OpenAI ships to the world. We do the reinforcement learning training for the agentic models we ship in Codex, ChatGPT, and the API (from o1 to 5.5).

Our role consists of (1) shepherding all integrations that should go into the final RL run and deciding what can make it in, (2) babysitting and scaling the final run, and (3) building the research and infra for horizontal integrations, such as improving function calling, factuality, multi-agent capabilities, memory, calibrated thinking, etc.

About the Role

This role focuses on keeping our frontier RL training runs fast, reliable, and unblocked. You will work across engineering and infrastructure problems as they emerge, from scaling and orchestration issues to inference bottlenecks, numerical problems, and hardware failures, as well as supporting large horizontal integrations in the big run, like multi-agent capabilities or memory. This is a role for a strong generalist who quickly learns anything needed for the task, has high attention to detail, debugs deeply, and is motivated by fixing the highest-impact problem in front of the team.

In this role, you will:

- Keep large-scale RL training runs moving by jumping into the most urgent engineering and infrastructure problems.

- Debug issues across training systems, inference, orchestration, scaling, and distributed infrastructure.

- Solve hard technical problems at the boundary between research and engineering: scaling experiments, improving training reliability, debugging distributed systems, reducing latency and cost, and making new capabilities robust under real workloads.

- Improve reliability and efficiency for RL training runs.

- Help researchers who are developing infra-heavy integrations, such as multi-agent capabilities or memory.

- Turn recurring operational issues into better tools, systems, processes, or abstractions.

- Work closely with research, infrastructure, and partner teams during tight model run timelines.

- Become useful quickly in messy, ambiguous areas where ownership matters more than a perfectly scoped project.

- Debug failures that cut across model behavior, training data, RL systems, evaluation infrastructure, serving systems, and agent harnesses, then turn those failures into hypotheses, fixes, and durable improvements.

You might thrive in this role if you:

- Want to train and ship our frontier models and ensure we make agents genuinely useful for developers, enterprises, researchers, and everyday users.

- Are a strong generalist engineer with experience in some layer of ML infrastructure.

- Have worked on RL, inference, scaling, training systems, orchestration, or adjacent ML infrastructure.

- Learn extremely quickly and are comfortable operating across unfamiliar layers.

- Are a strong debugger with high ownership, low ego, and excellent communication.

- Can land in a messy area with tight timelines, become useful quickly, and gradually raise the quality of the whole system.

- Are energized by fast-moving environments where reliability, speed, and judgment matter.

- Like building load-bearing systems and processes when that is what the team needs, even if the work is not glamorous.

Nice to have:

- Experience supporting large-scale model training, async RL systems, or high-throughput ML infrastructure.

- Experience debugging distributed systems across GPUs, networking, orchestration, or inference stacks.

- Background in performance optimization, scaling, or production-critical infrastructure.

- Experience working directly with researchers or fast-moving model teams.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Apply

Vacancy posted 17 hours ago

Similar jobs that could be interesting for youBased on the Software Engineer, RL Training Infra in San Francisco, CA vacancy

Software Engineer - RL Environments
$200k
...About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier... ...if they've worked for/interned for any RL environment companies in the past or any... ...results ~ Former founders and early engineers at early stage startups are a plus. We don...
Training
Full time
AfterQuery
San Francisco, CA
17 hours ago
Experienced Software Engineer - Machine Learning Infra
$182.52k - $297k
...Learning Infrastructure (ML Infra) team is responsible for creating... ...cases to streamline feature engineering for both batch and real-time... ...environment, ML model training, ML data feature creation/serving... ...Qualifications ~8+ years of software engineering experience. ~ Extensive...
Training
Full time
Work experience placement
Plaid
San Francisco, CA
17 hours ago
AI Engineer — LLM Infra
...entire stack to be agent‑first, from training our own models to generative... ...etc. Responsibilities: Scale infra for post-training of multimodal LLMs (CPT, SFT, RL, search, reward models) Scale infra... ...Work closely with product engineers to translate cutting‑edge AI capabilities...
Training
Work at office
Relocation
Visa sponsorship
Yutori
San Francisco, CA
1 day ago
Software Engineer: ML Infra
...About the Role Generalist trains very large robot foundation models. This requires utilizing very large numbers of the latest generation... ...scaled ChatGPT and GPT-4 to hundreds of millions of users, engineered the foundations of autonomous driving, built next-generation...
Training
Full time
The Generalist
San Francisco, CA
17 hours ago
Software Engineer, ML Infra
$127.1k - $185k
...content hyper-personalized for them. Twitch is looking for a Software Engineer to join our Machine Learning Infrastructure team. You will work... ..., including owning the software and data systems to develop, train and manage our real time and batch models at scale. We own...
Training
Full time
Flexible hours
Twitch
San Francisco, CA
17 hours ago
Software Engineer, RL Environments
$180k - $220k
...About the Company Our client builds the training data and evaluation infrastructure that... ...people · Industry: AI / ML The Role As a Software Engineer (Environments), you design the datasets... ...Nice to Haves Experience at RL-environment companies A background in...
Training
Full time
Visa sponsorship
David Joseph & Company
San Francisco, CA
17 days ago
Relocate to SF: Software Engineer (AI Infra)
...to get your help as we're hiring several extremely talented software engineers across the stack. In this role, you will... Build the... ...platforms that power Pylon's AI features - prompt executions, search infra, and more! Improve LLM observability - AI evals (online and...
Full time
Work at office
Relocation
Pylon
San Francisco, CA
17 hours ago
Software Engineer, Inference - TL
...systems. In this role, you’ll lead engineering efforts to ensure our largest... ...direction across research, infra, and product teams. Mentor... ...kernels for inference or training workloads. Have experience... ...performance issues across hardware and software layers. Have strong...
Training
Full time
OpenAI
San Francisco, CA
17 hours ago
Senior Software Engineer
...learning environments that teach AI models to code like 0.01% engineers. Our training environments are based on real-world coding scenarios that... ...you'll: Design and build scaleable systems that generate RL environments Create automated QA systems to validate environment...
Training
Full time
Contract work
Relocation package
Idler
San Francisco, CA
17 hours ago
Software Engineer
$200k
...than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the role As a Software Engineer at Magic, you will work on core systems or product surfaces...
Training
Full time
Relocation
Visa sponsorship
Magic
San Francisco, CA
17 hours ago
Software Engineer - Product
$225k
...than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the role: As a Software Engineer on the product team, you’ll be responsible for building and...
Training
Full time
Local area
Relocation
Visa sponsorship
Magic Ai
San Francisco, CA
17 hours ago
Software Engineer, Research Acceleration
$350k
...needs and goals. We are scientists, engineers, and builders who’ve created some of the... ...infrastructure — evaluation libraries, RL training libraries, experiment tracking... ...machine learning, or similar. Strong software engineering fundamentals with a track record...
Training
Full time
Visa sponsorship
Work visa
Relocation package
Thinking Machines Lab
San Francisco, CA
17 hours ago
Senior Software Engineer, Machine Learning (Commerce)
$220k - $247.5k
...are looking for a Senior Machine Learning Engineer to join our Revenue ML team at Discord.... ...closely with Shop, Game Commerce, Revenue Infra, ML Infra and Data Engineering teams to define... ..., experience, and relevant education or training. Please note that the compensation...
Training
Full time
Seasonal work
Discord
San Francisco, CA
17 hours ago
Software Engineer, Inference - Multi Modal
...a small, fast-moving team of engineers focused on delivering a world... ...Role We’re looking for a software engineer to help us serve OpenAI... ...directly with researchers training these models and with product... ...Collaborate closely with researchers, infra teams, and product engineers...
Training
Full time
OpenAI
San Francisco, CA
17 hours ago
Software Architecture Engineering
...or SSMs, a new primitive for training efficient, large-scale foundation... ...model innovation and systems engineering paired with a design-minded... ...Role Cartesia is hiring a Software Engineer, Product to build... ...end-to-end, such as realtime infra. Work alongside product teams...
Training
Full time
Work at office
Visa sponsorship
Flexible hours
Cartesia
San Francisco, CA
17 hours ago
Software Engineer, Computer-Using Agent
...we collaborate with cross-functional teams at every training and deployment stage to advance the capabilities of... ...more. Combining rigorous research with high-quality engineering across evaluation, data, training, RL environments and shared infrastructures, we aim to create...
Training
Full time
Work at office
Relocation package
OpenAI
San Francisco, CA
17 hours ago
Software Engineer (Ray Core)
...computing and make it accessible to software developers of all skill... .... We're looking for engineers with systems software experience... ...distributed libraries, test infra improvements, debugging, and... ...Knowledge of distributed model training and inference (e.g. tensor parallel...
Training
Full time
Work experience placement
Anyscale
San Francisco, CA
17 hours ago
Software Engineer, Logs Infrastructure
$175k - $215k
...Perception, Analysis, Release Eval, Onboard Infra, Mapping, Ops, Hardware, Depot Infra,... ...have: ~4+ years of professional software engineering experience ~ Experience working on large... ...work location, experience, relevant training and education, and skill level. Your recruiter...
Training
Full time
Remote work
Waymo
San Francisco, CA
17 hours ago
Senior Software Engineer
...Founding Member of Technical Staff — RL Environments & Platform Engineering We're partnering with a fast-... ...well-funded AI startup building the training and evaluation infrastructure that... ...difficult for frontier models Build software to 10–100x the quality and throughput...
Training
Digital Waffle
San Francisco, CA
1 day ago
Software Engineer, Agent Infrastructure
...robust and secure systems that power the training and advanced use cases of next-gen AI models... ...complex infrastructure that powers RL training and advanced model use cases.... ...training at scale, identifying bottlenecks and engineering solutions to optimize system performance...
Training
Full time
Work at office
Relocation package
OpenAI
San Francisco, CA
17 hours ago
Software Engineer, Frontier AI Infrastructure
$184k - $259.44k
...seeking a highly skilled and motivated Software Engineer, Frontier AI Infrastructure to join... ...features before they break, moving us from "infra-only debugging" to proactive... ...performance, and relevant education or training. Scale employees in eligible roles are...
Training
Full time
Work at office
3 days per week
Early shift
Scale AI
San Francisco, CA
18 hours ago
Junior Software Engineer
$300k
We're partnering with a top AI training lab in San Francisco to hire a junior software engineer — and they pay like it's a senior role: $300K base + equity + bonus,... ...About the company One-liner: Building high-fidelity RL environments used to train frontier AI models on...
Training
Simplify
San Francisco, CA
8 days ago
Software Engineer
$75k - $100k
...Emergent, we’re reimagining how software gets built. Our vision is to... ...breakthroughs in AI, systems engineering, and product design. Our team... ...infrastructure for AI model training and inference Collaborate... ...with distributed systems, cloud infra, and high-performance services...
Training
Full time
Worldwide
Emergent
San Francisco, CA
2 days ago
Software Engineer, Platform
$300k - $320k
...group of committed researchers, engineers, policy experts, and business... ...role We are looking for software engineers to join our... ...accelerating our mission. Service Infra: we build and maintain the... ...Intelligence: We build the training systems that adapt Claude to...
Training
Full time
Currently hiring
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
17 hours ago
Software Engineer (Infra)
...hiring in our SF and NY offices, but do hire remotely in some cases About the role: We’re looking for a foundational Software Engineer (Infrastructure) who thrives on solving complex distributed systems problems at scale. You’ll design and build core infrastructure...
Full time
Work at office
Remote work
Numeral
San Francisco, CA
17 hours ago
Staff Software Engineer, Code RL
$405k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...AI systems.About the role Code RL at Anthropic drives reinforcement learning... ...learning concepts, agentic systems, or LLM training pipelines Experience building or operating...
Training
Work at office
Remote work
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago
Software Engineer, Front-End Systems
$150k - $250k
...them. We own every layer, from infra to interface, so intelligence... .... We’re starting with expert software developers. But the platform... ...to experts across fields: engineering, research, operations, and beyond... ...Software Engineer - Post-training Alameda, CA $130,000.00-$160,...
Training
Full time
Summer work
Internship
Remote work
Flexible hours
Stealth AI Startup
San Francisco, CA
12 days ago
Software Engineer (Agent Infra)
...most recently from Google. The platform is already in production at Fortune 500 enterprises. The role This is a critical engineering hire with broad ownership over core platform infrastructure. Depending on your strengths, you may lead the systems that...
Full time
Hinoki Security
San Francisco, CA
17 hours ago
Junior Software Engineer - RL Environments for Frontier AI
RippleMatch Inc. is seeking an innovative and motivated individual to design and refine reinforcement learning tasks in San Francisco. This role requires a strong command of Python and the ability to work independently with coding agents. Responsibilities include the full...
RippleMatch
San Francisco, CA
4 days ago
Ground Software Solutions Engineer
$120k - $170k
...Wanna join the adventure? Loft Orbital is looking for a Software Engineer to join our Ground Software Solutions team. About the... ...problems as they occur. On top of this we also offer SatDevOps training to enable this person to actually operate our spacecraft!...
Training
Full time
Temporary work
Work at office
Relocation package
Flexible hours
Loft Orbital Solutions
San Francisco, CA
17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, RL Training Infra. Be the first to apply!