Member of Engineering (Reinforcement Learning Infrastructure)

Poolside

ABOUT POOLSIDE

In this decade, the world will create Artificial General Intelligence. There will only be a small number of companies who will achieve this. Their ability to stack advantages and pull ahead will define the winners. These companies will move faster than anyone else. They will attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts of capital along their journey to enable this. They will create powerful economic engines. They will obsess over the success of their users and customers.

Poolside exists to be this company : to build a world where AI will be the engine behind economically valuable work and scientific progress. We believe the fastest way to reach AGI lies in accelerating software development itself, by reshaping the developer experience with agentic systems, coding assistants, and the frontier models that power them. We deploy these systems directly into the development environments of security-conscious enterprises.

ABOUT OUR TEAM

We were founded in the US and have our home there, but our team is distributed across Europe and North America. We get our fix of in-person collaboration (and croissants) in Paris each month for 3 days, always Monday-Wednesday, with an open invitation to stay the whole week. We also do longer off-sites once a year.

Our team is a multidisciplinary blend of research, engineering, and business experts. What unites us is our deep care for what we build together. We're in a race that requires hard work, intellectual curiosity, and obsession; to balance this intensity, we've assembled a team of low ego and kind-hearted individuals who have built the special culture Poolside has. By building collaboratively and with intention, we create a compounding effect that moves the entire company forward towards our mission: reaching AGI through intelligence systems built for software development.

ABOUT THE ROLE

You would be working on our reinforcement learning team focused on improving reasoning and coding abilities of Large Language Models through reinforcement learning. This is a hands-on role where you'll work end-to-end from researching new exploration or training algorithms, to designing and scaling up RL environments, to implementing your ideas across the stack. You will have access to thousands of GPUs in this team.

YOUR MISSION

Build and scale the infrastructure that enables reliable, efficient training of Large Language Models with Reinforcement Learning at the frontier.

RESPONSIBILITIES

Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
Develop methods for tuning training and inference end-to-end for high throughput
Design data control systems in an RL pipeline that govern what the model sees and when
Debug cases where infrastructure decisions are silently degrading learning dynamics
Build observability tooling that surfaces when a system-level issue is the root cause of a training regression
Help build robust, flexible and scalable RL pipelines
Optimize performance across the stack - networking, memory, compute scheduling, and I/O
Write high-quality, pragmatic code
Work in the team: plan future steps, discuss, and always stay in touch

SKILLS & EXPERIENCE

Experience with LLMs and model post-training workflows
Understanding how Reinforcement Learning works and what its main bottlenecks are
Solid software engineering fundamentals (testing, code review, debugging complex systems)
Proficiency in Python with knowledge of concurrency, asynchronous programming, multiprocessing and performance optimization
Familiarity with deep learning frameworks (PyTorch or JAX) and RL workflows (rollouts, replay buffers, policy updates)
Experience designing and maintaining distributed RL training systems
Experience with large-scale LLM training infrastructure
Experience with profiling tools across the stack (e.g. py-spy)
Experience with inference stacks (e.g. vLLM)
Nice to have: Open-source contributions to RL or distributed ML projects

PROCESS

Intro call with one of our Founding Engineers
Technical Interview(s) with one of our Founding Engineers
Team fit call with the People team
Final interview with one of our Founding Engineers

BENEFITS

Fully remote work & flexible hours
37 days/year of vacation & holidays
Health insurance allowance for you & dependents
16 weeks of flexible, full-pay parental leave
Well-being, always-be-learning & home office allowances
Company-provided equipment
Frequent team get togethers
Diverse & inclusive people-first culture

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Member of Engineering (Reinforcement Learning Infrastructure) in United States vacancy

Reinforcement learning engineer
...Reinforcement Learning Expert Dexmate is building the foundation for physical AI — a unified... ...environments Collaborate with robotics engineers to integrate RL models into... ...algorithm performance Scale training infrastructure for efficient learning across multiple...
Suggested
Dexmate
Santa Clara, CA
18 days ago
Reinforcement Learning Engineer
$188k - $275k
...CoreWeave's industry-leading cloud infrastructure with the best-in-class tools... ...tools to help agents learn from experience . This is a... ..., we're looking for strong engineers with great taste. The most important... ..., with a strong focus on reinforcement learning or PhD + 2 years...
Suggested
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Bellevue, WA
5 days ago
Reinforcement Learning Engineer
...Reinforcement Learning (RL) Engineer Location: New York(Office) On-site| Full-time Compensation: Competitive Our client is an elite development... ...-growth software company responsible for building the infrastructure behind the world’s largest crypto social networks and...
Suggested
Full time
Work at office
Immediate start
MLabs
New York, NY
9 hours ago
Reinforcement Learning Engineer
$100k - $150k
...Reinforcement Learning Engineer Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative... ...models when relevant. Build scalable training infrastructure for distributed RL, including efficient experience collection...
Suggested
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Bright Vision Technologies
United States
3 days ago
Intern - Reinforcement Learning Engineer
$30 - $45 per hour
...Reinforcement Learning Intern Ghost Robotics is the industry leader in legged robotic systems... .... Key Duties: Develop infrastructure - scripts for training and evaluating... ...Holds Bachelor's degree in Computer Engineering, Software Engineering, or a related...
Suggested
Hourly pay
Part time
Internship
Remote work
Flexible hours
Ghost Robotics
Philadelphia, PA
5 days ago
Reinforcement Learning Engineer
...platform optimizes data center power infrastructure to maximize AI token generation... ...growth, mission‑driven environment. Learn from an experienced team that has built... ...startups before. Role Description As a Reinforcement Learning Engineer, you will be the architect of the...
Shift work
Hammerhead AI
Redwood City, CA
1 day ago
Reinforcement Learning Environment Engineer
...Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced... ...previous experience on Anthropic’s data team building data infrastructure, tokenizers, and datasets behind the Claude model....
Full time
For contractors
Remote work
Relocation
Open Data Science
San Francisco, CA
11 days ago
Machine Learning Infrastructure Engineer, GenAI Technology
$180k - $300k
...constantly evolving our firm’s IT infrastructure and engineering capabilities, positioning... ...and applications. As a member of Point72’s Technology... ...generative AI and machine learning workloads, enabling faster... ...Strong understanding of reinforcement learning concepts and their...
Work experience placement
Point72 Asset Management, L.P
New York, NY
9 hours ago
Software Engineer, Machine Learning Infrastructure
...Whatnot updates on our news and engineering blogs and join us as we enable... ...You’ll design and scale the core infrastructure that powers machine learning and self-hosted large language model... ...problem-solving, and connection. Team members in this role must live within...
Work experience placement
Work at office
Local area
Remote work
Work from home
Home office
Flexible hours
Whatnot
San Francisco, CA
1 day ago
Member of Technical Staff - RL Infrastructure
$180k
...motivated, and focused on engineering excellence. This organization... ...to build the software infrastructure that enables our models to... ...directly support advances in reinforcement learning, agentic AI, and model... ...interview”) during which a member of our team will ask some...
Local area
Relocation
Pantera Capital
Palo Alto, CA
1 day ago
Machine Learning Engineer, ML Systems and Infrastructure
...ML Engineer, ML Systems and Infrastructure The work we do at Autodesk touches nearly every person on the planet... ...behind large-scale machine learning systems. In this role, you will partner... ...is fully remote-friendly, with team members distributed across the US and Canada...
Remote work
Autodesk
United States
3 days ago
Member of Technical Staff - Infrastructure & Engineering (Austin, TX)
...Member of Technical Staff – Infrastructure & Engineering Location: Austin, TX Wind River is a global leader in delivering software for mission-critical... ...Employee assistance program for mental well-being Learning benefits, including a LinkedIn Learning subscription...
Permanent employment
Temporary work
Local area
Visa sponsorship
Flexible hours
Aptiv
Austin, TX
3 days ago
Anthropic Fellows Program — Reinforcement Learning
...growing group of committed researchers, engineers, policy experts, and business leaders... ...Fellows will primarily use external infrastructure (e.g. open‑source models, public APIs... ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts...
Full time
Remote work
Anthropic
San Francisco, CA
5 days ago
Reinforcement Learning Engineer - Locomanipulation
$200k - $350k
...industrial pilots - and we’re growing the team to take it even further. About the Role We are looking for a Senior or Staff Reinforcement Learning Engineer to develop learning-based control policies for humanoid robots. You will design and train reinforcement learning...
Work at office
Thehumanoid
Boston, MA
8 hours ago
Senior Engineering Manager, Reinforcement Learning Environments (RLE)
$230k - $310k
...educational institutions Work together with engineers, scientists, operators, and more from... ...AI Human data is the core infrastructure to AI advancement. Frontier AI labs... ...Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group...
Full time
Work at office
Remote work
Flexible hours
Handshake
San Francisco, CA
15 days ago
Reinforcement Learning Engineer
...Codertal is hiring a Reinforcement Learning Engineer for a remote opportunity within the European Union on a B2B contract . If you’re passionate about designing intelligent agents that learn through interaction, optimize rewards, and shape autonomous decision‑making...
Daily paid
Contract work
Remote work
Flexible hours
Codertal
Union, NJ
1 day ago
Reinforcement Learning Fellow - 4-Month Research Mentorship
...growing group of committed researchers, engineers, policy experts, and business leaders... ...Fellows will primarily use external infrastructure (e.g. open-source models, public APIs... ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts...
Full time
Part time
Remote work
Visa sponsorship
Nerdleveltech
San Francisco, CA
3 days ago
Remote Reinforcement Learning Engineer - EU
A technology firm is seeking a Reinforcement Learning Engineer for a remote opportunity. This role involves developing and optimizing RL algorithms and collaborating with teams to implement solutions. Candidates should have strong skills in Python and notable RL frameworks...
Daily paid
Remote work
Flexible hours
Codertal
Union, NJ
1 day ago
Reinforcement Learning Engineer
...workflows across large-scale AI model training and evaluation. Research Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF). Engage with frontier...
Relocation package
Flexible hours
Code Metal
San Francisco, CA
8 hours ago
Reinforcement Learning Engineer, Self-Driving
$176k - $420k
...Overview Tesla is looking for strong Machine Learning Engineers to help build foundation models for robotics to drive the future of autonomy... ...from generative modeling, imitation learning, and reinforcement learning to improve the planning and reasoning capabilities...
Hourly pay
Temporary work
Flexible hours
Tesla
Palo Alto, CA
3 days ago
Civil Engineer - Infrastructure
...committed to building the infrastructure that supports growing... ...Florida. As a Civil Engineer in our Product... ...with Us? Growth and Learning: We provide an environment... ...: Work as a team member within the Product... ...plus. Experience in reinforced concrete design and...
For contractors
Work at office
Rinker Materials
Deland, FL
6 days ago
Senior Reinforcement Learning Engineer - Remote
...Bright Vision Technologies is seeking a remote Reinforcement Learning Engineer to design and implement RL solutions for decision-making problems. The ideal candidate should have a Master’s or PhD in Computer Science or a related field, with over 6 years of relevant experience...
Full time
Remote work
Bright Vision Technologies
Edison, NJ
1 day ago
Senior Reinforcement Learning Engineer (Remote)
$100k - $150k
...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design and deploy RL-based systems for complex decision-making tasks. The successful candidate will manage the training of RL algorithms and engage in simulation environments, contributing to vital...
Full time
Immediate start
Remote work
Bright Vision Technologies
Suwanee, GA
2 days ago
Senior Reinforcement Learning Engineer - Remote
...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for critical decision-making problems. This full-time position is remote and requires a Master’s or PhD in a relevant field along with significant RL experience...
Full time
Remote work
Bright Vision Technologies
Alpharetta, GA
8 hours ago
Remote Reinforcement Learning Engineer
$100k - $150k
...Bright Vision Technologies is seeking a skilled Reinforcement Learning Engineer to design and deploy RL systems that tackle decision-making challenges. This role offers the chance to be part of a dynamic team and contribute to transformative business solutions. The ideal...
Remote work
Bright Vision Technologies
Johns Creek, GA
10 hours ago
Remote Reinforcement Learning Engineer: Scale & Deploy RL
...Bright Vision Technologies is hiring a Reinforcement Learning Engineer to design and deploy state-of-the-art RL-based systems crucial for decision-making in complex environments. This fully remote role requires strong expertise in modern RL algorithms and engineering skills...
Remote work
Bright Vision Technologies
Rockville, MD
3 days ago
Reinforcement Learning Engineer, Policy, Optimus
$176k - $420k
...embodied intelligence through humanoid robots. The goal of our reinforcement learning team is to build and demonstrate a general robot learning... ..., and more. Our reinforcement and imitation learning engineers are responsible for end-to-end robotic learning and own this...
Hourly pay
Temporary work
Flexible hours
Tesla
Palo Alto, CA
4 days ago
Reinforcement Learning Engineer ($400k - $800k salary)
$400k
...systems are low latency, high throughput, live under constant load, and break if you get them wrong. What You’ll Do As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role – it’s about...
Immediate start
Baton, Inc.
New York, NY
9 hours ago
Remote Reinforcement Learning Engineer Scale High-Impact AI
...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems. The ideal candidate will have a Master's or PhD in a related field and strong proficiency in Python. This position...
Full time
Remote work
Bright Vision Technologies
Naperville, IL
8 hours ago
Staff Machine Learning Engineer - ML Training Infrastructure
$185k - $335.3k
...driven expert in ML Training Infrastructure with a demonstrated ability... ...at scale. As a Staff ML Engineer, you will operate as a technical... ...closely with machine learning engineers, research scientists... ...equitable for all by becoming a member of GM's Talent Community (...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Madison, WI
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Engineering (Reinforcement Learning Infrastructure). Be the first to apply!