Member of Engineering (Reinforcement Learning Infrastructure)
Poolside
ABOUT POOLSIDE In this decade, the world will create Artificial General Intelligence. There will only be a small number of companies who will achieve this. Their ability to stack advantages and pull ahead will define the winners. These companies will move faster than anyone else. They will attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts of capital along their journey to enable this. They will create powerful economic engines. They will obsess over the success of their users and customers. Poolside exists to be this company : to build a world where AI will be the engine behind economically valuable work and scientific progress. We believe the fastest way to reach AGI lies in accelerating software development itself, by reshaping the developer experience with agentic systems, coding assistants, and the frontier models that power them. We deploy these systems directly into the development environments of security-conscious enterprises. ABOUT OUR TEAM We were founded in the US and have our home there, but our team is distributed across Europe and North America. We get our fix of in-person collaboration (and croissants) in Paris each month for 3 days, always Monday-Wednesday, with an open invitation to stay the whole week. We also do longer off-sites once a year. Our team is a multidisciplinary blend of research, engineering, and business experts. What unites us is our deep care for what we build together. We're in a race that requires hard work, intellectual curiosity, and obsession; to balance this intensity, we've assembled a team of low ego and kind-hearted individuals who have built the special culture Poolside has. By building collaboratively and with intention, we create a compounding effect that moves the entire company forward towards our mission: reaching AGI through intelligence systems built for software development. ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving reasoning and coding abilities of Large Language Models through reinforcement learning. This is a hands-on role where you'll work end-to-end from researching new exploration or training algorithms, to designing and scaling up RL environments, to implementing your ideas across the stack. You will have access to thousands of GPUs in this team. YOUR MISSION Build and scale the infrastructure that enables reliable, efficient training of Large Language Models with Reinforcement Learning at the frontier. RESPONSIBILITIES
- Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
- Develop methods for tuning training and inference end-to-end for high throughput
- Design data control systems in an RL pipeline that govern what the model sees and when
- Debug cases where infrastructure decisions are silently degrading learning dynamics
- Build observability tooling that surfaces when a system-level issue is the root cause of a training regression
- Help build robust, flexible and scalable RL pipelines
- Optimize performance across the stack - networking, memory, compute scheduling, and I/O
- Write high-quality, pragmatic code
- Work in the team: plan future steps, discuss, and always stay in touch
- Experience with LLMs and model post-training workflows
- Understanding how Reinforcement Learning works and what its main bottlenecks are
- Solid software engineering fundamentals (testing, code review, debugging complex systems)
- Proficiency in Python with knowledge of concurrency, asynchronous programming, multiprocessing and performance optimization
- Familiarity with deep learning frameworks (PyTorch or JAX) and RL workflows (rollouts, replay buffers, policy updates)
- Experience designing and maintaining distributed RL training systems
- Experience with large-scale LLM training infrastructure
- Experience with profiling tools across the stack (e.g. py-spy)
- Experience with inference stacks (e.g. vLLM)
- Nice to have: Open-source contributions to RL or distributed ML projects
- Intro call with one of our Founding Engineers
- Technical Interview(s) with one of our Founding Engineers
- Team fit call with the People team
- Final interview with one of our Founding Engineers
- Fully remote work & flexible hours
- 37 days/year of vacation & holidays
- Health insurance allowance for you & dependents
- 16 weeks of flexible, full-pay parental leave
- Well-being, always-be-learning & home office allowances
- Company-provided equipment
- Frequent team get togethers
- Diverse & inclusive people-first culture
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Engineering (Reinforcement Learning Infrastructure) in United States vacancy
- ...Reinforcement Learning Expert Dexmate is building the foundation for physical AI — a unified... ...environments Collaborate with robotics engineers to integrate RL models into... ...algorithm performance Scale training infrastructure for efficient learning across multiple...Suggested
$188k - $275k
...CoreWeave's industry-leading cloud infrastructure with the best-in-class tools... ...tools to help agents learn from experience . This is a... ..., we're looking for strong engineers with great taste. The most important... ..., with a strong focus on reinforcement learning or PhD + 2 years...SuggestedPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...Reinforcement Learning (RL) Engineer Location: New York(Office) On-site| Full-time Compensation: Competitive Our client is an elite development... ...-growth software company responsible for building the infrastructure behind the world’s largest crypto social networks and...SuggestedFull timeWork at officeImmediate start
$100k - $150k
...Reinforcement Learning Engineer Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative... ...models when relevant. Build scalable training infrastructure for distributed RL, including efficient experience collection...SuggestedFull timeH1bLocal areaImmediate startRemote workVisa sponsorship$30 - $45 per hour
...Reinforcement Learning Intern Ghost Robotics is the industry leader in legged robotic systems... .... Key Duties: Develop infrastructure - scripts for training and evaluating... ...Holds Bachelor's degree in Computer Engineering, Software Engineering, or a related...SuggestedHourly payPart timeInternshipRemote workFlexible hours- ...platform optimizes data center power infrastructure to maximize AI token generation... ...growth, mission‑driven environment. Learn from an experienced team that has built... ...startups before. Role Description As a Reinforcement Learning Engineer, you will be the architect of the...Shift work
- ...Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced... ...previous experience on Anthropic’s data team building data infrastructure, tokenizers, and datasets behind the Claude model....Full timeFor contractorsRemote workRelocation
$180k - $300k
...constantly evolving our firm’s IT infrastructure and engineering capabilities, positioning... ...and applications. As a member of Point72’s Technology... ...generative AI and machine learning workloads, enabling faster... ...Strong understanding of reinforcement learning concepts and their...Work experience placement- ...Whatnot updates on our news and engineering blogs and join us as we enable... ...You’ll design and scale the core infrastructure that powers machine learning and self-hosted large language model... ...problem-solving, and connection. Team members in this role must live within...Work experience placementWork at officeLocal areaRemote workWork from homeHome officeFlexible hours
$180k
...motivated, and focused on engineering excellence. This organization... ...to build the software infrastructure that enables our models to... ...directly support advances in reinforcement learning, agentic AI, and model... ...interview”) during which a member of our team will ask some...Local areaRelocation- ...ML Engineer, ML Systems and Infrastructure The work we do at Autodesk touches nearly every person on the planet... ...behind large-scale machine learning systems. In this role, you will partner... ...is fully remote-friendly, with team members distributed across the US and Canada...Remote work
- ...Member of Technical Staff – Infrastructure & Engineering Location: Austin, TX Wind River is a global leader in delivering software for mission-critical... ...Employee assistance program for mental well-being Learning benefits, including a LinkedIn Learning subscription...Permanent employmentTemporary workLocal areaVisa sponsorshipFlexible hours
- ...growing group of committed researchers, engineers, policy experts, and business leaders... ...Fellows will primarily use external infrastructure (e.g. open‑source models, public APIs... ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts...Full timeRemote work
$200k - $350k
...industrial pilots - and we’re growing the team to take it even further. About the Role We are looking for a Senior or Staff Reinforcement Learning Engineer to develop learning-based control policies for humanoid robots. You will design and train reinforcement learning...Work at office$230k - $310k
...educational institutions Work together with engineers, scientists, operators, and more from... ...AI Human data is the core infrastructure to AI advancement. Frontier AI labs... ...Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group...Full timeWork at officeRemote workFlexible hours- ...Codertal is hiring a Reinforcement Learning Engineer for a remote opportunity within the European Union on a B2B contract . If you’re passionate about designing intelligent agents that learn through interaction, optimize rewards, and shape autonomous decision‑making...Daily paidContract workRemote workFlexible hours
- ...growing group of committed researchers, engineers, policy experts, and business leaders... ...Fellows will primarily use external infrastructure (e.g. open-source models, public APIs... ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts...Full timePart timeRemote workVisa sponsorship
- A technology firm is seeking a Reinforcement Learning Engineer for a remote opportunity. This role involves developing and optimizing RL algorithms and collaborating with teams to implement solutions. Candidates should have strong skills in Python and notable RL frameworks...Daily paidRemote workFlexible hours
- ...workflows across large-scale AI model training and evaluation. Research Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF). Engage with frontier...Relocation packageFlexible hours
$176k - $420k
...Overview Tesla is looking for strong Machine Learning Engineers to help build foundation models for robotics to drive the future of autonomy... ...from generative modeling, imitation learning, and reinforcement learning to improve the planning and reasoning capabilities...Hourly payTemporary workFlexible hours- ...committed to building the infrastructure that supports growing... ...Florida. As a Civil Engineer in our Product... ...with Us? Growth and Learning: We provide an environment... ...: Work as a team member within the Product... ...plus. Experience in reinforced concrete design and...For contractorsWork at office
- ...Bright Vision Technologies is seeking a remote Reinforcement Learning Engineer to design and implement RL solutions for decision-making problems. The ideal candidate should have a Master’s or PhD in Computer Science or a related field, with over 6 years of relevant experience...Full timeRemote work
$100k - $150k
...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design and deploy RL-based systems for complex decision-making tasks. The successful candidate will manage the training of RL algorithms and engage in simulation environments, contributing to vital...Full timeImmediate startRemote work- ...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for critical decision-making problems. This full-time position is remote and requires a Master’s or PhD in a relevant field along with significant RL experience...Full timeRemote work
$100k - $150k
...Bright Vision Technologies is seeking a skilled Reinforcement Learning Engineer to design and deploy RL systems that tackle decision-making challenges. This role offers the chance to be part of a dynamic team and contribute to transformative business solutions. The ideal...Remote work- ...Bright Vision Technologies is hiring a Reinforcement Learning Engineer to design and deploy state-of-the-art RL-based systems crucial for decision-making in complex environments. This fully remote role requires strong expertise in modern RL algorithms and engineering skills...Remote work
$176k - $420k
...embodied intelligence through humanoid robots. The goal of our reinforcement learning team is to build and demonstrate a general robot learning... ..., and more. Our reinforcement and imitation learning engineers are responsible for end-to-end robotic learning and own this...Hourly payTemporary workFlexible hours$400k
...systems are low latency, high throughput, live under constant load, and break if you get them wrong. What You’ll Do As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role – it’s about...Immediate start- ...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems. The ideal candidate will have a Master's or PhD in a related field and strong proficiency in Python. This position...Full timeRemote work
$185k - $335.3k
...driven expert in ML Training Infrastructure with a demonstrated ability... ...at scale. As a Staff ML Engineer, you will operate as a technical... ...closely with machine learning engineers, research scientists... ...equitable for all by becoming a member of GM's Talent Community (...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Engineering (Reinforcement Learning Infrastructure). Be the first to apply!
Related searches
- security infrastructure engineer United States
- principal infrastructure engineer United States
- associate infrastructure engineer United States
- lead infrastructure engineer United States
- remote infrastructure engineer United States
- infrastructure developer United States
- senior infrastructure engineer United States
- entry level infrastructure engineer United States
- infrastructure automation engineer United States
- infrastructure engineer United States


