Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Engineering (Reinforcement Learning Infrastructure)

Poolside

ABOUT POOLSIDE

In this decade, the world will create Artificial General Intelligence. There will only be a small number of companies who will achieve this. Their ability to stack advantages and pull ahead will define the winners. These companies will move faster than anyone else. They will attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts of capital along their journey to enable this. They will create powerful economic engines. They will obsess over the success of their users and customers.

Poolside exists to be this company : to build a world where AI will be the engine behind economically valuable work and scientific progress. We believe the fastest way to reach AGI lies in accelerating software development itself, by reshaping the developer experience with agentic systems, coding assistants, and the frontier models that power them. We deploy these systems directly into the development environments of security-conscious enterprises.

ABOUT OUR TEAM

We were founded in the US and have our home there, but our team is distributed across Europe and North America. We get our fix of in-person collaboration (and croissants) in Paris each month for 3 days, always Monday-Wednesday, with an open invitation to stay the whole week. We also do longer off-sites once a year.

Our team is a multidisciplinary blend of research, engineering, and business experts. What unites us is our deep care for what we build together. We're in a race that requires hard work, intellectual curiosity, and obsession; to balance this intensity, we've assembled a team of low ego and kind-hearted individuals who have built the special culture Poolside has. By building collaboratively and with intention, we create a compounding effect that moves the entire company forward towards our mission: reaching AGI through intelligence systems built for software development.

ABOUT THE ROLE

You would be working on our reinforcement learning team focused on improving reasoning and coding abilities of Large Language Models through reinforcement learning. This is a hands-on role where you'll work end-to-end from researching new exploration or training algorithms, to designing and scaling up RL environments, to implementing your ideas across the stack. You will have access to thousands of GPUs in this team.

YOUR MISSION

Build and scale the infrastructure that enables reliable, efficient training of Large Language Models with Reinforcement Learning at the frontier.

RESPONSIBILITIES
  • Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
  • Develop methods for tuning training and inference end-to-end for high throughput
  • Design data control systems in an RL pipeline that govern what the model sees and when
  • Debug cases where infrastructure decisions are silently degrading learning dynamics
  • Build observability tooling that surfaces when a system-level issue is the root cause of a training regression
  • Help build robust, flexible and scalable RL pipelines
  • Optimize performance across the stack - networking, memory, compute scheduling, and I/O
  • Write high-quality, pragmatic code
  • Work in the team: plan future steps, discuss, and always stay in touch
SKILLS & EXPERIENCE
  • Experience with LLMs and model post-training workflows
  • Understanding how Reinforcement Learning works and what its main bottlenecks are
  • Solid software engineering fundamentals (testing, code review, debugging complex systems)
  • Proficiency in Python with knowledge of concurrency, asynchronous programming, multiprocessing and performance optimization
  • Familiarity with deep learning frameworks (PyTorch or JAX) and RL workflows (rollouts, replay buffers, policy updates)
  • Experience designing and maintaining distributed RL training systems
  • Experience with large-scale LLM training infrastructure
  • Experience with profiling tools across the stack (e.g. py-spy)
  • Experience with inference stacks (e.g. vLLM)
  • Nice to have: Open-source contributions to RL or distributed ML projects
PROCESS
  • Intro call with one of our Founding Engineers
  • Technical Interview(s) with one of our Founding Engineers
  • Team fit call with the People team
  • Final interview with one of our Founding Engineers
BENEFITS
  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you & dependents
  • 16 weeks of flexible, full-pay parental leave
  • Well-being, always-be-learning & home office allowances
  • Company-provided equipment
  • Frequent team get togethers
  • Diverse & inclusive people-first culture
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Engineering (Reinforcement Learning Infrastructure) in United States vacancy
  •  ...Reinforcement Learning Expert Dexmate is building the foundation for physical AI — a unified...  ...environments Collaborate with robotics engineers to integrate RL models into...  ...algorithm performance Scale training infrastructure for efficient learning across multiple... 
    Suggested

    Dexmate

    Santa Clara, CA
    18 days ago
  • $188k - $275k

     ...CoreWeave's industry-leading cloud infrastructure with the best-in-class tools...  ...tools to help agents learn from experience . This is a...  ..., we're looking for strong engineers with great taste. The most important...  ..., with a strong focus on reinforcement learning or PhD + 2 years... 
    Suggested
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Bellevue, WA
    5 days ago
  •  ...Reinforcement Learning (RL) Engineer Location: New York(Office) On-site| Full-time Compensation: Competitive Our client is an elite development...  ...-growth software company responsible for building the infrastructure behind the world’s largest crypto social networks and... 
    Suggested
    Full time
    Work at office
    Immediate start

    MLabs

    New York, NY
    9 hours ago
  • $100k - $150k

     ...Reinforcement Learning Engineer Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative...  ...models when relevant. Build scalable training infrastructure for distributed RL, including efficient experience collection... 
    Suggested
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship

    Bright Vision Technologies

    United States
    3 days ago
  • $30 - $45 per hour

     ...Reinforcement Learning Intern Ghost Robotics is the industry leader in legged robotic systems...  .... Key Duties: Develop infrastructure - scripts for training and evaluating...  ...Holds Bachelor's degree in Computer Engineering, Software Engineering, or a related... 
    Suggested
    Hourly pay
    Part time
    Internship
    Remote work
    Flexible hours

    Ghost Robotics

    Philadelphia, PA
    5 days ago
  •  ...platform optimizes data center power infrastructure to maximize AI token generation...  ...growth, mission‑driven environment. Learn from an experienced team that has built...  ...startups before. Role Description As a Reinforcement Learning Engineer, you will be the architect of the... 
    Shift work

    Hammerhead AI

    Redwood City, CA
    1 day ago
  •  ...Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced...  ...previous experience on Anthropic’s data team building data infrastructure, tokenizers, and datasets behind the Claude model.... 
    Full time
    For contractors
    Remote work
    Relocation

    Open Data Science

    San Francisco, CA
    11 days ago
  • $180k - $300k

     ...constantly evolving our firm’s IT infrastructure and engineering capabilities, positioning...  ...and applications. As a member of Point72’s Technology...  ...generative AI and machine learning workloads, enabling faster...  ...Strong understanding of reinforcement learning concepts and their... 
    Work experience placement

    Point72 Asset Management, L.P

    New York, NY
    9 hours ago
  •  ...Whatnot updates on our news and engineering blogs and join us as we enable...  ...You’ll design and scale the core infrastructure that powers machine learning and self-hosted large language model...  ...problem-solving, and connection. Team members in this role must live within... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Work from home
    Home office
    Flexible hours

    Whatnot

    San Francisco, CA
    1 day ago
  • $180k

     ...motivated, and focused on engineering excellence. This organization...  ...to build the software infrastructure that enables our models to...  ...directly support advances in reinforcement learning, agentic AI, and model...  ...interview”) during which a member of our team will ask some... 
    Local area
    Relocation

    Pantera Capital

    Palo Alto, CA
    1 day ago
  •  ...ML Engineer, ML Systems and Infrastructure The work we do at Autodesk touches nearly every person on the planet...  ...behind large-scale machine learning systems. In this role, you will partner...  ...is fully remote-friendly, with team members distributed across the US and Canada... 
    Remote work

    Autodesk

    United States
    3 days ago
  •  ...Member of Technical Staff – Infrastructure & Engineering Location: Austin, TX Wind River is a global leader in delivering software for mission-critical...  ...Employee assistance program for mental well-being Learning benefits, including a LinkedIn Learning subscription... 
    Permanent employment
    Temporary work
    Local area
    Visa sponsorship
    Flexible hours

    Aptiv

    Austin, TX
    3 days ago
  •  ...growing group of committed researchers, engineers, policy experts, and business leaders...  ...Fellows will primarily use external infrastructure (e.g. open‑source models, public APIs...  ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts... 
    Full time
    Remote work

    Anthropic

    San Francisco, CA
    5 days ago
  • $200k - $350k

     ...industrial pilots - and we’re growing the team to take it even further. About the Role We are looking for a Senior or Staff Reinforcement Learning Engineer to develop learning-based control policies for humanoid robots. You will design and train reinforcement learning... 
    Work at office

    Thehumanoid

    Boston, MA
    8 hours ago
  • $230k - $310k

     ...educational institutions Work together with engineers, scientists, operators, and more from...  ...AI Human data is the core infrastructure to AI advancement. Frontier AI labs...  ...Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    15 days ago
  •  ...Codertal is hiring a Reinforcement Learning Engineer for a remote opportunity within the European Union on a B2B contract . If you’re passionate about designing intelligent agents that learn through interaction, optimize rewards, and shape autonomous decision‑making... 
    Daily paid
    Contract work
    Remote work
    Flexible hours

    Codertal

    Union, NJ
    1 day ago
  •  ...growing group of committed researchers, engineers, policy experts, and business leaders...  ...Fellows will primarily use external infrastructure (e.g. open-source models, public APIs...  ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts... 
    Full time
    Part time
    Remote work
    Visa sponsorship

    Nerdleveltech

    San Francisco, CA
    3 days ago
  • A technology firm is seeking a Reinforcement Learning Engineer for a remote opportunity. This role involves developing and optimizing RL algorithms and collaborating with teams to implement solutions. Candidates should have strong skills in Python and notable RL frameworks... 
    Daily paid
    Remote work
    Flexible hours

    Codertal

    Union, NJ
    1 day ago
  •  ...workflows across large-scale AI model training and evaluation. Research Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF). Engage with frontier... 
    Relocation package
    Flexible hours

    Code Metal

    San Francisco, CA
    8 hours ago
  • $176k - $420k

     ...Overview Tesla is looking for strong Machine Learning Engineers to help build foundation models for robotics to drive the future of autonomy...  ...from generative modeling, imitation learning, and reinforcement learning to improve the planning and reasoning capabilities... 
    Hourly pay
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    3 days ago
  •  ...committed to building the infrastructure that supports growing...  ...Florida. As a Civil Engineer in our Product...  ...with Us? Growth and Learning: We provide an environment...  ...: Work as a team member within the Product...  ...plus. Experience in reinforced concrete design and... 
    For contractors
    Work at office

    Rinker Materials

    Deland, FL
    6 days ago
  •  ...Bright Vision Technologies is seeking a remote Reinforcement Learning Engineer to design and implement RL solutions for decision-making problems. The ideal candidate should have a Master’s or PhD in Computer Science or a related field, with over 6 years of relevant experience... 
    Full time
    Remote work

    Bright Vision Technologies

    Edison, NJ
    1 day ago
  • $100k - $150k

     ...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design and deploy RL-based systems for complex decision-making tasks. The successful candidate will manage the training of RL algorithms and engage in simulation environments, contributing to vital... 
    Full time
    Immediate start
    Remote work

    Bright Vision Technologies

    Suwanee, GA
    2 days ago
  •  ...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for critical decision-making problems. This full-time position is remote and requires a Master’s or PhD in a relevant field along with significant RL experience... 
    Full time
    Remote work

    Bright Vision Technologies

    Alpharetta, GA
    8 hours ago
  • $100k - $150k

     ...Bright Vision Technologies is seeking a skilled Reinforcement Learning Engineer to design and deploy RL systems that tackle decision-making challenges. This role offers the chance to be part of a dynamic team and contribute to transformative business solutions. The ideal... 
    Remote work

    Bright Vision Technologies

    Johns Creek, GA
    10 hours ago
  •  ...Bright Vision Technologies is hiring a Reinforcement Learning Engineer to design and deploy state-of-the-art RL-based systems crucial for decision-making in complex environments. This fully remote role requires strong expertise in modern RL algorithms and engineering skills... 
    Remote work

    Bright Vision Technologies

    Rockville, MD
    3 days ago
  • $176k - $420k

     ...embodied intelligence through humanoid robots. The goal of our reinforcement learning team is to build and demonstrate a general robot learning...  ..., and more. Our reinforcement and imitation learning engineers are responsible for end-to-end robotic learning and own this... 
    Hourly pay
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    4 days ago
  • $400k

     ...systems are low latency, high throughput, live under constant load, and break if you get them wrong. What You’ll Do As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role – it’s about... 
    Immediate start

    Baton, Inc.

    New York, NY
    9 hours ago
  •  ...Bright Vision Technologies is seeking a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems. The ideal candidate will have a Master's or PhD in a related field and strong proficiency in Python. This position... 
    Full time
    Remote work

    Bright Vision Technologies

    Naperville, IL
    8 hours ago
  • $185k - $335.3k

     ...driven expert in ML Training Infrastructure with a demonstrated ability...  ...at scale. As a Staff ML Engineer, you will operate as a technical...  ...closely with machine learning engineers, research scientists...  ...equitable for all by becoming a member of GM's Talent Community (... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Madison, WI
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Engineering (Reinforcement Learning Infrastructure). Be the first to apply!