Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote RL Environments Engineer for LLM Training

HeadHR

HeadHR is looking for candidates to design and build reinforcement learning environments for training large language models. You'll work with top AI labs to create environments where models face realistic challenges, implementing production-quality Python code in Docker environments. This fully remote position requires strong Python skills, an understanding of LLMs, and advanced English. The role involves impactful work in the AI sector, offering flexibility and collaboration with leading researchers. #J-18808-Ljbffr HeadHR

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Remote RL Environments Engineer for LLM Training in San Francisco, CA vacancy
  •  ...A company specializing in AI training data is seeking a Reinforcement Learning Environment Engineer to design and build MLE/SWE environments. This remote contractor position requires strong Python skills, hands-on LLM experience, and the ability to operate independently... 
    Remote work
    Training
    Full time
    For contractors

    Open Data Science

    San Francisco, CA
    2 days ago
  •  ...Reinforcement Learning Environment Engineer RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English (C1/C2); We’re...  ...Preference Model is building the next generation of training data to power the future of AI. Today's... 
    Remote work
    Training
    Full time
    For contractors
    Relocation

    Open Data Science

    San Francisco, CA
    2 days ago
  • $210k - $250k

     ...companies. Our team is 100% remote and we work with...  ...help them hire. RL Deep Learning Engineer Location: New York...  ...team building RL environments, evaluation harnesses...  ...legal AI models are trained and evaluated. The...  ...systems Experience with LLM evaluations or AI... 
    Remote work
    Training
    Work at office

    Recruiting from Scratch

    New York, NY
    21 hours ago
  • $180k - $270k

     ...data extraction company is seeking a Research Engineer focused on reinforcement learning in San Francisco or Remote. In this full-time role, you will build training infrastructures, fine-tune models, and bridge classical RL and modern agent systems. Ideal candidates have... 
    Remote work
    Training
    Full time

    Firecrawl

    San Francisco, CA
    2 days ago
  • $238k - $302k

     ...Staff Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous...  ...) to reinforcement learning (RL), for evaluating the quality,...  ...ML infra experience: training, evaluating and deploying ML...  ...if the role can be performed remote, the specific salary range for... 
    Remote work
    Training
    Full time

    Waymo

    San Francisco, CA
    2 days ago
  • $30 - $50 per hour

     ...A tech company specializing in AI research is looking for a STEM Research Engineer to enhance applied AI/ML workflows including LLM training and dataset development. This remote, full-time position requires mid-senior experience in STEM research and strong Python skills... 
    Remote work
    Training
    Hourly pay
    Full time

    Rex USA

    New York, NY
    21 hours ago
  • $90 - $130 per hour

     ...ML Engineer, LLM Safety & Security Sustainable Talent is partnering with Nvidia,...  ...based out of Santa Clara, CA with remote/hybrid work options. This is a full...  ...-to-txt. Develop datasets for training LLM models with SFT and RL techniques, for Content Safety, ML... 
    Remote work
    Training
    Full time
    Contract work
    Work experience placement

    Sustainable Talent

    Santa Clara, CA
    21 hours ago
  • $150k - $300k

     ...platform, Lab, unifies environments, evaluations,...  ...high-performance training into a single full...  ...frontier scale, from RL and SFT to tool...  ...machine learning engineering, with experience in...  ...Experience running LLM evaluations and/or...  ...Francisco or hybrid-remote) Visa... 
    Remote work
    Training
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    3 days ago
  •  ...Scaled Cognition is seeking an AI QA Engineer (Multilingual) to work remotely from New York, NY. This role involves ensuring the quality of LLM training data, maintaining testing environments, and analyzing errors. Candidates should possess strong coding skills, fluency... 
    Remote work
    Training

    ChatGPT Jobs

    New York, NY
    12 days ago
  • $30 - $70 per hour

     ...in designing robotics simulation environments and reinforcement learning algorithms. You will work fully remote and engage with various teams to ensure robust training for AI agents. Applicants should...  ...similar tools, a solid grasp of RL theory, and strong communication... 
    Remote job
    Training
    Hourly pay

    G2i Inc.

    Miami, FL
    3 days ago
  • $233k - $350k

     ...is seeking a Principal Machine Learning Engineer to lead technical efforts in optimizing...  ...advanced reinforcement learning. This fully remote role requires extensive experience in...  ...candidate will possess deep expertise in Post-training RL and demonstrate proficiency with... 
    Remote job
    Training

    Paramount Pictures

    New York, NY
    1 day ago
  • $350k

    Research Engineer, RL Infrastructure and Reliability (Knowledge Work) Anthropic’s mission...  ...the role The Knowledge Work team builds training environments and evaluations that make Claude...  ...RL environments, agent harnesses, or LLM evaluation frameworks. Familiarity with... 
    Training
    Visa sponsorship
    Shift work

    aijoblist

    San Francisco, CA
    2 days ago
  •  ...A leading tech firm is seeking an AI Software Engineer with strong LLM experience to work on advanced AI model training. This role involves collaborating with cross-functional teams, curating code examples, evaluating AI-generated code, and ensuring production-grade software... 
    Remote work
    Training

    SWITS DIGITAL Private Limited

    New York, NY
    21 hours ago
  • $80 - $150 per hour

    Mercor is seeking an experienced AI Trainer to support remote engineering and manufacturing operations in Oakley, California....  ...work on engineering tasks and be confident in designing LLM prompts suitable for training projects. The ideal candidate will have relevant experience... 
    Remote job
    Training
    Hourly pay

    Mercor

    Oakley, CA
    2 days ago
  • Bright Vision Technologies is seeking an LLM Fine-Tuning Engineer to design and operationalize workflows for...  ...of experience, especially with modern training stacks and dataset construction. As part of a collaborative environment, you will engage with product, engineering... 
    Remote job
    Training
    H1b

    Bright Vision Technologies

    Frisco, TX
    21 hours ago
  • $150k - $300k

     ...Solutions, Inc. is seeking an Applied Reinforcement Learning Engineer to design and build RL environments simulating enterprise workflows. The ideal candidate will have deep RL expertise and experience in LLM post-training, with a strong Python background. Responsibilities... 
    Training

    Centific Global Solutions, Inc.

    Redmond, WA
    3 days ago
  • Bright Vision Technologies is seeking an LLM Fine-Tuning Engineer to design and operationalize fine-...  ...for large language models. This is a remote, full-time role requiring over 6 years...  ...PyTorch. Responsibilities include building training pipelines, conducting experiments, and... 
    Remote job
    Training
    Full time

    Bright Vision Technologies

    Bellevue, WA
    4 days ago
  •  ...AI Engineer - NLP/LLM Data Specialist We are growing our Data Science and Data Engineering team and are looking for an experienced AI Engineer...  ...technical support during the implementation phase. Training and Documentation: Develop training materials for end-... 
    Remote work
    Training

    Saviance

    United States
    4 days ago
  •  ...Technologies is seeking an LLM Fine-Tuning Engineer to design and operationalize...  ...language models. This remote, full-time position requires...  ...and is expected to manage training pipelines, hyperparameter tuning...  ...Vision offers an inclusive environment with a commitment to equal... 
    Remote job
    Training
    Full time

    Bright Vision Technologies

    Edison, NJ
    21 hours ago
  • AI Research Engineer - Reinforcement Learning Full-time Who we are Helsing...  ...The day-to-day You'll build, train and integrate reinforcement...  .... Experience with large-scale RL training. Experience modeling...  ...family emergency leave, 100% remote work option during pregnancy and... 
    Remote work
    Training
    Full time
    Temporary work
    Work at office
    Relocation package

    helsing.ai

    New Bremen, OH
    1 day ago
  •  ...balance by offering flexible/remote work opportunities, part-time...  ...an  Environmental Compliance Engineer to join our Environmental Compliance...  ...in a fast-paced consulting environment and enjoys balancing...  ...consulting experience  ~ Engineer-in-Training (E.I.T.) certification... 
    Remote work
    Training
    Contract work
    Part time
    Work at office
    Local area
    Flexible hours

    Fuss & O'Neill

    Hartford, CT
    12 days ago
  • $213k - $263k

     ...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous driving technology company...  .... We prefer: Experience in training or deploying multi-modal LLMs....  ...location or, if the role can be performed remote, the specific salary range for your preferred... 
    Remote work
    Training
    Full time

    Waymo

    Mountain View, CA
    1 day ago
  •  ...currently searching for 2 ML/AI Engineers with a solid software...  ...a way which would boost LLM accuracy in inference & training - this is the job for you...  ...stimulating work environment. Be a technology innovator...  ...salary bands. Location : Remote work. Possibility to work... 
    Remote work
    Training
    Permanent employment
    Full time
    Contract work
    Immediate start

    Pathway

    Palo Alto, CA
    4 days ago
  • $126k - $423k

     ...This may include occasional remote work, starting the day with...  ...looking for a passionate Research Engineer (AI/RL Infrastructure) to join the...  ...WILL: * Design and build training and evaluation...  ...across heterogeneous cloud environments, focusing on reliability, efficiency... 
    Remote work
    Training
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Day shift

    Applied Intuition

    Sunnyvale, CA
    2 days ago
  • $130k - $150k

     ...seeking to join a growth-focused consulting & engineering firm? Are you looking for a company that...  ...tuition assistance, and more. ~ Cross-training and the ability to work on a variety of...  ...the US. The salary range for this remote U.S.-based position is $130,000–$150,000... 
    Remote work
    Training
    Temporary work
    Work at office

    Apex Companies

    Decatur, AL
    8 days ago
  • $204k - $259k

     ...lifecycle, including pre-training and post-training....  ...We are looking for engineers with ML system expertise...  ...reinforcement learning (RL), building systems that...  ...compute, data, and environments to improve model intelligence...  ...role can be performed remote, the specific salary... 
    Remote work
    Training
    Full time

    Waymo

    Mountain View, CA
    21 hours ago
  •  ...dynamic individual with an engineering background, the ability to work...  ...fieldwork, potentially in remote locations Level - II...  ...working experience, specialized training, and/or an advanced degree,...  ...of employment. Working Environment: ~ May be indoors or... 
    Remote work
    Training
    Work experience placement
    For subcontractor
    Work at office
    Monday to Friday
    Flexible hours

    Ahtna Solutions LLC

    Golden, CO
    20 days ago
  • $35 - $40 per hour

     ...Lead to take the helm of our LLM team (currently 3 engineers) and own the architecture, training, and deployment of the models...  ...illegal content within an explicit environment. Move beyond binary flags to...  .... Work From Anywhere: Fully remote. Choose your environment. Paid... 
    Remote work
    Training
    Contract work
    Flexible hours

    Ever AI

    United States
    21 hours ago
  • $77k - $135k

     ...for geotechnical, dam, and tunnel engineering projects worldwide. As a growing,...  ...successfully tackled projects in remote locations, challenging environments, and for communities large and small...  .... Perform geophysical field work; train and manage field personnel. Over time... 
    Remote work
    Training
    Worldwide

    GeoWorld

    Seattle, WA
    4 days ago
  • $57.2k - $108.8k

     ...than 100 years of experience and renowned engineering expertise to meet the needs of today's...  ...with exciting new challenges, on-the-job training, mentoring and purposeful rotations....  ...role is designated as on-site, hybrid or remote. The salary range for this role is 57... 
    Remote work
    Training
    Full time
    Temporary work
    Work experience placement
    Work at office
    Worldwide
    Relocation
    Flexible hours

    RTX

    Marlborough, MA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote RL Environments Engineer for LLM Training. Be the first to apply!