Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, RL Environments - Internship

Internship

Preference Model

Location: San Francisco preferred, remote considered Compensation: Paid internship About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world complexity, with diverse tasks and robust reward functions. Our founding team has previous experience on Anthropic’s data team building data infrastructure, and datasets behind Claude. We are partnering with leading AI labs to push AI closer to achieving its transformative potential. About the Role We're looking for PhD or Master's students, and gifted undergrads to spend an internship with us working on building RL training environments for large language models. This role blends research and engineering. It will require you to both develop novel approaches and realize them in code. Your work will include designing and implementing RL environments, conducting experiments and evaluations, delivering your work into production training runs, and collaborating with other researchers and engineers. What you'll do Design and build RL environments that test LLM reasoning on ML, systems, and research problems Write clean, production-grade Python (not notebooks) Work with Docker, build reproducible environments, debug when things break Translate ML papers and concepts into concrete training tasks What We are Looking For (Qualifications): You're an undergrad or PhD student in CS, ML, math, physics, or a related field. You write real code, not just research prototypes. You read ML papers for fun in your free time. Must have Strong Python skills Familiarity with how LLMs work, what they're good at, and where they fall short Ability to work independently, take feedback, and iterate fast You may be a good fit if one of the following applies You understand transformer internals and have worked with training or inference code You've written CUDA kernels or worked with low-level GPU programming You have a research area you know deeply (publications, public code, or strong coursework) You read broadly across ML and can connect ideas from different subfields You've built interactive environments, simulations, or complex software systems What We Offer: Paid Internship with opportunity to return full time based on performance Ownership and autonomy in a fast moving startup environment Opportunity to work with top machine learning engineersCompetitive cash and equity compensation (>90th percentile) Lunch provided everyday onsite Weekly snack orders Note: We utilize AI note-taking during our interview sessions to ensure we capture all answers and details accurately. Candidates are allowed to use AI note-takers as well, however, no other AI tools are permitted during any live interviews.

Vacancy posted 13 hours ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, RL Environments - Internship in United States vacancy
  •  ...seeking interns to assist in creating reinforcement learning environments. The role requires strong Python coding skills and offers opportunities for internships in winter, spring, and summer 2026. No prior experience in machine learning or AI is necessary. This is a great... 
    Internship
    Summer work

    Mechanize, Inc.

    San Francisco, CA
    23 hours ago
  •  ...AI training gigs to first internships to full-time careers and...  ...work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated...  ...Today, building a single RL environment is a...  ...steps across operators and engineers, and depends on tribal knowledge... 
    Internship
    Full time
    Freelance
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    3 days ago
  • $220k - $336k

     ...Senior Machine Learning Engineer, RL / Locomotion Costa Mesa, California, United States Anduril Industries is a defense technology company...  ...field. Our systems operate in unstructured, contested environments where robustness and reliability are non-negotiable. We... 
    Suggested
    Full time
    Work experience placement
    Immediate start

    anduril

    Costa Mesa, CA
    4 days ago
  •  ...Machine Learning Systems Engineer, RL Engineering San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission...  ...system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    3 days ago
  • $30 per hour

     ...us push both the research quality and production quality of our ML systems forward. We are hiring a Machine Learning Engineer Intern for a paid 12-week summer internship from May through July 2026. This is a remote role based in the United States and is expected to be 4... 
    Internship
    Summer internship
    Remote work

    Good At Numbers

    United States
    5 days ago
  • $35 - $70 per hour

     ...Feedinkoo is seeking a Machine Learning Contractor to engage in research and engineering tasks on LLMs, reinforcement learning, and infrastructure for advanced agent systems. This position is remote, with flexible engagement and potential monthly office visits to Mercor... 
    Internship
    Hourly pay
    For contractors
    Freelance
    Work at office
    Remote work
    Flexible hours

    Feedinkoo

    New York, NY
    2 days ago
  •  ...Sit at the intersection of software engineering and data science. Leverage big...  ...as needed. Prior experience with machine vision and machine learning in financial services is essential....  ...working in frameworks in last job or internship Preferred degree in Computer... 
    Internship

    Veryfi

    San Mateo, CA
    23 hours ago
  • $40 - $70 per hour

     ...Machine Learning Engineer Intern Pulse is tackling one of the most persistent challenges in data infrastructure: extracting accurate, structured...  ...by tier-1 investors and scaling quickly. About the Internship As a Machine Learning Engineer Intern, you'll work... 
    Internship
    Hourly pay
    Summer work
    Work at office

    PULSE

    San Francisco, CA
    3 days ago
  •  ...Machine Learning Engineer Title of Role: Machine Learning Engineer Location: San Francisco...  ...to thrive in fast-paced, ambiguous environments with a high degree of ownership and...  ...or classification models. Prior internships or 1-3 years of experience at top-tier... 
    Internship
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    2 days ago
  •  ...The Hong Kong Study Skills Research Institute is excited to announce a remote internship for a Machine Learning Engineer Intern. In this role, you will develop and implement machine learning models to enhance business applications. Ideal candidates will be proficient... 
    Internship
    Remote work

    Hong Kong Study Skills Research Institute

    New York, NY
    1 day ago
  •  ...unstructured data. We are a team of engineers and scientists from Berkeley, CMU, Ecole...  ...practices. Experience or willingness to learn about scalability technologies like AWS/...  ...competitions. About the Internship Based in person in San Francisco.... 
    Internship
    Summer work

    Zeroentropy (yc W25)

    San Francisco, CA
    3 days ago
  • $90k - $115k

     ...Junior Machine Learning Engineer — Predictive Analytics Remote / New York / New Jersey Full-Time...  ...skills in a real-world healthcare environment. What You'll Work On Assisting...  ...skills Nice to Have Internship or project experience with ML/data science... 
    Internship
    Full time
    Remote work

    Birth Model

    United States
    4 days ago
  •  ...Machine Learning Engineering Intern Mariana Minerals is a software-first, vertically integrated minerals...  ...solutions in production-like environments. You will work closely with engineers...  ...clear deliverables by the end of the internship Build and experiment with models... 
    Internship

    Mariana Minerals

    San Francisco, CA
    3 days ago
  • $95k - $150k

     ...Junior Machine Learning Engineer Since 2010, SynergisticIT has helped more than 10,000 jobseekers get hired at hundreds of tech clients by...  ...companies demand project-based evidence of your skills—real internship or portfolio work utilizing modern ML stacks makes you... 
    Internship

    SynergisticIT

    Denver, CO
    2 days ago
  • Internship Dates: June 8 - August 28 or June 22 - September 11 Location: Austin, Texas...  ...About the role We are looking for a Machine Learning Engineer Intern to join our Austin team to...  ...solutions in a globally distributed environment. Leverage Cloudflare products and... 
    Internship
    Summer work
    Summer internship
    Work at office
    3 days per week

    CloudFlare

    Austin, TX
    4 days ago
  • $107.66k - $161.7k

     ...providing a unique platform to learn and connect with others. Poe...  ...the Team and Role: Our small engineering team works on challenging...  ...our company. At Poe, we use Machine Learning in various parts of...  ...engineering experience via an internship, work experience, or coding competition... 
    Internship
    Work experience placement
    Remote work

    Quora

    New York, NY
    4 days ago
  • $19 - $65 per hour

     ...planning models. Support Reinforcement Learning: Create the infrastructure necessary for planning models to undergo self-play RL fine-tuning within the bridged BEV feature...  ...foundation in deep learning, computer vision, and machine learning. Proficiency in Python and deep... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  •  ...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing the transportation...  ...of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design...  ...large-scale, high-throughput training environments to enable rapid iteration on complex multi... 
    Shift work

    Bot Auto

    Houston, TX
    1 day ago
  • $100k - $150k

     ...Machine Learning Engineer I Seattle Scowtt is an early-stage startup transforming the way businesses...  ...a fast-paced, collaborative startup environment. What You'll Do Assist in...  ...through coursework, projects, or internships Curiosity, ownership mindset, and... 
    Internship
    Immediate start

    Scowtt

    Seattle, WA
    3 days ago
  • $45 - $60 per hour

     ...individuals to join us for an internship in 2026. Internships at...  ...of state-of-the-art applied machine learning projects. - Own key targeting...  ...Computer Science, Computer Engineering, or other relevant majors....  ...voices and to creating an environment that reflects the many communities... 
    Internship
    Hourly pay
    Full time
    Summer work
    Local area

    Tik Tok

    San Jose, CA
    23 hours ago
  • $168.1k - $227.4k

     ...Description The Product: Annapurna Labs Machine Learning accelerators are at the forefront of...  ...disciplines including silicon engineering, hardware design and verification, software...  ...Basic Qualifications ~5+ years of non-internship professional software development... 
    Internship
    Relocation
    Flexible hours

    Amazon

    Seattle, WA
    9 days ago
  • $168.1k - $227.4k

     ...right in. The Role We're seeking a talented Senior Machine Learning Engineer with expertise in agentic system, production ML systems, and...  ...impact BASIC QUALIFICATIONS - 5+ years of non-internship professional software development experience - 5+ years... 
    Internship
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    23 hours ago
  • $45 - $55 per hour

     ...Opportunity Adobe is looking for a Machine Learning intern who will apply AI and machine learning...  ...with product management and engineering groups to develop new products and features...  ...Ability to participate in a full-time internship between May-September About Adobe... 
    Internship
    Hourly pay
    Full time
    Work at office
    Local area
    Worldwide

    Adobe

    San Jose, CA
    3 days ago
  • $45 per hour

     ...individuals to join us for an internship in 2026. Internships at...  ...Master in Computer Science, Engineering, Math, Economics, Statistics...  ...challenge is an opportunity to learn and innovate as one team. We...  ...diverse voices and to creating an environment that reflects the many... 
    Internship
    Hourly pay
    Summer work
    Local area
    Shift work

    Tik Tok

    San Jose, CA
    1 day ago
  • $124.72k - $243.2k

     ...and implement cutting-edge machine learning algorithms to enhance our recommendation...  ...above in Computer Science, Engineering, Operations Research, or...  ...experience from previous internship, work experience, coding...  ...voices and to creating an environment that reflects the many... 
    Internship
    Temporary work
    Work experience placement
    Local area

    ByteDance

    Seattle, WA
    1 day ago
  • $60 per hour

     ...motivated graduate software engineers, who are eager to apply their knowledge in machine learning (ML), operations research (OR...  ...individuals to join us for an internship in 2026. Internships at ByteDance...  ...voices and to creating an environment that reflects the many... 
    Internship
    Hourly pay
    Work experience placement
    Local area

    ByteDance

    San Jose, CA
    1 day ago
  • $122.57k - $187.2k

     ...a) developing state-of-art Machine Learning (ML) solutions to prevent customers...  ...in Computer Science, Engineering, Operations Research, or...  ...engineering experience from previous internship, work experience, coding...  ...voices and to creating an environment that reflects the many... 
    Internship
    Temporary work
    Work experience placement
    Local area
    Worldwide

    ByteDance

    San Jose, CA
    1 day ago
  • $165.2k - $223.6k

     ...our vision is to make deep learning pervasive for everyday developers...  ...This role is for a software engineer in the Compiler team for AWS...  ...a startup-like development environment, where you’re always working...  ...~3+ years of non-internship professional software development... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    4 days ago
  • $193.3k - $261.5k

     ...passionate generative/agentic Senior Machine Learning Engineer to join our team and lead the...  ...Amazon tenures, and we’re building an environment that celebrates knowledge sharing and...  ...Basic Qualifications ~5+ years of non-internship professional software development... 
    Internship
    Temporary work
    Local area
    Flexible hours

    Amazon

    Seattle, WA
    5 days ago
  • $19 - $65 per hour

     ...and multi-step losses. Area of work Deep Learning Models, Planning, Prediction....  ...for ambiguous scenarios. Loss Function Engineering: Formulate and experiment with recursive...  ...investigation skills. $19 - $65 an hour Our internship hourly rates are a standard pay determined... 
    Internship
    Hourly pay

    Medium

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, RL Environments - Internship. Be the first to apply!