Research Internship Reinforcement Learning (Summer)
Cohere
Internship Opportunity
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
Join us on our mission and shape the future!
Duration: Minimum 4 months (summer 2026, with potential extension)
About the Project
This internship offers a unique opportunity to contribute to cutting-edge research in reinforcement learning (RL) and large language models (LLMs), focusing on two interconnected projects:
Combining Self-Distillation and Reinforcement Learning for LLMs, with Applications to Code and Agentic Tasks This project explores how LLMs can improve through self-reflection and iterative learning by combining reinforcement learning with verifiable rewards (RLVR) and self-distillation. The focus is on scenarios where structured feedback from verifiers, compilers, unit tests, or tool calls enables models to detect errors, revise outputs, and learn from failures. The internship will bridge theoretical mathematical modeling of self-distillation with practical, production-oriented implementation.
Dealing with Extremely Large Rollouts in RLVR As RLVR becomes a cornerstone for training reasoning-oriented LLMs, the challenge of handling extremely large rollouts grows. This project investigates mechanisms such as summarization, memory, context compaction, hierarchical sub-agents, and resumable rollouts to enable unbounded or very long trajectories. It also explores how to effectively learn from such trajectories, as traditional RLVR objectives fail when episodes exceed context window limits.
Both projects are grounded in recent research and aim to advance the state-of-the-art in LLM training and deployment.
Responsibilities
Conduct literature reviews and implement state-of-the-art algorithms in RL and self-distillation.
Design and execute experiments to evaluate the effectiveness of proposed methods on code generation and agentic tasks.
Develop and maintain codebases for both theoretical modeling and practical implementations.
Collaborate with researchers to analyze results, refine methodologies, and prepare findings for publication.
Contribute to the design of mechanisms for handling large rollouts, such as summarization and hierarchical sub-agents.
Document progress, methodologies, and outcomes clearly and comprehensively.
Requirements
Technical Skills:
Strong background in machine learning, particularly reinforcement learning and deep learning.
Proficiency in Python and experience with ML frameworks (e.g., PyTorch, TensorFlow).
Familiarity with LLMs and their training paradigms.
Experience with coding tasks, unit testing, or compiler tools is a plus.
Educational Background:
Currently pursuing a Master's or PhD in Computer Science, Machine Learning, or a related field.
Soft Skills:
Ability to work independently and manage complex projects.
Strong problem-solving and analytical skills.
Excellent communication skills for collaborating with a research team.
Additional:
Prior experience with RLVR, self-distillation, or large-scale ML experiments is highly desirable.
Willingness to learn and adapt to new methodologies and tools.
If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities.
$19 - $65 per hour
...unyielding safety net. This summer, you will own the... ...this architecture using Deep Reinforcement Learning to provide a continuous, constrained... ...Conduct groundbreaking research with the potential to... ...results. Key focus area for this internship will be reinforcement...Summer workInternship$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment... ...Internship positions in machine learning (ML) and artificial... ...areas of interest include reinforcement learning for language models... ...they typically begin in the summer. Qualifications Required...Summer workInternshipOngoing contractLocal area$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment... ...The AI Interaction and Learning ( team in Microsoft Research... ...following: Foundation Models, Reinforcement Learning, Multi-Objective Optimization... ...typically begin in the summer. Qualifications...Summer workInternshipOngoing contractLocal areaImmediate startShift work$80.17k - $124.8k
...s capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with... ...at top conferences and journals. Research Internship - Reinforcement Learning for Large Foundation Models Tencent AI Lab is dedicated...InternshipFull timeWork at office$30 - $94 per hour
We are looking for PhD research interns excited to advance the next generation of large language models through reinforcement learning. Our applied deep learning research team at NVIDIA... ...to solve real-world tasks. This internship will focus on algorithmic research at...InternshipHourly pay$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers with... ..., and the environment. The Deep Learning group at the Microsoft Research Redmond lab is seeking applicants for 2026 summer Research Internships in the areas of deep...Summer workInternshipOngoing contractSummer internshipLocal areaWorldwide$5,610 - $11,010 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers... ...are interested in cutting edge machine learning (ML), video and graphics and want to make... ..., though they typically begin in the summer. Additional Responsibilities We...Summer workInternshipOngoing contractLocal area$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers... ..., and the environment. The Machine Learning and Optimization (MLO) group in MSR-Redmond... ..., though they typically begin in the summer. Research Interns are expected to...Summer workInternshipOngoing contractLocal area$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers... ..., and the environment. The Machine Learning and Optimization (MLO) group in MSR-Redmond... ..., though they typically begin in the summer. Additional Responsibilities...Summer workInternshipOngoing contractLocal area$84k - $120k
...apply cutting-edge ML approaches (deep learning, reinforcement learning, imitation learning, etc) to... .... Experience with deep learning research and tools. Proficiency in software... ...Experience in GPU/CUDA/TensorRT Previous internships involving large-scale deep learning...Summer workInternship$54k
...Program Coordinator for Experiential Learning, College of Natural Sciences... ...programs including the Freshman Research Initiative, undergraduate research, internships, international education, and... ...CNS Abroad and the High School Summer Research Academy. Communicate with...Summer workInternshipFull timeContract workFor contractorsCasual workWork at officeAfternoon shift- ...quantitative investment manager in Chicago is seeking a Quantitative Research Intern for Summer 2027. The role involves developing and implementing... ...401(k) plan, wellness programs, and generous PTO. This internship offers a unique chance to shape a collaborative and...Summer workInternship
- ...Research Intern Applied Intuition, Inc. is powering the future of physical AI. Founded... ...contributions, you will contribute to and learn from best practices in the autonomy and... ..., You Will: Conduct research on reinforcement learning (RL) related topics including large...InternshipFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift
- ...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco. We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering...Permanent employmentFull time
- ...Research Intern Applied Intuition, Inc. is powering the future of physical AI. Founded... ...contributions, you will contribute to and learn from best practices in the autonomy and... ..., You Will: Conduct research on reinforcement learning (RL) related topics including large...InternshipFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift
$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research... ...together experts in Machine Learning (ML) (including reinforcement learning and synthetic data generation... ...they typically begin in the summer. Qualifications Required Qualifications...Summer workInternshipOngoing contractWork at officeLocal area- ...mission to bring the power of open-source LLMs and vLLM to every enterprise. We are seeking a highly motivated summer intern to join our Machine Learning Research Team. As an intern, you will work on cutting-edge AI inference and model optimization techniques, and...Summer workInternshipFull timeContract workSummer internshipWork at officeRemote workFlexible hours
- ...Job Title At Toyota Research Institute (TRI), we're on a mission to improve the... ...strategies, and a range of supervised and reinforcement learning techniques for physical manipulation.... ...completion), with some post-PhD or internship work experience. A demonstrated...InternshipWork experience placementShift work
- ...Internship - Research Intern (Agentic Learning, Memory & Neurosymbolic Reasoning) Castelldefels (Barcelona),... ...Experience or strong interest in reinforcement learning and model fine-tuning, particularly... ...Internship Details Type: Paid Summer Internship Duration:...InternshipFull timeTemporary workSummer internshipWork at officeRelocationRelocation package
$5,610 - $11,010 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers... ...include, but are not limited to: Reinforcement learning approaches for improving logical and... ...though they typically begin in the summer. Qualifications Required...Summer workInternshipOngoing contractLocal area- ...Research Engineer Internship Austin, TX About Avride Avride is a US-based... ...for building machine learning models that enable autonomous... ...and Planning team for the Summer of 2026. Autonomous Vehicles... ...of deep learning, reinforcement learning, computer vision,...Summer workInternshipWork at officeRemote workRelocationRelocation package
$218.7k - $249.6k
...Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning)Overview:At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning...Full timePart timeLocal areaFlexible hours- ...and truly belong. Constant learning, skill growth, great... ...What You’ll Build The SAP Internship Experience Program is SAP... ...Title: SAP iXp Intern - AI Research PhD Associate (Summer 2026) Location: Palo Alto... ..., LLM fine‑tuning and reinforcement learning, or synthetic data...Summer workInternshipFull timeContract work
- ...Division of the Applied Research Laboratory (ARL) at Penn... ...undergraduate students for internship opportunities. ARL/Penn State... ..., and/or machine learning (deep learning/reinforcement learning) is desired. Experience... ...40 hours/week over the summer. This is a paid...Summer workInternshipFor contractorsRemote workRelocation
- ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human... ..., and a range of supervised and reinforcement learning techniques for physical manipulation.... ...nearing completion), with some post-PhD or internship work experience. A demonstrated...InternshipWork experience placementLocal areaShift work
$80.17k - $124.8k
...What the Role Entails Responsibilities: 1. Conduct research on RL algorithms for multimodal models, including diffusion models... ...next-generation RL paradigms that more directly and effectively learn from environment feedback. Who We Look For Requirements...InternshipFull time- ...Foundation Models We are a dedicated research lab for building, understanding, using,... ...for high-performance computing in deep learning, driving impactful discoveries that... ...As a Research Scientist within our Reinforcement Learning team, you will play a fundamental...Visa sponsorshipShift work
$94.49k - $147.4k
Staff Scientist - Post-Training and Reinforcement Learning for AI for Science The Argonne Leadership Computing Facility (ALCF) seeks a Staff... ...mathematics, and domain scientists. Responsibilities Conduct research and development aligned with Argonne’s strategic mission in...Remote work- ...speeds and empowers machine learning users to effortlessly run large... ...an Applied Machine Learning Research Scientist at Cerebras, you... ..., fine-tuning, and reinforcement learning-based post-training... ...years of experience (including internships, research, or industry experience...Internship
$200k - $330k
...Convergent Ventures, and have raised over $150M to date. We're looking for a motivated and creative Machine Learning (ML) Scientist to drive research into reinforcement learning for biomolecular design. This position offers an opportunity to work at the forefront of...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Internship Reinforcement Learning (Summer). Be the first to apply!
- data analytics internship United States
- web design internship United States
- remote internship psychology United States
- internship electronic United States
- react native internship United States
- environmental internship United States
- natural resource internship United States
- architect internship United States
- IT internship United States
- school psychologist internship United States



