Research Internship - Reinforcement Learning for Large Foundation Models
$80.17k - $124.8kTencent
Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation. What the Role Entails About Tencent AI Lab at Seattle Area Tencent is a leading internet company in China. Tencent AI Lab at Seattle Area was established in May 2017. The lab strives to continuously improve AI's capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals. Research Internship – Reinforcement Learning for Large Foundation Models Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA. Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers. Who We Look For Requirements & Qualifications The ideal intern candidates are those who Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university, are self-motivated and excited about developing novel techniques, have research experiences in natural language processing or machine learning, are proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch. have good publication track records and history of creativity and intellectual flexibility, have excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation. Intern duration: 3 months (with the possibility of extension). Can start any time in the year 2026. Location State(s) US-Washington-Bellevue The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals. Who we are Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
- ...are hiring an AI Research Scientist (New Grad... ...reason, code, and learn at scale inside the... ...of agentic AI and reinforcement learning, where... ...research experience) Foundational expertise in... ...tuning, or reasoning model development Demonstrated... ...Experience with large‑scale distributed...Foundation
$57 per hour
...the distributed training, reinforcement learning framework, high-performance... ...technologies for AI foundation models. Conduct research on infrastructure and systems for large‑scale models. Explore methods... ...Python or C/C++. Research or internship experience related to...InternshipFoundationHourly payLocal area$139.5k - $258.1k
AIML - Machine Learning Researcher, Foundation Models Seattle, Washington, United States Machine Learning and AI We are a group... ...or editing Post-training, mid-training large language models or multimodal models. Reinforcement learning, on-policy distillation. Further,...FoundationRelocation$6,290 per month
Overview Research Internships at Microsoft provide a dynamic environment for research careers... ...for rapid iteration and continuous learning. Project Focus The selected Research... ...project centered around the use of Large Language Models (LLMs) and related technologies to analyse...InternshipWorldwide- ...Hybrid Team: AI Research Business Unit: Engineering... ...applying machine learning to high-impact... ...and execute large-scale experiments... ...reusable code. Develop models and systems that... ...experience, research internships, or publications).... ...LLMs or foundation models in applied...InternshipFoundationContract workFixed term contract
$112.7k - $169.1k
...builds the machine learning systems that decide... ...frontier has shifted — large language models, reinforcement learning from human... ...to bring research ideas into production... .... Strong research foundations in one or more of:... ...research or industry internships. Fluency in Python...InternshipFoundationWork at officeWorldwideRelocation packageShift work- Pangleglobal is seeking a Student Researcher in Seattle to conduct research on infrastructure for AI foundation models. This role requires pursuing a PhD in computer science... ..., focusing on efficiency and reliability in large-scale systems. Interns enjoy many benefits including...InternshipFoundation
$6,710 per month
...Overview Research Internships at Microsoft provide a dynamic environment for research careers... ...and the environment. The Machine Learning and Optimization (MLO) group in MSR-... ...Our focus right now is in combining Large Language Model (LLM) technology with optimization...InternshipOngoing contractSummer workLocal area- ByteDance is looking for a Campus Intern in Seattle, WA, specializing in AI infrastructure research. This role involves conducting research on large-scale model systems and exploring efficiency improvements across various platforms. Interns will benefit from day one access...InternshipFoundation
$250k - $350k
...Technical Staff — RL Research (New PhD Grad) Seattle... ...and we're developing foundation models designed for it from... ...and post‑training for large-scale omni models.... ...pipelines through research, internships, or open-source —... ...yet; you need to learn fast and go deep. Strong...InternshipFoundationH1bWork at officeVisa sponsorshipShift work- ...do About the internship Stripe's Applied... ...PhD machine learning engineering internships... ...areas such as foundation models used for dozens... ...and deploy large-scale machine learning... ..., classwork, or research. Familiarity... ...Language Models or Reinforcement Learning....InternshipFoundationRemote jobSummer workSummer internship
$173k
Senior Machine Learning Scientist The Senior Machine Learning... ...preparation through model/agent design,... ...lifecycle for medium‑to‑large projects: problem framing, ideation, research, prototyping, deployment... ...advanced ML techniques (reinforcement learning, sequence modeling...Flexible hours$176k - $255k
...progress in GenAI research. We are looking... ...SFT, RLHF, reward modeling). This role will... ...generalization of large-scale generative... ...partner with top foundation model labs to provide... ...Science, Machine Learning, AI, or a related... ...deep learning, reinforcement learning, and...FoundationFull timeShift work$192k - $304.75k
...calibration, and machine learning. This role will... ..., real‑time models for fault‑... ...Quantum Applied Research Scientist, you will... ...build on shared foundations. Build physics‑informed... ...feedback loops. Apply reinforcement learning and... ...Experience with large‑scale model training...Foundation$233k - $341k
...CRWV) in March 2025. Learn more at [ THE... ...VP of Product, Research Training Infrastructure... ...where frontier models are born. THE... ...for sophisticated Reinforcement Learning (RL) and... ...enabling labs to refine foundation models with... ...5+ years managing large-scale infrastructure...FoundationPermanent employmentFull timeTemporary workCasual workWork at officeFlexible hours$192.2k - $260k
...a hands-on machine learning practitioner and a research leader. You will play... ...machine learning models from the ground up.... ...learning, Gen AI, Reinforcement Learning, and optimization... ...approaches such as foundation models to connect... ...we don't have a large behavioral signal (...FoundationLocal areaWorldwideFlexible hours$107.66k - $161.7k
...providing a unique platform to learn and connect with others.... ...wide variety of AI language models (bots), including o3, o4-mini... ...understanding of mathematical foundations of Machine Learning algorithms... ...engineering experience via an internship, work experience, or coding competition...InternshipFoundationRemote jobFull timeWork experience placement$168k - $264.5k
...fundamental and applied robotics research across the full robotics... ..., planning, control, reinforcement learning, imitation learning, and simulation... ...the Cosmos and GR00T-N foundation models, as well as the Newton... ...advancements, large-scale training, and test-time...FoundationWork at office$42.75 per hour
...is made up of machine learning researchers and engineers, who... ...production recommendation models and drive product... ...individuals to join us for an internship in Fall 2025.... ...while laying a strong foundation for personal and... ...learning infrastructure, large-scale recommendation...InternshipFoundationHourly payLocal area$184.7k - $200.2k
Research Scientist at Meta - Bellevue, WA Meta Platforms... ...Design and implement large‑scale distributed... ...background, train new ranking models, and run experiments.... .... Use machine learning, statistics, or other... ...course/research project/internship/thesis, or 6 months of...InternshipHourly payLive in$192k - $304.75k
...looking for a Senior Robotics Research Scientist! NVIDIA is at... ..., planning, control, reinforcement learning, imitation learning, and simulation... ...the Cosmos and GR00T-N foundation models, as well as the Newton... ...architectural advancements, large‑scale training, and test‑time...FoundationWork at office- ...looking for a Senior Applied Research Scientist who is experienced with training large language models and/or large multimodal models... ...and trends in LLM/LMM and foundation models. THE PERSON The ideal... ...degree or equivalent in machine learning, computer science, artificial...Foundation
$137.5k
...Introduction to Team Our Machine Learning and Data Science team is... ...! We are looking to hire researchers and data scientists who want... ...analyze tests to validate your models, quantify business impact,... ...e‑commerce. Experience with reinforcement learning or other advanced ML...Flexible hours$188.2k - $325.5k
Sr Research Manager, Multimodal Reasoning - SIML... ...United States - Machine Learning and AI The System... ...to interface with large‑scale modeling & data infrastructure... ...one of the following: Reinforcement Learning, Multimodal... .../ Post‑training foundation models Proficiency...FoundationRelocation- 2026 Applied Science Internship - Reinforcement Learning & Optimization (Machine Learning... ...it. You'll be conducting research into the theory and application... ..., advanced statistical modeling, and graph models.... ...Learning, Causal Inference, Large Language Models, Time Series...InternshipRelocation
$148.2k - $300.96k
Research Scientist - TikTok E-Commerce Recommendation Foundation Location: Seattle Employment Type: Regular... ...a unified Foundation Model that supports multi-... ..., deeply integrating large language models (LLMs... ...multimodal understanding, reinforcement learning, and system...FoundationTemporary workLocal area$124.9k - $228.9k
...all aspects of designing, researching, building, and delivering... ...develop advanced machine learning or deep learning models that directly influence... ...deep understanding of the foundations of statistics and machine... ...like Spark to work with large datasets. Experience in programmatic...FoundationFull timeTemporary work- Advanced Micro Devices is seeking a Senior Applied Research Scientist to work with large language models and multimodal models in a hybrid environment. The ideal candidate should have a PhD in machine learning or related fields. Responsibilities include training and optimizing...
- ...are seeking a seasoned Senior Research Scientist for our AI team... ...computer vision and machine learning. In this role you will advance... ...cross‑functional teams to train models and develop cutting‑edge algorithms... ...models at scale. Strong foundation in computer vision...FoundationWork at officeRemote work
- ...seeking an innovative and experienced Embodied AI Researcher, focusing on the development and application of foundation models in robotics. The successful candidate will... ...that advance the field of robotics through deep learning and zero‑shot generalization....Foundation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Internship - Reinforcement Learning for Large Foundation Models. Be the first to apply!
- science internship Bellevue, WA
- software developer internship Bellevue, WA
- sql internship Bellevue, WA
- sales internship summer Bellevue, WA
- creative internship summer Bellevue, WA
- marketing communications internship Bellevue, WA
- marketing internship remote Bellevue, WA
- coding internship Bellevue, WA
- research and development internship Bellevue, WA
- internship computer science Bellevue, WA


