Reinforcement Learning Fellow - 4-Month Research Mentorship
Nerdleveltech
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic Fellows Program overview The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent – regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts, over 80% of fellows produced papers. We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond. What to expect 4 months of full‑time research Direct mentorship from Anthropic researchers Access to a shared workspace (in either Berkeley, California or London, United Kingdom) Connection to the broader AI safety and security research community Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (varies by country) Funding for compute (≈15k USD per month) and other research expenses Interview process The interview process will include an initial application and reference check, technical assessments and interviews, and a research discussion. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification listed. Research shows that people who identify as being from underrepresented groups are more prone to imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you’re interested in this work. Compensation The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (possible extension). Fellows workstreams Due to the success of the Anthropic Fellows for AI Safety Research program, we are expanding it across teams at Anthropic. We expect a significant overlap in the types of skills and responsibilities across the roles and will by default consider candidates for all the workstreams. AI Safety Fellows AI Security Fellows ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts Fellows Across the workstreams, you may be a good fit if you: Are motivated by making sure AI is safe and beneficial for society as a whole Are excited to transition into empirical AI research and would be interested in a full‑time role at Anthropic Have a strong technical background in computer science, mathematics, or physics Thrive in fast‑paced, collaborative environments Can implement ideas quickly and communicate clearly Strong candidates may also have: Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity) Experience in areas of research or engineering related to their workstream Candidates must be: Fluent in Python programming Available to work full‑time on the Fellows program Reinforcement Learning Fellows Fellows will undergo a project selection and mentor matching process. Potential research areas and mentors include: Ruhua Jiang Kaidi Cao Sunny Duan David Brandfonbrener Colt Steele Dino Distefano Will Williams Projects in this workstream may include: Building model‑based tools to better understand AI training data and improve training data quality A research project to better understand generalization Creating RL environments to improve Claude models within your domain of expertise Building RL environments for safety‑related tasks Conducting research and implementing solutions in areas such as RL algorithms Unique candidate criteria You might be a particularly great fit for this workstream if you: Have strong software engineering skills with experience building complex ML systems Can balance research exploration with engineering rigor and operational reliability Enjoy collaborating across research and engineering disciplines Are comfortable working with large‑scale distributed systems and high‑performance computing Have experience with training, fine‑tuning, or evaluating large language models Are adept at analyzing and debugging model training processes Logistics To participate in the Fellows program, you must have work authorization in the United States, United Kingdom, or Canada and be located in that country during the program. We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. We are also open to remote fellows in the United Kingdom, United States, or Canada. We will ask you about your availability to work from Berkeley or London (full‑ or part‑time) during the program. We are not currently able to sponsor visas for fellows. To participate, you need to have or independently obtain full‑time work authorization in the United Kingdom, United States, or Canada. The program runs for 4 months, full‑time. If you can’t commit to the full duration, please still apply and note your constraints in the application. We review these requests on a case‑by‑case basis. Please note: We do not guarantee that we will make any full‑time offers to fellows. However, strong performance during the program may indicate that a Fellow would be a good fit for full‑time roles at Anthropic. In previous cohorts, 25–50% of fellows received a full‑time offer, and we’ve supported many more to go on to do great work on AI safety and security at other organizations. We encourage diverse applicants We encourage you to apply even if you do not believe you meet every single qualification. We value representation and strive to include a range of diverse perspectives on our team. #J-18808-Ljbffr Nerdleveltech
- Menlo Ventures is offering a 4-month AI research Fellowship designed for candidates with strong technical backgrounds. Fellows will receive mentorship, a weekly stipend of 3,850 USD, and funding for research expenses. Responsibilities include designing and implementing...SuggestedRemote job
- Anthropic is offering a 4-month fellowship program aimed at nurturing talent in AI research and engineering. As a fellow, you will conduct a research project focused on AI systems... ...of 3,850 USD weekly and access to mentorship. Eligible candidates must be motivated to...SuggestedRemote jobFull time
- ...unique opportunity through the Anthropic Fellows Program for individuals interested in AI research and engineering. The program provides funding, mentorship, and the chance to work on impactful... ...engage in full-time research for 4 months, with an expectation of contributing...SuggestedFull timeRemote work
- ...Menlo Ventures is excited to support positions in the Anthropic Fellows Program, seeking talented individuals to engage in AI research and engineering. The position involves full-time research over four months, with a weekly stipend of $3,850. Candidates will be mentored...SuggestedFull time
- Nerdleveltech is seeking committed individuals for their Fellows program to support empirical AI research. Based in Berkeley, candidates will participate in a full-time research initiative lasting 4 months, including mentorship and collaboration with top researchers. Strong...SuggestedFull timeRemote work
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building the training infrastructure... ...week than one polished one next month. And when you have results, you... ...— 3 paid months off after 4 years, do something fun and new...Full timeTemporary workRemote work- ...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco. We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering...Permanent employmentFull time
- ...group of committed researchers, engineers, policy experts... ...cohort of Anthropic fellows starts on July 20, 2... ...provide funding and mentorship to promising... ...beyond. What To Expect 4 months of full-time research... ...Performance Fellows Reinforcement Learning Fellows Economics &...Full timePart timeWork at officeRemote workVisa sponsorshipFlexible hours
$192.6k - $344.85k
## AI Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco, CA, USA: AMER - United States - Massachusetts - Boston - Drydock... ...OverviewAs an **AI Scientist Manager Reinforcement Learning** at Autodesk Research, you will be doing fundamental...Remote work- ...Reinforcement Learning Researcher (Humanoid) Reinforcement Learning Researcher (Humanoid) Location: San Francisco, CA (On-site at REK HQ) Reports to: CTO Company: REK Inc. REK is pioneering a new global sport: Robot Kombat. Its VR-controlled humanoid robot fighting...
- ...group of committed researchers, engineers, policy experts... ...systems. Anthropic Fellows Program Overview The... ...provide funding and mentorship to promising... ...papers. What to Expect 4 months of full‑time research... ...Performance Fellows Reinforcement Learning Fellows Economics &...Full timePart timeRemote work
- ...technology firm in San Francisco is seeking an AI Research Engineer to enhance security operations. You will... ...AI agents, refine existing frameworks, and apply reinforcement learning techniques. Ideal candidates have 4+ years of engineering experience and strong programming...
- ...is a quickly growing group of committed researchers, engineers, policy experts, and... ...AI systems. About the teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning... ...and coding capabilities of Claude Sonnet 4.5 and Opus 4.5. Our work spans several...Work at officeVisa sponsorshipFlexible hours
- Member of Research Staff, Reinforcement Learning, Voleon Securities Employment Type Full time Location Type Hybrid Department Securities Compensation The listed base salary range for this position is based upon the location(s) of this posting. Individual salaries are...Full timeLocal areaImmediate startRelocationWork visa
- ...training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts. As a...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$250k - $350k
...running experiments, and turning research insights into products that... ...Most of us are in the office 4 days a week in SF; hybrid... ...exploring new architectures and learning methods to optimizing latency... ...techniques Perform reinforcement learning research to improve...Work at office$189.72k - $332.01k
...Pinterest’s advanced machine learning team. ATG’s goal is to keep Pinterest... ...publishes its work in applied research conferences, but the main goal... ...the interests of our 550M+ monthly active users and recommend... ...with tangible product impact. 4+ years working experience in...Work experience placementWork at officeLocal areaRemote workRelocationRelocation package$230k - $400k
...is a quickly growing group of committed researchers, engineers, policy experts, and... ...projects including: Complex multimodal reinforcement learning environments. High-performance RPC servers... ...your dependents. ~401(k) plan with 4% matching. ~22 weeks of paid parental...Work experience placementWork at officeHome officeVisa sponsorshipRelocation packageFlexible hours$200k - $330k
...Convergent Ventures, and have raised over $150M to date. We're looking for a motivated and creative Machine Learning (ML) Scientist to drive research into reinforcement learning for biomolecular design. This position offers an opportunity to work at the forefront of...$160k - $210k
...patients in need. To learn more and to view a... .... We\'re looking for a Research Scientist specializing... ...conferences Working with fellow researchers and engineers... ...imitation learning, reinforcement learning, and vision-language... ...Benefits ~401K with 4% company match ~...Work experience placementWork at officeWorldwideFlexible hours- ...the real world. Created by researchers from UC Berkeley’s SkyLab, our... ...of people use LMArena each month to explore how frontier systems... ...a variety of Machine Learning Scientist to help advance how... ...Transformers, diffusion models, reinforcement learning with human feedback...Permanent employmentWork at office
$144k - $187k
...MSCI is establishing a Machine Learning Center of Excellence within the Research & Development team to develop machine learning models that power investment... ...learning techniques including deep learning and reinforcement learning to systematic strategies, portfolio...Flexible hours$300k - $405k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together... ...About Horizons The Horizons team leads Anthropic's reinforcement learning (RL) research and development, playing a critical role in...Work at officeVisa sponsorshipFlexible hours$127k - $150k
...One of Australia’s leading research & teaching universities. Vibrant campus life with a strong sense of community... ...that makes a difference by collaborating & learning from the best. Job No: 540338. Senior Research Fellow - Psychology at UNSW, Sydney, NSW. Full Time. Salary...Full timeFixed term contract- ...of experience in AI/ML research or applied science,... ...equivalent technical field 4+ years of... ...experience using deep learning frameworks such as PyTorch... ...strong background in reinforcement learning, tool‑use, or... ...largest organizations Mentorship & Culture: act as a force...
$195k - $254k
...Team Responsibilities MSCI is establishing a Machine Learning Center of Excellence within the Research & Development team to develop machine learning... ...machine learning techniques including deep learning and reinforcement learning to systematic strategies, portfolio...Flexible hours$300k
Vmax is seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers...$150k - $300k
...across time, causality, and context. As a Research Scientist, you will tackle fundamental... ...micro-events into durable knowledge, and learn patterns that predict events before it happens... ...Wellness & Productivity Stipend: $2,500/month to cover meals, transport, gym...Relocation packageFlexible hours- ...Pantograph in San Francisco is looking for research scientists who specialize in reinforcement learning and multimodal representation learning. Ideal candidates should have experience with large GPU clusters and comfortable working with Kubernetes. You will work alongside...
$160k - $250k
Machine Learning Researcher, Audio Location: San Francisco, CA or Remote (US) About Bland At Bland.com, our mission is to empower enterprises... ...single experiment that can validate an idea in days, not months. You measure everything and let data drive decisions. You...Work at officeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Reinforcement Learning Fellow - 4-Month Research Mentorship. Be the first to apply!
- research dietitian San Francisco, CA
- history research San Francisco, CA
- education policy research San Francisco, CA
- research pharmacist San Francisco, CA
- research professional San Francisco, CA
- student research intern San Francisco, CA
- research intern San Francisco, CA
- physics research San Francisco, CA
- pharmaceutical research San Francisco, CA
- cancer research San Francisco, CA



