Reinforcement Learning Fellow - 4-Month Research Mentorship

Nerdleveltech

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic Fellows Program overview The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent – regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts, over 80% of fellows produced papers. We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond. What to expect 4 months of full‑time research Direct mentorship from Anthropic researchers Access to a shared workspace (in either Berkeley, California or London, United Kingdom) Connection to the broader AI safety and security research community Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (varies by country) Funding for compute (≈15k USD per month) and other research expenses Interview process The interview process will include an initial application and reference check, technical assessments and interviews, and a research discussion. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification listed. Research shows that people who identify as being from underrepresented groups are more prone to imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you’re interested in this work. Compensation The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (possible extension). Fellows workstreams Due to the success of the Anthropic Fellows for AI Safety Research program, we are expanding it across teams at Anthropic. We expect a significant overlap in the types of skills and responsibilities across the roles and will by default consider candidates for all the workstreams. AI Safety Fellows AI Security Fellows ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts Fellows Across the workstreams, you may be a good fit if you: Are motivated by making sure AI is safe and beneficial for society as a whole Are excited to transition into empirical AI research and would be interested in a full‑time role at Anthropic Have a strong technical background in computer science, mathematics, or physics Thrive in fast‑paced, collaborative environments Can implement ideas quickly and communicate clearly Strong candidates may also have: Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity) Experience in areas of research or engineering related to their workstream Candidates must be: Fluent in Python programming Available to work full‑time on the Fellows program Reinforcement Learning Fellows Fellows will undergo a project selection and mentor matching process. Potential research areas and mentors include: Ruhua Jiang Kaidi Cao Sunny Duan David Brandfonbrener Colt Steele Dino Distefano Will Williams Projects in this workstream may include: Building model‑based tools to better understand AI training data and improve training data quality A research project to better understand generalization Creating RL environments to improve Claude models within your domain of expertise Building RL environments for safety‑related tasks Conducting research and implementing solutions in areas such as RL algorithms Unique candidate criteria You might be a particularly great fit for this workstream if you: Have strong software engineering skills with experience building complex ML systems Can balance research exploration with engineering rigor and operational reliability Enjoy collaborating across research and engineering disciplines Are comfortable working with large‑scale distributed systems and high‑performance computing Have experience with training, fine‑tuning, or evaluating large language models Are adept at analyzing and debugging model training processes Logistics To participate in the Fellows program, you must have work authorization in the United States, United Kingdom, or Canada and be located in that country during the program. We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. We are also open to remote fellows in the United Kingdom, United States, or Canada. We will ask you about your availability to work from Berkeley or London (full‑ or part‑time) during the program. We are not currently able to sponsor visas for fellows. To participate, you need to have or independently obtain full‑time work authorization in the United Kingdom, United States, or Canada. The program runs for 4 months, full‑time. If you can’t commit to the full duration, please still apply and note your constraints in the application. We review these requests on a case‑by‑case basis. Please note: We do not guarantee that we will make any full‑time offers to fellows. However, strong performance during the program may indicate that a Fellow would be a good fit for full‑time roles at Anthropic. In previous cohorts, 25–50% of fellows received a full‑time offer, and we’ve supported many more to go on to do great work on AI safety and security at other organizations. We encourage diverse applicants We encourage you to apply even if you do not believe you meet every single qualification. We value representation and strive to include a range of diverse perspectives on our team. #J-18808-Ljbffr Nerdleveltech

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Reinforcement Learning Fellow - 4-Month Research Mentorship in San Francisco, CA vacancy

Reinforcement Learning Research Fellow - 4 Months, Mentored & Remote
Menlo Ventures is offering a 4-month AI research Fellowship designed for candidates with strong technical backgrounds. Fellows will receive mentorship, a weekly stipend of 3,850 USD, and funding for research expenses. Responsibilities include designing and implementing...
Suggested
Remote job
Menlo Ventures
San Francisco, CA
1 day ago
Reinforcement Learning Research Fellow — 4 Months, Remote
Anthropic is offering a 4-month fellowship program aimed at nurturing talent in AI research and engineering. As a fellow, you will conduct a research project focused on AI systems... ...of 3,850 USD weekly and access to mentorship. Eligible candidates must be motivated to...
Suggested
Remote job
Full time
Anthropic
San Francisco, CA
4 days ago
AI Research Fellows: 4‑Month Mentorship & Funding
...unique opportunity through the Anthropic Fellows Program for individuals interested in AI research and engineering. The program provides funding, mentorship, and the chance to work on impactful... ...engage in full-time research for 4 months, with an expectation of contributing...
Suggested
Full time
Remote work
Menlo Ventures
San Francisco, CA
1 day ago
AI Safety Research Fellow 4 Months, Mentorship & Stipend
...Menlo Ventures is excited to support positions in the Anthropic Fellows Program, seeking talented individuals to engage in AI research and engineering. The position involves full-time research over four months, with a weekly stipend of $3,850. Candidates will be mentored...
Suggested
Full time
Menlo Ventures
San Francisco, CA
22 hours ago
AI Security Fellows: 4-Month Research Residency
Nerdleveltech is seeking committed individuals for their Fellows program to support empirical AI research. Based in Berkeley, candidates will participate in a full-time research initiative lasting 4 months, including mentorship and collaboration with top researchers. Strong...
Suggested
Full time
Remote work
Nerdleveltech
Berkeley, CA
4 days ago
Research Engineer — Reinforcement Learning
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building the training infrastructure... ...week than one polished one next month. And when you have results, you... ...— 3 paid months off after 4 years, do something fun and new...
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
4 days ago
Research Scientist- Reinforcement Learning (Open-Endedness)
...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco. We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering...
Permanent employment
Full time
Brahma Consulting Group
San Francisco, CA
1 day ago
Anthropic Fellows Program - The Anthropic Institute Fellows (Economics & Policy)
...group of committed researchers, engineers, policy experts... ...cohort of Anthropic fellows starts on July 20, 2... ...provide funding and mentorship to promising... ...beyond. What To Expect 4 months of full-time research... ...Performance Fellows Reinforcement Learning Fellows Economics &...
Full time
Part time
Work at office
Remote work
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
23 hours ago
AI Research Manager/Scientist, Reinforcement Learning
$192.6k - $344.85k
## AI Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco, CA, USA: AMER - United States - Massachusetts - Boston - Drydock... ...OverviewAs an **AI Scientist Manager Reinforcement Learning** at Autodesk Research, you will be doing fundamental...
Remote work
Autodesk
San Francisco, CA
22 hours ago
Reinforcement Learning Researcher (Humanoid)
...Reinforcement Learning Researcher (Humanoid) Reinforcement Learning Researcher (Humanoid) Location: San Francisco, CA (On-site at REK HQ) Reports to: CTO Company: REK Inc. REK is pioneering a new global sport: Robot Kombat. Its VR-controlled humanoid robot fighting...
Rek Corp
San Francisco, CA
5 days ago
Anthropic Fellows Program, The Anthropic Institute (Economics & Policy)
...group of committed researchers, engineers, policy experts... ...systems. Anthropic Fellows Program Overview The... ...provide funding and mentorship to promising... ...papers. What to Expect 4 months of full‑time research... ...Performance Fellows Reinforcement Learning Fellows Economics &...
Full time
Part time
Remote work
Menlo Ventures
San Francisco, CA
22 hours ago
AI Research Engineer Security Agents & Reinforcement Learning
...technology firm in San Francisco is seeking an AI Research Engineer to enhance security operations. You will... ...AI agents, refine existing frameworks, and apply reinforcement learning techniques. Ideal candidates have 4+ years of engineering experience and strong programming...
Alaris
San Francisco, CA
22 hours ago
Research Engineer, Machine Learning (Reinforcement Learning) San Francisco, CA | New York City, NY
...is a quickly growing group of committed researchers, engineers, policy experts, and... ...AI systems. About the teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning... ...and coding capabilities of Claude Sonnet 4.5 and Opus 4.5. Our work spans several...
Work at office
Visa sponsorship
Flexible hours
San Francisco, CA
more than 2 months ago
Member of Research Staff, Reinforcement Learning, Voleon Securities
Member of Research Staff, Reinforcement Learning, Voleon Securities Employment Type Full time Location Type Hybrid Department Securities Compensation The listed base salary range for this position is based upon the location(s) of this posting. Individual salaries are...
Full time
Local area
Immediate start
Relocation
Work visa
The-Voleon-Group
Berkeley, CA
3 days ago
Research Engineer - Reinforcement Learning
...training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts. As a...
Remote work
Worldwide
Visa sponsorship
Relocation package
Flexible hours
Prime-Intellect
San Francisco, CA
3 days ago
Machine Learning Researcher
$250k - $350k
...running experiments, and turning research insights into products that... ...Most of us are in the office 4 days a week in SF; hybrid... ...exploring new architectures and learning methods to optimizing latency... ...techniques Perform reinforcement learning research to improve...
Work at office
Inference
San Francisco, CA
3 days ago
Senior Machine Learning Engineer (Responsible AI, Applied Research Science)
$189.72k - $332.01k
...Pinterest’s advanced machine learning team. ATG’s goal is to keep Pinterest... ...publishes its work in applied research conferences, but the main goal... ...the interests of our 550M+ monthly active users and recommend... ...with tangible product impact. 4+ years working experience in...
Work experience placement
Work at office
Local area
Remote work
Relocation
Relocation package
I did my part and supported the Regular Toilet
San Francisco, CA
22 hours ago
Deep Learning Research Scientist
$230k - $400k
...is a quickly growing group of committed researchers, engineers, policy experts, and... ...projects including: Complex multimodal reinforcement learning environments. High-performance RPC servers... ...your dependents. ~401(k) plan with 4% matching. ~22 weeks of paid parental...
Work experience placement
Work at office
Home office
Visa sponsorship
Relocation package
Flexible hours
TalentPros.AI
San Francisco, CA
16 days ago
Machine Learning Scientist, Reinforcement Learning
$200k - $330k
...Convergent Ventures, and have raised over $150M to date. We're looking for a motivated and creative Machine Learning (ML) Scientist to drive research into reinforcement learning for biomolecular design. This position offers an opportunity to work at the forefront of...
Profluent
Emeryville, CA
1 day ago
Senior / Staff Robotics Research Scientist - Dexterous & Mobile Manipulation
$160k - $210k
...patients in need. To learn more and to view a... .... We\'re looking for a Research Scientist specializing... ...conferences Working with fellow researchers and engineers... ...imitation learning, reinforcement learning, and vision-language... ...Benefits ~401K with 4% company match ~...
Work experience placement
Work at office
Worldwide
Flexible hours
Multiply Labs
San Francisco, CA
1 day ago
Machine Learning Scientist
...the real world. Created by researchers from UC Berkeley’s SkyLab, our... ...of people use LMArena each month to explore how frontier systems... ...a variety of Machine Learning Scientist to help advance how... ...Transformers, diffusion models, reinforcement learning with human feedback...
Permanent employment
Work at office
Arena Intelligence, Inc.
San Francisco, CA
1 day ago
Machine Learning Researcher
$144k - $187k
...MSCI is establishing a Machine Learning Center of Excellence within the Research & Development team to develop machine learning models that power investment... ...learning techniques including deep learning and reinforcement learning to systematic strategies, portfolio...
Flexible hours
MSCI
San Francisco, CA
23 hours ago
Research Engineer, Cybersecurity Reinforcement Learning San Francisco, CA | New York City, NY
$300k - $405k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together... ...About Horizons The Horizons team leads Anthropic's reinforcement learning (RL) research and development, playing a critical role in...
Work at office
Visa sponsorship
Flexible hours
San Francisco, CA
more than 2 months ago
Senior Research Fellow - Psychology
$127k - $150k
...One of Australia’s leading research & teaching universities. Vibrant campus life with a strong sense of community... ...that makes a difference by collaborating & learning from the best. Job No: 540338. Senior Research Fellow - Psychology at UNSW, Sydney, NSW. Full Time. Salary...
Full time
Fixed term contract
UNSW
San Francisco, CA
23 hours ago
Principal AI Researcher
...of experience in AI/ML research or applied science,... ...equivalent technical field 4+ years of... ...experience using deep learning frameworks such as PyTorch... ...strong background in reinforcement learning, tool‑use, or... ...largest organizations Mentorship & Culture: act as a force...
Workday
San Francisco, CA
2 days ago
Executive Director, Machine Learning Research
$195k - $254k
...Team Responsibilities MSCI is establishing a Machine Learning Center of Excellence within the Research & Development team to develop machine learning... ...machine learning techniques including deep learning and reinforcement learning to systematic strategies, portfolio...
Flexible hours
MSCI
San Francisco, CA
22 hours ago
RL Algorithms Research Scientist - Post-LLM Learning
$300k
Vmax is seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers...
Vmax
San Francisco, CA
2 days ago
Machine Learning Research Scientist
$150k - $300k
...across time, causality, and context. As a Research Scientist, you will tackle fundamental... ...micro-events into durable knowledge, and learn patterns that predict events before it happens... ...Wellness & Productivity Stipend: $2,500/month to cover meals, transport, gym...
Relocation package
Flexible hours
Dynamis Labs
San Francisco, CA
3 days ago
Robotics RL Research Scientist: Learn from Real-World Data
...Pantograph in San Francisco is looking for research scientists who specialize in reinforcement learning and multimodal representation learning. Ideal candidates should have experience with large GPU clusters and comfortable working with Kubernetes. You will work alongside...
Pantograph
San Francisco, CA
22 hours ago
Machine Learning Researcher, Audio
$160k - $250k
Machine Learning Researcher, Audio Location: San Francisco, CA or Remote (US) About Bland At Bland.com, our mission is to empower enterprises... ...single experiment that can validate an idea in days, not months. You measure everything and let data drive decisions. You...
Work at office
Remote work
Bland
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Reinforcement Learning Fellow - 4-Month Research Mentorship. Be the first to apply!