Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Anthropic Fellows Program — Reinforcement Learning

Anthropic

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic Fellows Program Overview The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent—regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open‑source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts over 80% of fellows produced papers. Responsibilities Conduct a 4‑month full‑time research project, developing and evaluating AI systems within the chosen workstream. Produce a public output such as a research paper, technical report, or other contribution to the AI safety and security community. Participate in weekly mentorship sessions with Anthropic researchers and collaborate with peers in the shared workspace or online community. What to Expect 4 months of full‑time research Direct mentorship from Anthropic researchers Access to a shared workspace in either Berkeley, California or London, UK Connection to the broader AI safety and security research community Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (which vary by country) Funding for compute (~$15k/month) and other research expenses Interview process The interview process will include an initial application & reference check, technical assessments & interviews, and a research discussion. Fellows Workstreams AI Safety Fellows AI Security Fellows ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts Fellows Eligibility & Qualifications Motivated by ensuring AI is safe and beneficial for society as a whole Strong technical background in computer science, mathematics, or physics Ability to implement ideas quickly and communicate clearly Fluent in Python programming Available to work full‑time during the program Additional Strengths (workstream‑specific) Strong background in a discipline relevant to the workstream (e.g. economics, social sciences, cybersecurity) Experience in areas of research or engineering related to the workstream Strong software engineering skills building complex ML systems Experience with large‑scale distributed systems and high‑performance computing Experience with training, fine‑tuning, or evaluating large language models Logistics Program duration: 4 months, full‑time Stipend: 3,850 USD / 2,310 GBP / 4,300 CAD per week Work authorization required in the US, UK, or Canada, and located in that country during the program Workplace available in Berkeley or London; remote fellowship possible in the US, UK, or Canada Visa sponsorship is not available for fellows; applicants must have or independently obtain work authorization #J-18808-Ljbffr Anthropic

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Anthropic Fellows Program — Reinforcement Learning in San Francisco, CA vacancy
  • About the Program Anthropic’s Fellows Program is designed to foster AI research and engineering talent. Fellows receive funding and mentorship to conduct empirical research projects aligned with our research priorities, with the goal of producing a public outcome such as... 
    Suggested
    Full time
    Remote work

    Menlo Ventures

    San Francisco, CA
    1 day ago
  • About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable...  ...our users and society. Anthropic Fellows Program Overview The Fellows Program is...  ...ML Systems & Performance Fellows Reinforcement Learning Fellows Economics & Societal Impacts... 
    Suggested
    Full time
    Remote work

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. Anthropic Fellows Program Overview The Fellows Program fosters AI research and engineering...  ...Systems & Performance Fellows, Reinforcement Learning Fellows, and Economics &... 
    Suggested
    Full time

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • About Anthropic Anthropic’s mission is to create reliable, interpretable...  ...next cohort of Anthropic fellows starts on July 20, 2026....  .... Anthropic Fellows Program Overview The Anthropic Fellows...  ...Systems & Performance Fellows Reinforcement Learning Fellows Economics &... 
    Suggested
    Full time
    Part time
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    1 day ago
  • Anthropic is offering a 4-month fellowship program aimed at nurturing talent in AI research and engineering. As a fellow, you will conduct a research project focused on AI systems, produce a public output, and collaborate with experts in your field. The role demands a... 
    Suggested
    Remote job
    Full time

    Anthropic

    San Francisco, CA
    4 days ago
  •  ...Fellowship designed for candidates with strong technical backgrounds. Fellows will receive mentorship, a weekly stipend of 3,850 USD, and...  ...must have a Bachelor's degree and be fluent in Python programming. Remote work options are available for eligible participants from... 
    Remote job

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...workflows across large-scale AI model training and evaluation. Research Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF). Engage with frontier... 
    Relocation package
    Flexible hours

    Code Metal

    San Francisco, CA
    5 days ago
  • $180k - $270k

    Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building the training infrastructure, reward pipelines, and fine-tuning systems that make our models meaningfully better at extracting, understanding, and structuring web... 
    Full time
    Temporary work
    Remote work

    Firecrawl

    San Francisco, CA
    4 days ago
  •  ...Responsibilities Build a superhuman generalist web-agent Scale infra, data, algorithms for large‑scale distributed async reinforcement learning with multimodal LLMs acting in web environments Work closely with product engineers to translate cutting‑edge AI capabilities... 
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    5 days ago
  •  ...secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts. As a Research Engineer in our... 
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $188k - $275k

     ...at CoreWeave is building tools to help agents learn from experience. This is a critical step to make...  ...in machine learning, with a strong focus on reinforcement learning or PhD + 2 years experience * Strong programming skills in Python and experience with ML frameworks... 
    Permanent employment
    Full time
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    Weights & Biases

    San Francisco, CA
    22 hours ago
  • Handshake is hiring a Senior Software Engineer in San Francisco to develop their Reinforcement Learning Environments platform. In this high-ownership role, you will build and scale systems for training AI models, ensuring reliability and data quality. Ideal candidates... 

    Handshake

    San Francisco, CA
    3 days ago
  • $192.6k - $344.85k

    ## AI Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco, CA, USA: AMER - United States - Massachusetts - Boston...  ...Position OverviewAs an **AI Scientist Manager Reinforcement Learning** at Autodesk Research, you will be doing fundamental and... 
    Remote work

    Autodesk, Inc.

    San Francisco, CA
    4 days ago
  • $190k - $230k

     ...models. This role sits at the intersection of simulation, machine learning, distributed systems, and content generation and has a high...  ...leadership/architecture experience. Strong experience with Reinforcement Learning (PPO, SAC, A3C, DQN, multi-agent RL, or equivalents)... 
    Full time
    Local area
    Remote work

    Serve Robotics

    San Francisco, CA
    3 days ago
  •  ...operations. You will develop new AI agents, refine existing frameworks, and apply reinforcement learning techniques. Ideal candidates have 4+ years of engineering experience and strong programming skills in Python and PyTorch. This role offers the opportunity to shape... 

    Alaris

    San Francisco, CA
    3 days ago
  •  ...seeking a member of the AI technical staff to build superhuman AI agents. Responsibilities include scaling infrastructure for reinforcement learning and collaborating with product engineers on product design. Ideal candidates will have experience in RL, distributed... 
    Visa sponsorship
    Relocation package

    Yutori

    San Francisco, CA
    5 days ago
  • $250k - $365k

    Data Operations Manager — Anthropic Location: San Francisco, CA | New York City, NY Work...  ...research initiatives, including: Reinforcement Learning from Human Feedback (RLHF) AI safety...  ...operations, consulting, product management, program management, or related roles... 
    Temporary work
    For contractors
    Work at office
    Relocation
    Visa sponsorship
    Flexible hours

    Globalsouthopportunities

    San Francisco, CA
    1 day ago
  • $295k - $345k

    About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial...  ...challenges the Institute is focused on, synthesizing what you learn from across the organization, and turning that into rigorous analysis... 
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    San Francisco, CA
    4 days ago
  • Project Coordinator, Learning & Development On-site San Francisco Role Summary: Lead the logistical planning, coordination, and execution...  ...end-to-end logistical planning for learning and development programs and workshops at NSMs and launch meetings. Partner with cross-functional... 

    MatchPointe Group

    San Francisco, CA
    1 day ago
  • An international nonprofit organization seeks a U.C. Berkeley LeadEx Fellow to provide key programmatic and administrative support for leadership development and exchange programs. Responsibilities include collaborating with staff on selection processes, providing logistical... 

    Employment - The Asia Foundation

    San Francisco, CA
    3 days ago
  •  ...Technical Enablement Director to drive usage and adoption of its platform through education and enablement. You'll create training programs for customers and employees, develop certification strategies, and ensure material relevance. Ideal candidates have a strong... 

    Sentry

    San Francisco, CA
    2 days ago
  • $70.7k - $123.8k

    Twitch is seeking a Program Manager for the Culture & People Development team based in San Francisco. The role focuses on AI upskilling and creating engaging learning experiences across hybrid and in-person formats. Candidates should have over 4 years of program management... 

    Twitch

    San Francisco, CA
    5 days ago
  • MatchPointe Group is seeking a Project Coordinator for Learning & Development in San Francisco. The role involves leading logistical planning for training events, managing vendor contracts, and collaborating with cross-functional teams. The ideal candidate will have strong... 

    MatchPointe Group

    San Francisco, CA
    1 day ago
  • $60 - $110 per hour

    A leading game development company is seeking an experienced L&D Specialist to join their Learning Programs team. The candidate will design and facilitate engaging learning experiences that drive measurable business performance. This role requires 5+ years in learning and... 
    Hourly pay

    Unity Technologies

    San Francisco, CA
    2 days ago
  • Twitch Interactive, Inc. - D48 is seeking a Program Manager based in San Francisco to contribute to the Culture & People Development team. In this role, you will design impactful learning experiences focusing on AI and manage a hybrid portfolio of culture programs. The... 
    Flexible hours

    Twitch Interactive, Inc. - D48

    San Francisco, CA
    5 days ago
  • $137k - $299k

     ...We're looking for an experienced senior machine learning engineer to help us develop the cutting-edge reinforcement learning models that power DoorDash's growing...  ...decision making tasks. Strong machine learning and programming skills, particularly in Python, with experience... 
    Hourly pay
    Work at office
    Local area
    Remote work
    Flexible hours

    DoorDash

    San Francisco, CA
    more than 2 months ago
  • $260k - $305k

    Web Designer, Enterprise job at Anthropic. San Francisco, CA. About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing... 
    Full time
    Visa sponsorship

    Carlsbad Tech

    San Francisco, CA
    4 days ago
  • $230k - $310k

     ...their most complex data at the largest scale.   About the Role We’re hiring a Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group building the interactive sandboxes where frontier models learn to complete real work.... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    4 days ago
  •  ...'re building the first truly private, personal AI that learns your skills, judgment, and preferences without big tech...  ...through a PhD, a research‑oriented master’s, or research programs like MATS or Anthropic Fellows Program. We encourage speculative applications; we... 
    Shift work

    Workshop Labs

    San Francisco, CA
    3 days ago
  • $200k - $400k

     ...from investors like Menlo, Lightspeed and Anthropic and work with customers including Arc...  ...role We’re looking for Machine Learning Engineers to help build our platform for...  ...infra, research engineering, or systems programming. ~ Comfort working across research and... 
    Full time

    Goodfire

    San Francisco, CA
    1 hour ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Anthropic Fellows Program — Reinforcement Learning. Be the first to apply!