Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist- Reinforcement Learning (Open-Endedness)

Brahma Consulting Group

Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco.

We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering their own goals rather than optimizing ones we hand them

What you'll do:

  • Develop RL methods for agents that discover useful objectives, tasks, and curricula without relying entirely on human-specified rewards
  • Design systems for open-ended learning including unsupervised / automated environment design, asymmetric self-play, and intrinsic motivation
  • Build training loops where agents learn from interaction, exploration, novelty, competence progress, self-generated challenges, or other nonstandard reward signals
  • Investigate how agents can avoid collapsing into trivial, degenerate, or easily exploitable objectives
  • Own and develop a research agenda end to end — from identifying promising directions, to running experiments, to communicating results

What we're looking for

  • 5+ years in reinforcement learning resea rch (PhD and academic years count).
  • Strong RL fundamentals, with exposure to one or more of: open-endedness, quality-diversity methods, intrinsic motivation, self-play, multi-agent RL, or goal-conditioned RL.
  • Proficiency in Python and modern ML framework s (PyTorch and/or JAX)
  • A track record of research output and/or shipping research-grade code.
  • High agency and comfort operating in an early-stage, fast-moving research environment.

Nice to have

  • Experience with LLM post-training and/or coding agents.
  • Familiarity with population-based training, POET, or related open-ended / evolutionary approaches
  • Publications at top venues (NeurIPS, ICML, ICLR, GECCO, RLC, or similar).

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Research Scientist- Reinforcement Learning (Open-Endedness) in Santa Clara, CA vacancy
  •  ...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity...  ...representing an early-stage applied research lab building AI capable of open-ended...  ...exposure to one or more of: open-endedness, quality-diversity methods, intrinsic... 
    Suggested
    Permanent employment
    Full time

    Brahma Consulting Group

    Sunnyvale, CA
    2 days ago
  •  ...We are a dedicated research lab for building, understanding...  ...researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...Scientist within our Reinforcement Learning team, you will...  ...Proactively engage with the open-source community.   -... 
    Suggested
    Visa sponsorship
    Shift work

    Institute of Foundation Models

    Sunnyvale, CA
    22 days ago
  • $126k - $423k

     ...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We...  ...looking for multiple passionate Research Scientists to join the Research Group...  ...to our business. We are open to all years of experience as...  ...will: Conduct research on reinforcement learning (RL) related topics... 
    Suggested
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    17 days ago
  •  ...speeds and empowers machine learning users to effortlessly run...  ...an Applied Machine Learning Research Scientist at Cerebras, you will play...  ...pretraining, fine-tuning, and reinforcement learning-based post-...  ...meaningful codebases, projects, or open-source systems Why... 
    Suggested
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  •  ...are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR - Efficiency...  ...quantization/sparsity/optimizers/reinforcement learning, efficient architectures and...  ...efficient while retaining accuracy, and open‑source or publish these algorithms for... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    19 hours ago
  • $175k - $250k

     ...intelligence by combining machine learning with large-scale...  ...sits at the frontier of AI research, and we believe the highest-...  ...Role We are hiring a Research Scientist to advance the learning algorithms...  ...models, action models, reinforcement learning, and learning from... 
    Full time
    Visa sponsorship
    Flexible hours

    Metamorphic

    Palo Alto, CA
    1 day ago
  • $176k - $253.5k

     ...At Toyota Research Institute (TRI), we're on a mission...  ...and advanced machine learning. The Opportunity...  ...looking for an AI Research Scientist, or Senior Machine...  ...work on cutting-edge open-ended research projects...  ...uncertainty modeling, reinforcement learning, and physiological... 
    Temporary work
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  • $300k

     ...Models We are a dedicated research lab for building, understanding...  ...-class researchers, data scientists, and engineers, tackling the...  ...performance computing in deep learning, driving impactful discoveries...  ...’s rare elsewhere. ·      Open, mission-driven science – Join... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    22 days ago
  • $167.2k - $250.8k

     ...motion planner combining machine learning and classic robotics...  ...parking lots). About the Work Research and Implement state of the art...  ...the hardest and most exciting open‑world robotics problems and drive...  ...Carlo Tree Search (MCTS), Reinforcement Learning, Machine Learning... 

    Icehouseventures

    Mountain View, CA
    2 days ago
  •  ...Machine Learning Research Scientist At Autoscience Institute, we create AI systems that autonomously conduct AI research. Recently, we announced...  ...machine learning models, including deep learning, reinforcement learning or genetic algorithms. This does not include building... 
    Full time
    Flexible hours

    Autoscience Institute

    Menlo Park, CA
    4 days ago
  • $160.36k - $240.54k

     ...About The Role The mandate of the learned behavior team is to use advanced machine...  ...to solve long tail problems, adjusting reinforcement learning techniques for motion planning...  ...trajectories for autonomous driving. Research generative sequence modeling and sequential... 

    Nuro

    Mountain View, CA
    4 days ago
  • $173k

     ...Leadership Agreements, we foster an open culture where everyone...  ...us. Senior Machine Learning Scientist The Senior Machine...  ...framing and ideation through research, prototyping, deployment,...  ...advanced ML techniques (e.g., reinforcement learning, sequence modeling... 
    Local area
    Worldwide
    Flexible hours

    Expedia Group

    San Jose, CA
    1 day ago
  •  ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities...  ...and curation strategies, and a range of supervised and reinforcement learning techniques for physical manipulation. Develop and... 
    Work experience placement
    Internship
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  • $160.36k - $240.54k

     ...Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ) Nuro is a self-driving technology...  ...modality generation. Optimize generative models using reinforcement learning to improve interactive reasoning. Explore reward... 

    Nuro

    Mountain View, CA
    3 days ago
  • $148.75k - $361k

     ...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization...  ...role you will work on applying SOTA research and conduct your own research to develop...  ...Advertising domain Contributions to open-source ML projects #LI-DH2 Our... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    1 day ago
  •  ...Machine Learning Scientist Recommender systems (e.g., image and content...  .... Conduct applied research to improve recommender systems...  ...techniques, deep learning and reinforcement learning. Build scalable...  ...Leverage and contribute to open-source ML frameworks while... 
    Work at office
    Remote work
    2 days per week

    Wayfair

    Mountain View, CA
    1 day ago
  • $148.75k - $361k

     ...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization...  ...role you will work on applying SOTA research and conduct your own research to develop...  ...domain Contributions to open-source ML projects #LI-DH2 Our Hybrid... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    1 day ago
  • $140k - $195k

     ...the Team : Our AI Research team, reporting...  ...cutting-edge machine learning can transform the way...  ...a world-class team of scientists and engineers, and represent...  ...applications Experience with reinforcement learning for robotic...  ...updates about GM, open roles, career insights... 
    Work at office
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    1 day ago
  • $165k - $185k

     ...Description The Bosch Research and Technology Center...  ...As a Research Scientist- Vision-Language-Action...  ...in core AI and machine learning fields to enable Embodied...  ...autonomous planning, open-world learning, and so...  ...Hands-on experience in reinforcement learning for behavior... 
    Work experience placement
    Local area
    Worldwide

    Bosch Group

    Sunnyvale, CA
    6 days ago
  • $176k - $264k

     ...At Toyota Research Institute (TRI), we’re on a mission...  ...complex tasks in open and interactive environments...  ...and generative deep learning. However, it remains a...  ...for a driven research scientist or research engineer with...  ..., Imitation Learning, Reinforcement Learning, Planning &... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    more than 2 months ago
  •  ...team. Role We are seeking a Machine Learning Scientist to lead the research and development of Large Language...  ...leveraging modern preference learning and reinforcement learning to enhance reasoning, tool...  ...evaluate and integrate SOTA open‑source and commercial models while... 
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    19 hours ago
  • $184k - $299k

    Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries and has impact in the... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    A leading technology company in California seeks a Senior Research Scientist specializing in efficient deep learning. The role focuses on designing and implementing innovative research methods while collaborating with various teams. A Ph.D. in a related field and 3+ years... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $176k - $253k

    At Toyota Research Institute (TRI), we’re on a mission to improve...  ...parts — and developing the learning infrastructure needed to train...  ...are looking for a Research Scientist to join us in building intelligent...  ...of policy learning, reinforcement learning, and physical reasoning... 
    Work experience placement
    Internship
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    1 day ago
  • A leading AI research institute in Sunnyvale, California, seeks a Research Scientist specializing in Natural Language Processing (NLP) to advance cutting-edge technologies...  ...experience with large language models and deep learning. This role offers a competitive salary,... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  •  ...Models We are a dedicated research lab for building,...  ...class researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...training systems, and reinforcement learning algorithms and...  ...research publications, and open-source software.... 

    Institute of Foundation Models

    Sunnyvale, CA
    7 days ago
  •  ...We are a dedicated research lab for building, understanding...  ...researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...evaluations to innovative reinforcement learning...  ...research publications, and open-source software ·... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    24 days ago
  •  ...mission to make frontier AI truly open for all. We are founded on...  .... Oumi also develops an open research stack and models in...  ...Role Overview The Research Scientist will be an integral part of Oumi...  ...in computer science, machine learning, artificial intelligence, or... 
    Worldwide
    Flexible hours

    Oumi

    Palo Alto, CA
    4 days ago
  •  ...possible by our cutting edge research and end‑to‑end system design....  ...We’re looking for a Research Scientist or Research Engineer to own the...  ...scaling high‑quality robot learning data. This role sits at the intersection...  ...‑embodiment datasets (e.g., Open X‑Embodiment, DROID)... 

    Rhoda ai

    Palo Alto, CA
    3 days ago
  •  ...possibly by our cutting edge research and end-to-end system design...  ...We're looking for Research Scientists and Research Engineers with...  ...performance beyond what imitation learning alone achieves — reward...  ...Required) Hands-on experience with reinforcement learning — reward design,... 
    Shift work

    Rhoda ai

    Palo Alto, CA
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Reinforcement Learning (Open-Endedness). Be the first to apply!