Research Scientist- Reinforcement Learning (Open-Endedness)

Brahma Consulting Group

Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco.

We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering their own goals rather than optimizing ones we hand them

What you'll do:

Develop RL methods for agents that discover useful objectives, tasks, and curricula without relying entirely on human-specified rewards
Design systems for open-ended learning including unsupervised / automated environment design, asymmetric self-play, and intrinsic motivation
Build training loops where agents learn from interaction, exploration, novelty, competence progress, self-generated challenges, or other nonstandard reward signals
Investigate how agents can avoid collapsing into trivial, degenerate, or easily exploitable objectives
Own and develop a research agenda end to end — from identifying promising directions, to running experiments, to communicating results

What we're looking for

5+ years in reinforcement learning resea rch (PhD and academic years count).
Strong RL fundamentals, with exposure to one or more of: open-endedness, quality-diversity methods, intrinsic motivation, self-play, multi-agent RL, or goal-conditioned RL.
Proficiency in Python and modern ML framework s (PyTorch and/or JAX)
A track record of research output and/or shipping research-grade code.
High agency and comfort operating in an early-stage, fast-moving research environment.

Nice to have

Experience with LLM post-training and/or coding agents.
Familiarity with population-based training, POET, or related open-ended / evolutionary approaches
Publications at top venues (NeurIPS, ICML, ICLR, GECCO, RLC, or similar).

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Research Scientist- Reinforcement Learning (Open-Endedness) in Santa Clara, CA vacancy

Research Scientist- Reinforcement Learning (Open-Endedness)
...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity... ...representing an early-stage applied research lab building AI capable of open-ended... ...exposure to one or more of: open-endedness, quality-diversity methods, intrinsic...
Suggested
Permanent employment
Full time
Brahma Consulting Group
Sunnyvale, CA
2 days ago
Research Scientist - Reinforcement Learning
...We are a dedicated research lab for building, understanding... ...researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...Scientist within our Reinforcement Learning team, you will... ...Proactively engage with the open-source community. -...
Suggested
Visa sponsorship
Shift work
Institute of Foundation Models
Sunnyvale, CA
22 days ago
Research Scientist - Reinforcement Learning, Robotics
$126k - $423k
...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We... ...looking for multiple passionate Research Scientists to join the Research Group... ...to our business. We are open to all years of experience as... ...will: Conduct research on reinforcement learning (RL) related topics...
Suggested
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
17 days ago
Applied Machine Learning Research Scientist
...speeds and empowers machine learning users to effortlessly run... ...an Applied Machine Learning Research Scientist at Cerebras, you will play... ...pretraining, fine-tuning, and reinforcement learning-based post-... ...meaningful codebases, projects, or open-source systems Why...
Suggested
Internship
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
4 days ago
Senior Applied Deep Learning Research Scientist, Efficiency
...are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR - Efficiency... ...quantization/sparsity/optimizers/reinforcement learning, efficient architectures and... ...efficient while retaining accuracy, and open‑source or publish these algorithms for...
Suggested
NVIDIA Gruppe
Santa Clara, CA
19 hours ago
ML Research Scientist (Embodied AI & Reinforcement Learning)
$175k - $250k
...intelligence by combining machine learning with large-scale... ...sits at the frontier of AI research, and we believe the highest-... ...Role We are hiring a Research Scientist to advance the learning algorithms... ...models, action models, reinforcement learning, and learning from...
Full time
Visa sponsorship
Flexible hours
Metamorphic
Palo Alto, CA
1 day ago
Machine Learning Research Scientist, Adaptive Behavioral Systems
$176k - $253.5k
...At Toyota Research Institute (TRI), we're on a mission... ...and advanced machine learning. The Opportunity... ...looking for an AI Research Scientist, or Senior Machine... ...work on cutting-edge open-ended research projects... ...uncertainty modeling, reinforcement learning, and physiological...
Temporary work
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Research Scientist - Distributed Machine Learning
$300k
...Models We are a dedicated research lab for building, understanding... ...-class researchers, data scientists, and engineers, tackling the... ...performance computing in deep learning, driving impactful discoveries... ...’s rare elsewhere. · Open, mission-driven science – Join...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
22 days ago
Machine Learning Research Scientist: Generative Modeling for Planning
$167.2k - $250.8k
...motion planner combining machine learning and classic robotics... ...parking lots). About the Work Research and Implement state of the art... ...the hardest and most exciting open‑world robotics problems and drive... ...Carlo Tree Search (MCTS), Reinforcement Learning, Machine Learning...
Icehouseventures
Mountain View, CA
2 days ago
Machine Learning Research Scientist
...Machine Learning Research Scientist At Autoscience Institute, we create AI systems that autonomously conduct AI research. Recently, we announced... ...machine learning models, including deep learning, reinforcement learning or genetic algorithms. This does not include building...
Full time
Flexible hours
Autoscience Institute
Menlo Park, CA
4 days ago
Machine Learning Research Scientist, Behavior Planning and Prediction
$160.36k - $240.54k
...About The Role The mandate of the learned behavior team is to use advanced machine... ...to solve long tail problems, adjusting reinforcement learning techniques for motion planning... ...trajectories for autonomous driving. Research generative sequence modeling and sequential...
Nuro
Mountain View, CA
4 days ago
Senior Machine Learning Scientist
$173k
...Leadership Agreements, we foster an open culture where everyone... ...us. Senior Machine Learning Scientist The Senior Machine... ...framing and ideation through research, prototyping, deployment,... ...advanced ML techniques (e.g., reinforcement learning, sequence modeling...
Local area
Worldwide
Flexible hours
Expedia Group
San Jose, CA
1 day ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities... ...and curation strategies, and a range of supervised and reinforcement learning techniques for physical manipulation. Develop and...
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Machine Learning Research Scientist: Generative Modeling for Planning
$160.36k - $240.54k
...Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ) Nuro is a self-driving technology... ...modality generation. Optimize generative models using reinforcement learning to improve interactive reasoning. Explore reward...
Nuro
Mountain View, CA
3 days ago
Senior Machine Learning Scientist, Conversion Modeling
$148.75k - $361k
...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization... ...role you will work on applying SOTA research and conduct your own research to develop... ...Advertising domain Contributions to open-source ML projects #LI-DH2 Our...
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
1 day ago
Machine Learning Scientist III - Recommendations
...Machine Learning Scientist Recommender systems (e.g., image and content... .... Conduct applied research to improve recommender systems... ...techniques, deep learning and reinforcement learning. Build scalable... ...Leverage and contribute to open-source ML frameworks while...
Work at office
Remote work
2 days per week
Wayfair
Mountain View, CA
1 day ago
Senior Machine Learning Scientist, Creative Generation & Personalization
$148.75k - $361k
...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization... ...role you will work on applying SOTA research and conduct your own research to develop... ...domain Contributions to open-source ML projects #LI-DH2 Our Hybrid...
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
1 day ago
Sr. Artificial Intelligence and Machine Learning Scientist, Early Career
$140k - $195k
...the Team : Our AI Research team, reporting... ...cutting-edge machine learning can transform the way... ...a world-class team of scientists and engineers, and represent... ...applications Experience with reinforcement learning for robotic... ...updates about GM, open roles, career insights...
Work at office
Local area
Work from home
Relocation package
Flexible hours
General Motors
Mountain View, CA
1 day ago
Research Scientist- Vision-Language-Action (VLA) Models
$165k - $185k
...Description The Bosch Research and Technology Center... ...As a Research Scientist- Vision-Language-Action... ...in core AI and machine learning fields to enable Embodied... ...autonomous planning, open-world learning, and so... ...Hands-on experience in reinforcement learning for behavior...
Work experience placement
Local area
Worldwide
Bosch Group
Sunnyvale, CA
6 days ago
Research Scientist, Trustworthy Learning under Uncertainty (TLU) - Large Behavior Models
$176k - $264k
...At Toyota Research Institute (TRI), we’re on a mission... ...complex tasks in open and interactive environments... ...and generative deep learning. However, it remains a... ...for a driven research scientist or research engineer with... ..., Imitation Learning, Reinforcement Learning, Planning &...
Local area
Shift work
Toyota Research Institute
Los Altos, CA
more than 2 months ago
Machine Learning Scientist (L4/L5) - Multi-modal Algorithms for Games
...team. Role We are seeking a Machine Learning Scientist to lead the research and development of Large Language... ...leveraging modern preference learning and reinforcement learning to enhance reasoning, tool... ...evaluate and integrate SOTA open‑source and commercial models while...
Hourly pay
Full time
Immediate start
Flexible hours
Netflix, Inc.
Los Gatos, CA
19 hours ago
Senior Research Scientist, Efficient Deep Learning
$184k - $299k
Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries and has impact in the...
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior Research Scientist, Efficient Deep Learning — Pruning & NAS
$184k - $356.5k
A leading technology company in California seeks a Senior Research Scientist specializing in efficient deep learning. The role focuses on designing and implementing innovative research methods while collaborating with various teams. A Ph.D. in a related field and 3+ years...
NVIDIA Corporation
Santa Clara, CA
1 day ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve... ...parts — and developing the learning infrastructure needed to train... ...are looking for a Research Scientist to join us in building intelligent... ...of policy learning, reinforcement learning, and physical reasoning...
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
1 day ago
NLP Research Scientist: LLMs, Deep Learning & AI
A leading AI research institute in Sunnyvale, California, seeks a Research Scientist specializing in Natural Language Processing (NLP) to advance cutting-edge technologies... ...experience with large language models and deep learning. This role offers a competitive salary,...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Research Scientist - Vision Language Model
...Models We are a dedicated research lab for building,... ...class researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...training systems, and reinforcement learning algorithms and... ...research publications, and open-source software....
Institute of Foundation Models
Sunnyvale, CA
7 days ago
Research Scientist - Agents
...We are a dedicated research lab for building, understanding... ...researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...evaluations to innovative reinforcement learning... ...research publications, and open-source software ·...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
24 days ago
Open-Source AI Research Scientist
...mission to make frontier AI truly open for all. We are founded on... .... Oumi also develops an open research stack and models in... ...Role Overview The Research Scientist will be an integral part of Oumi... ...in computer science, machine learning, artificial intelligence, or...
Worldwide
Flexible hours
Oumi
Palo Alto, CA
4 days ago
Research Scientist / Engineer - Robot Learning Data
...possible by our cutting edge research and end‑to‑end system design.... ...We’re looking for a Research Scientist or Research Engineer to own the... ...scaling high‑quality robot learning data. This role sits at the intersection... ...‑embodiment datasets (e.g., Open X‑Embodiment, DROID)...
Rhoda ai
Palo Alto, CA
3 days ago
Research Scientist / Engineer - Post-training & Robot Learning
...possibly by our cutting edge research and end-to-end system design... ...We're looking for Research Scientists and Research Engineers with... ...performance beyond what imitation learning alone achieves — reward... ...Required) Hands-on experience with reinforcement learning — reward design,...
Shift work
Rhoda ai
Palo Alto, CA
19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Reinforcement Learning (Open-Endedness). Be the first to apply!