Research Scientist- Reinforcement Learning (Open-Endedness)

Brahma Consulting Group

Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco.

We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering their own goals rather than optimizing ones we hand them

What you'll do:

Develop RL methods for agents that discover useful objectives, tasks, and curricula without relying entirely on human-specified rewards
Design systems for open-ended learning including unsupervised / automated environment design, asymmetric self-play, and intrinsic motivation
Build training loops where agents learn from interaction, exploration, novelty, competence progress, self-generated challenges, or other nonstandard reward signals
Investigate how agents can avoid collapsing into trivial, degenerate, or easily exploitable objectives
Own and develop a research agenda end to end — from identifying promising directions, to running experiments, to communicating results

What we're looking for

5+ years in reinforcement learning resea rch (PhD and academic years count).
Strong RL fundamentals, with exposure to one or more of: open-endedness, quality-diversity methods, intrinsic motivation, self-play, multi-agent RL, or goal-conditioned RL.
Proficiency in Python and modern ML framework s (PyTorch and/or JAX)
A track record of research output and/or shipping research-grade code.
High agency and comfort operating in an early-stage, fast-moving research environment.

Nice to have

Experience with LLM post-training and/or coding agents.
Familiarity with population-based training, POET, or related open-ended / evolutionary approaches
Publications at top venues (NeurIPS, ICML, ICLR, GECCO, RLC, or similar).

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Research Scientist- Reinforcement Learning (Open-Endedness) in San Jose, CA vacancy

Research Scientist- Reinforcement Learning (Open-Endedness)
...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity... ...representing an early-stage applied research lab building AI capable of open-ended... ...exposure to one or more of: open-endedness, quality-diversity methods, intrinsic...
Suggested
Permanent employment
Full time
Brahma Consulting Group
Sunnyvale, CA
1 day ago
Research Scientist - Reinforcement Learning
...We are a dedicated research lab for building, understanding... ...researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...Scientist within our Reinforcement Learning team, you will... ...Proactively engage with the open-source community. -...
Suggested
Visa sponsorship
Shift work
Institute of Foundation Models
Sunnyvale, CA
21 days ago
Research Scientist - Reinforcement Learning, Robotics
$126k - $423k
...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We... ...looking for multiple passionate Research Scientists to join the Research Group... ...to our business. We are open to all years of experience as... ...will: Conduct research on reinforcement learning (RL) related topics...
Suggested
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
16 days ago
Applied Machine Learning Research Scientist
...speeds and empowers machine learning users to effortlessly run... ...an Applied Machine Learning Research Scientist at Cerebras, you will play... ...pretraining, fine-tuning, and reinforcement learning-based post-... ...meaningful codebases, projects, or open-source systems Why...
Suggested
Internship
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Senior Machine Learning Scientist, Creative Generation & Personalization
$148.75k - $361k
...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization... ...role you will work on applying SOTA research and conduct your own research to develop... ...domain Contributions to open-source ML projects #LI-DH2 Our Hybrid...
Suggested
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
13 hours ago
Senior Machine Learning Scientist, Conversion Modeling
$148.75k - $361k
...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization... ...role you will work on applying SOTA research and conduct your own research to develop... ...Advertising domain Contributions to open-source ML projects #LI-DH2 Our...
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
5 days ago
Senior Machine Learning Scientist
$173k
...Leadership Agreements, we foster an open culture where everyone... ...us. Senior Machine Learning Scientist The Senior Machine... ...framing and ideation through research, prototyping, deployment,... ...advanced ML techniques (e.g., reinforcement learning, sequence modeling...
Local area
Worldwide
Flexible hours
Expedia Group
San Jose, CA
13 hours ago
Machine Learning Scientist
$137.5k
...Agreements, we foster an open culture where everyone belongs... ...about using machine learning to improve Customer Experience... ...! We are looking to hire researchers and data scientists interested in breaking new... ...commerce. Experience with reinforcement learning or other advanced...
Worldwide
Flexible hours
Expedia, Inc.
San Jose, CA
1 day ago
Research Scientist - Distributed Machine Learning
$300k
...Models We are a dedicated research lab for building, understanding... ...-class researchers, data scientists, and engineers, tackling the... ...performance computing in deep learning, driving impactful discoveries... ...’s rare elsewhere. · Open, mission-driven science – Join...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
21 days ago
AIML - Machine Learning Researcher, MLR
$181.1k - $318.4k
...AIML - Machine Learning Researcher, MLR Play a part in building the next revolution in machine learning technology. We're looking for passionate... ...in a diverse, collaborative environment. High-quality open-source contributions to related projects. PhD or...
Relocation
Apple
Cupertino, CA
13 hours ago
Senior Machine Learning Scientist - Agentic Experience
$187k
...and Leadership Agreements, we foster an open culture where everyone belongs,... ...world. Join us. Senior Machine Learning Scientist Introduction to the Team: Expedia... ...management, and multi-agent collaboration Research and implement state-of-the-art techniques...
Local area
Worldwide
Flexible hours
Expedia Group
San Jose, CA
13 hours ago
Machine Learning Scientist Intern (TikTok-Content application) - 2026 Start (PhD)
$60 per hour
...Machine Learning Scientist Intern (TikTok-Content Ecology—LLM application) - 2026 Start (PhD)... ...actively contribute to our products and research, and to the organization's future plans... ...ACL, etc.) or strong contributions to open-source AI projects. By submitting...
Hourly pay
Internship
Local area
Worldwide
Tik Tok
San Jose, CA
1 day ago
Senior Machine Learning Scientist - Personalization
$187k
...and Leadership Agreements, we foster an open culture where everyone belongs,... ...world. Join us. Senior Machine Learning Scientist – Personalization Introduction... ...role for someone who can translate recent research into production-quality systems, influence...
Local area
Flexible hours
Expedia Group
San Jose, CA
3 days ago
Senior Machine Learning Scientist - Personalization
$187k - $261.5k
Senior Machine Learning Scientist - Personalization We are looking for a Senior Machine Learning... ...role that involves translating recent research into production‑quality systems, influencing... ...academic publications, patents, open‑source contributions, technical blog posts...
Traveltechessentialist
San Jose, CA
4 days ago
Research Scientist- Vision-Language-Action (VLA) Models
$165k - $185k
...Description The Bosch Research and Technology Center... ...As a Research Scientist- Vision-Language-Action... ...in core AI and machine learning fields to enable Embodied... ...autonomous planning, open-world learning, and so... ...Hands-on experience in reinforcement learning for behavior...
Work experience placement
Local area
Worldwide
Bosch Group
Sunnyvale, CA
5 days ago
Research Scientist - Vision Language Model
...Models We are a dedicated research lab for building,... ...class researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...training systems, and reinforcement learning algorithms and... ...research publications, and open-source software....
Institute of Foundation Models
Sunnyvale, CA
6 days ago
Research Scientist - Agents
...We are a dedicated research lab for building, understanding... ...researchers, data scientists, and engineers,... ...performance computing in deep learning, driving impactful... ...evaluations to innovative reinforcement learning... ...research publications, and open-source software ·...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
23 days ago
Machine Learning Researcher
$150k - $290k
...status as a U.S. Citizen, a lawful permanent resident, or a protected individual under 8 U.S.C. 1324b(a)(3). Role: Machine Learning Researcher Location: 2550 N First Street Suite 250, San Jose, California 95131 Compensation*: $150,000-$290,000 + benefits Role...
Permanent employment
Rivet Industries
San Jose, CA
3 days ago
Machine Learning Researcher
...contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products. When our chips come out, these use... ...with python, pytorch, CUDA, DDP/FSDP • Deep understanding of open and closed source model architectures and open source...
OpenReq
Cupertino, CA
13 hours ago
AIML - Machine Learning Researcher, DMLI- Image/Video Generation
...Role Number: 200601297-3760 Summary We are hiring a researcher with a strong technical background in Image/Video generation and... ...Apple. We are looking for people with excellent applied machine learning, computer vision/graphics experience, and solid engineering skills...
Apple
Santa Clara, CA
4 days ago
Data Scientist - Machine Learning Focus
...Role: Data Scientist - Machine Learning Focus (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.) Adidev Technologies Inc Adidev Technologies is seeking 2 yrs of relevant experience in Data Science. A project can last...
Full time
Contract work
Immediate start
Remote work
Visa sponsorship
Relocation package
Adidev Technologies Inc
San Jose, CA
1 day ago
Data Scientist (Open Platform Mini Games) - TikTok Infrastructure (San Jose)
$179.2k - $374.4k
...Data Scientist (Open Platform Mini Games) - TikTok Infrastructure (San Jose) Location: San... ...Social, Creation, Product Infrastructure, Research and Science, etc. The goal of the team... ...Research and Development, and Machine Learning Engineering, to improve user experience...
Temporary work
Casual work
Local area
Tik Tok
San Jose, CA
3 days ago
Machine Learning - Data Scientist
...computer vision and solving deep learning problems? The Video... ...including ML engineers, data scientists, and ML Infrastructure engineers... ...judges to perform subjective and open-ended model evaluations (e.g.... ...Python. Collaborate with research, engineering, and product teams...
Apple
Sunnyvale, CA
4 days ago
Lead AI Research Scientist
$357k
...across the business. To learn more, visit Why... ...Responsibilities Workato's AI Research Lab is seeking an... ...Lead AI Research Scientist to join our growing team... ...authorship, and open scientific contribution... ...transformer architectures, reinforcement learning techniques,...
Work at office
Remote work
Flexible hours
Workato
San Jose, CA
20 days ago
Machine Learning Scientist Intern (TikTok-Recommendation) - 2026 Start (PhD)
$60 per hour
...Machine Learning Scientist Intern (TikTok-Recommendation) - 2026 Start (PhD) Location: San Jose Employment Type: Intern Job Code... ...and platform growth. The team is made up of machine learning researchers and engineers, who support and innovate on production...
Hourly pay
Internship
Local area
Tik Tok
San Jose, CA
2 days ago
Research Scientist Intern (TikTok Recommendation-LLMs, RL, GenAI) - 2026 Start (PhD)
$60 per hour
...constantly stretching the limits of deep learning and large-scale system design, we're... ...and feed quality needs. Conduct original research on applying RL (e.g., bandit models,... ...world applications. - Familiarity with reinforcement learning, bandit algorithms, or offline...
Hourly pay
Internship
Local area
Tik Tok
San Jose, CA
4 days ago
Sr. Machine Learning Scientist, Siri Speech
$181.1k - $318.4k
...Sr. Machine Learning Scientist, Siri Speech The Speech Team within the Siri organization drives major speech recognition, synthesis and... ...and services ecosystem. This allows for cutting edge applied research anchored in Apple specific production needs, while improving...
Relocation
Apple
Cupertino, CA
2 days ago
Machine Learning Scientist - LLM/AI, San Jose
$156k - $387.6k
...TikTok's products and platforms. About the role The Machine Learning Scientist will lead and collaborate with cross-functional teams to design... ...'s degree in Finance, Mathematics, Statistics, Operations Research or other related field and two years of experience in the...
Temporary work
Work at office
Local area
Tik Tok
San Jose, CA
2 days ago
Applied Machine Learning Scientist Intern - PhD
...analysis (EDA) to identify patterns, trends, and relationships within large datasets. Assist in building and optimizing machine learning models to solve business problems. Develop data visualizations, dashboards, and reports using tools like Power BI or Python...
Hourly pay
Permanent employment
Internship
Marvell
Santa Clara, CA
3 days ago
Student Researcher (AI Foundation Models Infrastructure - Seed Infra) - 2026 Start (PhD) PhD In[...]
$60 per hour
Student Researcher (AI Foundation Models Infrastructure - Seed Infra) - 2026 Start (PhD... ...efficiency, fault tolerance) Contribute to reinforcement learning training frameworks and large‑scale... ...projects Contributions to open‑source ML systems or performance tooling...
Hourly pay
Internship
Local area
ByteDance
San Jose, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Reinforcement Learning (Open-Endedness). Be the first to apply!