Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist- Reinforcement Learning (Open-Endedness)

Brahma Consulting Group

Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco.

We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering their own goals rather than optimizing ones we hand them

What you'll do:

  • Develop RL methods for agents that discover useful objectives, tasks, and curricula without relying entirely on human-specified rewards
  • Design systems for open-ended learning including unsupervised / automated environment design, asymmetric self-play, and intrinsic motivation
  • Build training loops where agents learn from interaction, exploration, novelty, competence progress, self-generated challenges, or other nonstandard reward signals
  • Investigate how agents can avoid collapsing into trivial, degenerate, or easily exploitable objectives
  • Own and develop a research agenda end to end — from identifying promising directions, to running experiments, to communicating results

What we're looking for

  • 5+ years in reinforcement learning resea rch (PhD and academic years count).
  • Strong RL fundamentals, with exposure to one or more of: open-endedness, quality-diversity methods, intrinsic motivation, self-play, multi-agent RL, or goal-conditioned RL.
  • Proficiency in Python and modern ML framework s (PyTorch and/or JAX)
  • A track record of research output and/or shipping research-grade code.
  • High agency and comfort operating in an early-stage, fast-moving research environment.

Nice to have

  • Experience with LLM post-training and/or coding agents.
  • Familiarity with population-based training, POET, or related open-ended / evolutionary approaches
  • Publications at top venues (NeurIPS, ICML, ICLR, GECCO, RLC, or similar).

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research Scientist- Reinforcement Learning (Open-Endedness) in San Jose, CA vacancy
  •  ...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity...  ...representing an early-stage applied research lab building AI capable of open-ended...  ...exposure to one or more of: open-endedness, quality-diversity methods, intrinsic... 
    Suggested
    Permanent employment
    Full time

    Brahma Consulting Group

    Sunnyvale, CA
    1 day ago
  •  ...We are a dedicated research lab for building, understanding...  ...researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...Scientist within our Reinforcement Learning team, you will...  ...Proactively engage with the open-source community.   -... 
    Suggested
    Visa sponsorship
    Shift work

    Institute of Foundation Models

    Sunnyvale, CA
    21 days ago
  • $126k - $423k

     ...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We...  ...looking for multiple passionate Research Scientists to join the Research Group...  ...to our business. We are open to all years of experience as...  ...will: Conduct research on reinforcement learning (RL) related topics... 
    Suggested
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    16 days ago
  •  ...speeds and empowers machine learning users to effortlessly run...  ...an Applied Machine Learning Research Scientist at Cerebras, you will play...  ...pretraining, fine-tuning, and reinforcement learning-based post-...  ...meaningful codebases, projects, or open-source systems Why... 
    Suggested
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $148.75k - $361k

     ...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization...  ...role you will work on applying SOTA research and conduct your own research to develop...  ...domain Contributions to open-source ML projects #LI-DH2 Our Hybrid... 
    Suggested
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    13 hours ago
  • $148.75k - $361k

     ...scale and low latencies. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization...  ...role you will work on applying SOTA research and conduct your own research to develop...  ...Advertising domain Contributions to open-source ML projects #LI-DH2 Our... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    5 days ago
  • $173k

     ...Leadership Agreements, we foster an open culture where everyone...  ...us. Senior Machine Learning Scientist The Senior Machine...  ...framing and ideation through research, prototyping, deployment,...  ...advanced ML techniques (e.g., reinforcement learning, sequence modeling... 
    Local area
    Worldwide
    Flexible hours

    Expedia Group

    San Jose, CA
    13 hours ago
  • $137.5k

     ...Agreements, we foster an open culture where everyone belongs...  ...about using machine learning to improve Customer Experience...  ...! We are looking to hire researchers and data scientists interested in breaking new...  ...commerce. Experience with reinforcement learning or other advanced... 
    Worldwide
    Flexible hours

    Expedia, Inc.

    San Jose, CA
    1 day ago
  • $300k

     ...Models We are a dedicated research lab for building, understanding...  ...-class researchers, data scientists, and engineers, tackling the...  ...performance computing in deep learning, driving impactful discoveries...  ...’s rare elsewhere. ·      Open, mission-driven science – Join... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    21 days ago
  • $181.1k - $318.4k

     ...AIML - Machine Learning Researcher, MLR Play a part in building the next revolution in machine learning technology. We're looking for passionate...  ...in a diverse, collaborative environment. High-quality open-source contributions to related projects. PhD or... 
    Relocation

    Apple

    Cupertino, CA
    13 hours ago
  • $187k

     ...and Leadership Agreements, we foster an open culture where everyone belongs,...  ...world. Join us. Senior Machine Learning Scientist Introduction to the Team: Expedia...  ...management, and multi-agent collaboration Research and implement state-of-the-art techniques... 
    Local area
    Worldwide
    Flexible hours

    Expedia Group

    San Jose, CA
    13 hours ago
  • $60 per hour

     ...Machine Learning Scientist Intern (TikTok-Content Ecology—LLM application) - 2026 Start (PhD)...  ...actively contribute to our products and research, and to the organization's future plans...  ...ACL, etc.) or strong contributions to open-source AI projects. By submitting... 
    Hourly pay
    Internship
    Local area
    Worldwide

    Tik Tok

    San Jose, CA
    1 day ago
  • $187k

     ...and Leadership Agreements, we foster an open culture where everyone belongs,...  ...world. Join us. Senior Machine Learning Scientist – Personalization Introduction...  ...role for someone who can translate recent research into production-quality systems, influence... 
    Local area
    Flexible hours

    Expedia Group

    San Jose, CA
    3 days ago
  • $187k - $261.5k

    Senior Machine Learning Scientist - Personalization We are looking for a Senior Machine Learning...  ...role that involves translating recent research into production‑quality systems, influencing...  ...academic publications, patents, open‑source contributions, technical blog posts... 

    Traveltechessentialist

    San Jose, CA
    4 days ago
  • $165k - $185k

     ...Description The Bosch Research and Technology Center...  ...As a Research Scientist- Vision-Language-Action...  ...in core AI and machine learning fields to enable Embodied...  ...autonomous planning, open-world learning, and so...  ...Hands-on experience in reinforcement learning for behavior... 
    Work experience placement
    Local area
    Worldwide

    Bosch Group

    Sunnyvale, CA
    5 days ago
  •  ...Models We are a dedicated research lab for building,...  ...class researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...training systems, and reinforcement learning algorithms and...  ...research publications, and open-source software.... 

    Institute of Foundation Models

    Sunnyvale, CA
    6 days ago
  •  ...We are a dedicated research lab for building, understanding...  ...researchers, data scientists, and engineers,...  ...performance computing in deep learning, driving impactful...  ...evaluations to innovative reinforcement learning...  ...research publications, and open-source software ·... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    23 days ago
  • $150k - $290k

     ...status as a U.S. Citizen, a lawful permanent resident, or a protected individual under 8 U.S.C. 1324b(a)(3). Role: Machine Learning Researcher Location: 2550 N First Street Suite 250, San Jose, California 95131 Compensation*: $150,000-$290,000 + benefits Role... 
    Permanent employment

    Rivet Industries

    San Jose, CA
    3 days ago
  •  ...contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products. When our chips come out, these use...  ...with python, pytorch, CUDA, DDP/FSDP • Deep understanding of open and closed source model architectures and open source... 

    OpenReq

    Cupertino, CA
    13 hours ago
  •  ...Role Number: 200601297-3760 Summary We are hiring a researcher with a strong technical background in Image/Video generation and...  ...Apple. We are looking for people with excellent applied machine learning, computer vision/graphics experience, and solid engineering skills... 

    Apple

    Santa Clara, CA
    4 days ago
  •  ...Role: Data Scientist - Machine Learning Focus (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.) Adidev Technologies Inc Adidev Technologies is seeking 2 yrs of relevant experience in Data Science. A project can last... 
    Full time
    Contract work
    Immediate start
    Remote work
    Visa sponsorship
    Relocation package

    Adidev Technologies Inc

    San Jose, CA
    1 day ago
  • $179.2k - $374.4k

     ...Data Scientist (Open Platform Mini Games) - TikTok Infrastructure (San Jose) Location: San...  ...Social, Creation, Product Infrastructure, Research and Science, etc. The goal of the team...  ...Research and Development, and Machine Learning Engineering, to improve user experience... 
    Temporary work
    Casual work
    Local area

    Tik Tok

    San Jose, CA
    3 days ago
  •  ...computer vision and solving deep learning problems? The Video...  ...including ML engineers, data scientists, and ML Infrastructure engineers...  ...judges to perform subjective and open-ended model evaluations (e.g....  ...Python. Collaborate with research, engineering, and product teams... 

    Apple

    Sunnyvale, CA
    4 days ago
  • $357k

     ...across the business. To learn more, visit Why...  ...Responsibilities Workato's AI Research Lab is seeking an...  ...Lead AI Research Scientist to join our growing team...  ...authorship, and open scientific contribution...  ...transformer architectures, reinforcement learning techniques,... 
    Work at office
    Remote work
    Flexible hours

    Workato

    San Jose, CA
    20 days ago
  • $60 per hour

     ...Machine Learning Scientist Intern (TikTok-Recommendation) - 2026 Start (PhD) Location: San Jose Employment Type: Intern Job Code...  ...and platform growth. The team is made up of machine learning researchers and engineers, who support and innovate on production... 
    Hourly pay
    Internship
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  • $60 per hour

     ...constantly stretching the limits of deep learning and large-scale system design, we're...  ...and feed quality needs. Conduct original research on applying RL (e.g., bandit models,...  ...world applications. - Familiarity with reinforcement learning, bandit algorithms, or offline... 
    Hourly pay
    Internship
    Local area

    Tik Tok

    San Jose, CA
    4 days ago
  • $181.1k - $318.4k

     ...Sr. Machine Learning Scientist, Siri Speech The Speech Team within the Siri organization drives major speech recognition, synthesis and...  ...and services ecosystem. This allows for cutting edge applied research anchored in Apple specific production needs, while improving... 
    Relocation

    Apple

    Cupertino, CA
    2 days ago
  • $156k - $387.6k

     ...TikTok's products and platforms. About the role The Machine Learning Scientist will lead and collaborate with cross-functional teams to design...  ...'s degree in Finance, Mathematics, Statistics, Operations Research or other related field and two years of experience in the... 
    Temporary work
    Work at office
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  •  ...analysis (EDA) to identify patterns, trends, and relationships within large datasets. Assist in building and optimizing machine learning models to solve business problems. Develop data visualizations, dashboards, and reports using tools like Power BI or Python... 
    Hourly pay
    Permanent employment
    Internship

    Marvell

    Santa Clara, CA
    3 days ago
  • $60 per hour

    Student Researcher (AI Foundation Models Infrastructure - Seed Infra) - 2026 Start (PhD...  ...efficiency, fault tolerance) Contribute to reinforcement learning training frameworks and large‑scale...  ...projects Contributions to open‑source ML systems or performance tooling... 
    Hourly pay
    Internship
    Local area

    ByteDance

    San Jose, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Reinforcement Learning (Open-Endedness). Be the first to apply!