Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist- Reinforcement Learning (Open-Endedness)

Brahma Consulting Group

Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco.

We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering their own goals rather than optimizing ones we hand them

What you'll do:

  • Develop RL methods for agents that discover useful objectives, tasks, and curricula without relying entirely on human-specified rewards
  • Design systems for open-ended learning including unsupervised / automated environment design, asymmetric self-play, and intrinsic motivation
  • Build training loops where agents learn from interaction, exploration, novelty, competence progress, self-generated challenges, or other nonstandard reward signals
  • Investigate how agents can avoid collapsing into trivial, degenerate, or easily exploitable objectives
  • Own and develop a research agenda end to end — from identifying promising directions, to running experiments, to communicating results

What we're looking for

  • 5+ years in reinforcement learning resea rch (PhD and academic years count).
  • Strong RL fundamentals, with exposure to one or more of: open-endedness, quality-diversity methods, intrinsic motivation, self-play, multi-agent RL, or goal-conditioned RL.
  • Proficiency in Python and modern ML framework s (PyTorch and/or JAX)
  • A track record of research output and/or shipping research-grade code.
  • High agency and comfort operating in an early-stage, fast-moving research environment.

Nice to have

  • Experience with LLM post-training and/or coding agents.
  • Familiarity with population-based training, POET, or related open-ended / evolutionary approaches
  • Publications at top venues (NeurIPS, ICML, ICLR, GECCO, RLC, or similar).

Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Research Scientist- Reinforcement Learning (Open-Endedness) in Sonoma, CA vacancy
  •  ...full stack of a unified multimodal foundation model, from pretraining to deployment on real robotic hardware. This is foundational research with direct physical impact. No hand-offs, no bureaucracy, just hard problems and the resources to solve them. What You'll Do... 
    Suggested

    Prime Recruitment Partners

    Sonoma, CA
    4 days ago
  • $4,595 - $6,694 per month

     ...NWIC Researcher, Research Technician I, Temporary Job no: 557068 Work type: Staff...  ...Process This position is open until filled. For full consideration, please...  ...excellence through teaching, scholarship, learning and inclusion. In line with our Strategic... 
    Suggested
    Permanent employment
    Full time
    Temporary work
    H1b
    Work at office
    Local area
    Immediate start

    California State University

    Sonoma, CA
    1 day ago
  •  ...profile. This could suit: A senior scientist in translational biology or early development...  ...depth and a clear track record of learning quickly across disciplines Core attributes...  ...building something meaningful, whether research, tools or programmes • Interest or... 
    Suggested
    Remote work
    Relocation
    Flexible hours

    Aspire Life Sciences Search

    Sonoma, CA
    2 days ago
  •  ...AI Researcher – Video World Generation San Francisco (Bay Area) Help build the next generation of AI video systems that can create rich...  ...video creation RL techniques for more adaptive and open-ended video generation Required: Strong production experience... 
    Suggested

    DeepRec.ai

    Sonoma, CA
    8 hours ago
  •  ...garden staff Facilitate regular team check-ins and maintain open, effective communication across a geographically dispersed...  ...individual identity in all garden interactions Practice positive reinforcement and redirection; uphold NBCCs policies prohibiting corporal... 
    Suggested
    Seasonal work
    Work at office
    Night shift
    Afternoon shift

    North Bay Children

    Kenwood, CA
    3 days ago
  •  ...creators. The Role We're looking for a data-driven Data Scientist to help us better understand our users, products, and AI systems...  ...~3+ years of experience in data science, analytics, machine learning, or related quantitative roles ~ Strong proficiency in SQL and... 
    Work at office
    Visa sponsorship
    Flexible hours

    Sonilo

    Sonoma, CA
    8 hours ago
  • $45 - $65 per hour

     ...Kelly® Science & Clinical is seeking a Scientist, iPSC Models for a contract-to-hire opportunity with one of our clients, a leading life...  .... Preferred Significant experience with human stem cell research and stem cell culture techniques. Comprehensive knowledge... 
    Hourly pay
    Contract work

    Kelly

    Sonoma, CA
    4 days ago
  •  ...assessment. Define measurements, interpret dynamics, and drive improvements in system-level function. Will collaborate with scientists and engineers across the company on a variety of methods inspired by first principles of biology, physics, and chemistry.... 

    Commonwealth Sciences, Inc.

    Sonoma, CA
    8 hours ago
  • About Us Deciphr Health is a seed-stage biotech developing programmable DNA nanostructures for next-generation point-of-care diagnostics. Our first clinical application targets preeclampsia screening — a condition that affects 1 in 25 pregnancies with no adequate rapid...

    Deciphr Health

    Sonoma, CA
    2 days ago
  •  ...with our trusted client to identify a Director-level Clinical Scientist . Our client is a biopharmaceutical company developing next-...  ...interactions and responses. Contribute to therapeutic area research, KOL/investigator engagement, and development of scalable clinical... 

    Stevenson Search Partners

    Sonoma, CA
    8 hours ago
  •  ...where the minerals actually are, how much of it there is, and where to drill next. You'll sit between our geophysical analysis, ML research, and engineering teams, translating geological judgment into reproducible, code-driven models that the agent can run autonomously... 

    Octavia Technologies

    Sonoma, CA
    8 hours ago
  • $100 per hour

     ...language models (like ChatGPT) using your chemistry knowledge. You’ll design problems, check how well AI solves them, and work with researchers to build better benchmarks. Responsibilities: Design advanced chemistry problems to test AI performance (e.g., chemical... 
    Contract work
    Remote work
    Flexible hours

    Turing

    Sonoma, CA
    8 hours ago
  •  ...Clinical Laboratory Science, Biology, Biochemistry, or related field . Licensure ~ Active California Clinical Laboratory Scientist (CLS) License issued by the California Department of Public Health (CDPH) . Experience ~5+ years of laboratory... 
    Weekend work
    Early shift

    Confidential

    Sonoma, CA
    1 day ago
  •  ...scientific leader to oversee and execute our data strategy. We need a scientist who can lead—not just manage—and who brings a wealth of "big...  ..., with a strong focus on clinical programs and translational research. Technical Skills: Deep bioinformatics expertise with a... 
    For contractors

    Oakwell Hampton Group

    Sonoma, CA
    8 hours ago
  • $58 - $63 per hour

     ..., Snowflake, and Trino), and a deep understanding of dbt, including model development, macros, and Jinja templating. This isn't a learning role—you need to be a subject matter expert. Demonstrated ability to work autonomously and manage your own time effectively to meet... 
    Contract work
    For contractors
    Immediate start
    Remote work

    DeWinter Group

    Napa, CA
    8 hours ago
  • $150k - $175k

    Data Engineer Location: San Francisco, CA or New York, NY Work Model: Onsite Compensation: $150,000 – $175,000 base + bonus About the Company This organization is a technology-focused investment platform that partners with high-growth and enterprise software...

    Harnham

    Sonoma, CA
    1 day ago
  • Job Title: Outpatient Allergy and Immunologist Location: San Francisco Bay Area Job Description: Are you ready to bring your allergy expertise to one of the most dynamic and desirable regions in the country? A well-established, physician-owned multispecialty...
    Work at office
    Relocation package
    Night shift
    Weekend work
    Afternoon shift

    She Recruits. LLC

    Napa, CA
    1 day ago
  • $175k - $225k

    We are seeking a Staff Software Engineer to join a well-funded, early-stage technology startup. This is a ground-floor opportunity to become a core member of a lean, high-caliber engineering team helping build foundational software, data, and platform capabilities from...

    Murphy Talent Group

    Sonoma, CA
    8 hours ago
  •  ...Other Functions • Mentor IT staff on security best practices and emerging technologies. • Participate in continuous learning to stay current with cybersecurity advancements. • Collaborate with colleagues to draft technology plans that enhance... 

    JBA International

    Napa, CA
    3 days ago
  • A leader in FinTech/SaaS is seeking a skilled Data Engineer for a remote 1-year contract. This role involves migrating dbt models from Redshift and Snowflake to Trino, requiring deep expertise in SQL and dbt along with the ability to work autonomously. Responsibilities ...
    Contract work
    Remote work

    DeWinter Group

    Napa, CA
    5 days ago
  • $26 - $30 per hour

     ...Job Description Job Description WINE DATA RESEARCHER About Us: Last Bottle, Inc. and its sister-companies belong to a portfolio of much-loved wine businesses—each with a distinct identity, loyal following, and shared commitment to excellence. From daily deal... 
    Remote work
    Worldwide
    Flexible hours

    Last Bottle Wines

    Napa, CA
    25 days ago
  • $2,075.22 per month

     ...Job Overview TLC Nursing Associates, Inc. is seeking an experienced Clinical Laboratory Scientist (CLS) for travel assignments in various healthcare facilities. This role involves performing and analyzing a variety of laboratory tests to assist in the diagnosis,... 
    Weekly pay
    Contract work
    Temporary work
    Zero hours contract
    Local area
    Flexible hours
    Napa, CA
    more than 2 months ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Reinforcement Learning (Open-Endedness). Be the first to apply!