Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist, Reinforcement Learning

Deeproute.ai

We are building next‑generation end-to-end autonomous driving systems powered by reinforcement learning. You will work on applying RL in closed-loop, safety‑critical environments , leveraging large‑scale simulation and real‑world driving data to improve safety, comfort, and robustness. Train and deploy RL policies in closed-loop driving environments Scale RL training using massively parallel simulation systems Design and optimize reward functions for complex driving behaviours Improve sim‑to‑real transfer for real‑world robustness Collaborate with cross‑functional teams to integrate models into production systems Core Technical Skills Proficiency in modern RL algorithms: DQN, PPO, SAC, TD3, etc. Proficiency in modern RLHF algorithms: PPO, DPO, GRPO, etc. Hands‑on experience training reward models and finetuning LLM/VLM/VLA Knowledge of distributed RL training at scale Proficiency with massively parallel simulation environments Knowledge of sim‑to‑real transfer techniques and domain randomisation Proficiency in Python, comfortable with C++ Proficiency in deep learning frameworks such as PyTorch Experience with distributed training frameworks (Ray, Horovod, etc.) Knowledge of model optimisation (quantisation, pruning) and CUDA is a plus Knowledge of traffic rules, driving behaviour modelling Preferred Qualifications Publications in top‑tier venues (ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, ICRA, IROS, etc.) Open‑source contributions to RL libraries or autonomous driving projects Previous experience with LLM fine‑tuning using RLHF Knowledge of safe RL, interpretable AI, or robustness techniques Familiarity with autonomous vehicle regulations and safety standards #J-18808-Ljbffr

Vacancy posted 9 hours ago
Similar jobs that could be interesting for youBased on the Research Scientist, Reinforcement Learning in Fremont, CA vacancy
  •  ...We are building next-generation end-to-end autonomous driving systems powered by reinforcement learning. You will work on applying RL in closed-loop, safety-critical environments , leveraging large-scale simulation and real-world driving data to improve safety, comfort... 
    Suggested

    DeepRoute

    Fremont, CA
    4 days ago
  •  ...team includes more than 150 PhDs and data scientists, along with more than 4,000 AI...  ...multi‑turn, tool‑using agents with closed learning loops Design reward functions and verifiers...  ...review, engineering standards Mentor researchers and engineers; drive technical direction... 
    Suggested

    Centific Global Solutions, Inc.

    East Palo Alto, CA
    4 days ago
  •  ...Focus Multimodal Foundation Models · Representation Learning · Method Innovation We are looking for strong technical builders and researchers who deeply understand foundation models and representation learning beyond simply applying existing frameworks. Ideal candidates... 
    Suggested

    DeepRoute

    Fremont, CA
    9 hours ago
  •  ...DeepRoute in Fremont, California is seeking engineers to develop end-to-end autonomous driving systems using reinforcement learning. The ideal candidate will have experience training RL policies in safety-critical environments and expertise in modern RL algorithms like... 
    Suggested

    DeepRoute

    Fremont, CA
    3 days ago
  •  ...Deeproute.ai in Fremont, California, is seeking candidates to build next-generation autonomous driving systems leveraging reinforcement learning. The position involves applying RL in safety-critical environments and improving robustness using real-world data. The ideal... 
    Suggested

    Deeproute.ai

    Fremont, CA
    9 hours ago
  • $78k - $90k

    A leading technology firm is seeking a Scientist II, Research Associate in Fremont, California. The ideal candidate will support the development of CMP products by conducting lab experiments and tests, analyzing data, and maintaining lab inventory. A Bachelor's degree... 

    Entegris Inc.

    Fremont, CA
    3 days ago
  • Senior / Staff AI Research Scientist, Manipulation Milpitas, CA RoboForce is an AI robotics company...  ...Create and enhance contact-rich robot learning stacks through physics-based...  ...etc.). Expertise in imitation learning, reinforcement learning, tactile sensing and robotics... 
    Work at office
    Visa sponsorship

    Broughton Group

    Milpitas, CA
    4 days ago
  • $78k - $90k

     ...Job Title: Scientist II, Research Associate Job Description: The Role: Entegris is currently seeking a Staff Research...  ..., highly creative, innovative, and committed to continuous learning. ~ Knowledge of Windows and associated software (Excel,... 
    H1b
    Work at office
    Flexible hours

    Entegris

    Fremont, CA
    3 days ago
  • $100k - $155k

     ...experience, location, qualifications and other job-related reasons. Description ESSENTIAL DUTIES & RESPONSIBILITIES Conduct experimental research for the purpose of establishing enzyme-based processes for small-molecule pharmaceuticals, and promote process development in... 
    Shift work

    Default Brand

    Newark, CA
    3 days ago
  •  ...ESSENTIAL DUTIES & RESPONSIBILITIES Conduct experimental research for the purpose of establishing enzyme-based processes for small-molecule pharmaceuticals, and promote process development in collaboration with research teams in Japan, customers, and external resources... 

    Broughton Group

    Newark, CA
    4 days ago
  • $228k - $342k

    About the Team Workday AI Research is a newly built, elite organization...  ...Team. This is a role for a scientist-leader who thrives on...  ...professional experience using deep learning frameworks such as PyTorch,...  ...: Strong background in reinforcement learning, tool-use, or multi... 
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday, Inc.

    Pleasanton, CA
    2 days ago
  • $22 - $23.5 per hour

     ...eight weeks. Face-to-face interaction is blended with technology (such as online discussions and online assignments) for an enhanced learning environment. Faculty are responsible for facilitating student learning by teaching courses and programs in accordance with DeVry... 
    Hourly pay
    Part time
    Local area
    Remote work
    Flexible hours

    DeVry University

    Newark, CA
    4 days ago
  • DeepRoute is seeking a skilled researcher in Fremont, California, to lead developments in multimodal foundation models and efficient deployment...  ...on experience with large-scale pretraining and representation learning techniques. Responsibilities include training optimization,... 

    DeepRoute

    Fremont, CA
    2 days ago
  • $25 per hour

     ...reported to the Federal Trade Commission. Scientist (Newark, CA, Boston, MA, or Remote)...  ...Headquartered in Newark, CA, we are an innovative research and development company that partners...  ...engineering and commercial wisdom. Learn more at . Over the years, Triple Ring has... 
    Remote job
    Hourly pay
    Freelance

    Triple Ring Technologies, Inc.

    Newark, CA
    2 days ago
  •  ...Solutions, Inc. is seeking an experienced ML/AI Engineer to lead research and development in post-training and simulation for...  ...technology. The position requires extensive hands-on experience in reinforcement learning, strong programming skills, and the ability to translate... 

    Centific Global Solutions, Inc.

    East Palo Alto, CA
    3 days ago
  •  ...Disability Insurance Business Travel Insurance 401(k), Plus Match Weekly Pay If this is a role that interests you and you'd like to learn more, click apply now and a recruiter will be in touch with you to discuss this great opportunity. We look forward to speaking with... 
    Weekly pay
    Temporary work
    Flexible hours

    ManpowerGroup Global, Inc.

    Fremont, CA
    3 days ago
  •  ...Companies in America list for seven consecutive years. Schweiger Dermatology Group has also received Great Place to Work certification. To learn more, click here. Join Schweiger Dermatology & Allergy Group as an Allergist Where expert care meets a patient-first culture-and... 
    Temporary work
    Work at office
    Local area
    Relocation package
    Flexible hours

    HealthEcareers - Client

    Fremont, CA
    17 days ago
  • $22 - $23.5 per hour

     ...with 18 graduate credit hours in the relevant subject area. Responsibilities include developing course syllabi, facilitating student learning, and utilizing appropriate technology. The position offers variable pay of $1500-$2700 per 8-week session, and in CA, the hourly... 
    Hourly pay

    DeVry University

    Newark, CA
    3 days ago
  • $22 - $23.5 per hour

     ...eight weeks. Face-to-face interaction is blended with technology (such as online discussions and online assignments) for an enhanced learning environment. Faculty are responsible for facilitating student learning by teaching courses and programs in accordance with DeVry... 
    Hourly pay
    Local area
    Remote work
    Flexible hours

    DeVry University

    Newark, CA
    3 days ago
  •  ...Master's or Doctorate and possess strong communication skills. This role includes developing course materials, facilitating student learning, and ensuring high-quality instruction. Positions also offer flexibility with hours while engaging in ongoing faculty development.... 
    Remote job

    DeVry University

    Newark, CA
    2 days ago
  • DeVry University is seeking a Visiting Professor to facilitate courses and enhance student learning through innovative methods. The role requires a Juris Doctorate or Doctoral degree, along with effective communication. Responsibilities include developing course syllabi... 
    Remote job
    Part time
    Flexible hours

    DeVry University

    Newark, CA
    1 day ago
  •  ...in four specialty areas: Information Technology (IT), Clinical Research, Rehabilitation Therapy and Nursing. Job Description JOB...  ...Travel to internal and external collaborator sites, interact with scientists and leaders in different areas to perform studies and develop... 
    Work at office

    Integrated Resources

    Hayward, CA
    2 days ago
  •  ...KLA-Belgium in Milpitas, California is seeking a Research Scientist to join the RAPID Advanced Technology team. You will design and validate advanced optical inspection platforms, collaborate with engineering teams, and apply your expertise in electromagnetism and optical... 

    KLA-Belgium

    Milpitas, CA
    9 hours ago
  • $25 - $30 per hour

     ...interpersonal skills for working with children and families. Experience: Prior experience with children is advantageous. Continuous Learning: Interest in ongoing development. Transportation: Must have reliable transportation. Physical Requirements: This role... 
    Hourly pay
    Part time

    Butterfly Effects

    Fremont, CA
    13 days ago
  •  ...Job Description Job Description POSITION OVERVIEW We are seeking a highly motivated Research Scientist to lead technology research efforts focused on the development of novel genetic and epigenetic molecular assays. This role offers an excellent opportunity for... 

    AccuraGen

    Milpitas, CA
    11 days ago
  • $136.3k - $231.7k

     ...devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation...  ...back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading... 
    Minimum wage
    Work experience placement
    Flexible hours

    KLA-Belgium

    Milpitas, CA
    9 hours ago
  •  ...seeking a highly qualified Protein Expression & Purification Scientist (Contractor) with demonstrated expertise in molecular biology,...  ...protein purification. The successful candidate will contribute to research programs within an industry-leading drug discovery environment... 
    For contractors

    Connect Life Sciences

    Fremont, CA
    3 days ago
  •  ...Job Title: Data Scientist Job ID: 2759811 Position Type: Full-time Location: Fremont, CA (Onsite) Job Description Responsibilities...  ...into technical solutions using data science and machine learning. Minimum Qualifications Bachelor's degree in... 
    Full time

    Prophecy Technologies

    Fremont, CA
    1 day ago
  •  ...Solutions, LLC We have job opening for Data Scientist and the detailed Job description is...  ...that do not yet have solutions in the research. Therefore, the individual in this role...  ...• pplies advanced data science, machine learning, and deep learning algorithms necessary... 
    Full time
    Work experience placement

    Info Way Solutions

    Fremont, CA
    3 days ago
  • $86k - $183k

     ...impact you'll make Join Lam as a Data Scientist, where you'll design, develop, and...  ...cutting-edge semiconductor manufacturing and research. Your work will directly impact how...  ...analysis, statistical modeling, and machine learning, with the ability to apply these methods... 
    Local area
    Immediate start
    Remote work
    Flexible hours
    2 days per week
    3 days per week
    1 day per week

    Lam Research

    Fremont, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Reinforcement Learning. Be the first to apply!