Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Staff Research Scientist, Reinforcement Learning

Centific Global Solutions, Inc.

About Centific Centific is a frontier AI data foundry that curates diverse, high‑quality data, using our purpose‑built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry‑leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre‑trained datasets; fine‑tuned, industry‑specific LLMs; and RAG pipelines supported by vector databases. Our zero‑distance innovation solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster. Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers, helping them unlock significant business value by deploying GenAI at scale to maintain a competitive edge. What You'll Do Design simulation environments and digital twins for enterprise workflows Post‑train LLM agents using RLHF, DPO, GRPO, PPO, and emerging methods Build pipelines that convert human‑labeled traces and verifiable signals into training data Architect multi‑turn, tool‑using agents with closed learning loops Design reward functions and verifiers that resist reward hacking and reflect real task outcomes Set the technical bar across the team — architecture, code review, engineering standards Mentor researchers and engineers; drive technical direction through influence Translate research into production; contribute to publications Required Qualifications 7+ years in ML/AI research or engineering, with 3+ years at senior or staff level MS or PhD in Computer Science, Machine Learning, or related field (or equivalent) 5+ years hands‑on RL—environment design, reward engineering, policy optimization—with at least one production deployment 3+ years fine‑tuning LLMs with hands‑on RL post‑training (RLHF, DPO, GRPO, PPO) Expert‑level implementation of RLHF pipelines, reward modeling (Bradley‑Terry), DPO, and KTO Working knowledge of modern post‑training and rollout‑serving libraries (TRL, veRL, OpenRLHF, SkyRL) Experience building LLM‑based agents: tool use, multi‑turn reasoning, trajectory evaluation Strong Python and software engineering skills—comfortable building production pipelines, not just notebooks Deep expertise in MDPs, policy gradient methods (PPO, SAC), and temporal difference learning Hands‑on experience with Gymnasium‑based environments and reward engineering (sparse vs. dense) Preferred Qualifications Publications at NeurIPS, ICML, ICLR, ACL, COLM, or similar venues Open‑source contributions to post‑training or agent frameworks (TRL, veRL, OpenRLHF, SkyRL) Experience with Offline RL (CQL, IQL), Model‑based RL / World Models, or Hierarchical RL Background in synthetic data generation, simulation, or world models Domain experience in healthcare, finance, logistics, or compliance Distributed training on GPU clusters Why Join Centific Lead the frontier and shape a new discipline at the intersection of post‑training, simulation, and enterprise AI Ship your science and see your research power real systems across healthcare, finance, and safety‑critical operations Collaborate with leaders and work alongside NVIDIA, Microsoft, and the global AI community Build governed, compliant AI systems enterprises can trust EEO Statement Centific is an equal‑opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, citizenship status, age, mental or physical disability, medical condition, sex (including pregnancy), gender identity or expression, sexual orientation, marital status, familial status, veteran status, or any other characteristic protected by applicable law. We consider qualified applicants regardless of criminal histories, consistent with legal requirements. #J-18808-Ljbffr

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Senior Staff Research Scientist, Reinforcement Learning in East Palo Alto, CA vacancy
  • ## Senior Staff Research Scientist, Agentic AI & RLApplylocations: East Palo Alto, CAtime type: Full timeposted on: Posted Todayjob requisition...  ...hiring a **Staff Research Scientist, Agentic AI & Reinforcement Learning** — a dual-track role designed for exceptional... 
    Senior

    Centific Global Solutions, Inc.

    Palo Alto, CA
    10 hours ago
  • $193.93k - $352.29k

     ...Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ) Who We Are Nuro is a self...  ...modality generation. Optimize generative models using reinforcement learning to improve interactive reasoning. Explore... 
    Senior

    Nuro

    Mountain View, CA
    2 days ago
  • $202.35k - $303.05k

     ...hardest autonomy problems. Here you will explore novel and advanced machine learning methods to solve practical real-world challenging problems in autonomous driving. You will be focusing on researching and developing state of the art generative models, with an emphasis on... 
    Senior

    Icehouseventures

    Mountain View, CA
    10 hours ago
  •  ...Intuit's AI Research team develops novel AI/ML solutions that power intelligent...  ...customer value. We are looking for a Senior Staff AI Research Scientist to shape and drive the research agenda...  ...systems, neuro-symbolic AI, deep learning, and/or LLM-based reasoning for business... 
    Senior

    Intuit

    Mountain View, CA
    10 hours ago
  • $175k - $250k

     ...intelligence by combining machine learning with large-scale...  ...sits at the frontier of AI research, and we believe the highest-...  ...Role We are hiring a Research Scientist to advance the learning algorithms...  ...models, action models, reinforcement learning, and learning from... 
    Suggested
    Full time
    Visa sponsorship
    Flexible hours

    Metamorphic

    Palo Alto, CA
    9 hours ago
  •  ...We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR – Efficiency team to make deep learning faster...  ...deployment. Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and pre‑training. Our... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco. We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering... 
    Permanent employment
    Full time

    Brahma Consulting Group

    Fremont, CA
    1 day ago
  •  ...We are building next-generation end-to-end autonomous driving systems powered by reinforcement learning. You will work on applying RL in closed-loop, safety-critical environments , leveraging large-scale simulation and real-world driving data to improve safety, comfort... 

    DeepRoute

    Fremont, CA
    10 hours ago
  • $150k

     ...Foundation Models We are a dedicated research lab for building,...  ...world‑class researchers, data scientists, and engineers, tackling the...  ...performance computing in deep learning, driving impactful...  ...Research Scientist within our Reinforcement Learning team, you will play... 
    Visa sponsorship
    Shift work

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $262k - $365k

    Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements PhD in Computer Science, a related field, or equivalent...  ...breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and... 
    Senior
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $222.78k - $367.18k

     ...AI‑first self‑driving software. Built to learn and improve through data, the Nuro...  ...distribution detection, imitation learning, and reinforcement learning. If you love solving...  ...come join us! About the Work Apply and research novel and advanced machine‑learning techniques... 
    Work experience placement

    Icehouseventures

    Mountain View, CA
    2 days ago
  •  ..., CA, and focuses on building the agent learning platform and SLM AI flywheel that powers...  ...techniques such as PEFT, SFT, distillation, and reinforcement learning (RLHF, DPO). Drive model...  ...that influence the roadmap. Mentor senior and mid‑level engineers on experimentation... 
    Senior
    Full time
    Immediate start

    Uniphore Technologies North America Inc

    Palo Alto, CA
    10 hours ago
  • $222k - $300k

     ...for innovative and hands-on Senior Staff AI Scientist to join the Intuit AI team...  ...AI scientists and machine learning engineers and build models...  ..., Bayesian Learning, Reinforcement Learning, or Deep Learning...  ...DS solutions. Proactively researches, explores, and enables new... 
    Senior

    Intuit

    Mountain View, CA
    1 day ago
  • $232.9k - $320.25k

     ...category-defining product - building the agent learning platform and SLM AI flywheel that powers...  ...such as PEFT, SFT, distillation, and reinforcement learning (RLHF, DPO). ~ Drive model...  ...influence the roadmap. ~ Mentor senior and mid-level engineers on experimentation... 
    Senior
    Full time
    Immediate start

    Uniphore

    Palo Alto, CA
    1 day ago
  • Centific Global Solutions, Inc. is seeking a Senior Staff Research Scientist in Agentic AI & Reinforcement Learning to lead cutting-edge AI initiatives. This role emphasizes significant technical influence, mentoring, and design responsibilities in building governed RL... 
    Senior

    Centific Global Solutions, Inc.

    Palo Alto, CA
    1 day ago
  • $184k - $356.5k

     ...A leading technology company in California seeks a Senior Research Scientist specializing in efficient deep learning. The role focuses on designing and implementing innovative research methods while collaborating with various teams. A Ph.D. in a related field and 3+ years... 
    Senior

    NVIDIA

    Santa Clara, CA
    9 hours ago
  • $184k - $299k

     ...Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries and has impact in... 
    Senior

    NVIDIA

    Santa Clara, CA
    11 hours ago
  • $183.83k - $275.98k

     ...Perception & Behavior team, leveraging the cutting edge of machine learning research to solve challenging real-world robotics problems. This...  ...End-to-End Autonomous Driving Models, Imitation Learning, Reinforcement Learning, or World Models. You have strong problem-solving... 
    Senior

    Icehouseventures

    Mountain View, CA
    9 hours ago
  • $140k - $195k

     ...About the Team Our AI Research team, reporting directly...  ...cutting-edge machine learning can transform the way...  ...scale. The Role As a Senior Machine Learning Engineer...  ...a world-class team of scientists and engineers, and...  ...applications Experience with reinforcement learning for robotic... 
    Senior
    Work at office
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    10 hours ago
  •  ...MongoDB is looking for a Staff Research Scientist to join their Voyage AI team in Palo Alto or remotely in the U.S. This role involves conducting...  ...PhD in Computer Science and a strong background in machine learning and deep learning. Join MongoDB to tackle complex... 
    Remote work

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    10 hours ago
  • $130k - $260k

     ...and Great Careers.Job Title: Staff Applied Research ScientistWe are seeking a...  ...strategic Staff Applied Research Scientist to lead the technical...  ...Building: Mentor junior and senior scientists, elevate...  ...Computer Science, Machine Learning, Statistics, Mathematics, or... 
    Hourly pay
    Work experience placement
    Local area
    Flexible hours

    GEICO

    Palo Alto, CA
    10 hours ago
  • $169.8k - $212.3k

     ...POSITION SUMMARY: Natera is seeking a Senior Machine Learning Scientist to join our AI team, an advanced R...  ...and Chain-of-Thought, alongside reinforcement fine-tuning (RFT) to ensure agents...  ...or more years of experience in AI research or engineering, with a proven track... 
    Senior
    Work at office
    Immediate start
    Remote work
    Worldwide

    Natera

    San Carlos, CA
    4 days ago
  •  ...Research Scientists at Riot combine deep technical expertise across machine learning, artificial intelligence (AI), experimental design, and...  ...experience games. As a Staff Research Scientist, you push...  ...representation learning, model-based reinforcement learning, or learned... 
    Local area
    Flexible hours

    Riot Games

    Redwood City, CA
    2 days ago
  •  ...Machine Learning Research Scientist At Autoscience Institute, we create AI systems that autonomously conduct AI research. Recently, we announced...  ...machine learning models, including deep learning, reinforcement learning or genetic algorithms. This does not include building... 
    Full time
    Flexible hours

    Autoscience Institute

    Menlo Park, CA
    3 days ago
  • Staff Research Scientist, Women's Health Sensing Google • Mountain View, CA, USA Qualifications PhD degree in Computer Science, a related field...  ...solutions. Publish work in top clinical and machine learning venues, representing Google in the community. Equal Opportunity... 
    Immediate start

    Google Inc.

    Mountain View, CA
    2 days ago
  • $222.78k - $367.18k

    Nuro is seeking a seasoned machine learning expert to enhance its autonomy systems in Mountain View, California. You will apply advanced methods in machine learning to tackle perception, prediction, and planning issues. Ideal candidates have over 6 years of experience... 

    Icehouseventures

    Mountain View, CA
    2 days ago
  • $151k - $297k

     ...It is backed by a strong team of AI researchers from Stanford, MIT, Berkeley, Princeton...  ...Position Overview We are seeking a Staff Research Scientist to join our team and contribute to the...  ...problems at the intersection of machine learning research and practical deployment of... 
    Work at office
    Local area
    Remote work
    Worldwide

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    1 day ago
  • $167.2k - $250.8k

     ...level motion planner combining machine learning and classic robotics approaches. It also...  ..., parking lots). About the Work Research and Implement state of the art algorithms...  ...Search, Monte Carlo Tree Search (MCTS), Reinforcement Learning, Machine Learning for Planning... 

    Icehouseventures

    Mountain View, CA
    1 day ago
  • $126k - $423k

     ...Stockholm; Bangalore; Seoul; and Tokyo. Learn more at applied.co. We are an in-...  ...We are looking for multiple passionate Research Scientists to join the Research Group at Applied...  ...Intuition, you will: Conduct research on reinforcement learning (RL) related topics including... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    21 days ago
  •  ...scale - including candidate generation, learning-to-rank, retrieval, and ranking...  ...uplift modeling, multi-armed bandits, Reinforcement Learning, Online Learning, and Deep Learning...  ...proactively in a fast-paced environment. Quickly research, explore, and enable new ML,... 
    Senior

    Intuit

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Staff Research Scientist, Reinforcement Learning. Be the first to apply!