Senior Staff Research Scientist, Reinforcement Learning

Centific Global Solutions, Inc.

About Centific Centific is a frontier AI data foundry that curates diverse, high‑quality data, using our purpose‑built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry‑leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre‑trained datasets; fine‑tuned, industry‑specific LLMs; and RAG pipelines supported by vector databases. Our zero‑distance innovation solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster. Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers, helping them unlock significant business value by deploying GenAI at scale to maintain a competitive edge. What You'll Do Design simulation environments and digital twins for enterprise workflows Post‑train LLM agents using RLHF, DPO, GRPO, PPO, and emerging methods Build pipelines that convert human‑labeled traces and verifiable signals into training data Architect multi‑turn, tool‑using agents with closed learning loops Design reward functions and verifiers that resist reward hacking and reflect real task outcomes Set the technical bar across the team — architecture, code review, engineering standards Mentor researchers and engineers; drive technical direction through influence Translate research into production; contribute to publications Required Qualifications 7+ years in ML/AI research or engineering, with 3+ years at senior or staff level MS or PhD in Computer Science, Machine Learning, or related field (or equivalent) 5+ years hands‑on RL—environment design, reward engineering, policy optimization—with at least one production deployment 3+ years fine‑tuning LLMs with hands‑on RL post‑training (RLHF, DPO, GRPO, PPO) Expert‑level implementation of RLHF pipelines, reward modeling (Bradley‑Terry), DPO, and KTO Working knowledge of modern post‑training and rollout‑serving libraries (TRL, veRL, OpenRLHF, SkyRL) Experience building LLM‑based agents: tool use, multi‑turn reasoning, trajectory evaluation Strong Python and software engineering skills—comfortable building production pipelines, not just notebooks Deep expertise in MDPs, policy gradient methods (PPO, SAC), and temporal difference learning Hands‑on experience with Gymnasium‑based environments and reward engineering (sparse vs. dense) Preferred Qualifications Publications at NeurIPS, ICML, ICLR, ACL, COLM, or similar venues Open‑source contributions to post‑training or agent frameworks (TRL, veRL, OpenRLHF, SkyRL) Experience with Offline RL (CQL, IQL), Model‑based RL / World Models, or Hierarchical RL Background in synthetic data generation, simulation, or world models Domain experience in healthcare, finance, logistics, or compliance Distributed training on GPU clusters Why Join Centific Lead the frontier and shape a new discipline at the intersection of post‑training, simulation, and enterprise AI Ship your science and see your research power real systems across healthcare, finance, and safety‑critical operations Collaborate with leaders and work alongside NVIDIA, Microsoft, and the global AI community Build governed, compliant AI systems enterprises can trust EEO Statement Centific is an equal‑opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, citizenship status, age, mental or physical disability, medical condition, sex (including pregnancy), gender identity or expression, sexual orientation, marital status, familial status, veteran status, or any other characteristic protected by applicable law. We consider qualified applicants regardless of criminal histories, consistent with legal requirements. #J-18808-Ljbffr

Apply

Vacancy posted 11 hours ago

Similar jobs that could be interesting for youBased on the Senior Staff Research Scientist, Reinforcement Learning in East Palo Alto, CA vacancy

Senior Staff Research Scientist, Reinforcement Learning
## Senior Staff Research Scientist, Agentic AI & RLApplylocations: East Palo Alto, CAtime type: Full timeposted on: Posted Todayjob requisition... ...hiring a **Staff Research Scientist, Agentic AI & Reinforcement Learning** — a dual-track role designed for exceptional...
Senior
Centific Global Solutions, Inc.
Palo Alto, CA
10 hours ago
Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning
$193.93k - $352.29k
...Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ) Who We Are Nuro is a self... ...modality generation. Optimize generative models using reinforcement learning to improve interactive reasoning. Explore...
Senior
Nuro
Mountain View, CA
2 days ago
Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning
$202.35k - $303.05k
...hardest autonomy problems. Here you will explore novel and advanced machine learning methods to solve practical real-world challenging problems in autonomous driving. You will be focusing on researching and developing state of the art generative models, with an emphasis on...
Senior
Icehouseventures
Mountain View, CA
10 hours ago
Senior Staff AI Research Scientist
...Intuit's AI Research team develops novel AI/ML solutions that power intelligent... ...customer value. We are looking for a Senior Staff AI Research Scientist to shape and drive the research agenda... ...systems, neuro-symbolic AI, deep learning, and/or LLM-based reasoning for business...
Senior
Intuit
Mountain View, CA
10 hours ago
ML Research Scientist (Embodied AI & Reinforcement Learning)
$175k - $250k
...intelligence by combining machine learning with large-scale... ...sits at the frontier of AI research, and we believe the highest-... ...Role We are hiring a Research Scientist to advance the learning algorithms... ...models, action models, reinforcement learning, and learning from...
Suggested
Full time
Visa sponsorship
Flexible hours
Metamorphic
Palo Alto, CA
9 hours ago
Senior Applied Deep Learning Research Scientist, Efficiency
...We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR – Efficiency team to make deep learning faster... ...deployment. Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and pre‑training. Our...
Senior
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Research Scientist- Reinforcement Learning (Open-Endedness)
...Member Technical Staff- Reinforcement Learning & Open-Ended Learning. Full-time, permanent opportunity in San Francisco. We're representing an early-stage applied research lab building AI capable of open-ended learning , systems that keep getting better by discovering...
Permanent employment
Full time
Brahma Consulting Group
Fremont, CA
1 day ago
Research Scientist, Reinforcement Learning
...We are building next-generation end-to-end autonomous driving systems powered by reinforcement learning. You will work on applying RL in closed-loop, safety-critical environments , leveraging large-scale simulation and real-world driving data to improve safety, comfort...
DeepRoute
Fremont, CA
10 hours ago
Research Scientist - Reinforcement Learning
$150k
...Foundation Models We are a dedicated research lab for building,... ...world‑class researchers, data scientists, and engineers, tackling the... ...performance computing in deep learning, driving impactful... ...Research Scientist within our Reinforcement Learning team, you will play...
Visa sponsorship
Shift work
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Senior Staff Research Scientist, Google Cloud AI
$262k - $365k
Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements PhD in Computer Science, a related field, or equivalent... ...breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and...
Senior
Full time
Worldwide
Google Inc.
Sunnyvale, CA
3 days ago
Staff ML Research Scientist
$222.78k - $367.18k
...AI‑first self‑driving software. Built to learn and improve through data, the Nuro... ...distribution detection, imitation learning, and reinforcement learning. If you love solving... ...come join us! About the Work Apply and research novel and advanced machine‑learning techniques...
Work experience placement
Icehouseventures
Mountain View, CA
2 days ago
Senior Staff AI Scientist
..., CA, and focuses on building the agent learning platform and SLM AI flywheel that powers... ...techniques such as PEFT, SFT, distillation, and reinforcement learning (RLHF, DPO). Drive model... ...that influence the roadmap. Mentor senior and mid‑level engineers on experimentation...
Senior
Full time
Immediate start
Uniphore Technologies North America Inc
Palo Alto, CA
10 hours ago
Senior Staff AI Scientist
$222k - $300k
...for innovative and hands-on Senior Staff AI Scientist to join the Intuit AI team... ...AI scientists and machine learning engineers and build models... ..., Bayesian Learning, Reinforcement Learning, or Deep Learning... ...DS solutions. Proactively researches, explores, and enables new...
Senior
Intuit
Mountain View, CA
1 day ago
Senior Staff AI Scientist
$232.9k - $320.25k
...category-defining product - building the agent learning platform and SLM AI flywheel that powers... ...such as PEFT, SFT, distillation, and reinforcement learning (RLHF, DPO). ~ Drive model... ...influence the roadmap. ~ Mentor senior and mid-level engineers on experimentation...
Senior
Full time
Immediate start
Uniphore
Palo Alto, CA
1 day ago
Senior Staff Scientist, Agentic AI & RL — Lead AI Systems
Centific Global Solutions, Inc. is seeking a Senior Staff Research Scientist in Agentic AI & Reinforcement Learning to lead cutting-edge AI initiatives. This role emphasizes significant technical influence, mentoring, and design responsibilities in building governed RL...
Senior
Centific Global Solutions, Inc.
Palo Alto, CA
1 day ago
Senior Research Scientist, Efficient Deep Learning Pruning & NAS
$184k - $356.5k
...A leading technology company in California seeks a Senior Research Scientist specializing in efficient deep learning. The role focuses on designing and implementing innovative research methods while collaborating with various teams. A Ph.D. in a related field and 3+ years...
Senior
NVIDIA
Santa Clara, CA
9 hours ago
Senior Research Scientist, Efficient Deep Learning
$184k - $299k
...Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries and has impact in...
Senior
NVIDIA
Santa Clara, CA
11 hours ago
Senior ML Research Scientist, End-to-End Autonomous Driving
$183.83k - $275.98k
...Perception & Behavior team, leveraging the cutting edge of machine learning research to solve challenging real-world robotics problems. This... ...End-to-End Autonomous Driving Models, Imitation Learning, Reinforcement Learning, or World Models. You have strong problem-solving...
Senior
Icehouseventures
Mountain View, CA
9 hours ago
Sr. Artificial Intelligence and Machine Learning Scientist, Early Career
$140k - $195k
...About the Team Our AI Research team, reporting directly... ...cutting-edge machine learning can transform the way... ...scale. The Role As a Senior Machine Learning Engineer... ...a world-class team of scientists and engineers, and... ...applications Experience with reinforcement learning for robotic...
Senior
Work at office
Relocation package
Flexible hours
General Motors
Mountain View, CA
10 hours ago
Staff Research Scientist AI Models to Production (Remote)
...MongoDB is looking for a Staff Research Scientist to join their Voyage AI team in Palo Alto or remotely in the U.S. This role involves conducting... ...PhD in Computer Science and a strong background in machine learning and deep learning. Join MongoDB to tackle complex...
Remote work
I did my part and supported the Regular Toilet
Palo Alto, CA
10 hours ago
Staff Applied Research Scientist
$130k - $260k
...and Great Careers.Job Title: Staff Applied Research ScientistWe are seeking a... ...strategic Staff Applied Research Scientist to lead the technical... ...Building: Mentor junior and senior scientists, elevate... ...Computer Science, Machine Learning, Statistics, Mathematics, or...
Hourly pay
Work experience placement
Local area
Flexible hours
GEICO
Palo Alto, CA
10 hours ago
Senior Machine Learning Scientist, Agentic AI
$169.8k - $212.3k
...POSITION SUMMARY: Natera is seeking a Senior Machine Learning Scientist to join our AI team, an advanced R... ...and Chain-of-Thought, alongside reinforcement fine-tuning (RFT) to ensure agents... ...or more years of experience in AI research or engineering, with a proven track...
Senior
Work at office
Immediate start
Remote work
Worldwide
Natera
San Carlos, CA
4 days ago
Staff Research Scientist, Foundation Models - Tech Research
...Research Scientists at Riot combine deep technical expertise across machine learning, artificial intelligence (AI), experimental design, and... ...experience games. As a Staff Research Scientist, you push... ...representation learning, model-based reinforcement learning, or learned...
Local area
Flexible hours
Riot Games
Redwood City, CA
2 days ago
Machine Learning Research Scientist
...Machine Learning Research Scientist At Autoscience Institute, we create AI systems that autonomously conduct AI research. Recently, we announced... ...machine learning models, including deep learning, reinforcement learning or genetic algorithms. This does not include building...
Full time
Flexible hours
Autoscience Institute
Menlo Park, CA
3 days ago
Staff Research Scientist, Women's Health Sensing
Staff Research Scientist, Women's Health Sensing Google • Mountain View, CA, USA Qualifications PhD degree in Computer Science, a related field... ...solutions. Publish work in top clinical and machine learning venues, representing Google in the community. Equal Opportunity...
Immediate start
Google Inc.
Mountain View, CA
2 days ago
Staff ML Research Scientist — Autonomy & Robotics
$222.78k - $367.18k
Nuro is seeking a seasoned machine learning expert to enhance its autonomy systems in Mountain View, California. You will apply advanced methods in machine learning to tackle perception, prediction, and planning issues. Ideal candidates have over 6 years of experience...
Icehouseventures
Mountain View, CA
2 days ago
Staff Research Scientist
$151k - $297k
...It is backed by a strong team of AI researchers from Stanford, MIT, Berkeley, Princeton... ...Position Overview We are seeking a Staff Research Scientist to join our team and contribute to the... ...problems at the intersection of machine learning research and practical deployment of...
Work at office
Local area
Remote work
Worldwide
I did my part and supported the Regular Toilet
Palo Alto, CA
1 day ago
Machine Learning Research Scientist: Generative Modeling for Planning
$167.2k - $250.8k
...level motion planner combining machine learning and classic robotics approaches. It also... ..., parking lots). About the Work Research and Implement state of the art algorithms... ...Search, Monte Carlo Tree Search (MCTS), Reinforcement Learning, Machine Learning for Planning...
Icehouseventures
Mountain View, CA
1 day ago
Research Scientist - Reinforcement Learning, Robotics
$126k - $423k
...Stockholm; Bangalore; Seoul; and Tokyo. Learn more at applied.co. We are an in-... ...We are looking for multiple passionate Research Scientists to join the Research Group at Applied... ...Intuition, you will: Conduct research on reinforcement learning (RL) related topics including...
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
21 days ago
Sr Staff AI Scientist, Agentic AI & Recommendations
...scale - including candidate generation, learning-to-rank, retrieval, and ranking... ...uplift modeling, multi-armed bandits, Reinforcement Learning, Online Learning, and Deep Learning... ...proactively in a fast-paced environment. Quickly research, explore, and enable new ML,...
Senior
Intuit
Mountain View, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Staff Research Scientist, Reinforcement Learning. Be the first to apply!