Research Scientist - Post Training
$250kErsilia
About Us We build training data and evaluation infrastructure that frontier AI labs use to improve their models. We partner with the world's leading labs to design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. We're a small, early team (post‑Series A) where individual contributors have direct impact on how the next generation of models learns and improves. The Role We're building out our post‑training research team and hiring 2–3 Research Scientists to work together on this mission. Your job is to prove that our data works. You'll design and run training experiments that isolate the impact of our datasets on model behavior, including SFT and RL‑based post‑training, to measure how different data sources shift capability, generalization, and alignment. Working closely with partner labs, you'll turn our datasets into clear, defensible evidence that the data improves performance under these conditions. It's experimental, high‑leverage work at the edge of model development. What You'll Do Run controlled SFT and RL experiments to measure the impact of our datasets on model performance. Quantify lift across capabilities—reasoning, tool use, long‑horizon tasks, and domain‑specific workflows. Share findings directly with partner labs to deepen relationships and drive sales. Collaborate with internal SPLs to iterate on data quality based on your results. Work closely with the other Research Scientists on this team to build shared experimental infrastructure and benchmarks. What We're Looking For Strong familiarity with LLM training and evaluation methodologies (SFT, RL post‑training). Genuine obsession with how data structure, selection, and quality drive model behavior. Ability to design lightweight experiments, move fast, and extract actionable insights from messy results. Comfort working across domains—finance, software engineering, policy, and more. A bias toward building over theorizing. Nice‑to‑Have Requirements Prior work or internship at an RL environment company, AI safety org, or benchmarking org. Experience running controlled training experiments end‑to‑end. Published research on model evaluation, post‑training, or data curation. Strong software engineering skills alongside research instincts. Compensation US$250K–$450K total compensation + equity. #J-18808-Ljbffr Ersilia
- A dynamic AI platform provider in California is seeking a Research Lead to drive machine learning innovation. The role involves defining a research agenda, conducting rigorous experiments, and collaborating with teams to enhance product development. Candidates should hold...Training
$340k - $425k
A tech research firm in San Francisco is looking for a Research Engineer to join their Pre-training team. The ideal candidate will have an advanced degree in Computer Science and strong software engineering skills, particularly in Python. Responsibilities include conducting...Training$300k
...seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to...Training- Jack & Jill, a stealth physical AI startup in San Francisco, is seeking a Founding Research Scientist to lead the research function. This role involves training manipulation policies using cutting-edge techniques and building a validation pipeline to enhance real-world...Training
$350k
Thinking Machines Lab in San Francisco is seeking a Pre-Training Researcher to advance AI methodologies and conduct experiments. The ideal candidate will have a Bachelor's degree in Computer Science or related fields and experience with deep learning frameworks. The role...TrainingVisa sponsorship- ...exploration in the real world. We're looking for research scientists with strong foundations in... ...representation learning, or large-scale model training. Qualifications: You've worked on one... ..., multimodal, image, or video) RL post-training, reasoning, or tool use Robotics...Training
- ...upside. Make high-conviction bets - Try and fail. But succeed an unfair amount. Job: Our first dedicated research hire - you will answer the question: how to train and scale a model that can serve a web index? You: Have deep intuition on modern models and training....Training
$400k
...working prototype and early commercial traction across several high‑profile industry verticals. Role As a Senior Research Scientist, your focus is post‑training — curating data, fine‑tuning pre‑trained speech models, and building the evaluation infrastructure that...TrainingRelocation packageShift work- ...massively accelerates certain kinds of probabilistic inference. Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior research and engineering talent to derive probabilistic ML theory, empirically demonstrate its...Training
$225k - $300k
Research Scientist About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those... ...work on: Verifiable reinforcement learning at scale Mid-training and post-training of foundation models Novel objectives derived...TrainingWork at officeImmediate start$160k - $280k
...office. About the Role We’re looking for early members of our research team. You’ll work closely with the founding team and have... ...how we build and deploy our state of the art ML models trained with an H100/scientist ratio of >100x. Check out our Suno version of the job...TrainingWork at officeFlexible hours- ...founding team brought together leading researchers in this space and top silicon valley operators... .... About the role As an AI Research Scientist, you will conduct groundbreaking... ...and deep-learning frameworks Experience training and evaluating large models on protein,...Training
$250k
About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier... ...benchmarks. We are a small, early team (post Series A) where individual contributors... ...and measured. Working directly with research teams at top AI labs, you’ll experiment with...Training- Member of Technical Staff, ML Research Scientist Omnifold trains custom AI models for each customer's supply chain - purpose-built systems that forecast demand, optimize decisions, and adapt continuously to a changing world. The research team is responsible for the core...TrainingShift work
- Founding Research Scientist, Human Simulation TL;DR: Listen is building the human layer of AI: a preference model trained on millions of real human conversations. We're hiring a founding researcher... ..., Nestlé, P&G, and Sweetgreen. Post‑PMF growth: 20x year ‑over‑year revenue...TrainingFlexible hoursShift work
- ...ll Work On Reasoning via reinforcement learning: Designing and training reasoning systems using RLHF, RLAIF, and reward modeling... .... Scalable oversight: Contributing to alignment and oversight research - figuring out how to reliably supervise models on geological tasks...TrainingFull timeInternship
- Job Title Founding Research Scientist Company Description Stealth physical AI startup Job Description You will lead the research function... ...-backed startup solving the data bottleneck in robotics. By training end-to-end manipulation policies and designing evaluation frameworks...Training
$170k - $220k
Introduction The Center for AI Safety is a research and field-building nonprofit located in... ...and technical research. As a research scientist, you will pursue a variety of research... ...HuggingFace). Have experience launching and training distributed ML jobs. Communicate...TrainingWork at office- Research Scientist, Real-Time Interactivity / Inference San Francisco · Research · Full Time Real-time interactivity can come from inference... ..., and we aim to both get more out of what they've already trained and shape how the next generation of models is designed. We'...TrainingFull timeVisa sponsorshipRelocation package
$225k - $400k
ABOUT THE ROLE This is a research-driven, high-impact role for ML researchers who want to push the boundaries of real-time AI. As... ...Background - You’ve worked on advanced ML problems (e.g., LLM pre‑training and post training, transcription model training, text to speech model...TrainingH1bRelocation$300k - $320k
...is a quickly growing group of committed researchers, engineers, policy experts, and... ...We're seeking an exceptional Research Scientist to join our Life Sciences team at Anthropic... ...capabilities on scientific tasks through post‑training, evaluation design, and RL environment...TrainingVisa sponsorship$150k - $250k
...grasping and more dexterous behaviors in unstructured environments ● Research and implement state-of-the-art robot learning policies,... ...production robot fleets ● Optimize robot policies for distributed training at scale and real-time edge deployment ● Ship production...Training$200k - $250k
...Center for AI Safety (CAIS) is a leading research and advocacy organization focused on... ...Safety Action Fund. As a Senior Research Scientist here, you will lead and execute high‑impact... ...models, build the tooling needed to train and evaluate models at scale, and turn results...TrainingWork at officeLocal area- ...OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we... ...agenda. You will pursue open problems at the intersection of post‑training methodology and performant inference and then collaborate...TrainingImmediate startFlexible hours
$54 - $60 per hour
...problems lie in enterprise domains, behind closed doors. Our research team's goal is to push the frontier of "domain adaptation" -... ...Compose multiple methods to create new recipes for efficient post‑training. Evaluate LLMs and AI systems. Qualifications & Requirements...TrainingHourly payInternship- Overview: Hedra is building a world-class Physical AI research team to push the boundaries of action-conditioned world models and generative... ...modeling for embodied systems Design novel architectures, training objectives, and evaluation frameworks for VLMs, VLAs, and world...TrainingWork at office
- Research Scientist / Machine Learning Scientist Location:SF Bay Area/Hybrid / Remote Type:Full-Time About the Role: The Clientis seeking a... ...meaningful home here. We’re looking for: • Hands-on experience training large-scale models, including reward models, preference...TrainingFull timeRemote work
$200k - $325k
About the Role We're looking for a Research Scientist to collaborate with partners and lead the development... ...art datasets that drive frontier model training and evaluation based on current model... ...externally through publications, blog posts, conference talks, and customer...TrainingLocal area- ...realize all of our product goals. As a Machine Learning Scientist at Sesame, you are a research-oriented person with experience in NLP, Speech, and/or... ...model architectures, data curation, model evaluation, training & inference infrastructure, research, and experimentation...TrainingFull timeContract workFlexible hours
- ...systems . This role is for an experienced scientist who thrives both in innovating... ...reasoning, and deep content extraction. Research, evaluate, and integrate the latest vision... ...knowledge of quantization/LoRA/efficient training. Proficiency with deep learning frameworks...Training
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist - Post Training. Be the first to apply!
- lab scientist San Francisco, CA
- principal scientist San Francisco, CA
- research scientist - biology San Francisco, CA
- senior principal scientist San Francisco, CA
- drug safety scientist San Francisco, CA
- machine learning scientist San Francisco, CA
- cell culture scientist San Francisco, CA
- analytical scientist San Francisco, CA
- scientist immunology San Francisco, CA
- downstream processing scientist San Francisco, CA
