Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist - Post Training

$250k

Ersilia

About Us We build training data and evaluation infrastructure that frontier AI labs use to improve their models. We partner with the world's leading labs to design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. We're a small, early team (post‑Series A) where individual contributors have direct impact on how the next generation of models learns and improves. The Role We're building out our post‑training research team and hiring 2–3 Research Scientists to work together on this mission. Your job is to prove that our data works. You'll design and run training experiments that isolate the impact of our datasets on model behavior, including SFT and RL‑based post‑training, to measure how different data sources shift capability, generalization, and alignment. Working closely with partner labs, you'll turn our datasets into clear, defensible evidence that the data improves performance under these conditions. It's experimental, high‑leverage work at the edge of model development. What You'll Do Run controlled SFT and RL experiments to measure the impact of our datasets on model performance. Quantify lift across capabilities—reasoning, tool use, long‑horizon tasks, and domain‑specific workflows. Share findings directly with partner labs to deepen relationships and drive sales. Collaborate with internal SPLs to iterate on data quality based on your results. Work closely with the other Research Scientists on this team to build shared experimental infrastructure and benchmarks. What We're Looking For Strong familiarity with LLM training and evaluation methodologies (SFT, RL post‑training). Genuine obsession with how data structure, selection, and quality drive model behavior. Ability to design lightweight experiments, move fast, and extract actionable insights from messy results. Comfort working across domains—finance, software engineering, policy, and more. A bias toward building over theorizing. Nice‑to‑Have Requirements Prior work or internship at an RL environment company, AI safety org, or benchmarking org. Experience running controlled training experiments end‑to‑end. Published research on model evaluation, post‑training, or data curation. Strong software engineering skills alongside research instincts. Compensation US$250K–$450K total compensation + equity. #J-18808-Ljbffr Ersilia

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research Scientist - Post Training in San Francisco, CA vacancy
  • A dynamic AI platform provider in California is seeking a Research Lead to drive machine learning innovation. The role involves defining a research agenda, conducting rigorous experiments, and collaborating with teams to enhance product development. Candidates should hold... 
    Training

    Baseten

    San Francisco, CA
    2 days ago
  • $340k - $425k

    A tech research firm in San Francisco is looking for a Research Engineer to join their Pre-training team. The ideal candidate will have an advanced degree in Computer Science and strong software engineering skills, particularly in Python. Responsibilities include conducting... 
    Training

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • $300k

     ...seeking a Member of Technical Staff to develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to... 
    Training

    Vmax

    San Francisco, CA
    20 hours ago
  • Jack & Jill, a stealth physical AI startup in San Francisco, is seeking a Founding Research Scientist to lead the research function. This role involves training manipulation policies using cutting-edge techniques and building a validation pipeline to enhance real-world... 
    Training

    Jack & Jill

    San Francisco, CA
    4 days ago
  • $350k

    Thinking Machines Lab in San Francisco is seeking a Pre-Training Researcher to advance AI methodologies and conduct experiments. The ideal candidate will have a Bachelor's degree in Computer Science or related fields and experience with deep learning frameworks. The role... 
    Training
    Visa sponsorship

    Thinking Machines Lab

    San Francisco, CA
    3 days ago
  •  ...exploration in the real world. We're looking for research scientists with strong foundations in...  ...representation learning, or large-scale model training. Qualifications: You've worked on one...  ..., multimodal, image, or video) RL post-training, reasoning, or tool use Robotics... 
    Training

    Pantograph

    San Francisco, CA
    3 days ago
  •  ...upside. Make high-conviction bets - Try and fail. But succeed an unfair amount. Job: Our first dedicated research hire - you will answer the question: how to train and scale a model that can serve a web index? You: Have deep intuition on modern models and training.... 
    Training

    Parallel Web Systems

    San Francisco, CA
    1 day ago
  • $400k

     ...working prototype and early commercial traction across several high‑profile industry verticals. Role As a Senior Research Scientist, your focus is post‑training — curating data, fine‑tuning pre‑trained speech models, and building the evaluation infrastructure that... 
    Training
    Relocation package
    Shift work

    Trades Workforce Solutions

    San Francisco, CA
    2 days ago
  •  ...massively accelerates certain kinds of probabilistic inference. Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior research and engineering talent to derive probabilistic ML theory, empirically demonstrate its... 
    Training

    Extropic Corp

    San Francisco, CA
    4 days ago
  • $225k - $300k

    Research Scientist About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those...  ...work on: Verifiable reinforcement learning at scale Mid-training and post-training of foundation models Novel objectives derived... 
    Training
    Work at office
    Immediate start

    Latent

    San Francisco, CA
    1 day ago
  • $160k - $280k

     ...office. About the Role We’re looking for early members of our research team. You’ll work closely with the founding team and have...  ...how we build and deploy our state of the art ML models trained with an H100/scientist ratio of >100x. Check out our Suno version of the job... 
    Training
    Work at office
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    2 days ago
  •  ...founding team brought together leading researchers in this space and top silicon valley operators...  .... About the role As an AI Research Scientist, you will conduct groundbreaking...  ...and deep-learning frameworks Experience training and evaluating large models on protein,... 
    Training

    Chaidiscovery

    San Francisco, CA
    2 days ago
  • $250k

    About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier...  ...benchmarks. We are a small, early team (post Series A) where individual contributors...  ...and measured. Working directly with research teams at top AI labs, you’ll experiment with... 
    Training

    AfterQuery

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, ML Research Scientist Omnifold trains custom AI models for each customer's supply chain - purpose-built systems that forecast demand, optimize decisions, and adapt continuously to a changing world. The research team is responsible for the core... 
    Training
    Shift work

    Omnifold

    San Francisco, CA
    20 hours ago
  • Founding Research Scientist, Human Simulation TL;DR: Listen is building the human layer of AI: a preference model trained on millions of real human conversations. We're hiring a founding researcher...  ..., Nestlé, P&G, and Sweetgreen. Post‑PMF growth: 20x year‑over‑year revenue... 
    Training
    Flexible hours
    Shift work

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  •  ...ll Work On Reasoning via reinforcement learning: Designing and training reasoning systems using RLHF, RLAIF, and reward modeling...  .... Scalable oversight: Contributing to alignment and oversight research - figuring out how to reliably supervise models on geological tasks... 
    Training
    Full time
    Internship

    Xterraai

    San Francisco, CA
    2 days ago
  • Job Title Founding Research Scientist Company Description Stealth physical AI startup Job Description You will lead the research function...  ...-backed startup solving the data bottleneck in robotics. By training end-to-end manipulation policies and designing evaluation frameworks... 
    Training

    Jack & Jill

    San Francisco, CA
    4 days ago
  • $170k - $220k

    Introduction The Center for AI Safety is a research and field-building nonprofit located in...  ...and technical research. As a research scientist, you will pursue a variety of research...  ...HuggingFace). Have experience launching and training distributed ML jobs. Communicate... 
    Training
    Work at office

    Center for AI Safety

    San Francisco, CA
    20 hours ago
  • Research Scientist, Real-Time Interactivity / Inference San Francisco · Research · Full Time Real-time interactivity can come from inference...  ..., and we aim to both get more out of what they've already trained and shape how the next generation of models is designed. We'... 
    Training
    Full time
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    4 days ago
  • $225k - $400k

    ABOUT THE ROLE This is a research-driven, high-impact role for ML researchers who want to push the boundaries of real-time AI. As...  ...Background - You’ve worked on advanced ML problems (e.g., LLM pre‑training and post training, transcription model training, text to speech model... 
    Training
    H1b
    Relocation

    Retell AI

    San Francisco, CA
    1 day ago
  • $300k - $320k

     ...is a quickly growing group of committed researchers, engineers, policy experts, and...  ...We're seeking an exceptional Research Scientist to join our Life Sciences team at Anthropic...  ...capabilities on scientific tasks through post‑training, evaluation design, and RL environment... 
    Training
    Visa sponsorship

    Menlo Ventures

    San Francisco, CA
    20 hours ago
  • $150k - $250k

     ...grasping and more dexterous behaviors in unstructured environments ● Research and implement state-of-the-art robot learning policies,...  ...production robot fleets ● Optimize robot policies for distributed training at scale and real-time edge deployment ● Ship production... 
    Training

    Deft AI, Inc.

    San Francisco, CA
    2 days ago
  • $200k - $250k

     ...Center for AI Safety (CAIS) is a leading research and advocacy organization focused on...  ...Safety Action Fund. As a Senior Research Scientist here, you will lead and execute high‑impact...  ...models, build the tooling needed to train and evaluate models at scale, and turn results... 
    Training
    Work at office
    Local area

    Center for AI Safety

    San Francisco, CA
    4 days ago
  •  ...OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we...  ...agenda. You will pursue open problems at the intersection of post‑training methodology and performant inference and then collaborate... 
    Training
    Immediate start
    Flexible hours

    Baseten

    San Francisco, CA
    2 days ago
  • $54 - $60 per hour

     ...problems lie in enterprise domains, behind closed doors. Our research team's goal is to push the frontier of "domain adaptation" -...  ...Compose multiple methods to create new recipes for efficient post‑training. Evaluate LLMs and AI systems. Qualifications & Requirements... 
    Training
    Hourly pay
    Internship

    Databricks Inc.

    San Francisco, CA
    3 days ago
  • Overview: Hedra is building a world-class Physical AI research team to push the boundaries of action-conditioned world models and generative...  ...modeling for embodied systems Design novel architectures, training objectives, and evaluation frameworks for VLMs, VLAs, and world... 
    Training
    Work at office

    Hedra, Inc

    San Francisco, CA
    1 day ago
  • Research Scientist / Machine Learning Scientist Location:SF Bay Area/Hybrid / Remote Type:Full-Time About the Role: The Clientis seeking a...  ...meaningful home here. We’re looking for: • Hands-on experience training large-scale models, including reward models, preference... 
    Training
    Full time
    Remote work

    Lead Allies Inc

    San Francisco, CA
    3 days ago
  • $200k - $325k

    About the Role We're looking for a Research Scientist to collaborate with partners and lead the development...  ...art datasets that drive frontier model training and evaluation based on current model...  ...externally through publications, blog posts, conference talks, and customer... 
    Training
    Local area

    Neura Market

    San Francisco, CA
    3 days ago
  •  ...realize all of our product goals. As a Machine Learning Scientist at Sesame, you are a research-oriented person with experience in NLP, Speech, and/or...  ...model architectures, data curation, model evaluation, training & inference infrastructure, research, and experimentation... 
    Training
    Full time
    Contract work
    Flexible hours

    Sesame

    San Francisco, CA
    20 hours ago
  •  ...systems . This role is for an experienced scientist who thrives both in innovating...  ...reasoning, and deep content extraction. Research, evaluate, and integrate the latest vision...  ...knowledge of quantization/LoRA/efficient training. Proficiency with deep learning frameworks... 
    Training

    Tensorlake Inc.

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist - Post Training. Be the first to apply!