Research, Post-Training Data

$350k

Thinking Machines

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who’ve created some of the most widely used AI products, including ChatGPT and Character.ai, open‑weights models like Mistral, as well as popular open source projects like PyTorch, OpenAI Gym, Fairseq, and Segment Anything. About the Role The role of post‑training researchers sits at the core of our roadmap. This is the critical bridge between raw model intelligence and a system that is actually useful, safe, and collaborative for humans. Post‑training data research work sits at the intersection of human insight and machine learning. Our work combines human and synthetic data techniques, along with other innovative approaches, to capture the nuances of human behavior and use them to steer models. We research and model the mechanisms that create value for people to explain, predict, and optimize for human preferences, behaviors, and satisfaction. Our goal is to turn research ideas into data by scoping well‑run data labeling or collection campaigns, and understanding the science behind what makes the data high quality and useful to train our models. We also develop and evaluate quantitative metrics that measure the success and impact of our data and training interventions. Beyond execution, we explore new paradigms for human‑ai interaction and scalable oversight, experimenting with how humans can best supervise, guide, and collaborate with models. It’s interdisciplinary work that blends research, data operations, and technical implementation to advance the frontier of aligned, human‑centered AI systems. This role blends fundamental research and practical engineering, as we do not distinguish between the two roles internally. You will be expected to write high‑performance code and read technical reports. It’s an excellent fit for someone who enjoys both deep theoretical exploration and hands‑on experimentation, and who wants to shape the foundations of how AI learns. What You’ll Do Design and execute data collection and synthesis strategies for post‑training by combining human feedback, preference data, and synthetic examples to guide model behavior. Develop pipelines and frameworks for scalable, high‑quality human labeling, model‑assisted labeling, and synthetic data generation. Research and model human preferences and behavior, creating data‑driven methods to improve reasoning, truthfulness, and helpfulness. Iterate on evals: post‑training involves a never‑ending loop of defining a set of evaluations, optimizing them, and then realizing your existing evals don’t capture what matters. You’ll be responsible for both making numbers go up, and making sure the numbers are meaningful. Design and evaluate metrics and benchmarks that measure data quality, alignment, and the real‑world impact of post‑training interventions. Scale and explore: post‑training will involve a combination of scaling the existing methodologies and developing new ones. Publish and present research that moves the entire community forward. Share code, datasets, and insights that accelerate progress across industry and academia. Skills and Qualifications Strong engineering skills, ability to contribute code and debug in complex codebases. Experience with data curation, human feedback, or synthetic data generation for large language models or similar systems. Ability to design, run, and interpret experiments with scientific rigor and clarity. Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow, or JAX). Comfortable with debugging distributed training and writing code that scales. Bachelor’s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding. Clarity in communication, an ability to explain complex technical concepts in writing. Preferred qualifications A strong grasp of probability, statistics, and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs. Prior experience with RLHF, RLAIF, preference modeling, or reward learning for large models. Experience managing or analyzing human data collection campaigns or large‑scale annotation workflows. Research or engineering contributions in alignment, data‑centric AI, or human‑AI collaboration. Familiarity with synthetic data pipelines, active learning, or model‑assisted labeling. PhD in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding; or, equivalent industry research experience. Logistics Location: This role is based in San Francisco, California. Compensation: Depending on background, skills and experience, the expected annual salary range for this position is $350,000 - $475,000 USD. Visa sponsorship: We sponsor visas. While we can’t guarantee success for every candidate or role, if you’re the right fit, we’re committed to working through the visa process together. Benefits: Thinking Machines offers generous health, dental, and vision benefits, unlimited PTO, paid parental leave, and relocation support as needed. As set forth in Thinking Machines' Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. Thinking Machines Lab will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of the California Fair Chance Act, the San Francisco Fair Chance Ordinance, and any other applicable state or local fair chance ordinance or law. #J-18808-Ljbffr Thinking Machines

Apply

Vacancy posted 17 hours ago

Similar jobs that could be interesting for youBased on the Research, Post-Training Data in San Francisco, CA vacancy

Applied Research Scientist, Frontier Data (SF Onsite)
$150k - $250k
...David Joseph & Company is seeking a Research Scientist in San Francisco, focusing on applied research to enhance frontier AI systems.... ...possess quantitative research skills and familiarity with LLM training processes. The comprehensive benefits package includes a competitive...
Data
Training
David Joseph & Company
San Francisco, CA
2 days ago
Post-Training Research Scientist Data-Driven AI Impact (SF)
...David Joseph & Company is recruiting for AfterQuery, seeking a Research Scientist – Post Training in San Francisco. The role is focused on conducting hands-on experiments to validate the impact of datasets on model performance. Candidates should have a solid background...
Data
Training
David Joseph & Company
San Francisco, CA
1 day ago
Research Scientist - Post Training AfterQuery
$150k - $250k
...Research Scientist – Post Training — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,000 – $250,000 base | $250,000 – $450,000 total... ...is an AI infrastructure company building training data and evaluation systems used by leading frontier AI labs....
Data
Training
David Joseph & Company
San Francisco, CA
1 day ago
Senior Research Scientist
$400k
...prototype and early commercial traction across several high‑profile industry verticals. Role As a Senior Research Scientist, your focus is post‑training — curating data, fine‑tuning pre‑trained speech models, and building the evaluation infrastructure that validates it...
Data
Training
Relocation package
Shift work
Trades Workforce Solutions
San Francisco, CA
1 day ago
Research Intern (Fall 2026)
...About Phonic Phonic is a product and research lab focused on powering the most realistic, human... ..., latency, and quality Curate massive training datasets and execute rigorous experiments to determine exactly how data quality shapes model behavior and performance...
Data
Training
Internship
Work at office
Phonic
San Francisco, CA
6 days ago
Senior Embryologist
$110.4k - $235.4k
...the vision for matters related to policy, research studies, and protocols. This role works... ...related area) and/or equivalent experience/training 10+ years of directly relevant experience... ...and consistency Expert knowledge of data analysis, research and reporting, particularly...
Data
Training
UCSF Health
San Francisco, CA
1 day ago
Remote Data Entry Research Panelist (Flexible Hours)
...A remote work company is seeking Data Entry Clerks to join their team. This role allows... ...flexible schedule and offers comprehensive training. Ideal for individuals with various... ...skills are a plus. You will participate in research studies that help create meaningful outcomes...
Data
Training
Remote work
Work from home
Flexible hours
Maxion Corp
San Francisco, CA
2 days ago
Staff Research Scientist, Continual Learning for LLMs
Trajectory is looking for a Member of Technical Staff (Research) to design and build the post-training stack that enables continual learning. In this role,... ...will manage the entire experimentation process, from data and training to evaluation, allowing our models to learn...
Data
Training
Trajectory
San Francisco, CA
17 hours ago
Research Scientist (post-training)
...Overview We are Genmo, a research lab dedicated to building open, state-of-the-art models... ...join our team, focusing on alignment and post-training techniques for large-scale video... ...and output quality Create and optimize data collection pipelines for human feedback...
Data
Training
Relocation
Genmo
San Francisco, CA
1 day ago
Research Scientist
...Member of Technical Staff, Research Scientist About Fleet Fleet studies how environments... ...looks like. We work with frontier labs on post-training across modalities — building benchmarks... ...class specialized models through better data and training recipes How We Work You carry...
Data
Training
Fleet AI, Inc.
San Francisco, CA
1 day ago
Research Scientist - Frontier Data AfterQuery
$150k - $250k
...Research Scientist – Frontier Data — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,00... ...About AfterQuery AfterQuery builds training data infrastructure and evaluation systems... ...AI systems. The company is post-Series A with a small, high-impact technical...
Data
Training
Full time
Visa sponsorship
David Joseph & Company
San Francisco, CA
1 day ago
Research Scientist - Machine Learning
$180k - $250k
...inference. Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior research and engineering talent to derive... ...production models for domain experts using customer data. Required Qualifications Experience in scientific...
Data
Training
Kindredventures
San Francisco, CA
1 day ago
Senior Applied Research Scientist
...sandboxes, durable execution and unstructured data ingestion APIs to developers in... ...reasoning, and deep content extraction. Research, evaluate, and integrate the latest vision... ...knowledge of quantization/LoRA/efficient training. Proficiency with deep learning frameworks...
Data
Training
Tensorlake Inc.
San Francisco, CA
2 days ago
Senior Embryologist
...Assist with daily laboratory maintenance tasks. Assist in the training of Embryologists, Embryology Associates and trainee... ...communication skills and a team player Expert knowledge of data analysis, research and reporting particularly in Embryology Advanced organizational...
Data
Training
Traineeship
Local area
Kaiser Permanente
San Francisco, CA
1 day ago
Research, Post-Training
$350k
...Research, Post-Training Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building... ..., and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs. Prior...
Data
Training
Local area
Immediate start
Visa sponsorship
Work visa
Relocation package
Thinking Machines Lab
San Francisco, CA
3 days ago
Senior Physicist | Upto $90/hr Hourly
$90 per hour
...creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...physics problems to enhance AI model training and innovation. Evaluate the quality and... ...with AI research teams to improve training data and model outputs. Work independently and...
Data
Training
Hourly pay
Contract work
Summer work
Remote work
Mercor Inc
San Francisco, CA
2 days ago
Research Scientist
$150k - $250k
...behaviors in unstructured environments ● Research and implement state-of-the-art robot learning... ...performance ● Design and maintain robust data collection and curation pipelines for... ...Optimize robot policies for distributed training at scale and real-time edge deployment ●...
Data
Training
Deft AI, Inc.
San Francisco, CA
1 day ago
Applied Scientist
...who thrives at the intersection of applied research and real-world products. You'll push the... ...-on Execution: Own implementation of data products at Adaption, addressing novel challenges... ...Solid understanding of data modeling for training and how curation decisions shape model...
Data
Training
Flexible hours
Adaption Labs
San Francisco, CA
1 day ago
Research Scientist - Frontier Data
$250k
...About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier... ...benchmarks. We are a small, early team (post Series A) where individual contributors... ...and measured. Working directly with research teams at top AI labs, you’ll experiment...
Data
Training
AfterQuery
San Francisco, CA
2 days ago
Applied Research - Evals & Data
$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...RL/post-training methods, applied data, and agent systems. You'll have a...
Data
Training
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
San Francisco, CA
4 days ago
Research Scientist - Frontier Benchmarks
$200k - $325k
...the Role We're looking for a Research Scientist to collaborate with... ...startups operate across research, data operations, and commercial... ...datasets that drive frontier model training and evaluation based on... ...externally through publications, blog posts, conference talks, and...
Data
Training
Local area
Neura Market
San Francisco, CA
1 day ago
Arnaout Lab - Junior, Assistant, or Associate Specialist
...This position will carry out grant-funded research to study computational imaging and... ...algorithms and machine learning models for data analysis, providing data visualization of... ...period. Salary and rank will be commensurate with the applicants experience and training....
Data
Training
Immediate start
UCSF
San Francisco, CA
4 days ago
Principal Decision Scientist
$197.3k - $313.7k
...efforts. Job Category Data Job Details About... ...Computer Science, Operations Research, or similar field. Experience... ...tools and opt out options. Posting Statement Salesforce is an... ...compensation, promotion, benefits, training, assessment of job...
Data
Training
Salesforce.Com Inc
San Francisco, CA
3 days ago
Research Scientist (Intern)
...via reinforcement learning: Designing and training reasoning systems using RLHF, RLAIF, and... ...Contributing to alignment and oversight research - figuring out how to reliably supervise... ...problems and ship real experiments on real data. At More Senior Levels, We’d Also Expect...
Data
Training
Full time
Internship
Xterraai
San Francisco, CA
1 day ago
Staff Research Engineer - LLM Post-Training & Evaluation
...infrastructure necessary for models to learn from production workflows continually. You will manage end-to-end experiments related to data, training, and system evaluation, working closely with the company's founders. The ideal candidate will have a strong background in large...
Data
Training
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
5 days ago
Applied Research - Forward-Deployed
$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...at scale (benchmarks, synthetic data generation, model grading)...
Data
Training
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
San Francisco, CA
4 days ago
Founding Research Scientist at stealth physical AI startup
...Job Title Founding Research Scientist Company Description Stealth physical AI startup Job Description You will lead the... ...research function at a venture-backed startup solving the data bottleneck in robotics. By training end-to-end manipulation policies and designing...
Data
Training
Jack & Jill
San Francisco, CA
1 day ago
Research, Post-Training
...competitive programmers, former founders, and researchers from the frontier of AI, including... ..., and others. Role Mission Post-training is the critical bridge between raw model... ...Exploration: Measure how performance scales with data and compute, and develop new...
Data
Training
Shift work
Cognition Corp
San Francisco, CA
5 days ago
Subject Matter Expert: Legal & Regulatory Research - Remote Advisory- (San Francisco)
$170 per hour
...OverviewSubject Matter Expert: Legal & Regulat?ory Research (Remote Advisory) – ProlificAbout... ...the biggest pool of quality human data in the world. Over 35,000 AI developers,... ...knowledge, and skills.What you'll bringAI Training task skills and verifiable professional experience...
Data
Training
Hourly pay
Remote work
Work from home
Flexible hours
Prolific
San Francisco, CA
2 days ago
Applied Research - RL & Agents
$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...LLM evaluations and/or synthetic data generation. Experience...
Data
Training
Remote work
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research, Post-Training Data. Be the first to apply!