Research, Post-Training Data
$350kThinking Machines
Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who’ve created some of the most widely used AI products, including ChatGPT and Character.ai, open‑weights models like Mistral, as well as popular open source projects like PyTorch, OpenAI Gym, Fairseq, and Segment Anything. About the Role The role of post‑training researchers sits at the core of our roadmap. This is the critical bridge between raw model intelligence and a system that is actually useful, safe, and collaborative for humans. Post‑training data research work sits at the intersection of human insight and machine learning. Our work combines human and synthetic data techniques, along with other innovative approaches, to capture the nuances of human behavior and use them to steer models. We research and model the mechanisms that create value for people to explain, predict, and optimize for human preferences, behaviors, and satisfaction. Our goal is to turn research ideas into data by scoping well‑run data labeling or collection campaigns, and understanding the science behind what makes the data high quality and useful to train our models. We also develop and evaluate quantitative metrics that measure the success and impact of our data and training interventions. Beyond execution, we explore new paradigms for human‑ai interaction and scalable oversight, experimenting with how humans can best supervise, guide, and collaborate with models. It’s interdisciplinary work that blends research, data operations, and technical implementation to advance the frontier of aligned, human‑centered AI systems. This role blends fundamental research and practical engineering, as we do not distinguish between the two roles internally. You will be expected to write high‑performance code and read technical reports. It’s an excellent fit for someone who enjoys both deep theoretical exploration and hands‑on experimentation, and who wants to shape the foundations of how AI learns. What You’ll Do Design and execute data collection and synthesis strategies for post‑training by combining human feedback, preference data, and synthetic examples to guide model behavior. Develop pipelines and frameworks for scalable, high‑quality human labeling, model‑assisted labeling, and synthetic data generation. Research and model human preferences and behavior, creating data‑driven methods to improve reasoning, truthfulness, and helpfulness. Iterate on evals: post‑training involves a never‑ending loop of defining a set of evaluations, optimizing them, and then realizing your existing evals don’t capture what matters. You’ll be responsible for both making numbers go up, and making sure the numbers are meaningful. Design and evaluate metrics and benchmarks that measure data quality, alignment, and the real‑world impact of post‑training interventions. Scale and explore: post‑training will involve a combination of scaling the existing methodologies and developing new ones. Publish and present research that moves the entire community forward. Share code, datasets, and insights that accelerate progress across industry and academia. Skills and Qualifications Strong engineering skills, ability to contribute code and debug in complex codebases. Experience with data curation, human feedback, or synthetic data generation for large language models or similar systems. Ability to design, run, and interpret experiments with scientific rigor and clarity. Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow, or JAX). Comfortable with debugging distributed training and writing code that scales. Bachelor’s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding. Clarity in communication, an ability to explain complex technical concepts in writing. Preferred qualifications A strong grasp of probability, statistics, and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs. Prior experience with RLHF, RLAIF, preference modeling, or reward learning for large models. Experience managing or analyzing human data collection campaigns or large‑scale annotation workflows. Research or engineering contributions in alignment, data‑centric AI, or human‑AI collaboration. Familiarity with synthetic data pipelines, active learning, or model‑assisted labeling. PhD in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding; or, equivalent industry research experience. Logistics Location: This role is based in San Francisco, California. Compensation: Depending on background, skills and experience, the expected annual salary range for this position is $350,000 - $475,000 USD. Visa sponsorship: We sponsor visas. While we can’t guarantee success for every candidate or role, if you’re the right fit, we’re committed to working through the visa process together. Benefits: Thinking Machines offers generous health, dental, and vision benefits, unlimited PTO, paid parental leave, and relocation support as needed. As set forth in Thinking Machines' Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. Thinking Machines Lab will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of the California Fair Chance Act, the San Francisco Fair Chance Ordinance, and any other applicable state or local fair chance ordinance or law. #J-18808-Ljbffr Thinking Machines
$150k - $250k
...David Joseph & Company is seeking a Research Scientist in San Francisco, focusing on applied research to enhance frontier AI systems.... ...possess quantitative research skills and familiarity with LLM training processes. The comprehensive benefits package includes a competitive...DataTraining- ...David Joseph & Company is recruiting for AfterQuery, seeking a Research Scientist – Post Training in San Francisco. The role is focused on conducting hands-on experiments to validate the impact of datasets on model performance. Candidates should have a solid background...DataTraining
$150k - $250k
...Research Scientist – Post Training — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,000 – $250,000 base | $250,000 – $450,000 total... ...is an AI infrastructure company building training data and evaluation systems used by leading frontier AI labs....DataTraining$400k
...prototype and early commercial traction across several high‑profile industry verticals. Role As a Senior Research Scientist, your focus is post‑training — curating data, fine‑tuning pre‑trained speech models, and building the evaluation infrastructure that validates it...DataTrainingRelocation packageShift work- ...About Phonic Phonic is a product and research lab focused on powering the most realistic, human... ..., latency, and quality Curate massive training datasets and execute rigorous experiments to determine exactly how data quality shapes model behavior and performance...DataTrainingInternshipWork at office
$110.4k - $235.4k
...the vision for matters related to policy, research studies, and protocols. This role works... ...related area) and/or equivalent experience/training 10+ years of directly relevant experience... ...and consistency Expert knowledge of data analysis, research and reporting, particularly...DataTraining- ...A remote work company is seeking Data Entry Clerks to join their team. This role allows... ...flexible schedule and offers comprehensive training. Ideal for individuals with various... ...skills are a plus. You will participate in research studies that help create meaningful outcomes...DataTrainingRemote workWork from homeFlexible hours
- Trajectory is looking for a Member of Technical Staff (Research) to design and build the post-training stack that enables continual learning. In this role,... ...will manage the entire experimentation process, from data and training to evaluation, allowing our models to learn...DataTraining
- ...Overview We are Genmo, a research lab dedicated to building open, state-of-the-art models... ...join our team, focusing on alignment and post-training techniques for large-scale video... ...and output quality Create and optimize data collection pipelines for human feedback...DataTrainingRelocation
- ...Member of Technical Staff, Research Scientist About Fleet Fleet studies how environments... ...looks like. We work with frontier labs on post-training across modalities — building benchmarks... ...class specialized models through better data and training recipes How We Work You carry...DataTraining
$150k - $250k
...Research Scientist – Frontier Data — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,00... ...About AfterQuery AfterQuery builds training data infrastructure and evaluation systems... ...AI systems. The company is post-Series A with a small, high-impact technical...DataTrainingFull timeVisa sponsorship$180k - $250k
...inference. Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior research and engineering talent to derive... ...production models for domain experts using customer data. Required Qualifications Experience in scientific...DataTraining- ...sandboxes, durable execution and unstructured data ingestion APIs to developers in... ...reasoning, and deep content extraction. Research, evaluate, and integrate the latest vision... ...knowledge of quantization/LoRA/efficient training. Proficiency with deep learning frameworks...DataTraining
- ...Assist with daily laboratory maintenance tasks. Assist in the training of Embryologists, Embryology Associates and trainee... ...communication skills and a team player Expert knowledge of data analysis, research and reporting particularly in Embryology Advanced organizational...DataTrainingTraineeshipLocal area
$350k
...Research, Post-Training Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building... ..., and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs. Prior...DataTrainingLocal areaImmediate startVisa sponsorshipWork visaRelocation package$90 per hour
...creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...physics problems to enhance AI model training and innovation. Evaluate the quality and... ...with AI research teams to improve training data and model outputs. Work independently and...DataTrainingHourly payContract workSummer workRemote work$150k - $250k
...behaviors in unstructured environments ● Research and implement state-of-the-art robot learning... ...performance ● Design and maintain robust data collection and curation pipelines for... ...Optimize robot policies for distributed training at scale and real-time edge deployment ●...DataTraining- ...who thrives at the intersection of applied research and real-world products. You'll push the... ...-on Execution: Own implementation of data products at Adaption, addressing novel challenges... ...Solid understanding of data modeling for training and how curation decisions shape model...DataTrainingFlexible hours
$250k
...About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier... ...benchmarks. We are a small, early team (post Series A) where individual contributors... ...and measured. Working directly with research teams at top AI labs, you’ll experiment...DataTraining$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...RL/post-training methods, applied data, and agent systems. You'll have a...DataTrainingRemote workVisa sponsorshipRelocation packageFlexible hours$200k - $325k
...the Role We're looking for a Research Scientist to collaborate with... ...startups operate across research, data operations, and commercial... ...datasets that drive frontier model training and evaluation based on... ...externally through publications, blog posts, conference talks, and...DataTrainingLocal area- ...This position will carry out grant-funded research to study computational imaging and... ...algorithms and machine learning models for data analysis, providing data visualization of... ...period. Salary and rank will be commensurate with the applicants experience and training....DataTrainingImmediate start
$197.3k - $313.7k
...efforts. Job Category Data Job Details About... ...Computer Science, Operations Research, or similar field. Experience... ...tools and opt out options. Posting Statement Salesforce is an... ...compensation, promotion, benefits, training, assessment of job...DataTraining- ...via reinforcement learning: Designing and training reasoning systems using RLHF, RLAIF, and... ...Contributing to alignment and oversight research - figuring out how to reliably supervise... ...problems and ship real experiments on real data. At More Senior Levels, We’d Also Expect...DataTrainingFull timeInternship
- ...infrastructure necessary for models to learn from production workflows continually. You will manage end-to-end experiments related to data, training, and system evaluation, working closely with the company's founders. The ideal candidate will have a strong background in large...DataTraining
$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...at scale (benchmarks, synthetic data generation, model grading)...DataTrainingRemote workVisa sponsorshipRelocation packageFlexible hours- ...Job Title Founding Research Scientist Company Description Stealth physical AI startup Job Description You will lead the... ...research function at a venture-backed startup solving the data bottleneck in robotics. By training end-to-end manipulation policies and designing...DataTraining
- ...competitive programmers, former founders, and researchers from the frontier of AI, including... ..., and others. Role Mission Post-training is the critical bridge between raw model... ...Exploration: Measure how performance scales with data and compute, and develop new...DataTrainingShift work
$170 per hour
...OverviewSubject Matter Expert: Legal & Regulat?ory Research (Remote Advisory) – ProlificAbout... ...the biggest pool of quality human data in the world. Over 35,000 AI developers,... ...knowledge, and skills.What you'll bringAI Training task skills and verifiable professional experience...DataTrainingHourly payRemote workWork from homeFlexible hours$150k - $300k
..., sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL... ...at the intersection of frontier research and real infrastructure. We recently... ...LLM evaluations and/or synthetic data generation. Experience...DataTrainingRemote workVisa sponsorshipRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research, Post-Training Data. Be the first to apply!
- data officer San Francisco, CA
- data network cabling San Francisco, CA
- data auditor San Francisco, CA
- test data management San Francisco, CA
- data mining San Francisco, CA
- minimum data set coordinator San Francisco, CA
- data capturer San Francisco, CA
- data tech San Francisco, CA
- sap data migration San Francisco, CA
- provider data management San Francisco, CA

