Research, Post-Training
Cognition Corp
Who We Are We are an applied AI lab building end-to-end software agents. We're the team behind Devin, the first AI software engineer, and Windsurf, an AI-native IDE. These products represent our vision for AI that doesn't just assist engineers, but works alongside them as a genuine teammate. Our team is small and talent-dense: world-class competitive programmers, former founders, and researchers from the frontier of AI, including Scale AI, Palantir, Cursor, Google DeepMind, and others. Role Mission Post-training is the critical bridge between raw model capability and a system that is actually useful, safe, and effective in the real world. You will shape how our agents learn by iterating on training recipes, evaluations, and alignment methods that directly determine what Devin and our future systems can do. This role blends deep research and hands-on engineering. We don't distinguish between the two. What You'll Accomplish
- Post-Training Recipe Development: Iterate on the full stack of datasets, training stages, and hyperparameters that determine model behavior. Measure how choices compound across evals and production performance, not just isolated benchmarks.
- Evaluation Design and Integrity: Build evals that actually capture what matters. The loop never ends: define, optimize, realize the gaps, and rebuild. You'll be responsible for making numbers go up and making sure the numbers mean something.
- Deep Understanding: When training produces results that don't make sense, you dig until you understand why. The goal isn't just to fix it; it's to carry that understanding forward to the next problem.
- Alignment and Agent Behavior: Apply and advance techniques like RLHF, RLAIF, and constitutional approaches to shape how agents reason, act, and collaborate with humans in long-horizon tasks.
- Scaling and Exploration: Measure how performance scales with data and compute, and develop new methodologies when existing ones hit ceilings. We expect both rigor and invention.
- A track record of advancing ML systems through post-training, alignment, or related methods: RLHF, RLAIF, preference modeling, reward learning, or equivalent
- Strong fundamentals in probability, statistics, and ML theory. The ability to look at experimental data and distinguish real effects from noise and bugs
- Evidence of original contributions: publications at top venues, open-source impact, or equivalent industry results
- Experience with large-scale distributed training and the debugging that comes with it
- Systems-level thinking: not just model optimization, but understanding how training pipelines, data, and evaluation interact
- Comfort with ambiguity and fast-moving research environments where priorities shift quickly
- We care more about demonstrated capability than credentials. A PhD is one signal among many.
- Small, highly selective team where research and product move together; prototypes reach real deployment quickly
- Compute is not a constraint: large allocations with training jobs routinely running across thousands of GPUs from day one
- The environment rewards speed, autonomy, and technical depth with minimal process overhead; this is one of the most competitive and fast-moving problems in AI
- Everything needed to operate at frontier scale from day one.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research, Post-Training in San Francisco, CA vacancy
$197.3k - $313.7k
...Economics, Computer Science, Operations Research, or similar field. Experience: 10+... ...use of AI tools and opt out options. Posting Statement Salesforce is an equal opportunity... ..., compensation, promotion, benefits, training, assessment of job performance,...Training$55k - $194.8k
...related science (or equivalent degree) or at least four years of research experience. Specialists appointed at the Assistant rank... ...to perform statistical analyses of complex data sets. Post graduate training in field. Advanced degree in physics or engineering and a...TrainingLocal area$250k - $350k
Principal Research Scientist, Electromagnetism New York or San Francisco About the role: As a Principal Research Scientist, Electromagnetism, you will lead the architecture, training, and validation of our electromagnetic foundation model, bridging numerically solved...TrainingLocal areaRelocation package$90 per hour
...Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include... ...Develop complex physics problems to enhance AI model training and innovation. Evaluate the quality and rigor of AI-generated...TrainingHourly payContract workSummer workRemote work$225k - $400k
ABOUT THE ROLE This is a research-driven, high-impact role for ML researchers who want to push the boundaries of real-time AI. As... ...Background - You’ve worked on advanced ML problems (e.g., LLM pre‑training and post training, transcription model training, text to speech model...TrainingH1bRelocation$350k
...popular open source projects like PyTorch, OpenAI Gym, Fairseq, and Segment Anything. About the Role The role of post-training researchers sits at the core of our roadmap. This is the critical bridge between raw model intelligence and a system that is actually...TrainingLocal areaImmediate startVisa sponsorshipWork visaRelocation package$55k - $70.7k
...A specialist opportunity in flow cytometry and immunology research is available at the University of California, San Francisco in... ...procurement and processing for the studies. The specialist will be trained by the Lee Lab for relevant tasks if needed and also be able to...TrainingLocal area- A leading technology firm is looking for a Principal Research Scientist in Electromagnetism. This key role will involve leading the architecture... ...will possess a PhD and extensive experience in model training along with skills in Python and related tools. The position offers...Training
$53.1k - $188.2k
...Junior, Assistant, Associate, or Full Specialist. The primary research area will focus on applying advanced MRI methods and/or machine... ...spectral analysis, multi-parametric cohort analysis, building and training predictive models, and the translational application of the...Training- ...observation, and learning. ~ Productionise models into a reproducible, API-first toolkit; partner with Infrastructure on GPU training and HPC; maintain high standards of ML best practices, including rigorous evaluation, benchmarks, and reproducibility. ~ Mentor...Training
$264.8k - $331k
...next generation of AI capabilities. Our Research teams work with the industry's leading... .... The range displayed on each job posting reflects the minimum and maximum target... ...performance, and relevant education or training. Scale employees in eligible roles are also...TrainingFull time- ...exploration in the real world. We're looking for research scientists with strong foundations in... ...learning, or large-scale model training. Qualifications: You've worked on one of:... ...language, multimodal, image, or video) RL post-training, reasoning, or tool use Robotics...Training
$50 per hour
...design problems, check how well AI solves them, and work with researchers to build better benchmarks. Responsibilities: Design advanced... ...accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding,...TrainingRemote jobContract workFlexible hours$60 - $75 per hour
...academic team in small, enrichment-focused classrooms Structured training, mentorship, and ready-to-use curriculum provided Flexible... ...experience for those considering careers in education, research, consulting, or STEM Competitive hourly compensation reflective...TrainingHourly payInternship10 hours per weekFlexible hoursWeekday work- ...design, and database application. Strong working knowledge of research methodology / research study design, hypothesis testing and qualitative... ...in a Labor/Management Partnership environment. Comprehensive training & experience in all aspects of ART clinical lab technologies,...TrainingLocal areaWeekend work
- ...company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core... ...involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at...TrainingWork at officeRelocation package
- ...Assist with daily laboratory maintenance tasks. Assist in the training of Embryologists, Embryology Associates and trainee candidates.... ...communication skills and a team player Expert knowledge of data analysis, research and reporting particularly in Embryology Advanced...TrainingTraineeshipLocal area
$172.5k - $260.1k
...including with regard to use of AI tools and opt out options. Posting Statement Salesforce is an equal opportunity employer and... ...recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and...Training$90k - $100k
...Research Associate Until is a moonshot company building a "pause button" for biology. Our near-term focus is organ-scale reversible... ...undergraduate degree in a STEM field, or significant STEM-related training You have 2 years experience and an excellent record of...TrainingFull timeFlexible hours$53.1k - $188.2k
...phenomena. The candidate will be expected to contribute to the core research efforts in the lab, by conducting molecular and cell biological... ...support in achieving their long-term career goals (including training in oral and written communication, flexible scheduling to...TrainingTraineeshipLocal areaFlexible hours- ...teaching professional courses in the Pharm.D. curriculum, clinical research, and public and clinical service. The Department of... ...medication management appointments for pharmacists. 10% Yes Training and Educational Activities Serve as a trainer for...TrainingWork experience placementWorldwide
$150k - $300k
...environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool... ...who want to build at the intersection of frontier research and real infrastructure. We recently raised $15mm...TrainingRemote workVisa sponsorshipRelocation packageFlexible hours$180k - $250k
...massively accelerates certain kinds of probabilistic inference. Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior research and engineering talent to derive probabilistic ML theory, empirically demonstrate its...Training$181.1k - $318.4k
...collaborate with teams across Apple to research, design, develop, test, and qualify camera... ...with synthetic data generation for ML training pipelines, model optimization and/or deployment... .... Apple accepts applications to this posting on an ongoing basis. #J-18808-Ljbffr...TrainingRelocation$150k - $300k
...environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool... ...who want to build at the intersection of frontier research and real infrastructure. We recently raised $15mm...TrainingRemote workVisa sponsorshipRelocation packageFlexible hours$300k
.... This role involves developing innovative methods to enhance AI training through interpretability, requiring a PhD and extensive expertise in machine learning. You will contribute to high-impact research while collaborating with a dynamic team at the forefront of AI development...Training- ...pharmaceuticals, medical devices, forensics, and life sciences research laboratories spanning research, development and manufacturing/QC... ...in the configuration/customization of the system and provide training to staff members. This position requires flexibility with changing...Training
$150k - $300k
...environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool... ...who want to build at the intersection of frontier research and real infrastructure. We recently raised $15mm...TrainingRemote workVisa sponsorshipRelocation packageFlexible hours- ...company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core... ...speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel...TrainingWork at officeRelocation package
$20 - $23 per hour
...Animal Technician role, you will support the research programs of the faculty by providing all... ...2 and higher areas. Assist in the training of new employees and act as area leader... ...material job duties described in this posting are job duties for which a criminal history...TrainingPermanent employmentContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research, Post-Training. Be the first to apply!
Related searches
- research professional San Francisco, CA
- research and development manager San Francisco, CA
- research and development engineer San Francisco, CA
- vice president research and development San Francisco, CA
- research assistantship San Francisco, CA
- research investigator San Francisco, CA
- pharmaceutical research San Francisco, CA
- research nurse San Francisco, CA
- research chef San Francisco, CA
- research nurse practitioner San Francisco, CA


