Research Engineer, Post-Training
$231k - $340kHarvey
Why Harvey At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come. This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched. Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you. At Harvey, the future of professional services is being written today — and we’re just getting started. Role Overview Post-training is how Harvey turns expert feedback and agent traces into models that are meaningfully better at legal work. We are looking for a research engineer who can help scale that loop: defining and running model training experiments, interpreting results, and working with internal and external research partners to build better data, environments, graders, and training recipes. This role is for someone who can self-manage model training and applied research projects. You will work closely with internal and external research collaborators on post-training efforts that matter to our product roadmap. The ideal candidate has extensive hands-on experience training open weight models, either in a research or production setting, and enough engineering depth to run and debug experiments efficiently. What You'll Do Drive post-training experiments, pushing agent performance while navigating the Pareto frontier of cost, latency, security, and governance. Optimize agent harnesses, including domain-specific skills, tools, subagents, retrieval strategies, and validation loops that improve quality on long-horizon legal work. Design and develop grading and reward systems that are reliable enough for evaluation, efficient enough for iteration, and strict enough for high-stakes legal work. Study agent behavior, identifying patterns that correlate with successful work product, and converting those findings into training data, evals, or harness changes. Work with Harvey researchers and external research partners to define experiments, evaluate methodology, review results, and keep projects moving toward concrete model improvements. What You Have Hands-on experience with post-training or model-training work, such as SFT, preference optimization, RLHF/RLAIF, reward modeling, distillation, or adapting open-weight models to specialized domains. Strong judgment about model behavior: you can read traces, inspect outputs, identify failure modes, and reason about whether a metric is measuring the thing that matters. Strong Python and research-engineering ability. You can write clean code, debug experiments, and build the simple but reliable systems needed to make research move faster. Ability to self-manage ambiguous applied research projects and communicate clearly with researchers, engineers, product teams, domain experts, and external partners. Nice to Have Experience building data or evaluation infrastructure for ML workflows, such as dataset curation pipelines, model-output processing, experiment tracking, evaluation dashboards, or regression analysis tooling. Experience with distributed training, inference systems, GPU workloads, or large-scale ML experimentation. Research publications, open-source contributions, or shipped industry work in LLMs, agents, evaluation, or ML systems. Compensation $231,000 - $340,000 Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].
#LI-AK1
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing View email address on click.appcast.io- ...Space Models or SSMs, a new primitive for training efficient, large‑scale foundation models... ...in model innovation and systems engineering paired with a design‑minded product engineering... ...building scalable systems that bridge research and production. What We Offer Lunch...TrainingWork at officeRelocation package
- General Analysis is a security research lab focused on adversarial simulations, evaluations... ...Francisco and work across research, engineering, and product. About the role As a... ...Engineer, you will be responsible for post-training models for adversarial capabilities using...Training
- ...will fine-tune models for extraction and updates, and implement research findings while ensuring high reliability and low latency. The... ...through trials. Key qualifications include experience in RAG, model training, proficiency in Python, and a strong understanding of PyTorch...Training
- ...that learn through exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model training. Qualifications: You’ve worked on large GPU clusters, and are...Training
- A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance...TrainingWork at office
- Eventual, based in San Francisco, is seeking a Research Engineer for the Visual Understanding team. In this role, you'll manage visual understanding... ...capabilities to transform vast video datasets into usable training data. Responsibilities include training multimodal models,...TrainingWork at office
$200k - $350k
...deeply curious—building at the intersection of research, product, and creativity . The Role As a Machine Learning Research Engineer , you’ll own end-to-end research cycles—designing, running, and analyzing post‑training experiments that help models evaluate style...Training$100k - $120k
...'dreams' of robot trajectories which can be collected for training. You will become a part of Coda's founding team and lead the... ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality...Training$350k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...language models Building scalable RL infrastructure and training methodologies Enhancing model reasoning capabilities We...TrainingWork at officeVisa sponsorshipFlexible hours- ...to-end lifecycle of memory features—from research to production. You’ll fine-tune models... ...benchmark ideas from papers; and ship with Engineering to SOTA latency, reliability, and cost .... ...quality. What You’ll Do Fine-tune and train models for memory extraction, updates,...Training
- ...stealth team of elite founders and AI researchers, with backgrounds spanning Stanford, OpenAI... ..., the lab ships vibes. With one, every training run, every prompt change, every agent... ...we measure is what we want Product engineers, by instrumenting real-user behavior on...TrainingRelocation package
- As a research engineer, you will work on open-ended research problems in the domain of RL, agents and data, and share your work with the public... ...you might tackle on a day-to-day basis: Modifying our RL training stack to support end-to-end training with agent harnesses...TrainingWork at office
- About the Role As an applied research engineer at Sieve, you’ll build high performance building... ...performance out of them through clever pre/post-processing, parallelism, pipelining,... ...building end-to-end products—not just training models Able to break problems down from...Training
- ...company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at...TrainingWork at officeRelocation package
$175k - $350k
Adept thrives at the intersection of research and product. As a research engineer, you will play a crucial role in developing and implementing AI models,... ...them Build, operate, and maintain large-scale data and training pipelines Design and run experiments to improve our...TrainingWork at officeRemote work- About Human Archive Human Archive is a research lab backed by Y Combinator focused on modeling... ...join us. The Opportunity As a Research Engineer, you’ll work on multimodal sensing... ...synchronized, fused, and used for downstream VLA training. You’ll research emerging sensing...TrainingShift work
$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our API. In this role, you will design and iterate on state‑of‑the‑art finetuning techniques, such as Constitutional AI and...TrainingWork at officeHome officeVisa sponsorshipRelocation package$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence... ...engineering principles. Are familiar with methods of training and fine‑tuning large language models, such as distillation,...TrainingInternship- Introduction The Center for AI Safety is a research and field-building nonprofit located in... ...and technical research. As a research engineer here, you will pursue a variety of research... ...). Have experience launching and training distributed ML jobs. Communicate clearly...TrainingWork at office
- ...understand how their systems are behaving post-deployment. Instead of reactive... ...and others. The Role: We are looking for Research Engineers to build AI systems that use agent interaction... ...behaviors Design and implement post‑training and optimization workflows to improve...TrainingImmediate start
- The role As a research scientist, you will design, implement, and optimize the large-scale training infrastructure that powers our frontier reinforcement... ...to bring frontier post-training capabilities into... ...Who we are : We’re a team of engineers, researchers, and operators....TrainingWork at officeVisa sponsorshipRelocation package
- ...) Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range not provided... ...and craftsmanship. The Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind...TrainingRemote work
- ...breakthrough AI models at leading research labs and enterprises. Since 2... ...to produce high-quality training data at scale Frontier Data Labeling... ...As an Applied Research Engineer, you will be at the forefront... ...documentation, blog posts, and educational content that...TrainingFlexible hours
- ...the quality of AI across everything Gamma creates. As our Research Engineer, you'll design evaluation frameworks that measure AI output... ...and iterate toward quality improvements. Experience with post‑training techniques for LLMs including reinforcement learning and supervised...TrainingWork at officeWork from home
- ...detect and remediate critical software vulnerabilities. We are training and scaling security AI agents to discover zero-days... ...Infrastructure. About this role We’re seeking an experienced Research Engineer to join our effort in building and training AI agents for vulnerability...TrainingFull timeWork at office
- ...company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor... ...and preprocessing to designing novel architectures and training methodologies. You’ll Work Across: Large-scale EEG model training...TrainingWork at officeRelocation package
- At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting-edge challenge that requires innovative problem-solving and creative approaches to tackle complex, highly-technical problems....TrainingWork at office
- ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...Team The Universes team within Research is responsible for training AI models to perform complex, difficult, long-horizon...TrainingWork at officeRemote workVisa sponsorshipShift work
$300k - $400k
...a team. About the Team The Research team develops the model and decision... ...-the-art techniques in model training, prompting, orchestration,... ...Role As a Senior Research Engineer, you’ll be responsible for building... ...research. Prior experience post-training and deploying LLMs...TrainingWork at office$200k - $350k
...person applied AI team spanning Ex-Moonshot (post-training & agents, diffusion model training), second-time technical founders, engineers that made 100+ games for Voodoo,... ...engaging games & 3D environments. Our current research spans: Distributed multi-agent orchestration...TrainingVisa sponsorshipRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Post-Training. Be the first to apply!
- research assistant engineering San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- research programmer San Francisco, CA
- deep learning research engineer San Francisco, CA
- research software engineer San Francisco, CA
- senior research engineer San Francisco, CA
- assistant research professor San Francisco, CA
- research and development engineer San Francisco, CA


