Research Engineer, Post-Training

$231k - $340k

Full-time

Harvey

Why Harvey At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come. This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched. Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you. At Harvey, the future of professional services is being written today — and we’re just getting started. Role Overview Post-training is how Harvey turns expert feedback and agent traces into models that are meaningfully better at legal work. We are looking for a research engineer who can help scale that loop: defining and running model training experiments, interpreting results, and working with internal and external research partners to build better data, environments, graders, and training recipes. This role is for someone who can self-manage model training and applied research projects. You will work closely with internal and external research collaborators on post-training efforts that matter to our product roadmap. The ideal candidate has extensive hands-on experience training open weight models, either in a research or production setting, and enough engineering depth to run and debug experiments efficiently. What You'll Do Drive post-training experiments, pushing agent performance while navigating the Pareto frontier of cost, latency, security, and governance. Optimize agent harnesses, including domain-specific skills, tools, subagents, retrieval strategies, and validation loops that improve quality on long-horizon legal work. Design and develop grading and reward systems that are reliable enough for evaluation, efficient enough for iteration, and strict enough for high-stakes legal work. Study agent behavior, identifying patterns that correlate with successful work product, and converting those findings into training data, evals, or harness changes. Work with Harvey researchers and external research partners to define experiments, evaluate methodology, review results, and keep projects moving toward concrete model improvements. What You Have Hands-on experience with post-training or model-training work, such as SFT, preference optimization, RLHF/RLAIF, reward modeling, distillation, or adapting open-weight models to specialized domains. Strong judgment about model behavior: you can read traces, inspect outputs, identify failure modes, and reason about whether a metric is measuring the thing that matters. Strong Python and research-engineering ability. You can write clean code, debug experiments, and build the simple but reliable systems needed to make research move faster. Ability to self-manage ambiguous applied research projects and communicate clearly with researchers, engineers, product teams, domain experts, and external partners. Nice to Have Experience building data or evaluation infrastructure for ML workflows, such as dataset curation pipelines, model-output processing, experiment tracking, evaluation dashboards, or regression analysis tooling. Experience with distributed training, inference systems, GPU workloads, or large-scale ML experimentation. Research publications, open-source contributions, or shipped industry work in LLMs, agents, evaluation, or ML systems. Compensation $231,000 - $340,000 Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-AK1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing View email address on click.appcast.io

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Research Engineer, Post-Training in San Francisco, CA vacancy

Research Engineer, Data
...Space Models or SSMs, a new primitive for training efficient, large‑scale foundation models... ...in model innovation and systems engineering paired with a design‑minded product engineering... ...building scalable systems that bridge research and production. What We Offer Lunch...
Training
Work at office
Relocation package
Cartesia
San Francisco, CA
4 days ago
Research - engineering
General Analysis is a security research lab focused on adversarial simulations, evaluations... ...Francisco and work across research, engineering, and product. About the role As a... ...Engineer, you will be responsible for post-training models for adversarial capabilities using...
Training
General Analysis
San Francisco, CA
10 hours ago
Senior Research Engineer - Memory & Retrieval at Scale
...will fine-tune models for extraction and updates, and implement research findings while ensuring high reliability and low latency. The... ...through trials. Key qualifications include experience in RAG, model training, proficiency in Python, and a strong understanding of PyTorch...
Training
Mem0 Official Documentation
San Francisco, CA
4 days ago
Research Engineer
...that learn through exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model training. Qualifications: You’ve worked on large GPU clusters, and are...
Training
Pantograph
San Francisco, CA
10 hours ago
Research Engineer: Action-Conditioned World Models
A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance...
Training
Work at office
Hedra, Inc
San Francisco, CA
4 days ago
Research Engineer, Visual Understanding for Multimodal AI
Eventual, based in San Francisco, is seeking a Research Engineer for the Visual Understanding team. In this role, you'll manage visual understanding... ...capabilities to transform vast video datasets into usable training data. Responsibilities include training multimodal models,...
Training
Work at office
Eventual
San Francisco, CA
1 day ago
Machine Learning Research Engineer
$200k - $350k
...deeply curious—building at the intersection of research, product, and creativity . The Role As a Machine Learning Research Engineer , you’ll own end-to-end research cycles—designing, running, and analyzing post‑training experiments that help models evaluate style...
Training
Coders Connect
San Francisco, CA
10 hours ago
Research Engineer - World Models
$100k - $120k
...'dreams' of robot trajectories which can be collected for training. You will become a part of Coda's founding team and lead the... ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality...
Training
Coda Robotics
San Francisco, CA
4 days ago
Research Engineer, Performance RL (Reinforcement Learning)
$350k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...language models Building scalable RL infrastructure and training methodologies Enhancing model reasoning capabilities We...
Training
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
10 hours ago
Senior Research Engineer
...to-end lifecycle of memory features—from research to production. You’ll fine-tune models... ...benchmark ideas from papers; and ship with Engineering to SOTA latency, reliability, and cost .... ...quality. What You’ll Do Fine-tune and train models for memory extraction, updates,...
Training
Mem0
San Francisco, CA
4 days ago
Research Engineer - Evals
...stealth team of elite founders and AI researchers, with backgrounds spanning Stanford, OpenAI... ..., the lab ships vibes. With one, every training run, every prompt change, every agent... ...we measure is what we want Product engineers, by instrumenting real-user behavior on...
Training
Relocation package
AGI, Inc.
San Francisco, CA
4 days ago
Research Engineer
As a research engineer, you will work on open-ended research problems in the domain of RL, agents and data, and share your work with the public... ...you might tackle on a day-to-day basis: Modifying our RL training stack to support end-to-end training with agent harnesses...
Training
Work at office
Proximal
San Francisco, CA
1 day ago
Applied Research Engineer
About the Role As an applied research engineer at Sieve, you’ll build high performance building... ...performance out of them through clever pre/post-processing, parallelism, pipelining,... ...building end-to-end products—not just training models Able to break problems down from...
Training
Sieve
San Francisco, CA
2 days ago
Research Engineer - Agency and Reasoning
...company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at...
Training
Work at office
Relocation package
Zyphra
San Francisco, CA
4 days ago
Research Engineer
$175k - $350k
Adept thrives at the intersection of research and product. As a research engineer, you will play a crucial role in developing and implementing AI models,... ...them Build, operate, and maintain large-scale data and training pipelines Design and run experiments to improve our...
Training
Work at office
Remote work
I did my part and supported the Regular Toilet
San Francisco, CA
4 days ago
Research Engineer
About Human Archive Human Archive is a research lab backed by Y Combinator focused on modeling... ...join us. The Opportunity As a Research Engineer, you’ll work on multimodal sensing... ...synchronized, fused, and used for downstream VLA training. You’ll research emerging sensing...
Training
Shift work
Human Archive
San Francisco, CA
3 days ago
Research Engineer, Applied Finetuning
$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our API. In this role, you will design and iterate on state‑of‑the‑art finetuning techniques, such as Constitutional AI and...
Training
Work at office
Home office
Visa sponsorship
Relocation package
Anthropic
San Francisco, CA
2 days ago
Research Engineer - Speech & Realtime Models
$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence... ...engineering principles. Are familiar with methods of training and fine‑tuning large language models, such as distillation,...
Training
Internship
OpenAI
San Francisco, CA
1 day ago
Research Engineer
Introduction The Center for AI Safety is a research and field-building nonprofit located in... ...and technical research. As a research engineer here, you will pursue a variety of research... ...). Have experience launching and training distributed ML jobs. Communicate clearly...
Training
Work at office
Center for AI Safety
San Francisco, CA
2 days ago
Research Engineer
...understand how their systems are behaving post-deployment. Instead of reactive... ...and others. The Role: We are looking for Research Engineers to build AI systems that use agent interaction... ...behaviors Design and implement post‑training and optimization workflows to improve...
Training
Immediate start
Judgment Labs Inc.
San Francisco, CA
10 hours ago
Research Engineer
The role As a research scientist, you will design, implement, and optimize the large-scale training infrastructure that powers our frontier reinforcement... ...to bring frontier post-training capabilities into... ...Who we are : We’re a team of engineers, researchers, and operators....
Training
Work at office
Visa sponsorship
Relocation package
Applied Compute Inc.
San Francisco, CA
1 day ago
Research Engineer
...) Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range not provided... ...and craftsmanship. The Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind...
Training
Remote work
talentpluto
San Francisco, CA
10 hours ago
Applied Research Engineer
...breakthrough AI models at leading research labs and enterprises. Since 2... ...to produce high-quality training data at scale Frontier Data Labeling... ...As an Applied Research Engineer, you will be at the forefront... ...documentation, blog posts, and educational content that...
Training
Flexible hours
HRB
San Francisco, CA
3 days ago
Research Engineer
...the quality of AI across everything Gamma creates. As our Research Engineer, you'll design evaluation frameworks that measure AI output... ...and iterate toward quality improvements. Experience with post‑training techniques for LLMs including reinforcement learning and supervised...
Training
Work at office
Work from home
gamma.app
San Francisco, CA
1 day ago
Research Engineer
...detect and remediate critical software vulnerabilities. We are training and scaling security AI agents to discover zero-days... ...Infrastructure. About this role We’re seeking an experienced Research Engineer to join our effort in building and training AI agents for vulnerability...
Training
Full time
Work at office
DepthFirst
San Francisco, CA
10 hours ago
Research Engineer - Brain Computer Interface Models
...company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor... ...and preprocessing to designing novel architectures and training methodologies. You’ll Work Across: Large-scale EEG model training...
Training
Work at office
Relocation package
Zyphra
San Francisco, CA
2 days ago
Research - engineering
At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting-edge challenge that requires innovative problem-solving and creative approaches to tackle complex, highly-technical problems....
Training
Work at office
Camfer
San Francisco, CA
3 days ago
Research Engineer (Universes)
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...Team The Universes team within Research is responsible for training AI models to perform complex, difficult, long-horizon...
Training
Work at office
Remote work
Visa sponsorship
Shift work
Menlo Ventures
San Francisco, CA
10 hours ago
Senior Research Engineer
$300k - $400k
...a team. About the Team The Research team develops the model and decision... ...-the-art techniques in model training, prompting, orchestration,... ...Role As a Senior Research Engineer, you’ll be responsible for building... ...research. Prior experience post-training and deploying LLMs...
Training
Work at office
Decagon
San Francisco, CA
1 day ago
Research Engineer
$200k - $350k
...person applied AI team spanning Ex-Moonshot (post-training & agents, diffusion model training), second-time technical founders, engineers that made 100+ games for Voodoo,... ...engaging games & 3D environments. Our current research spans: Distributed multi-agent orchestration...
Training
Visa sponsorship
Relocation package
Roam
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Post-Training. Be the first to apply!