Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Post-Training

$231k - $340k
Full-time

Harvey

Why Harvey At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come. This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched. Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you. At Harvey, the future of professional services is being written today — and we’re just getting started. Role Overview Post-training is how Harvey turns expert feedback and agent traces into models that are meaningfully better at legal work. We are looking for a research engineer who can help scale that loop: defining and running model training experiments, interpreting results, and working with internal and external research partners to build better data, environments, graders, and training recipes. This role is for someone who can self-manage model training and applied research projects. You will work closely with internal and external research collaborators on post-training efforts that matter to our product roadmap. The ideal candidate has extensive hands-on experience training open weight models, either in a research or production setting, and enough engineering depth to run and debug experiments efficiently. What You'll Do Drive post-training experiments, pushing agent performance while navigating the Pareto frontier of cost, latency, security, and governance. Optimize agent harnesses, including domain-specific skills, tools, subagents, retrieval strategies, and validation loops that improve quality on long-horizon legal work. Design and develop grading and reward systems that are reliable enough for evaluation, efficient enough for iteration, and strict enough for high-stakes legal work. Study agent behavior, identifying patterns that correlate with successful work product, and converting those findings into training data, evals, or harness changes. Work with Harvey researchers and external research partners to define experiments, evaluate methodology, review results, and keep projects moving toward concrete model improvements. What You Have Hands-on experience with post-training or model-training work, such as SFT, preference optimization, RLHF/RLAIF, reward modeling, distillation, or adapting open-weight models to specialized domains. Strong judgment about model behavior: you can read traces, inspect outputs, identify failure modes, and reason about whether a metric is measuring the thing that matters. Strong Python and research-engineering ability. You can write clean code, debug experiments, and build the simple but reliable systems needed to make research move faster. Ability to self-manage ambiguous applied research projects and communicate clearly with researchers, engineers, product teams, domain experts, and external partners. Nice to Have Experience building data or evaluation infrastructure for ML workflows, such as dataset curation pipelines, model-output processing, experiment tracking, evaluation dashboards, or regression analysis tooling. Experience with distributed training, inference systems, GPU workloads, or large-scale ML experimentation. Research publications, open-source contributions, or shipped industry work in LLMs, agents, evaluation, or ML systems. Compensation $231,000 - $340,000 Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-AK1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing View email address on click.appcast.io

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Research Engineer, Post-Training in San Francisco, CA vacancy
  •  ...Space Models or SSMs, a new primitive for training efficient, large‑scale foundation models...  ...in model innovation and systems engineering paired with a design‑minded product engineering...  ...building scalable systems that bridge research and production. What We Offer Lunch... 
    Training
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    4 days ago
  • General Analysis is a security research lab focused on adversarial simulations, evaluations...  ...Francisco and work across research, engineering, and product. About the role As a...  ...Engineer, you will be responsible for post-training models for adversarial capabilities using... 
    Training

    General Analysis

    San Francisco, CA
    10 hours ago
  •  ...will fine-tune models for extraction and updates, and implement research findings while ensuring high reliability and low latency. The...  ...through trials. Key qualifications include experience in RAG, model training, proficiency in Python, and a strong understanding of PyTorch... 
    Training

    Mem0 Official Documentation

    San Francisco, CA
    4 days ago
  •  ...that learn through exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model training. Qualifications: You’ve worked on large GPU clusters, and are... 
    Training

    Pantograph

    San Francisco, CA
    10 hours ago
  • A pioneering generative modeling company in San Francisco is seeking a Research Engineer for their Physical AI team. This role focuses on designing pre-training and post-training pipelines for AI models, collaborating with industrial partners, and evaluating model performance... 
    Training
    Work at office

    Hedra, Inc

    San Francisco, CA
    4 days ago
  • Eventual, based in San Francisco, is seeking a Research Engineer for the Visual Understanding team. In this role, you'll manage visual understanding...  ...capabilities to transform vast video datasets into usable training data. Responsibilities include training multimodal models,... 
    Training
    Work at office

    Eventual

    San Francisco, CA
    1 day ago
  • $200k - $350k

     ...deeply curious—building at the intersection of research, product, and creativity . The Role As a Machine Learning Research Engineer , you’ll own end-to-end research cycles—designing, running, and analyzing post‑training experiments that help models evaluate style... 
    Training

    Coders Connect

    San Francisco, CA
    10 hours ago
  • $100k - $120k

     ...'dreams' of robot trajectories which can be collected for training. You will become a part of Coda's founding team and lead the...  ...Robotics' world models. Responsibilities Lead a team of researchers and engineers. Develop more efficient ways to generate high quality... 
    Training

    Coda Robotics

    San Francisco, CA
    4 days ago
  • $350k

     ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  ...language models Building scalable RL infrastructure and training methodologies Enhancing model reasoning capabilities We... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    10 hours ago
  •  ...to-end lifecycle of memory features—from research to production. You’ll fine-tune models...  ...benchmark ideas from papers; and ship with Engineering to SOTA latency, reliability, and cost ....  ...quality. What You’ll Do Fine-tune and train models for memory extraction, updates,... 
    Training

    Mem0

    San Francisco, CA
    4 days ago
  •  ...stealth team of elite founders and AI researchers, with backgrounds spanning Stanford, OpenAI...  ..., the lab ships vibes. With one, every training run, every prompt change, every agent...  ...we measure is what we want Product engineers, by instrumenting real-user behavior on... 
    Training
    Relocation package

    AGI, Inc.

    San Francisco, CA
    4 days ago
  • As a research engineer, you will work on open-ended research problems in the domain of RL, agents and data, and share your work with the public...  ...you might tackle on a day-to-day basis: Modifying our RL training stack to support end-to-end training with agent harnesses... 
    Training
    Work at office

    Proximal

    San Francisco, CA
    1 day ago
  • About the Role As an applied research engineer at Sieve, you’ll build high performance building...  ...performance out of them through clever pre/post-processing, parallelism, pipelining,...  ...building end-to-end products—not just training models Able to break problems down from... 
    Training

    Sieve

    San Francisco, CA
    2 days ago
  •  ...company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to...  ...with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at... 
    Training
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    4 days ago
  • $175k - $350k

    Adept thrives at the intersection of research and product. As a research engineer, you will play a crucial role in developing and implementing AI models,...  ...them Build, operate, and maintain large-scale data and training pipelines Design and run experiments to improve our... 
    Training
    Work at office
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    4 days ago
  • About Human Archive Human Archive is a research lab backed by Y Combinator focused on modeling...  ...join us. The Opportunity As a Research Engineer, you’ll work on multimodal sensing...  ...synchronized, fused, and used for downstream VLA training. You’ll research emerging sensing... 
    Training
    Shift work

    Human Archive

    San Francisco, CA
    3 days ago
  • $315k

    As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our API. In this role, you will design and iterate on state‑of‑the‑art finetuning techniques, such as Constitutional AI and... 
    Training
    Work at office
    Home office
    Visa sponsorship
    Relocation package

    Anthropic

    San Francisco, CA
    2 days ago
  • $295k

    Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI is at the forefront of artificial intelligence...  ...engineering principles. Are familiar with methods of training and fine‑tuning large language models, such as distillation,... 
    Training
    Internship

    OpenAI

    San Francisco, CA
    1 day ago
  • Introduction The Center for AI Safety is a research and field-building nonprofit located in...  ...and technical research. As a research engineer here, you will pursue a variety of research...  ...). Have experience launching and training distributed ML jobs. Communicate clearly... 
    Training
    Work at office

    Center for AI Safety

    San Francisco, CA
    2 days ago
  •  ...understand how their systems are behaving post-deployment. Instead of reactive...  ...and others. The Role: We are looking for Research Engineers to build AI systems that use agent interaction...  ...behaviors Design and implement post‑training and optimization workflows to improve... 
    Training
    Immediate start

    Judgment Labs Inc.

    San Francisco, CA
    10 hours ago
  • The role As a research scientist, you will design, implement, and optimize the large-scale training infrastructure that powers our frontier reinforcement...  ...to bring frontier post-training capabilities into...  ...Who we are : We’re a team of engineers, researchers, and operators.... 
    Training
    Work at office
    Visa sponsorship
    Relocation package

    Applied Compute Inc.

    San Francisco, CA
    1 day ago
  •  ...) Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range not provided...  ...and craftsmanship. The Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind... 
    Training
    Remote work

    talentpluto

    San Francisco, CA
    10 hours ago
  •  ...breakthrough AI models at leading research labs and enterprises. Since 2...  ...to produce high-quality training data at scale Frontier Data Labeling...  ...As an Applied Research Engineer, you will be at the forefront...  ...documentation, blog posts, and educational content that... 
    Training
    Flexible hours

    HRB

    San Francisco, CA
    3 days ago
  •  ...the quality of AI across everything Gamma creates. As our Research Engineer, you'll design evaluation frameworks that measure AI output...  ...and iterate toward quality improvements. Experience with post‑training techniques for LLMs including reinforcement learning and supervised... 
    Training
    Work at office
    Work from home

    gamma.app

    San Francisco, CA
    1 day ago
  •  ...detect and remediate critical software vulnerabilities. We are training and scaling security AI agents to discover zero-days...  ...Infrastructure. About this role We’re seeking an experienced Research Engineer to join our effort in building and training AI agents for vulnerability... 
    Training
    Full time
    Work at office

    DepthFirst

    San Francisco, CA
    10 hours ago
  •  ...company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor...  ...and preprocessing to designing novel architectures and training methodologies. You’ll Work Across: Large-scale EEG model training... 
    Training
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    2 days ago
  • At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting-edge challenge that requires innovative problem-solving and creative approaches to tackle complex, highly-technical problems.... 
    Training
    Work at office

    Camfer

    San Francisco, CA
    3 days ago
  •  ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  ...Team The Universes team within Research is responsible for training AI models to perform complex, difficult, long-horizon... 
    Training
    Work at office
    Remote work
    Visa sponsorship
    Shift work

    Menlo Ventures

    San Francisco, CA
    10 hours ago
  • $300k - $400k

     ...a team. About the Team The Research team develops the model and decision...  ...-the-art techniques in model training, prompting, orchestration,...  ...Role As a Senior Research Engineer, you’ll be responsible for building...  ...research. Prior experience post-training and deploying LLMs... 
    Training
    Work at office

    Decagon

    San Francisco, CA
    1 day ago
  • $200k - $350k

     ...person applied AI team spanning Ex-Moonshot (post-training & agents, diffusion model training), second-time technical founders, engineers that made 100+ games for Voodoo,...  ...engaging games & 3D environments. Our current research spans: Distributed multi-agent orchestration... 
    Training
    Visa sponsorship
    Relocation package

    Roam

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Post-Training. Be the first to apply!