Head of Evaluation

$300k - $385k

Harvey

Harvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:

Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world like A&O, PwC, and many others.

Strategic investors: Raised over $100 million from strategic investors including Sequoia, Kleiner Perkins, and the OpenAI Startup Fund.

World-class team: Harvey is hiring the best technical and non-technical talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Superhuman, Glean, etc.

Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.

Value: Top of market cash and equity compensation.

Role

We are looking for a technical lead who can own the development of our evaluation platform. In this role, you will:

Build a team of 10-20 researchers and engineers with experience evaluating LLMs and large-scale AI systems.

Lead research and development of novel model-based evaluation methods and language model programs for evaluating complex tasks in legal and professional services.

Design and implement a red-teaming pipeline for our custom models and collaborate with other research teams to fine-tune models from human feedback.

Train reward models that accurately reflect the preferences of top-tier domain experts.

Experiment with synthetic data generation and LLM-based data augmentation to complement human-generated eval benchmarks.

Impact:

Lead research and development of Harvey’s evaluation platform.

Contribute to a product that transforms the nature of professional services.

Help define what it means for LLMs to effectively perform complex knowledge work tasks.

Work directly with our founders, research, and product teams, as well as foundation model providers like OpenAI.

Tackle unsolved research and engineering problems, including the hardest in the world relevant to LLMs in production.

Qualifications

5+ years experience leading highly-technical teams composed of both researchers and engineers.

Experience evaluating large-scale AI systems in high-stakes settings.

Technical: can serve as a tech lead and contribute substantially to our codebase as necessary.

Ability to communicate complex technical outcomes to diverse stakeholders.

Strong conviction in setting technical direction.

Compensation

The expected range of compensation for this role is between $300,000 and $385,000. Additionally, this role is eligible to participate in our equity plan. The successful candidate’s starting salary will be determined based on non-discriminatory factors such as skills, experience, and geographic location.

#J-18808-Ljbffr

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Head of Evaluation in San Francisco, CA vacancy

Head of Computer Vision
...creating exceptional impact for e-commerce businesses worldwide. Head of Computer Vision Compensation: Competitive salary +... ...for our ambitious growth phase, building comprehensive evaluation frameworks and creating space for calculated risk-taking alongside...
Suggested
Work at office
Local area
Remote work
Worldwide
Home office
Visa sponsorship
Relocation package
Flexible hours
Photoroom
San Francisco, CA
15 days ago
Head of Disputes
$140k - $190k
...Head of Disputes At Cardless, we're building a credit card and loyalty platform that consumer businesses use to engage their customers... ...rules, regulatory timelines, fraud pattern analysis, evidence evaluation — good disputes work isn’t rote. It rewards people who think...
Suggested
Work at office
Flexible hours
Cardless
San Francisco, CA
4 days ago
Head of Applied AI
$275k - $325k
...therapeutic lifecycle. The Role & Your Mission We're looking for a Head of Applied AI that can manage and lead our Applied AI team at... ...and their derivatives (e.g., agents, RAG) in order to build, evaluate, and deploy AI features that are valuable, controllable, and...
Suggested
Work at office
Remote work
Flexible hours
Weave Bio
San Francisco, CA
3 days ago
Head of Lab Platform
...The opportunity We are seeking a Head of Lab Platform to join our team working at the interface of generative AI and synthetic... ...capabilities and progress towards full lab autonomy. Identify, evaluate, and integrate new automation hardware and software. Identify...
Suggested
Flexible hours
Latent Labs
San Francisco, CA
1 day ago
Head of Admissions, 2 Hour Learning (Remote) - $200,000/year USD
$200k
...families perceive us, how thoroughly they grasp the model, and whether their enrollment decisions stem from sound reasoning. You will evaluate compatibility, facilitate meaningful discussions, and advise against enrollment when appropriate. The objective is not to maximize...
Suggested
Full time
Remote work
Relocation
Visa sponsorship
Relocation package
Crossover for Work
San Francisco, CA
3 days ago
Head of Credit
$220k - $240k
...Fintech 50 2022 Role Overview We are looking to hire a Head of Credit to expand and improve our underwriting capabilities and... ...by leveraging bank transactional data, allowing us to both evaluate new applicants and continuously monitor the performance of our outstanding...
Work experience placement
Local area
Flexible hours
Brigit
San Francisco, CA
3 days ago
Head of Capacity
...systems at Cockroach Labs. Our GTM is led by Jon Boyer, formerly Head of Sales at Zapier. We’re now extending the same CI... ...Lead hardware procurement and infrastructure expansion efforts.Evaluate new suppliers, hardware platforms, and deployment opportunities....
Blacksmith
San Francisco, CA
4 days ago
Head of GTM, AI Inference
...Head Of GTM, AI Inference Hybrid At Cloudflare, we are on a mission to help build a better Internet. Today the company runs... ...global market. You'll work directly with Workers AI prospects evaluating Cloudflare as a source of GPU capacity, partner with product leadership...
Temporary work
Flexible hours
Shift work
Cloudflare Inc
San Francisco, CA
10 days ago
Head of Audience
...drinks ~401K plan ~ Unlimited PTO About Us Founding team: The core methodology behind this platform comes from NLP evaluation research we had done at Stanford. We raised a $5M seed from some of the top institutional and angel investors in the valley. Our...
Work experience placement
Relocation package
Shift work
Vals AI
San Francisco, CA
3 days ago
Head of Video Policy & Regional High Harm - Trust and Safety
...regulatory and cultural realities of every region. We are hiring a Head of Video Policy & Regional High Harm to lead this team of... ...moderation systems, including policy guidance, labeling standards, and evaluation frameworks. * Define and implement quality frameworks to...
Tik Tok
San Francisco, CA
1 day ago
Head of Machine Learning
...States, EMEA, and APAC. Role Overview We are looking for a Head of Machine Learning to lead the development of the next... ...services. Work directly in the codebase to prototype models, evaluate approaches, and ship production systems. Build and lead the machine...
RZR Global Inc.
San Francisco, CA
1 day ago
Head of AI & Machine Learning
...Head Of Ai & Machine Learning As the Head of AI & Machine Learning, you will lead the development of transformative AI systems,... ...including documents, market signals, and user inputs. Design novel evaluation frameworks to measure performance, trust, and qualitative...
Pivotal Solutions Inc
San Francisco, CA
2 days ago
Head of System Architecture, Frontier AI Hardware
...Modeled and reasoned about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluated how collective communication patterns interact with emerging accelerators and non-GPU compute elements. Worked with software...
Casual work
Visa sponsorship
Sygaldry
San Francisco, CA
3 days ago
Head of Applied AI
...Experience Required 5-8 Years of experience Job Title Head of Applied AI About the company InCommon is hiring on behalf... ..., and being the credible technical voice in enterprise evaluations, investor conversations, and public research. You will work directly...
Work at office
Flexible hours
InCommon LLC
San Francisco, CA
3 days ago
Head of Computer Vision
$230k - $300k
...individual contributors. Responsibilities Develop and train CV models Design end-to-end pipelines for data ingestion, training, evaluation, and inference Scale infrastructure for large training and inference volume while minimizing costs Entire reliably of...
Full time
For contractors
Bobyard
San Francisco, CA
3 days ago
Head of AI
...frontier of physical intelligence. Role Overview As the Head of AI, you will serve as both our chief scientific visionary and... ...example by remaining actively involved in designing, training, and evaluating state‑of‑the‑art foundational models. Scientific Excellence:...
Work at office
Relocation
3 days per week
Alfa AI
San Francisco, CA
3 days ago
Head of Public Affairs and Industry Outreach, SVP
$305k - $340k
...legislative partners. Communications: In partnership with the head of Marketing and External Communications, develop communication... ...national, state, and local public policy organizations. Monitor and Evaluate: Continuously monitor, analyze, and evaluate legislative and...
Work experience placement
Bank staff
Work at office
Local area
Flexible hours
Federal Home Loan Bank of San Francisco
San Francisco, CA
3 days ago
Head of Demand Generation
...data-driven approach to understanding how organizations discover, evaluate, and adopt AI. We partner closely with Sales, Revenue Operations... ...and long-term value. About the Role We’re looking for a Head of Demand Generation to build and lead OpenAI’s enterprise demand...
Work at office
Relocation package
OpenAI
San Francisco, CA
2 days ago
Head of Revenue
...rolling up your sleeves to design and execute the next generation of growth initiatives. What You’ll Do Refresh GTM strategy: Re-evaluate our current growth engines (SEO, paid, social, partnerships) and design a plan to 10x revenue. Experiment & innovate: Stay ahead...
Full time
Work at office
Shift work
53 Stations
San Francisco, CA
3 days ago
Head of System Architecture, Frontier AI Hardware
...hyperscaler scale. Model and reason about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluate how collective communication patterns interact with emerging accelerators and non‑GPU compute elements. Work with software and...
Full time
Casual work
Visa sponsorship
Wheel the World
San Francisco, CA
1 day ago
Head of AI Solutions Architecture & Pre-Sales
$212k
...for long-term customer success. You will own the "Technical Win," serving as the primary authority during the pre-sales phase to evaluate customer needs and scope complex AI data engagements. Your influence extends beyond the sale; you will act as a critical voice in...
Full time
Work at office
Local area
Remote work
Flexible hours
Uber
San Francisco, CA
1 day ago
Head of Artificial Intelligence
$280k - $320k
...,000 + Equity About the Role A fast‑moving startup is seeking a Head of AI to build and lead its artificial intelligence function from... ..., ensuring safety, reliability, robustness, and transparency. Evaluate and integrate emerging AI technologies, including LLMs, agentic...
Harnham
San Francisco, CA
2 days ago
Head of Biology
$250k - $285k
...answer might be. You think about controls, confounds, and interpretability across domains, not just within your own. Experience evaluating scientific work outside your own area of training. This role requires making decisions about programs spanning immunology, translational...
Full time
Contract work
REACH INDUSTRIES
San Francisco, CA
22 hours ago
Head of Analyst Relations
$139.3k - $174.2k
...the world. About the Role Armada is hiring a Head of Analyst Relations to own and grow our analyst relations program with a key focus on participation in key industry evaluations such as Gartner Magic Quadrants, Forrester Waves, and similar reports...
Work at office
Flexible hours
Armada
San Francisco, CA
2 days ago
Head of Developer Relations
...reviewing CVs, qualifications, and assignments by providing additional insights, but it does not make decisions, every application is evaluated at each step by a member of our hiring team. We also use AI to transcribe interviews so our team can stay focused on the...
Remote work
Shift work
Night shift
Luxor Technology Corp
San Francisco, CA
1 day ago
Head of AI Agent Systems
...Head Of Ai Agent Systems San Francisco About Wonderschool Wonderschool builds software and systems that help businesses operate... ...failures, memory loss, and context limitations Design evaluation systems to measure success rates, failure modes, and reliability...
Immediate start
Shift work
Wonderschool
San Francisco, CA
a month ago
Head of Developer Experience
$200k - $270k
...Head Of Developer Experience Los Angeles, San Francisco About HeyGen At HeyGen, our mission is to make visual storytelling... ...agents and developer workflows — and to own how developers discover, evaluate, adopt, and build on our platform. The AI agent ecosystem is...
Work experience placement
Remote work
Heygen
San Francisco, CA
4 days ago
Director, Head of Global Benefits
$184k - $260k
...We’re seeking a strategic and experienced Head of Global Benefits to lead the design, implementation, and scaling of our benefits programs... ...manage benefits budgets, forecasting, and cost optimization Evaluate and benchmark programs to ensure competitiveness in each market...
Local area
Worldwide
Datadog
San Francisco, CA
3 days ago
Senior Director, Head of Pricing
$170k - $320k
...leads) who are outside of VAS but part of Global Finance. The Head of VAS Pricing reports to the VP, Head of VAS Pricing & Deals.... ...Financial acumen-ability to utilize sophisticated financial analyses to evaluate business opportunities and make strategic choices ~ Highly...
Work experience placement
Work at office
Local area
Visa
San Francisco, CA
2 days ago
Senior Director, Head of AI-Native ES
...equally self-directed and agile Employee Success (ES) team. The Head of AI-Native ES will lead this experimental team and serve as... ...playbooks, and pioneer new ways of organizing, compensating, and evaluating AI-native talent. You'll work across the organization with the...
B Capital
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Head of Evaluation. Be the first to apply!