Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Head of Evaluation

$300k - $385k

Harvey

Harvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:

  • Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world like A&O, PwC, and many others.
  • Strategic investors: Raised over $100 million from strategic investors including Sequoia, Kleiner Perkins, and the OpenAI Startup Fund.
  • World-class team: Harvey is hiring the best technical and non-technical talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Superhuman, Glean, etc.
  • Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
  • Value: Top of market cash and equity compensation.

Role

We are looking for a technical lead who can own the development of our evaluation platform. In this role, you will:

  • Build a team of 10-20 researchers and engineers with experience evaluating LLMs and large-scale AI systems.
  • Lead research and development of novel model-based evaluation methods and language model programs for evaluating complex tasks in legal and professional services.
  • Design and implement a red-teaming pipeline for our custom models and collaborate with other research teams to fine-tune models from human feedback.
  • Train reward models that accurately reflect the preferences of top-tier domain experts.
  • Experiment with synthetic data generation and LLM-based data augmentation to complement human-generated eval benchmarks.

Impact:

  • Lead research and development of Harvey’s evaluation platform.
  • Contribute to a product that transforms the nature of professional services.
  • Help define what it means for LLMs to effectively perform complex knowledge work tasks.
  • Work directly with our founders, research, and product teams, as well as foundation model providers like OpenAI.
  • Tackle unsolved research and engineering problems, including the hardest in the world relevant to LLMs in production.

Qualifications

  • 5+ years experience leading highly-technical teams composed of both researchers and engineers.
  • Experience evaluating large-scale AI systems in high-stakes settings.
  • Technical: can serve as a tech lead and contribute substantially to our codebase as necessary.
  • Ability to communicate complex technical outcomes to diverse stakeholders.
  • Strong conviction in setting technical direction.

Compensation

The expected range of compensation for this role is between $300,000 and $385,000. Additionally, this role is eligible to participate in our equity plan. The successful candidate’s starting salary will be determined based on non-discriminatory factors such as skills, experience, and geographic location.

#J-18808-Ljbffr
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Head of Evaluation in San Francisco, CA vacancy
  •  ...creating exceptional impact for e-commerce businesses worldwide. Head of Computer Vision Compensation: Competitive salary +...  ...for our ambitious growth phase, building comprehensive evaluation frameworks and creating space for calculated risk-taking alongside... 
    Suggested
    Work at office
    Local area
    Remote work
    Worldwide
    Home office
    Visa sponsorship
    Relocation package
    Flexible hours

    Photoroom

    San Francisco, CA
    15 days ago
  • $140k - $190k

     ...Head of Disputes At Cardless, we're building a credit card and loyalty platform that consumer businesses use to engage their customers...  ...rules, regulatory timelines, fraud pattern analysis, evidence evaluation — good disputes work isn’t rote. It rewards people who think... 
    Suggested
    Work at office
    Flexible hours

    Cardless

    San Francisco, CA
    4 days ago
  • $275k - $325k

     ...therapeutic lifecycle. The Role & Your Mission We're looking for a Head of Applied AI that can manage and lead our Applied AI team at...  ...and their derivatives (e.g., agents, RAG) in order to build, evaluate, and deploy AI features that are valuable, controllable, and... 
    Suggested
    Work at office
    Remote work
    Flexible hours

    Weave Bio

    San Francisco, CA
    3 days ago
  •  ...The opportunity We are seeking a Head of Lab Platform to join our team working at the interface of generative AI and synthetic...  ...capabilities and progress towards full lab autonomy. Identify, evaluate, and integrate new automation hardware and software. Identify... 
    Suggested
    Flexible hours

    Latent Labs

    San Francisco, CA
    1 day ago
  • $200k

     ...families perceive us, how thoroughly they grasp the model, and whether their enrollment decisions stem from sound reasoning. You will evaluate compatibility, facilitate meaningful discussions, and advise against enrollment when appropriate. The objective is not to maximize... 
    Suggested
    Full time
    Remote work
    Relocation
    Visa sponsorship
    Relocation package

    Crossover for Work

    San Francisco, CA
    3 days ago
  • $220k - $240k

     ...Fintech 50 2022 Role Overview We are looking to hire a Head of Credit to expand and improve our underwriting capabilities and...  ...by leveraging bank transactional data, allowing us to both evaluate new applicants and continuously monitor the performance of our outstanding... 
    Work experience placement
    Local area
    Flexible hours

    Brigit

    San Francisco, CA
    3 days ago
  •  ...systems at Cockroach Labs. Our GTM is led by Jon Boyer, formerly Head of Sales at Zapier. We’re now extending the same CI...  ...Lead hardware procurement and infrastructure expansion efforts.Evaluate new suppliers, hardware platforms, and deployment opportunities.... 

    Blacksmith

    San Francisco, CA
    4 days ago
  •  ...Head Of GTM, AI Inference Hybrid At Cloudflare, we are on a mission to help build a better Internet. Today the company runs...  ...global market. You'll work directly with Workers AI prospects evaluating Cloudflare as a source of GPU capacity, partner with product leadership... 
    Temporary work
    Flexible hours
    Shift work

    Cloudflare Inc

    San Francisco, CA
    10 days ago
  •  ...drinks ~401K plan ~ Unlimited PTO About Us Founding team: The core methodology behind this platform comes from NLP evaluation research we had done at Stanford. We raised a $5M seed from some of the top institutional and angel investors in the valley. Our... 
    Work experience placement
    Relocation package
    Shift work

    Vals AI

    San Francisco, CA
    3 days ago
  •  ...regulatory and cultural realities of every region. We are hiring a Head of Video Policy & Regional High Harm to lead this team of...  ...moderation systems, including policy guidance, labeling standards, and evaluation frameworks. * Define and implement quality frameworks to... 

    Tik Tok

    San Francisco, CA
    1 day ago
  •  ...States, EMEA, and APAC. Role Overview We are looking for a Head of Machine Learning to lead the development of the next...  ...services. Work directly in the codebase to prototype models, evaluate approaches, and ship production systems. Build and lead the machine... 

    RZR Global Inc.

    San Francisco, CA
    1 day ago
  •  ...Head Of Ai & Machine Learning As the Head of AI & Machine Learning, you will lead the development of transformative AI systems,...  ...including documents, market signals, and user inputs. Design novel evaluation frameworks to measure performance, trust, and qualitative... 

    Pivotal Solutions Inc

    San Francisco, CA
    2 days ago
  •  ...Modeled and reasoned about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluated how collective communication patterns interact with emerging accelerators and non-GPU compute elements. Worked with software... 
    Casual work
    Visa sponsorship

    Sygaldry

    San Francisco, CA
    3 days ago
  •  ...Experience Required 5-8 Years of experience Job Title Head of Applied AI About the company InCommon is hiring on behalf...  ..., and being the credible technical voice in enterprise evaluations, investor conversations, and public research. You will work directly... 
    Work at office
    Flexible hours

    InCommon LLC

    San Francisco, CA
    3 days ago
  • $230k - $300k

     ...individual contributors. Responsibilities Develop and train CV models Design end-to-end pipelines for data ingestion, training, evaluation, and inference Scale infrastructure for large training and inference volume while minimizing costs Entire reliably of... 
    Full time
    For contractors

    Bobyard

    San Francisco, CA
    3 days ago
  •  ...frontier of physical intelligence. Role Overview As the Head of AI, you will serve as both our chief scientific visionary and...  ...example by remaining actively involved in designing, training, and evaluating state‑of‑the‑art foundational models. Scientific Excellence:... 
    Work at office
    Relocation
    3 days per week

    Alfa AI

    San Francisco, CA
    3 days ago
  • $305k - $340k

     ...legislative partners. Communications: In partnership with the head of Marketing and External Communications, develop communication...  ...national, state, and local public policy organizations. Monitor and Evaluate: Continuously monitor, analyze, and evaluate legislative and... 
    Work experience placement
    Bank staff
    Work at office
    Local area
    Flexible hours

    Federal Home Loan Bank of San Francisco

    San Francisco, CA
    3 days ago
  •  ...data-driven approach to understanding how organizations discover, evaluate, and adopt AI. We partner closely with Sales, Revenue Operations...  ...and long-term value. About the Role We’re looking for a Head of Demand Generation to build and lead OpenAI’s enterprise demand... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...rolling up your sleeves to design and execute the next generation of growth initiatives. What You’ll Do Refresh GTM strategy: Re-evaluate our current growth engines (SEO, paid, social, partnerships) and design a plan to 10x revenue. Experiment & innovate: Stay ahead... 
    Full time
    Work at office
    Shift work

    53 Stations

    San Francisco, CA
    3 days ago
  •  ...hyperscaler scale. Model and reason about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluate how collective communication patterns interact with emerging accelerators and non‑GPU compute elements. Work with software and... 
    Full time
    Casual work
    Visa sponsorship

    Wheel the World

    San Francisco, CA
    1 day ago
  • $212k

     ...for long-term customer success. You will own the "Technical Win," serving as the primary authority during the pre-sales phase to evaluate customer needs and scope complex AI data engagements. Your influence extends beyond the sale; you will act as a critical voice in... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    Uber

    San Francisco, CA
    1 day ago
  • $280k - $320k

     ...,000 + Equity About the Role A fast‑moving startup is seeking a Head of AI to build and lead its artificial intelligence function from...  ..., ensuring safety, reliability, robustness, and transparency. Evaluate and integrate emerging AI technologies, including LLMs, agentic... 

    Harnham

    San Francisco, CA
    2 days ago
  • $250k - $285k

     ...answer might be. You think about controls, confounds, and interpretability across domains, not just within your own. Experience evaluating scientific work outside your own area of training. This role requires making decisions about programs spanning immunology, translational... 
    Full time
    Contract work

    REACH INDUSTRIES

    San Francisco, CA
    22 hours ago
  • $139.3k - $174.2k

     ...the world. About the Role Armada is hiring a Head of Analyst Relations to own and grow our analyst relations program with a key focus on participation in key industry evaluations such as Gartner Magic Quadrants, Forrester Waves, and similar reports... 
    Work at office
    Flexible hours

    Armada

    San Francisco, CA
    2 days ago
  •  ...reviewing CVs, qualifications, and assignments by providing additional insights, but it does not make decisions, every application is evaluated at each step by a member of our hiring team. We also use AI to transcribe interviews so our team can stay focused on the... 
    Remote work
    Shift work
    Night shift

    Luxor Technology Corp

    San Francisco, CA
    1 day ago
  •  ...Head Of Ai Agent Systems San Francisco About Wonderschool Wonderschool builds software and systems that help businesses operate...  ...failures, memory loss, and context limitations Design evaluation systems to measure success rates, failure modes, and reliability... 
    Immediate start
    Shift work

    Wonderschool

    San Francisco, CA
    a month ago
  • $200k - $270k

     ...Head Of Developer Experience Los Angeles, San Francisco About HeyGen At HeyGen, our mission is to make visual storytelling...  ...agents and developer workflows — and to own how developers discover, evaluate, adopt, and build on our platform. The AI agent ecosystem is... 
    Work experience placement
    Remote work

    Heygen

    San Francisco, CA
    4 days ago
  • $184k - $260k

     ...We’re seeking a strategic and experienced Head of Global Benefits to lead the design, implementation, and scaling of our benefits programs...  ...manage benefits budgets, forecasting, and cost optimization Evaluate and benchmark programs to ensure competitiveness in each market... 
    Local area
    Worldwide

    Datadog

    San Francisco, CA
    3 days ago
  • $170k - $320k

     ...leads) who are outside of VAS but part of Global Finance. The Head of VAS Pricing reports to the VP, Head of VAS Pricing & Deals....  ...Financial acumen-ability to utilize sophisticated financial analyses to evaluate business opportunities and make strategic choices ~ Highly... 
    Work experience placement
    Work at office
    Local area

    Visa

    San Francisco, CA
    2 days ago
  •  ...equally self-directed and agile Employee Success (ES) team. The Head of AI-Native ES will lead this experimental team and serve as...  ...playbooks, and pioneer new ways of organizing, compensating, and evaluating AI-native talent. You'll work across the organization with the... 

    B Capital

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Head of Evaluation. Be the first to apply!