Head of Evaluation
$300k - $385kHarvey
Harvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:
- Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world like A&O, PwC, and many others.
- Strategic investors: Raised over $100 million from strategic investors including Sequoia, Kleiner Perkins, and the OpenAI Startup Fund.
- World-class team: Harvey is hiring the best technical and non-technical talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Superhuman, Glean, etc.
- Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
- Value: Top of market cash and equity compensation.
Role
We are looking for a technical lead who can own the development of our evaluation platform. In this role, you will:
- Build a team of 10-20 researchers and engineers with experience evaluating LLMs and large-scale AI systems.
- Lead research and development of novel model-based evaluation methods and language model programs for evaluating complex tasks in legal and professional services.
- Design and implement a red-teaming pipeline for our custom models and collaborate with other research teams to fine-tune models from human feedback.
- Train reward models that accurately reflect the preferences of top-tier domain experts.
- Experiment with synthetic data generation and LLM-based data augmentation to complement human-generated eval benchmarks.
Impact:
- Lead research and development of Harvey’s evaluation platform.
- Contribute to a product that transforms the nature of professional services.
- Help define what it means for LLMs to effectively perform complex knowledge work tasks.
- Work directly with our founders, research, and product teams, as well as foundation model providers like OpenAI.
- Tackle unsolved research and engineering problems, including the hardest in the world relevant to LLMs in production.
Qualifications
- 5+ years experience leading highly-technical teams composed of both researchers and engineers.
- Experience evaluating large-scale AI systems in high-stakes settings.
- Technical: can serve as a tech lead and contribute substantially to our codebase as necessary.
- Ability to communicate complex technical outcomes to diverse stakeholders.
- Strong conviction in setting technical direction.
Compensation
The expected range of compensation for this role is between $300,000 and $385,000. Additionally, this role is eligible to participate in our equity plan. The successful candidate’s starting salary will be determined based on non-discriminatory factors such as skills, experience, and geographic location.
#J-18808-Ljbffr- ...creating exceptional impact for e-commerce businesses worldwide. Head of Computer Vision Compensation: Competitive salary +... ...for our ambitious growth phase, building comprehensive evaluation frameworks and creating space for calculated risk-taking alongside...SuggestedWork at officeLocal areaRemote workWorldwideHome officeVisa sponsorshipRelocation packageFlexible hours
$140k - $190k
...Head of Disputes At Cardless, we're building a credit card and loyalty platform that consumer businesses use to engage their customers... ...rules, regulatory timelines, fraud pattern analysis, evidence evaluation — good disputes work isn’t rote. It rewards people who think...SuggestedWork at officeFlexible hours$275k - $325k
...therapeutic lifecycle. The Role & Your Mission We're looking for a Head of Applied AI that can manage and lead our Applied AI team at... ...and their derivatives (e.g., agents, RAG) in order to build, evaluate, and deploy AI features that are valuable, controllable, and...SuggestedWork at officeRemote workFlexible hours- ...The opportunity We are seeking a Head of Lab Platform to join our team working at the interface of generative AI and synthetic... ...capabilities and progress towards full lab autonomy. Identify, evaluate, and integrate new automation hardware and software. Identify...SuggestedFlexible hours
$200k
...families perceive us, how thoroughly they grasp the model, and whether their enrollment decisions stem from sound reasoning. You will evaluate compatibility, facilitate meaningful discussions, and advise against enrollment when appropriate. The objective is not to maximize...SuggestedFull timeRemote workRelocationVisa sponsorshipRelocation package$220k - $240k
...Fintech 50 2022 Role Overview We are looking to hire a Head of Credit to expand and improve our underwriting capabilities and... ...by leveraging bank transactional data, allowing us to both evaluate new applicants and continuously monitor the performance of our outstanding...Work experience placementLocal areaFlexible hours- ...systems at Cockroach Labs. Our GTM is led by Jon Boyer, formerly Head of Sales at Zapier. We’re now extending the same CI... ...Lead hardware procurement and infrastructure expansion efforts.Evaluate new suppliers, hardware platforms, and deployment opportunities....
- ...Head Of GTM, AI Inference Hybrid At Cloudflare, we are on a mission to help build a better Internet. Today the company runs... ...global market. You'll work directly with Workers AI prospects evaluating Cloudflare as a source of GPU capacity, partner with product leadership...Temporary workFlexible hoursShift work
- ...drinks ~401K plan ~ Unlimited PTO About Us Founding team: The core methodology behind this platform comes from NLP evaluation research we had done at Stanford. We raised a $5M seed from some of the top institutional and angel investors in the valley. Our...Work experience placementRelocation packageShift work
- ...regulatory and cultural realities of every region. We are hiring a Head of Video Policy & Regional High Harm to lead this team of... ...moderation systems, including policy guidance, labeling standards, and evaluation frameworks. * Define and implement quality frameworks to...
- ...States, EMEA, and APAC. Role Overview We are looking for a Head of Machine Learning to lead the development of the next... ...services. Work directly in the codebase to prototype models, evaluate approaches, and ship production systems. Build and lead the machine...
- ...Head Of Ai & Machine Learning As the Head of AI & Machine Learning, you will lead the development of transformative AI systems,... ...including documents, market signals, and user inputs. Design novel evaluation frameworks to measure performance, trust, and qualitative...
- ...Modeled and reasoned about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluated how collective communication patterns interact with emerging accelerators and non-GPU compute elements. Worked with software...Casual workVisa sponsorship
- ...Experience Required 5-8 Years of experience Job Title Head of Applied AI About the company InCommon is hiring on behalf... ..., and being the credible technical voice in enterprise evaluations, investor conversations, and public research. You will work directly...Work at officeFlexible hours
$230k - $300k
...individual contributors. Responsibilities Develop and train CV models Design end-to-end pipelines for data ingestion, training, evaluation, and inference Scale infrastructure for large training and inference volume while minimizing costs Entire reliably of...Full timeFor contractors- ...frontier of physical intelligence. Role Overview As the Head of AI, you will serve as both our chief scientific visionary and... ...example by remaining actively involved in designing, training, and evaluating state‑of‑the‑art foundational models. Scientific Excellence:...Work at officeRelocation3 days per week
$305k - $340k
...legislative partners. Communications: In partnership with the head of Marketing and External Communications, develop communication... ...national, state, and local public policy organizations. Monitor and Evaluate: Continuously monitor, analyze, and evaluate legislative and...Work experience placementBank staffWork at officeLocal areaFlexible hours- ...data-driven approach to understanding how organizations discover, evaluate, and adopt AI. We partner closely with Sales, Revenue Operations... ...and long-term value. About the Role We’re looking for a Head of Demand Generation to build and lead OpenAI’s enterprise demand...Work at officeRelocation package
- ...rolling up your sleeves to design and execute the next generation of growth initiatives. What You’ll Do Refresh GTM strategy: Re-evaluate our current growth engines (SEO, paid, social, partnerships) and design a plan to 10x revenue. Experiment & innovate: Stay ahead...Full timeWork at officeShift work
- ...hyperscaler scale. Model and reason about GPU collective operations (e.g., NCCL, RCCL, or equivalents) across nodes, racks, and pods. Evaluate how collective communication patterns interact with emerging accelerators and non‑GPU compute elements. Work with software and...Full timeCasual workVisa sponsorship
$212k
...for long-term customer success. You will own the "Technical Win," serving as the primary authority during the pre-sales phase to evaluate customer needs and scope complex AI data engagements. Your influence extends beyond the sale; you will act as a critical voice in...Full timeWork at officeLocal areaRemote workFlexible hours$280k - $320k
...,000 + Equity About the Role A fast‑moving startup is seeking a Head of AI to build and lead its artificial intelligence function from... ..., ensuring safety, reliability, robustness, and transparency. Evaluate and integrate emerging AI technologies, including LLMs, agentic...$250k - $285k
...answer might be. You think about controls, confounds, and interpretability across domains, not just within your own. Experience evaluating scientific work outside your own area of training. This role requires making decisions about programs spanning immunology, translational...Full timeContract work$139.3k - $174.2k
...the world. About the Role Armada is hiring a Head of Analyst Relations to own and grow our analyst relations program with a key focus on participation in key industry evaluations such as Gartner Magic Quadrants, Forrester Waves, and similar reports...Work at officeFlexible hours- ...reviewing CVs, qualifications, and assignments by providing additional insights, but it does not make decisions, every application is evaluated at each step by a member of our hiring team. We also use AI to transcribe interviews so our team can stay focused on the...Remote workShift workNight shift
- ...Head Of Ai Agent Systems San Francisco About Wonderschool Wonderschool builds software and systems that help businesses operate... ...failures, memory loss, and context limitations Design evaluation systems to measure success rates, failure modes, and reliability...Immediate startShift work
$200k - $270k
...Head Of Developer Experience Los Angeles, San Francisco About HeyGen At HeyGen, our mission is to make visual storytelling... ...agents and developer workflows — and to own how developers discover, evaluate, adopt, and build on our platform. The AI agent ecosystem is...Work experience placementRemote work$184k - $260k
...We’re seeking a strategic and experienced Head of Global Benefits to lead the design, implementation, and scaling of our benefits programs... ...manage benefits budgets, forecasting, and cost optimization Evaluate and benchmark programs to ensure competitiveness in each market...Local areaWorldwide$170k - $320k
...leads) who are outside of VAS but part of Global Finance. The Head of VAS Pricing reports to the VP, Head of VAS Pricing & Deals.... ...Financial acumen-ability to utilize sophisticated financial analyses to evaluate business opportunities and make strategic choices ~ Highly...Work experience placementWork at officeLocal area- ...equally self-directed and agile Employee Success (ES) team. The Head of AI-Native ES will lead this experimental team and serve as... ...playbooks, and pioneer new ways of organizing, compensating, and evaluating AI-native talent. You'll work across the organization with the...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Head of Evaluation. Be the first to apply!


