Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Expert Prompt Curators for Advanced AI Evaluation Dataset

CloudDevs

Expert Prompt Curators for Advanced AI Evaluation Dataset This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more. Role Description Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. Key Responsibilities Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. Test prompts against advanced AI models and document failures/successes. Provide reasoning steps and solutions for each prompt. Classify prompts into subject domains for dataset organization. Collaborate with reviewers for expert validation and prompt refinement. Qualifications Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. Experience in academic research, benchmarking, or test question design preferred. Attention to detail and ability to provide concise reasoning explanations. Familiarity with AI models and their limitations is a plus. Requirements Remote and asynchronous — set your own hours. Expected commitment: ~10–20 hours/week. Project duration: ~2 months, with possible extensions based on dataset needs. Opportunity to contribute to high-impact AI safety and evaluation research. Compensation & Contract Terms Competitive hourly compensation based on expertise. Independent contractor engagement. Payments for services rendered processed weekly via Stripe Connect. Application Process Submit your resume or CV highlighting your subject matter expertise. Complete a brief questionnaire about your background and areas of specialization. Selected applicants may be asked to draft a short test prompt. You’ll receive follow-up within a few days regarding next steps. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Expert Prompt Curators for Advanced AI Evaluation Dataset in New York, NY vacancy
  • A leading AI research firm is seeking Expert Prompt Curators to design challenging prompts for evaluating advanced AI models. The role requires advanced knowledge in diverse fields and offers flexible hours, remote work, and a competitive hourly wage. Ideal candidates... 
    Suggested
    Remote job
    Hourly pay
    Temporary work
    Flexible hours

    CloudDevs

    New York, NY
    2 days ago
  • $20 - $23 per hour

     ...business process outsourcing, advanced professional services,...  ...in writing and evaluation (location is not a factor...  ...assigned. Work may include prompt creation, response evaluation...  ...Position We are looking for AI Writing Evaluators (Domain Experts) with strong analytical... 
    Suggested
    Hourly pay
    Full time
    Freelance
    Immediate start
    Remote work
    Monday to Friday
    3 days per week

    Volga Partners

    New York, NY
    2 days ago
  • $150 per hour

     ...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification...  .... Responsibilities include creating realistic prompts, grading responses, and providing feedback for model... 
    Suggested
    Remote work
    Flexible hours

    Modern MedEd

    New York, NY
    2 days ago
  • $40 per hour

    A technology firm is seeking an advanced mathematician with an advanced degree to help train AI models. Responsibilities include solving complex mathematical problems, evaluating AI performance, and ensuring quality. Candidates should have fluency in English, strong detail... 
    Suggested
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    1 day ago
  • A leading educational organization is seeking a Linguistics Expert for a remote position to research and curate advanced linguistics materials into datasets for AI training. This role focuses on scientific accuracy and academic rigor, requiring a strong background in linguistics... 
    Suggested
    Remote job
    Hourly pay
    10 hours per week

    Crossing Hurdles

    New York, NY
    2 days ago
  • $8 - $65 per hour

     ...a Bash Coding Specialist for a freelance AI Trainer project based in the United States...  ...technical writing. You will challenge advanced language models using Bash to support AI...  ...verification, error assessment, and improving prompt engineering. Ideal candidates have... 
    Remote job
    Hourly pay
    Freelance

    Meridial

    New York, NY
    4 days ago
  • Prolific in New York, NY, is seeking Chemistry Experts and Chemical Engineers to join its...  .... In this role, you will help train and evaluate AI models using your chemical expertise....  ...reactions. Ideal candidates hold a relevant advanced degree and possess deep chemistry... 
    Remote job
    Work from home
    Flexible hours

    Prolific

    New York, NY
    4 days ago
  •  ...education and research sector is seeking a Social Sciences PhD Expert for a remote part-time role, requiring strong analytical...  ...work. The position involves creating historically relevant prompts, evaluating AI outputs, and contributing to research initiatives. Ideal candidates... 
    Remote job
    Part time
    Flexible hours

    Crossing Hurdles

    New York, NY
    2 days ago
  •  ...SME Careers is seeking a remote Data Scientist to contribute to AI training content and ensure model integrity. The role involves developing AI prompts, evaluating AI responses, and testing for model reliability. Ideal candidates will have a degree in a quantitative field... 
    Hourly pay
    For contractors
    Remote work

    SME Careers

    New York, NY
    14 hours ago
  •  ...A leading AI training platform is seeking Advanced Japanese Speakers to train and evaluate AI models. Candidates will complete tasks such as analyzing and writing in Japanese...  .... Successful applicants join as Domain Expert participants, earning competitive pay for completed... 
    Remote work
    Work from home

    Prolific - UK Job Board?

    New York, NY
    2 days ago
  • $30 per hour

    A leading AI training platform is seeking Advanced Japanese Speakers for a remote AI Trainer role. You will train and evaluate AI models by completing tasks in Japanese, with competitive pay rates around $30/hr. Ideal candidates should have advanced Japanese language skills... 
    Remote job
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    2 days ago
  • $175k - $250k

     ...changing that. We're building the AI-native financial platform...  ...health systems and medical groups evaluate risk, price contracts,...  ...Build proprietary benchmarks and datasets to evaluate models and AI Agents...  ...exposure to AI/ML concepts or prompt engineering Experience at a high... 
    Full time
    Contract work
    Work at office
    Shift work

    Translucent AI

    New York, NY
    3 days ago
  • $175k - $250k

     ...changing that. We're building the agentic AI platform designed exclusively for...  ...workflows Build proprietary benchmarks and datasets to evaluate models and AI Agents against real-world...  ...Prior exposure to AI/ML concepts or prompt engineering Anticipated compensation: $... 
    Full time
    Work at office

    Translucent AI

    New York, NY
    3 days ago
  • $8 - $65 per hour

     ...Prolific is hiring Mental Health Professionals in New York to train and evaluate AI models. As a Domain Expert, you will be responsible for reviewing AI-generated responses, completing tasks related to psychology, and improving AI models based on your expertise. Pay rates... 
    Hourly pay
    Remote work
    Work from home
    Flexible hours

    Prolific

    New York, NY
    4 days ago
  • $100 - $120 per hour

     ...motivated Insurance Underwriting AI Expert to join our growing team....  ...domain expertise and advanced analytics, focusing on transforming...  ..., improving accuracy in evaluating applicant profiles, exposures...  ...historical data and external datasets to identify risk patterns, trends... 
    Hourly pay
    Contract work
    Part time
    Remote work

    Weekday AI (YC W21)

    New York, NY
    2 days ago
  • $5,000 per month

     ...in touch. More about this AI for Sales course Sales is...  ...any code. You’ll practice prompt patterns for sales tasks, evaluate outputs for accuracy and tone...  ...what they need to advance in their field Record authentic...  ...content created by industry experts (that’s you!) featuring... 

    Ziplines

    New York, NY
    2 days ago
  • Freelancer - Biology Expert for GenAI Prompts Review About the Role: We are seeking...  ...involving Generative AI (GenAI) by creating biology-...  ...dangerous CBRN-related outputs). Evaluate AI-generated responses to...  ...evaluations. Requirements Advanced degree in Biology (Ph.D. preferred... 
    Freelance
    Remote work

    ActiveFence

    New York, NY
    2 days ago
  • Obsidian is seeking experienced retail banking professionals to evaluate and enhance AI systems focused on consumer banking operations. You will review AI outputs, create scenarios based on banking workflows, and collaborate with research teams. The ideal candidate has... 

    Obsidian

    New York, NY
    2 days ago
  • $40 per hour

    A technology company is seeking a Physics Expert to train AI models and evaluate their outputs. This role requires expertise in physics and a strong understanding of classical mechanics, thermodynamics, and quantum concepts. The position is remote and offers flexible work... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    3 days ago
  • A leading research services provider is seeking a Chemistry Expert (PhD) to evaluate complex chemistry problems and review AI-generated outputs for accuracy. This remote, hourly contract role requires deep subject-matter expertise and excellent communication skills. Candidates... 
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    Crossing Hurdles

    New York, NY
    2 days ago
  • A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming... 
    Remote job
    Hourly pay
    Self employment
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    2 days ago
  •  ...A tech company focusing on AI research is looking for experienced Krita users for a flexible, project-based contract opportunity. This role allows you to earn while evaluating AI-generated content related to digital painting and concept art. Candidates should have at... 
    Contract work
    Remote work
    Flexible hours

    Handshake

    New York, NY
    1 day ago
  • A technology firm in Pennsylvania is seeking a Physics Expert to train AI models by providing complex physics challenges and evaluating the outputs of these AI chatbots. You must have an expert-level knowledge of physics, particularly in areas like classical mechanics... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • Transperfect is looking for doctoral-level experts in STEM disciplines, especially Physics, to contribute to a high-impact AI training project. You will design complex prompts aimed at testing Large Language Models (LLMs) and generate thorough solutions while utilizing... 

    Transperfect

    New York, NY
    3 days ago
  • A leading AI training company is seeking a Biology Instructor to improve AI models by evaluating responses to complex biology queries. The position allows for full-time or part-time hours, with the flexibility to choose projects and work remotely. Ideal candidates should... 
    Remote job
    Hourly pay
    Full time
    Contract work
    Part time

    DataAnnotation

    New York, NY
    4 days ago
  • $8 - $65 per hour

     ...Overview Are you a Nahuatl language expert eager to shape the future of AI? Large‑scale language models are evolving from...  ...error traces, and suggest improvements to prompt engineering and evaluation metrics. Challenge advanced language models on contextual interpretation... 
    Hourly pay
    Contract work
    For contractors
    Immediate start
    Remote work

    Meridial Marketplace, by Invisible

    New York, NY
    2 days ago
  • $80 - $150 per hour

    Prolific in New York, NY, is seeking Medical Doctors to join our Expert Network for training and evaluating AI models. As a Domain Expert participant, you will review and rate AI-generated clinical responses, providing crucial feedback for AI development. You must hold... 
    Remote job
    Hourly pay
    Work from home
    Flexible hours

    Prolific

    New York, NY
    5 days ago
  • $55 per hour

    Freelance Mathematics Expert - AI Trainer 1 week ago Be among the first...  ...might typically: Generate prompts that challenge AI. Define comprehensive...  ...scoring criteria to evaluate the accuracy of the AI's...  ...theory. Your level of English is advanced (C1) or above. You are ready... 
    Part time
    Freelance
    Remote work

    Mindrift

    New York, NY
    2 days ago
  • $55 per hour

     ...ethically shape the future of AI. What We Do The Mindrift platform...  ...Responsibilities Generate prompts that challenge AI. Define...  ...comprehensive scoring criteria to evaluate the accuracy of the AI’s...  ...Mechanics. Your level of English is advanced (C1) or above. You are ready... 
    Part time
    Freelance
    Remote work

    Mindrift

    New York, NY
    2 days ago
  • A leading data collection platform is looking for AI Trainers fluent in Korean to evaluate AI models. Responsibilities include analyzing, editing, and providing feedback on tasks performed in Korean. The opportunity offers competitive rates, the ability to work remotely... 
    Remote job
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Expert Prompt Curators for Advanced AI Evaluation Dataset. Be the first to apply!