Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Expert Prompt Curators for Advanced AI Evaluation Dataset

CloudDevs

Expert Prompt Curators for Advanced AI Evaluation Dataset This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more. Role Description Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. Key Responsibilities Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. Test prompts against advanced AI models and document failures/successes. Provide reasoning steps and solutions for each prompt. Classify prompts into subject domains for dataset organization. Collaborate with reviewers for expert validation and prompt refinement. Qualifications Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. Experience in academic research, benchmarking, or test question design preferred. Attention to detail and ability to provide concise reasoning explanations. Familiarity with AI models and their limitations is a plus. Requirements Remote and asynchronous — set your own hours. Expected commitment: ~10–20 hours/week. Project duration: ~2 months, with possible extensions based on dataset needs. Opportunity to contribute to high-impact AI safety and evaluation research. Compensation & Contract Terms Competitive hourly compensation based on expertise. Independent contractor engagement. Payments for services rendered processed weekly via Stripe Connect. Application Process Submit your resume or CV highlighting your subject matter expertise. Complete a brief questionnaire about your background and areas of specialization. Selected applicants may be asked to draft a short test prompt. You’ll receive follow-up within a few days regarding next steps. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Expert Prompt Curators for Advanced AI Evaluation Dataset in New York, NY vacancy
  • $20 - $23 per hour

     ...business process outsourcing, advanced professional services,...  ...in writing and evaluation (location is not a factor...  ...assigned. Work may include prompt creation, response evaluation...  ...Position We are looking for AI Writing Evaluators (Domain Experts) with strong analytical... 
    Suggested
    Hourly pay
    Full time
    Freelance
    Immediate start
    Remote work
    Monday to Friday
    3 days per week

    Volga Partners

    New York, NY
    1 day ago
  • $40 per hour

    A data and AI solutions company is seeking an advanced mathematician to join their team in a remote position. The role involves training AI models by evaluating their logic and solving complex mathematical problems. Candidates should have expertise in various fields like... 
    Suggested
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    New York, NY
    3 days ago
  • $30 per hour

     ...A leading AI data platform is seeking Advanced Urdu Speakers to participate in training and evaluating AI models. Successful candidates will complete tasks such as analyzing and writing in Urdu and assessing AI performance. This role offers competitive pay of $30 per... 
    Suggested
    Hourly pay
    Remote work
    Work from home
    Flexible hours

    Prolific

    New York, NY
    1 day ago
  •  ...education and research sector is seeking a Social Sciences PhD Expert for a remote part-time role, requiring strong analytical...  ...work. The position involves creating historically relevant prompts, evaluating AI outputs, and contributing to research initiatives. Ideal candidates... 
    Suggested
    Part time
    Remote work
    Flexible hours

    Crossing Hurdles

    New York, NY
    1 day ago
  •  ...A leading AI data services provider is seeking highly qualified mathematics experts for a remote role focused on evaluating and annotating mathematical content. The ideal candidate should...  ...own schedule, contributing to the advancement of AI capabilities in mathematics. #J... 
    Suggested
    Remote work
    Flexible hours

    HumanSignal

    New York, NY
    1 day ago
  • $30 per hour

     ...A data-driven AI research firm is seeking Advanced Mandarin Speakers to evaluate and train AI models. This remote role offers the opportunity to work flexible hours and earn approximately $30 per hour. Candidates must have a fluent command of Mandarin, possess strong... 
    Hourly pay
    Remote work
    Flexible hours

    Prolific

    New York, NY
    1 day ago
  • $30 per hour

     ...A leading AI training platform is seeking Advanced Japanese Speakers for a remote AI Trainer role. You will train and evaluate AI models by completing tasks in Japanese, with competitive pay rates around $30/hr. Ideal candidates should have advanced Japanese language... 
    Remote work
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    1 day ago
  •  ...Prolific Academic Ltd is looking for an AI Trainer with advanced French fluency to evaluate and enhance AI models. The role involves analyzing and editing tasks in French, and participants are compensated for their work. Ideal candidates will need a reliable internet connection... 
    Remote work
    Flexible hours

    Prolific Academic Ltd

    New York, NY
    1 day ago
  •  ...A leading AI training platform is seeking Advanced Japanese Speakers to train and evaluate AI models. Candidates will complete tasks such as analyzing and writing in Japanese...  .... Successful applicants join as Domain Expert participants, earning competitive pay for completed... 
    Remote work
    Work from home

    Prolific - UK Job Board?

    New York, NY
    1 day ago
  • $30 per hour

     ...A leading AI data platform is looking for Advanced Mandarin Speakers to help train and evaluate AI models. Successful candidates will complete tasks such as analyzing and writing in Mandarin. This role offers competitive pay at around $30/hr and allows for flexible hours... 
    Remote work
    Work from home
    Flexible hours

    Prolific

    New York, NY
    1 day ago
  • $100 - $120 per hour

     ...motivated Insurance Underwriting AI Expert to join our growing team....  ...domain expertise and advanced analytics, focusing on transforming...  ..., improving accuracy in evaluating applicant profiles, exposures...  ...historical data and external datasets to identify risk patterns, trends... 
    Hourly pay
    Contract work
    Part time
    Remote work

    Weekday AI (YC W21)

    New York, NY
    1 day ago
  •  ...Mercor is seeking a Hematology/Oncology Expert to work remotely on a contract basis. Key responsibilities will include designing clinical prompts, grading AI-generated responses, and providing high-quality feedback. Ideal candidates include board-certified attending physicians... 
    Contract work
    Remote work

    Mercor Inc

    New York, NY
    1 day ago
  • $40 per hour

     ...A technology company is seeking a Physics Expert to train AI models and evaluate their outputs. This role requires expertise in physics and a strong understanding of classical mechanics, thermodynamics, and quantum concepts. The position is remote and offers flexible... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    4 days ago
  • $5,000 per month

     ...in touch. More about this AI for Sales course Sales is...  ...any code. You’ll practice prompt patterns for sales tasks, evaluate outputs for accuracy and tone...  ...what they need to advance in their field Record authentic...  ...content created by industry experts (that’s you!) featuring... 

    Ziplines

    New York, NY
    1 day ago
  •  ...A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming... 
    Hourly pay
    Self employment
    Remote work
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    1 day ago
  • $70 - $75 per hour

     ...for a talented AWS Generative AI Expert This is a 06+ Months...  ...etc. Develop and optimize prompt engineering strategies for various...  ...Implement automated model evaluation frameworks. Design and maintain...  ...~ Essential Knowledge Areas-Advanced understanding of prompt... 
    Contract work
    Local area
    Immediate start

    Pyramid Consulting

    Jersey City, NJ
    3 days ago
  •  ...Are you passionate about AI and skilled at analyzing both text and multimedia content...  ...projects, including tasks such as prompt evaluation, video content understanding, text review...  ...around the world. Earn money. Have fun. Advance human knowledge. Work on diverse projects... 
    Freelance
    Local area
    Remote work

    LILT (Production)

    New York, NY
    1 day ago
  •  ...A technology firm in Pennsylvania is seeking a Physics Expert to train AI models by providing complex physics challenges and evaluating the outputs of these AI chatbots. You must have an expert-level knowledge of physics, particularly in areas like classical mechanics... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  •  ...A tech company focusing on AI research is looking for experienced Krita users for a flexible, project-based contract opportunity. This role allows you to earn while evaluating AI-generated content related to digital painting and concept art. Candidates should have at... 
    Contract work
    Remote work
    Flexible hours

    Handshake

    New York, NY
    3 hours ago
  • $73 per hour

     ...ethically shape the future of AI. What We Do The...  ...systems are tested and evaluated? What the role looks...  ...hand‑holding. Real expert complexity only. You’re...  ...From creating training prompts to refining model responses...  .... Work on advanced AI projects and gain valuable... 
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Brooklyn, NY
    4 days ago
  • $73 per hour

     ...Quantitative Statistics Expert - Freelance AI Trainer 1 day ago Be among...  ...might typically: Generate prompts that challenge AI. Define...  ...scoring criteria to evaluate the accuracy of the AI's answers...  ...modeling. Level of English is advanced (C1) or above. Strong... 
    Part time
    Freelance
    Remote work

    Mind Rift

    New York, NY
    3 days ago
  • $8 - $65 per hour

     ...Are you a Mapudungun language expert eager to shape the future of AI? Large‑scale language models are...  ...cultural references. You’ll challenge advanced language models on topics such...  ...suggest improvements to our prompt engineering and evaluation metrics. A master’s degree in... 
    Hourly pay
    For contractors
    Freelance
    Immediate start
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  •  ...A leading data collection platform is looking for AI Trainers fluent in Korean to evaluate AI models. Responsibilities include analyzing, editing, and providing feedback on tasks performed in Korean. The opportunity offers competitive rates, the ability to work remotely... 
    Remote work
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    1 day ago
  • $100 - $120 per hour

     ...detail-oriented Insurance Claims Management AI Expert to join our growing team. This role sits...  ...claims processes using AI and advanced analytics. Your primary responsibilities...  ...automation Experience working with large datasets and building predictive models Preferred... 
    Hourly pay
    Contract work
    Part time
    Remote work

    Weekday AI

    New York, NY
    1 day ago
  • $30 per hour

     ...Prolific is looking for Advanced Mandarin Speakers to join as Domain Expert participants. You will assist in training and evaluating AI models while being part of a groundbreaking platform. Responsibilities include analyzing and editing Mandarin content and assessing AI... 
    Remote work
    Flexible hours

    Prolific

    New York, NY
    1 day ago
  •  ...seeking candidates for the Rubric Academy Fellowship, a remote opportunity that involves completing a self-paced Academy focused on AI evaluation principles. Ideal candidates will possess strong analytical skills and deep subject-matter expertise, enabling them to work... 
    Remote work

    Crossing Hurdles

    New York, NY
    1 day ago
  •  ...Position Physics Expert Type Hourly contract Location...  ...week Role Responsibilities Evaluate, curate, and annotate complex physics content for AI training datasets. Create precise prompts, responses, and...  ...training materials to reflect advancements in physics and AI practices... 
    Hourly pay
    Contract work
    Remote work

    Crossing Hurdles

    New York, NY
    1 day ago
  •  ...one of the world’s fastest-growing AI companies accelerating the advancement and deployment of powerful AI...  ...researchers to align problems with evaluation goals, especially in areas where models...  ...community knowledge in a new way. Experts add insights directly into each... 
    Hourly pay
    Contract work
    For contractors
    Freelance
    Remote work

    Turing Inc

    New York, NY
    1 day ago
  •  ...leveraging expertise in scientific visualization and fluid dynamics to evaluate AI-generated content. Ideal candidates have at least 3 years of...  ...for concurrent projects. Join a year-round program focused on advancing AI understanding in computational science. #J-18808-Ljbffr... 
    Contract work
    Remote work
    Flexible hours

    Handshake

    New York, NY
    1 day ago
  • $210k - $300k

     ...Description This Opportunity WSP is seeking an Advanced Air Mobility (AAM ) and Uncrewed Aircraft Systems (UAS) Expert to serve as a national technical authority,...  ...operations, and multimodal integration. Evaluate infrastructure readiness (vertiports, energy,... 
    Work at office
    Local area
    Flexible hours

    WSP

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Expert Prompt Curators for Advanced AI Evaluation Dataset. Be the first to apply!