Expert Prompt Curators for Advanced AI Evaluation Dataset

CloudDevs

Expert Prompt Curators for Advanced AI Evaluation Dataset This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more. Role Description Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. Key Responsibilities Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. Test prompts against advanced AI models and document failures/successes. Provide reasoning steps and solutions for each prompt. Classify prompts into subject domains for dataset organization. Collaborate with reviewers for expert validation and prompt refinement. Qualifications Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. Experience in academic research, benchmarking, or test question design preferred. Attention to detail and ability to provide concise reasoning explanations. Familiarity with AI models and their limitations is a plus. Requirements Remote and asynchronous — set your own hours. Expected commitment: ~10–20 hours/week. Project duration: ~2 months, with possible extensions based on dataset needs. Opportunity to contribute to high-impact AI safety and evaluation research. Compensation & Contract Terms Competitive hourly compensation based on expertise. Independent contractor engagement. Payments for services rendered processed weekly via Stripe Connect. Application Process Submit your resume or CV highlighting your subject matter expertise. Complete a brief questionnaire about your background and areas of specialization. Selected applicants may be asked to draft a short test prompt. You’ll receive follow-up within a few days regarding next steps. #J-18808-Ljbffr

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Expert Prompt Curators for Advanced AI Evaluation Dataset in New York, NY vacancy

Remote Expert Prompt Curator for AI Evaluation Dataset
A leading AI research firm is seeking Expert Prompt Curators to design challenging prompts for evaluating advanced AI models. The role requires advanced knowledge in diverse fields and offers flexible hours, remote work, and a competitive hourly wage. Ideal candidates...
Suggested
Remote job
Hourly pay
Temporary work
Flexible hours
CloudDevs
New York, NY
2 days ago
AI Writing Evaluators (Domain Experts) - English Expertise
$20 - $23 per hour
...business process outsourcing, advanced professional services,... ...in writing and evaluation (location is not a factor... ...assigned. Work may include prompt creation, response evaluation... ...Position We are looking for AI Writing Evaluators (Domain Experts) with strong analytical...
Suggested
Hourly pay
Full time
Freelance
Immediate start
Remote work
Monday to Friday
3 days per week
Volga Partners
New York, NY
2 days ago
Remote Psychiatry Expert for AI Model Evaluation
$150 per hour
...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification... .... Responsibilities include creating realistic prompts, grading responses, and providing feedback for model...
Suggested
Remote work
Flexible hours
Modern MedEd
New York, NY
2 days ago
Remote AI Mathematics Expert — Train & Evaluate Bots
$40 per hour
A technology firm is seeking an advanced mathematician with an advanced degree to help train AI models. Responsibilities include solving complex mathematical problems, evaluating AI performance, and ensuring quality. Candidates should have fluency in English, strong detail...
Suggested
Remote job
Hourly pay
Flexible hours
DataAnnotation
New York, NY
1 day ago
Remote Linguistics Expert for AI Dataset Curation
A leading educational organization is seeking a Linguistics Expert for a remote position to research and curate advanced linguistics materials into datasets for AI training. This role focuses on scientific accuracy and academic rigor, requiring a strong background in linguistics...
Suggested
Remote job
Hourly pay
10 hours per week
Crossing Hurdles
New York, NY
2 days ago
Remote Bash Expert for AI Training & Evaluation
$8 - $65 per hour
...a Bash Coding Specialist for a freelance AI Trainer project based in the United States... ...technical writing. You will challenge advanced language models using Bash to support AI... ...verification, error assessment, and improving prompt engineering. Ideal candidates have...
Remote job
Hourly pay
Freelance
Meridial
New York, NY
4 days ago
Remote Chemistry Expert — AI Training & Evaluation
Prolific in New York, NY, is seeking Chemistry Experts and Chemical Engineers to join its... .... In this role, you will help train and evaluate AI models using your chemical expertise.... ...reactions. Ideal candidates hold a relevant advanced degree and possess deep chemistry...
Remote job
Work from home
Flexible hours
Prolific
New York, NY
4 days ago
Remote PhD Social Sciences Expert — AI Prompt Specialist
...education and research sector is seeking a Social Sciences PhD Expert for a remote part-time role, requiring strong analytical... ...work. The position involves creating historically relevant prompts, evaluating AI outputs, and contributing to research initiatives. Ideal candidates...
Remote job
Part time
Flexible hours
Crossing Hurdles
New York, NY
2 days ago
Remote Data Scientist: AI Training & Evaluation Expert
...SME Careers is seeking a remote Data Scientist to contribute to AI training content and ensure model integrity. The role involves developing AI prompts, evaluating AI responses, and testing for model reliability. Ideal candidates will have a degree in a quantitative field...
Hourly pay
For contractors
Remote work
SME Careers
New York, NY
14 hours ago
Remote AI Trainer (Advanced Japanese) Domain Expert
...A leading AI training platform is seeking Advanced Japanese Speakers to train and evaluate AI models. Candidates will complete tasks such as analyzing and writing in Japanese... .... Successful applicants join as Domain Expert participants, earning competitive pay for completed...
Remote work
Work from home
Prolific - UK Job Board?
New York, NY
2 days ago
Remote AI Trainer (Advanced Japanese) - Domain Expert
$30 per hour
A leading AI training platform is seeking Advanced Japanese Speakers for a remote AI Trainer role. You will train and evaluate AI models by completing tasks in Japanese, with competitive pay rates around $30/hr. Ideal candidates should have advanced Japanese language skills...
Remote job
Flexible hours
Prolific - UK Job Board?
New York, NY
2 days ago
Healthcare Actuarial Science Domain Expert, Applied AI
$175k - $250k
...changing that. We're building the AI-native financial platform... ...health systems and medical groups evaluate risk, price contracts,... ...Build proprietary benchmarks and datasets to evaluate models and AI Agents... ...exposure to AI/ML concepts or prompt engineering Experience at a high...
Full time
Contract work
Work at office
Shift work
Translucent AI
New York, NY
3 days ago
Healthcare FP&A Domain Expert, Applied AI
$175k - $250k
...changing that. We're building the agentic AI platform designed exclusively for... ...workflows Build proprietary benchmarks and datasets to evaluate models and AI Agents against real-world... ...Prior exposure to AI/ML concepts or prompt engineering Anticipated compensation: $...
Full time
Work at office
Translucent AI
New York, NY
3 days ago
Remote Mental Health Expert for AI Training & Evaluation
$8 - $65 per hour
...Prolific is hiring Mental Health Professionals in New York to train and evaluate AI models. As a Domain Expert, you will be responsible for reviewing AI-generated responses, completing tasks related to psychology, and improving AI models based on your expertise. Pay rates...
Hourly pay
Remote work
Work from home
Flexible hours
Prolific
New York, NY
4 days ago
Insurance Underwriting AI Expert
$100 - $120 per hour
...motivated Insurance Underwriting AI Expert to join our growing team.... ...domain expertise and advanced analytics, focusing on transforming... ..., improving accuracy in evaluating applicant profiles, exposures... ...historical data and external datasets to identify risk patterns, trends...
Hourly pay
Contract work
Part time
Remote work
Weekday AI (YC W21)
New York, NY
2 days ago
Subject Matter Expert (SME) - AI for Sales (Early 2026)
$5,000 per month
...in touch. More about this AI for Sales course Sales is... ...any code. You’ll practice prompt patterns for sales tasks, evaluate outputs for accuracy and tone... ...what they need to advance in their field Record authentic... ...content created by industry experts (that’s you!) featuring...
Ziplines
New York, NY
2 days ago
Freelancer - CBRN Biology Expert for GenAI Prompts Review
Freelancer - Biology Expert for GenAI Prompts Review About the Role: We are seeking... ...involving Generative AI (GenAI) by creating biology-... ...dangerous CBRN-related outputs). Evaluate AI-generated responses to... ...evaluations. Requirements Advanced degree in Biology (Ph.D. preferred...
Freelance
Remote work
ActiveFence
New York, NY
2 days ago
Retail Banking Compliance Expert: AI Data Evaluation
Obsidian is seeking experienced retail banking professionals to evaluate and enhance AI systems focused on consumer banking operations. You will review AI outputs, create scenarios based on banking workflows, and collaborate with research teams. The ideal candidate has...
Obsidian
New York, NY
2 days ago
Remote Physics Expert for AI Training & Evaluation
$40 per hour
A technology company is seeking a Physics Expert to train AI models and evaluate their outputs. This role requires expertise in physics and a strong understanding of classical mechanics, thermodynamics, and quantum concepts. The position is remote and offers flexible work...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Brooklyn, NY
3 days ago
Remote Chemistry Expert for AI Research and Evaluation
A leading research services provider is seeking a Chemistry Expert (PhD) to evaluate complex chemistry problems and review AI-generated outputs for accuracy. This remote, hourly contract role requires deep subject-matter expertise and excellent communication skills. Candidates...
Remote job
Hourly pay
Contract work
Flexible hours
Crossing Hurdles
New York, NY
2 days ago
Remote AI Domain Expert: Train & Evaluate Models
A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming...
Remote job
Hourly pay
Self employment
Flexible hours
Prolific - UK Job Board?
New York, NY
2 days ago
Krita Expert & AI Content Evaluator - Remote
...A tech company focusing on AI research is looking for experienced Krita users for a flexible, project-based contract opportunity. This role allows you to earn while evaluating AI-generated content related to digital painting and concept art. Candidates should have at...
Contract work
Remote work
Flexible hours
Handshake
New York, NY
1 day ago
Remote Physics Expert for AI Model Evaluation
A technology firm in Pennsylvania is seeking a Physics Expert to train AI models by providing complex physics challenges and evaluating the outputs of these AI chatbots. You must have an expert-level knowledge of physics, particularly in areas like classical mechanics...
Remote job
Hourly pay
Flexible hours
DataAnnotation
New York, NY
2 days ago
PhD Physics Expert — AI Prompt Architect for Hard Problems
Transperfect is looking for doctoral-level experts in STEM disciplines, especially Physics, to contribute to a high-impact AI training project. You will design complex prompts aimed at testing Large Language Models (LLMs) and generate thorough solutions while utilizing...
Transperfect
New York, NY
3 days ago
Remote Biology Expert - AI Model Evaluation & Training
A leading AI training company is seeking a Biology Instructor to improve AI models by evaluating responses to complex biology queries. The position allows for full-time or part-time hours, with the flexibility to choose projects and work remotely. Ideal candidates should...
Remote job
Hourly pay
Full time
Contract work
Part time
DataAnnotation
New York, NY
4 days ago
Nahuatl Language Expert - AI Trainer
$8 - $65 per hour
...Overview Are you a Nahuatl language expert eager to shape the future of AI? Large‑scale language models are evolving from... ...error traces, and suggest improvements to prompt engineering and evaluation metrics. Challenge advanced language models on contextual interpretation...
Hourly pay
Contract work
For contractors
Immediate start
Remote work
Meridial Marketplace, by Invisible
New York, NY
2 days ago
Medical Doctor - AI Training & Evaluation Expert (Remote)
$80 - $150 per hour
Prolific in New York, NY, is seeking Medical Doctors to join our Expert Network for training and evaluating AI models. As a Domain Expert participant, you will review and rate AI-generated clinical responses, providing crucial feedback for AI development. You must hold...
Remote job
Hourly pay
Work from home
Flexible hours
Prolific
New York, NY
5 days ago
Freelance Mathematics Expert - AI Trainer
$55 per hour
Freelance Mathematics Expert - AI Trainer 1 week ago Be among the first... ...might typically: Generate prompts that challenge AI. Define comprehensive... ...scoring criteria to evaluate the accuracy of the AI's... ...theory. Your level of English is advanced (C1) or above. You are ready...
Part time
Freelance
Remote work
Mindrift
New York, NY
2 days ago
Freelance Physics Expert - AI Trainer
$55 per hour
...ethically shape the future of AI. What We Do The Mindrift platform... ...Responsibilities Generate prompts that challenge AI. Define... ...comprehensive scoring criteria to evaluate the accuracy of the AI’s... ...Mechanics. Your level of English is advanced (C1) or above. You are ready...
Part time
Freelance
Remote work
Mindrift
New York, NY
2 days ago
Remote AI Trainer - Advanced Korean (Domain Expert)
A leading data collection platform is looking for AI Trainers fluent in Korean to evaluate AI models. Responsibilities include analyzing, editing, and providing feedback on tasks performed in Korean. The opportunity offers competitive rates, the ability to work remotely...
Remote job
Flexible hours
Prolific - UK Job Board?
New York, NY
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Expert Prompt Curators for Advanced AI Evaluation Dataset. Be the first to apply!