Expert Prompt Curators for Advanced AI Evaluation Dataset
CloudDevs
Expert Prompt Curators for Advanced AI Evaluation Dataset This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more. Role Description Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. Key Responsibilities Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. Test prompts against advanced AI models and document failures/successes. Provide reasoning steps and solutions for each prompt. Classify prompts into subject domains for dataset organization. Collaborate with reviewers for expert validation and prompt refinement. Qualifications Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. Experience in academic research, benchmarking, or test question design preferred. Attention to detail and ability to provide concise reasoning explanations. Familiarity with AI models and their limitations is a plus. Requirements Remote and asynchronous — set your own hours. Expected commitment: ~10–20 hours/week. Project duration: ~2 months, with possible extensions based on dataset needs. Opportunity to contribute to high-impact AI safety and evaluation research. Compensation & Contract Terms Competitive hourly compensation based on expertise. Independent contractor engagement. Payments for services rendered processed weekly via Stripe Connect. Application Process Submit your resume or CV highlighting your subject matter expertise. Complete a brief questionnaire about your background and areas of specialization. Selected applicants may be asked to draft a short test prompt. You’ll receive follow-up within a few days regarding next steps. #J-18808-Ljbffr
- A leading AI research firm is seeking Expert Prompt Curators to design challenging prompts for evaluating advanced AI models. The role requires advanced knowledge in diverse fields and offers flexible hours, remote work, and a competitive hourly wage. Ideal candidates...SuggestedRemote jobHourly payTemporary workFlexible hours
$20 - $23 per hour
...business process outsourcing, advanced professional services,... ...in writing and evaluation (location is not a factor... ...assigned. Work may include prompt creation, response evaluation... ...Position We are looking for AI Writing Evaluators (Domain Experts) with strong analytical...SuggestedHourly payFull timeFreelanceImmediate startRemote workMonday to Friday3 days per week$150 per hour
...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification... .... Responsibilities include creating realistic prompts, grading responses, and providing feedback for model...SuggestedRemote workFlexible hours$40 per hour
A technology firm is seeking an advanced mathematician with an advanced degree to help train AI models. Responsibilities include solving complex mathematical problems, evaluating AI performance, and ensuring quality. Candidates should have fluency in English, strong detail...SuggestedRemote jobHourly payFlexible hours- A leading educational organization is seeking a Linguistics Expert for a remote position to research and curate advanced linguistics materials into datasets for AI training. This role focuses on scientific accuracy and academic rigor, requiring a strong background in linguistics...SuggestedRemote jobHourly pay10 hours per week
$8 - $65 per hour
...a Bash Coding Specialist for a freelance AI Trainer project based in the United States... ...technical writing. You will challenge advanced language models using Bash to support AI... ...verification, error assessment, and improving prompt engineering. Ideal candidates have...Remote jobHourly payFreelance- Prolific in New York, NY, is seeking Chemistry Experts and Chemical Engineers to join its... .... In this role, you will help train and evaluate AI models using your chemical expertise.... ...reactions. Ideal candidates hold a relevant advanced degree and possess deep chemistry...Remote jobWork from homeFlexible hours
- ...education and research sector is seeking a Social Sciences PhD Expert for a remote part-time role, requiring strong analytical... ...work. The position involves creating historically relevant prompts, evaluating AI outputs, and contributing to research initiatives. Ideal candidates...Remote jobPart timeFlexible hours
- ...SME Careers is seeking a remote Data Scientist to contribute to AI training content and ensure model integrity. The role involves developing AI prompts, evaluating AI responses, and testing for model reliability. Ideal candidates will have a degree in a quantitative field...Hourly payFor contractorsRemote work
- ...A leading AI training platform is seeking Advanced Japanese Speakers to train and evaluate AI models. Candidates will complete tasks such as analyzing and writing in Japanese... .... Successful applicants join as Domain Expert participants, earning competitive pay for completed...Remote workWork from home
$30 per hour
A leading AI training platform is seeking Advanced Japanese Speakers for a remote AI Trainer role. You will train and evaluate AI models by completing tasks in Japanese, with competitive pay rates around $30/hr. Ideal candidates should have advanced Japanese language skills...Remote jobFlexible hours$175k - $250k
...changing that. We're building the AI-native financial platform... ...health systems and medical groups evaluate risk, price contracts,... ...Build proprietary benchmarks and datasets to evaluate models and AI Agents... ...exposure to AI/ML concepts or prompt engineering Experience at a high...Full timeContract workWork at officeShift work$175k - $250k
...changing that. We're building the agentic AI platform designed exclusively for... ...workflows Build proprietary benchmarks and datasets to evaluate models and AI Agents against real-world... ...Prior exposure to AI/ML concepts or prompt engineering Anticipated compensation: $...Full timeWork at office$8 - $65 per hour
...Prolific is hiring Mental Health Professionals in New York to train and evaluate AI models. As a Domain Expert, you will be responsible for reviewing AI-generated responses, completing tasks related to psychology, and improving AI models based on your expertise. Pay rates...Hourly payRemote workWork from homeFlexible hours$100 - $120 per hour
...motivated Insurance Underwriting AI Expert to join our growing team.... ...domain expertise and advanced analytics, focusing on transforming... ..., improving accuracy in evaluating applicant profiles, exposures... ...historical data and external datasets to identify risk patterns, trends...Hourly payContract workPart timeRemote work$5,000 per month
...in touch. More about this AI for Sales course Sales is... ...any code. You’ll practice prompt patterns for sales tasks, evaluate outputs for accuracy and tone... ...what they need to advance in their field Record authentic... ...content created by industry experts (that’s you!) featuring...- Freelancer - Biology Expert for GenAI Prompts Review About the Role: We are seeking... ...involving Generative AI (GenAI) by creating biology-... ...dangerous CBRN-related outputs). Evaluate AI-generated responses to... ...evaluations. Requirements Advanced degree in Biology (Ph.D. preferred...FreelanceRemote work
- Obsidian is seeking experienced retail banking professionals to evaluate and enhance AI systems focused on consumer banking operations. You will review AI outputs, create scenarios based on banking workflows, and collaborate with research teams. The ideal candidate has...
$40 per hour
A technology company is seeking a Physics Expert to train AI models and evaluate their outputs. This role requires expertise in physics and a strong understanding of classical mechanics, thermodynamics, and quantum concepts. The position is remote and offers flexible work...Remote jobHourly payFlexible hours- A leading research services provider is seeking a Chemistry Expert (PhD) to evaluate complex chemistry problems and review AI-generated outputs for accuracy. This remote, hourly contract role requires deep subject-matter expertise and excellent communication skills. Candidates...Remote jobHourly payContract workFlexible hours
- A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming...Remote jobHourly paySelf employmentFlexible hours
- ...A tech company focusing on AI research is looking for experienced Krita users for a flexible, project-based contract opportunity. This role allows you to earn while evaluating AI-generated content related to digital painting and concept art. Candidates should have at...Contract workRemote workFlexible hours
- A technology firm in Pennsylvania is seeking a Physics Expert to train AI models by providing complex physics challenges and evaluating the outputs of these AI chatbots. You must have an expert-level knowledge of physics, particularly in areas like classical mechanics...Remote jobHourly payFlexible hours
- Transperfect is looking for doctoral-level experts in STEM disciplines, especially Physics, to contribute to a high-impact AI training project. You will design complex prompts aimed at testing Large Language Models (LLMs) and generate thorough solutions while utilizing...
- A leading AI training company is seeking a Biology Instructor to improve AI models by evaluating responses to complex biology queries. The position allows for full-time or part-time hours, with the flexibility to choose projects and work remotely. Ideal candidates should...Remote jobHourly payFull timeContract workPart time
$8 - $65 per hour
...Overview Are you a Nahuatl language expert eager to shape the future of AI? Large‑scale language models are evolving from... ...error traces, and suggest improvements to prompt engineering and evaluation metrics. Challenge advanced language models on contextual interpretation...Hourly payContract workFor contractorsImmediate startRemote work$80 - $150 per hour
Prolific in New York, NY, is seeking Medical Doctors to join our Expert Network for training and evaluating AI models. As a Domain Expert participant, you will review and rate AI-generated clinical responses, providing crucial feedback for AI development. You must hold...Remote jobHourly payWork from homeFlexible hours$55 per hour
Freelance Mathematics Expert - AI Trainer 1 week ago Be among the first... ...might typically: Generate prompts that challenge AI. Define comprehensive... ...scoring criteria to evaluate the accuracy of the AI's... ...theory. Your level of English is advanced (C1) or above. You are ready...Part timeFreelanceRemote work$55 per hour
...ethically shape the future of AI. What We Do The Mindrift platform... ...Responsibilities Generate prompts that challenge AI. Define... ...comprehensive scoring criteria to evaluate the accuracy of the AI’s... ...Mechanics. Your level of English is advanced (C1) or above. You are ready...Part timeFreelanceRemote work- A leading data collection platform is looking for AI Trainers fluent in Korean to evaluate AI models. Responsibilities include analyzing, editing, and providing feedback on tasks performed in Korean. The opportunity offers competitive rates, the ability to work remotely...Remote jobFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Expert Prompt Curators for Advanced AI Evaluation Dataset. Be the first to apply!

