Expert Prompt Curators for Advanced AI Evaluation Dataset
CloudDevs
Expert Prompt Curators for Advanced AI Evaluation Dataset This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more. Role Description Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. Key Responsibilities Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. Test prompts against advanced AI models and document failures/successes. Provide reasoning steps and solutions for each prompt. Classify prompts into subject domains for dataset organization. Collaborate with reviewers for expert validation and prompt refinement. Qualifications Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. Experience in academic research, benchmarking, or test question design preferred. Attention to detail and ability to provide concise reasoning explanations. Familiarity with AI models and their limitations is a plus. Requirements Remote and asynchronous — set your own hours. Expected commitment: ~10–20 hours/week. Project duration: ~2 months, with possible extensions based on dataset needs. Opportunity to contribute to high-impact AI safety and evaluation research. Compensation & Contract Terms Competitive hourly compensation based on expertise. Independent contractor engagement. Payments for services rendered processed weekly via Stripe Connect. Application Process Submit your resume or CV highlighting your subject matter expertise. Complete a brief questionnaire about your background and areas of specialization. Selected applicants may be asked to draft a short test prompt. You’ll receive follow-up within a few days regarding next steps. #J-18808-Ljbffr
$20 - $23 per hour
...business process outsourcing, advanced professional services,... ...in writing and evaluation (location is not a factor... ...assigned. Work may include prompt creation, response evaluation... ...Position We are looking for AI Writing Evaluators (Domain Experts) with strong analytical...SuggestedHourly payFull timeFreelanceImmediate startRemote workMonday to Friday3 days per week$40 per hour
A data and AI solutions company is seeking an advanced mathematician to join their team in a remote position. The role involves training AI models by evaluating their logic and solving complex mathematical problems. Candidates should have expertise in various fields like...SuggestedHourly payRemote workFlexible hours$30 per hour
...A leading AI data platform is seeking Advanced Urdu Speakers to participate in training and evaluating AI models. Successful candidates will complete tasks such as analyzing and writing in Urdu and assessing AI performance. This role offers competitive pay of $30 per...SuggestedHourly payRemote workWork from homeFlexible hours- ...education and research sector is seeking a Social Sciences PhD Expert for a remote part-time role, requiring strong analytical... ...work. The position involves creating historically relevant prompts, evaluating AI outputs, and contributing to research initiatives. Ideal candidates...SuggestedPart timeRemote workFlexible hours
- ...A leading AI data services provider is seeking highly qualified mathematics experts for a remote role focused on evaluating and annotating mathematical content. The ideal candidate should... ...own schedule, contributing to the advancement of AI capabilities in mathematics. #J...SuggestedRemote workFlexible hours
$30 per hour
...A data-driven AI research firm is seeking Advanced Mandarin Speakers to evaluate and train AI models. This remote role offers the opportunity to work flexible hours and earn approximately $30 per hour. Candidates must have a fluent command of Mandarin, possess strong...Hourly payRemote workFlexible hours$30 per hour
...A leading AI training platform is seeking Advanced Japanese Speakers for a remote AI Trainer role. You will train and evaluate AI models by completing tasks in Japanese, with competitive pay rates around $30/hr. Ideal candidates should have advanced Japanese language...Remote workFlexible hours- ...Prolific Academic Ltd is looking for an AI Trainer with advanced French fluency to evaluate and enhance AI models. The role involves analyzing and editing tasks in French, and participants are compensated for their work. Ideal candidates will need a reliable internet connection...Remote workFlexible hours
- ...A leading AI training platform is seeking Advanced Japanese Speakers to train and evaluate AI models. Candidates will complete tasks such as analyzing and writing in Japanese... .... Successful applicants join as Domain Expert participants, earning competitive pay for completed...Remote workWork from home
$30 per hour
...A leading AI data platform is looking for Advanced Mandarin Speakers to help train and evaluate AI models. Successful candidates will complete tasks such as analyzing and writing in Mandarin. This role offers competitive pay at around $30/hr and allows for flexible hours...Remote workWork from homeFlexible hours$100 - $120 per hour
...motivated Insurance Underwriting AI Expert to join our growing team.... ...domain expertise and advanced analytics, focusing on transforming... ..., improving accuracy in evaluating applicant profiles, exposures... ...historical data and external datasets to identify risk patterns, trends...Hourly payContract workPart timeRemote work- ...Mercor is seeking a Hematology/Oncology Expert to work remotely on a contract basis. Key responsibilities will include designing clinical prompts, grading AI-generated responses, and providing high-quality feedback. Ideal candidates include board-certified attending physicians...Contract workRemote work
$40 per hour
...A technology company is seeking a Physics Expert to train AI models and evaluate their outputs. This role requires expertise in physics and a strong understanding of classical mechanics, thermodynamics, and quantum concepts. The position is remote and offers flexible...Hourly payRemote workFlexible hours$5,000 per month
...in touch. More about this AI for Sales course Sales is... ...any code. You’ll practice prompt patterns for sales tasks, evaluate outputs for accuracy and tone... ...what they need to advance in their field Record authentic... ...content created by industry experts (that’s you!) featuring...- ...A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming...Hourly paySelf employmentRemote workFlexible hours
$70 - $75 per hour
...for a talented AWS Generative AI Expert This is a 06+ Months... ...etc. Develop and optimize prompt engineering strategies for various... ...Implement automated model evaluation frameworks. Design and maintain... ...~ Essential Knowledge Areas-Advanced understanding of prompt...Contract workLocal areaImmediate start- ...Are you passionate about AI and skilled at analyzing both text and multimedia content... ...projects, including tasks such as prompt evaluation, video content understanding, text review... ...around the world. Earn money. Have fun. Advance human knowledge. Work on diverse projects...FreelanceLocal areaRemote work
- ...A technology firm in Pennsylvania is seeking a Physics Expert to train AI models by providing complex physics challenges and evaluating the outputs of these AI chatbots. You must have an expert-level knowledge of physics, particularly in areas like classical mechanics...Hourly payRemote workFlexible hours
- ...A tech company focusing on AI research is looking for experienced Krita users for a flexible, project-based contract opportunity. This role allows you to earn while evaluating AI-generated content related to digital painting and concept art. Candidates should have at...Contract workRemote workFlexible hours
$73 per hour
...ethically shape the future of AI. What We Do The... ...systems are tested and evaluated? What the role looks... ...hand‑holding. Real expert complexity only. You’re... ...From creating training prompts to refining model responses... .... Work on advanced AI projects and gain valuable...Permanent employmentPart timeFreelanceRemote workFlexible hours$73 per hour
...Quantitative Statistics Expert - Freelance AI Trainer 1 day ago Be among... ...might typically: Generate prompts that challenge AI. Define... ...scoring criteria to evaluate the accuracy of the AI's answers... ...modeling. Level of English is advanced (C1) or above. Strong...Part timeFreelanceRemote work$8 - $65 per hour
...Are you a Mapudungun language expert eager to shape the future of AI? Large‑scale language models are... ...cultural references. You’ll challenge advanced language models on topics such... ...suggest improvements to our prompt engineering and evaluation metrics. A master’s degree in...Hourly payFor contractorsFreelanceImmediate startRemote work- ...A leading data collection platform is looking for AI Trainers fluent in Korean to evaluate AI models. Responsibilities include analyzing, editing, and providing feedback on tasks performed in Korean. The opportunity offers competitive rates, the ability to work remotely...Remote workFlexible hours
$100 - $120 per hour
...detail-oriented Insurance Claims Management AI Expert to join our growing team. This role sits... ...claims processes using AI and advanced analytics. Your primary responsibilities... ...automation Experience working with large datasets and building predictive models Preferred...Hourly payContract workPart timeRemote work$30 per hour
...Prolific is looking for Advanced Mandarin Speakers to join as Domain Expert participants. You will assist in training and evaluating AI models while being part of a groundbreaking platform. Responsibilities include analyzing and editing Mandarin content and assessing AI...Remote workFlexible hours- ...seeking candidates for the Rubric Academy Fellowship, a remote opportunity that involves completing a self-paced Academy focused on AI evaluation principles. Ideal candidates will possess strong analytical skills and deep subject-matter expertise, enabling them to work...Remote work
- ...Position Physics Expert Type Hourly contract Location... ...week Role Responsibilities Evaluate, curate, and annotate complex physics content for AI training datasets. Create precise prompts, responses, and... ...training materials to reflect advancements in physics and AI practices...Hourly payContract workRemote work
- ...one of the world’s fastest-growing AI companies accelerating the advancement and deployment of powerful AI... ...researchers to align problems with evaluation goals, especially in areas where models... ...community knowledge in a new way. Experts add insights directly into each...Hourly payContract workFor contractorsFreelanceRemote work
- ...leveraging expertise in scientific visualization and fluid dynamics to evaluate AI-generated content. Ideal candidates have at least 3 years of... ...for concurrent projects. Join a year-round program focused on advancing AI understanding in computational science. #J-18808-Ljbffr...Contract workRemote workFlexible hours
$210k - $300k
...Description This Opportunity WSP is seeking an Advanced Air Mobility (AAM ) and Uncrewed Aircraft Systems (UAS) Expert to serve as a national technical authority,... ...operations, and multimodal integration. Evaluate infrastructure readiness (vertiports, energy,...Work at officeLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Expert Prompt Curators for Advanced AI Evaluation Dataset. Be the first to apply!
- subject matter expert senior New York, NY
- technology expert New York, NY
- sql expert New York, NY
- fulfillment expert New York, NY
- subject matter expert New York, NY
- guest service support expert New York, NY
- subject matter expert work from home New York, NY
- curator New York, NY
- music curator New York, NY
- data curator New York, NY

