Lead AI Research Trainer for Frontier-Model Evaluation
Mercor
Mercor is looking for advanced STEM Researchers to join a leading AI lab’s research team. This position requires a PhD in a relevant STEM field and 3+ years of experience. You will guide research teams, design rigorous domain tasks, and collaborate with other experts while working 40 hours per week. The role involves evaluating tasks and providing technical feedback to enhance AI model accuracy. This is a W-2 employment position through Cincinnatus LLC, offering benefits and compliance support. #J-18808-Ljbffr Mercor
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Join our team to help train the... ...new area – teaching AI Assistant models to be amore useful tool for finance professionals... ...diverse and complex problems and evaluate their outputs Evaluate the quality produced...SuggestedHourly payFull timeContract workPart timeWork experience placementRemote work$50 - $60 per hour
...Annotation is committed to creating high-quality AI. Join our team to help train the... ...exciting new area – teaching AI Assistant models to be amore useful tool for finance professionals... ...diverse and complex problems and evaluate their outputs Evaluate the quality produced...SuggestedHourly payFull timeContract workPart timeWork experience placementRemote work$50 per hour
...seeking a Spanish (Spain) Audio Generalist Evaluator Expert to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on... ...help train and benchmark advanced language models. This is a short-term, structured...SuggestedRemote jobHourly payTemporary work10 hours per week$400 per month
Mercor is seeking experienced frontend engineers for a unique opportunity to work with AI coding agents for complex tasks. The role focuses on evaluating model-generated frontend implementations and requires 2+ years of frontend engineering experience, particularly with...SuggestedRemote jobFlexible hours- Mercor is seeking advanced LLM power users to help assess how well AI systems manage personalized real-world tasks. This role requires... ...will have experience using AI for personal workflows and strong evaluative skills, particularly with respect to identifying AI outputs'...SuggestedPart time
- Mercor is looking for advanced LLM power users in California to evaluate AI systems' handling of personal tasks. This role involves creating written responses for complex scenarios and judging AI outputs on their usefulness and personalization. Applicants should have significant...
- Mercor is looking for advanced LLM power users in Perris, California, to evaluate AI tools in real-world tasks. This role requires experience with AI products and the ability to craft responses for complex personal tasks. You will judge outputs for practicality and write...
$130 - $170 per hour
...recruiting UK-based Legal Experts for a research project with a leading foundational model AI lab. Ideal candidates have... ...high-quality training data for frontier AI systems. You are available to... ...history screen Short assessment evaluating law expertise #J-18808-Ljbffr...Remote jobHourly payOdd job$100 - $150 per hour
...the Role Mercor is partnering with a leading AI lab to train frontier models on high-quality legal reasoning... ...design realistic workplace scenarios, evaluate model outputs against established... ...rubrics Provide written feedback the research team uses to improve model behavior...Remote jobHourly payWork at officeImmediate start- Mercor is seeking health insurance professionals to evaluate AI systems in various areas such as plan design and coverage determination. The role includes reviewing AI-generated outputs and providing structured feedback based on health insurance regulations and workflows...Remote jobHourly pay
- Mercor is seeking AI & Data Science SMEs to develop advanced Large Language Models. This W-2 position with Cincinnatus LLC requires expertise... ...week during weekdays. You will guide AI research teams, design complex tasks, and evaluate AI outputs. The ideal candidate has a...Weekday work
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Join our team to help train the... ...new area – teaching AI Assistant models to be amore useful tool for finance professionals... ...diverse and complex problems and evaluate their outputs Evaluate the quality produced...Hourly payFull timeContract workPart timeWork experience placementRemote work- Mercor is looking for experienced telecommunications professionals to improve AI systems in network architecture and telecom operations. The role involves evaluating AI outputs, creating realistic telecom scenarios, and providing structured feedback on accuracy. The ideal...
$70 - $100 per hour
Mercor is seeking an experienced AI Trainer for STEM Computational Scientific Software & Evaluation Design in Seismology & Geophysics. The role involves designing computational problems that require advanced use of specific software libraries. Candidates should possess...Remote jobHourly pay- ...the safe, and provides change as requested to cash registers. Models and delivers a distinctive and delightful customer experience.... ...empty package). Assists manager or assistant store manager in evaluating and developing displays, including promotional, seasonal, super...Work experience placementSeasonal workLocal areaFlexible hoursShift workAfternoon shift
$1,150 - $1,450 per month
...Overview Mercor is partnering with leading AI labs to engage experienced... ...enterprise communications to evaluate and enhance AI performance on... ...asynchronously with research teams to refine evaluation frameworks... ...to an hourly compensation model based on sustained quality and...Remote jobHourly payNight shift- ...Engineer for a focused 3-4 week remote engagement assessing the feasibility of a computer vision system. This role includes benchmarking model performance and translating findings for a non-technical audience. The ideal candidate has 5+ years of experience in computer vision...Remote job
- Mercor is seeking a professional to evaluate Assamese AI-generated responses and identify areas for improvement. The role requires native fluency... ...Ideal candidates will have experience using large language models and strong analytical skills to ensure high-quality...Contract work
- ...applying to job listings, taking AI‑led interviews, receiving... ..., separate UX friction, model edge cases, and system... ...intersection of labor markets and AI research. We partner with leading AI labs and enterprises to... ...vast talent network trains frontier AI models in the same way...Remote jobHourly pay
$400 per month
Mercor is looking for contributors to support a Frontier Code Agents project with a leading AI research lab. The role focuses on evaluating and improving AI coding models through structured technical assessments. The contributors will work on infrastructure engineering...- ...Job Description Job Description Lead Teacher l Summary: The Lead Teacher I... ...always ready for the children Review and evaluate the performance of the Associate Teacher(... ...conduct themselves with integrity and model ethical conduct for themselves and others...Work at officeFlexible hours
- Mercor is seeking AI & Data Science experts to join a cutting-edge team... ...of advanced Large Language Models. You will guide research teams, develop evaluation frameworks, and enhance the quality... ...40 hours per week, working at a leading AI lab within a collaborative team...
- Mercor is looking for an expert to join their AI lab's GenAI team. This role involves guiding research, designing tasks in machine learning, and evaluating AI agent outputs. Candidates should have over 3 years of experience in a STEM field, and strong written communication...Full time
$50 - $60 per hour
...experienced finance professionals to join as independent contractors. This role involves teaching AI models about financial principles, reviewing AI Assistant answers, and evaluating outputs from complex financial scenarios. The ideal candidate will have a Master's or PhD in...Remote jobHourly payFor contractors$21 - $24.25 per hour
...Catering Lead At Panera, our people come first. If you’re looking for a place where you can grow, feel supported, be yourself, enjoy great perks, and have a little fun along the way—you’re in the right spot. We’re here to help you succeed every day, in every way. Get...Full timeLocal areaFlexible hoursShift workNight shift- ...and preprocess diverse data sets to ensure integrity and readiness for analysis. # Develop, validate, and implement statistical models to extract actionable insights from complex data. # Conduct exploratory data analysis to identify trends, patterns, and opportunities...For contractorsRemote work
$50 per hour
About the Opportunity A leading AI research organization is seeking advanced LLM power users to evaluate how well AI systems handle personalized, real‑world life tasks. This... ...rubrics and quality criteria to assess model performance Use your own LLM experience to evaluate...Remote jobHourly pay$20 per hour
...Make your move. See a day in the life of a Guest Experience Lead at McDonald's Requirements: We believe in letting you do... ...order for that organization to reach out to me and process and evaluate my application. I acknowledge that McDonald’s USA will not receive...Full timePart timeLocal areaFlexible hours- ...Experts for remote roles on a foundational model AI lab research project. Ideal candidates include those... ...high‑quality training data for frontier AI systems with a commitment of 40 hours... ...per week. Successful contributions may lead to selection for future projects with Mercor...Remote job
- ...customer service approaches and techniques by modeling excellent customer service for example... ...achieved assisting the store manager in leading the management team in controlling... ...expectations and conducting regular performance evaluations providing recognition and rewards...Hourly payFull timeTemporary workPart timeWork experience placementLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead AI Research Trainer for Frontier-Model Evaluation. Be the first to apply!


