AI Agent Evaluation Analyst (Freelance)

$60 per hour

Mindrift

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who We're Looking For We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate. Are you comfortable with ambiguity and complexity? Does an asynchronous, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated? This is a flexible, project-based opportunity well-suited for: Analysts, researchers, or consultants with strong critical thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part-time and non-permanent opportunity About the Project We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit. What You'll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism Identifying inconsistencies, missing assumptions, or unclear decision points Helping define clear expected behaviors (gold standards) for AI agents Annotating cause-effect relationships, reasoning paths, and plausible alternatives Thinking through complex systems and policies as a human would to ensure agents are tested properly Working closely with QA, writers, or developers to suggest refinements or edge case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications Strong attention to detail: can spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: can read, not necessarily write, JSON/YAML Ability to assess scenarios holistically: what's missing, what's unrealistic, what might break? Good communication and clear writing (in English) to document your findings. We Also Value Applicants Who Have Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (e.g., logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI-generated content Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”) Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mindrift

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the AI Agent Evaluation Analyst (Freelance) in Raleigh, NC vacancy

MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hour
...ago Be among the first 25 applicants Get AI‑powered advice on this job and more... ...servers and internal tools for running and evaluating agent behavior. You'll implement base methods... ...needs Take part in a flexible, remote, freelance project that fits around your primary professional...
Freelance
Part time
Remote work
Flexible hours
Mind Rift
Raleigh, NC
1 day ago
Remote AI Agent QA & Evaluation Analyst
$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention to detail to review tasks and define clear standards. This remote, flexible opportunity offers rates...
Suggested
Remote job
Part time
Flexible hours
Mindrift
Raleigh, NC
4 days ago
Online Data Analyst Spanish Speakers (US)
...national and local geography? This freelance opportunity allows you to... ...the Life of an Online Data Analyst: In this role, you will be working... ...Completing research and evaluation tasks in a web-based environment... ...in the world! TELUS Digital AI Community Our global AI...
Freelance
Part time
Local area
Worldwide
TELUS Digital
Raleigh, NC
11 days ago
Freelance US Law Attorney — AI Project Evaluator
$60 per hour
A leading technology firm based in the United States is seeking legal consultants for project-based AI opportunities. The role involves generating prompts, evaluating AI solutions, and improving AI reasoning. Candidates should have a law degree and at least two years of...
Freelance
Flexible hours
Mindrift
Raleigh, NC
5 days ago
Remote AI Agent Evaluation Specialist
$80 per hour
...-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and... ...analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/...
Suggested
Part time
Remote work
Flexible hours
Mind Rift
Raleigh, NC
1 day ago
Freelance AI Claims Evaluator & Test Scenario Designer
$60 per hour
Mindrift is offering an exciting opportunity for professionals to engage in evaluating AI-generated auto insurance claims. Candidates will need a degree in related fields and 3+ years of relevant experience. The role requires thorough evaluation and documentation of claims...
Freelance
Hourly pay
10 hours per week
Flexible hours
Mindrift
Raleigh, NC
4 days ago
AI Agent Evaluation Specialist — Remote & Flexible
$60 per hour
...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking... ...rates up to $60/hour. This position is ideal for analysts or students looking to contribute meaningfully to...
Remote job
Flexible hours
Mindrift
Raleigh, NC
4 days ago
Remote Data Analyst (Odia) - Map Content Research
...solutions company is seeking an Online Data Analyst to work on enhancing the quality of digital maps used worldwide. This part-time freelance role allows you to work from home at your... ...team and contribute to building better AI models within an inclusive environment. #J...
Freelance
Remote job
Part time
Work from home
Worldwide
TELUS Digital AI Data Solutions
Raleigh, NC
5 days ago
Remote Map Data Analyst (Freelance, Part-Time)
A global technology company is seeking an Online Data Analyst in the United States to enhance digital map content and quality. This part-time, long-term freelance role includes conducting online research and completing tasks related to maps and data. Candidates should...
Freelance
Remote job
Part time
TELUS Digital
Raleigh, NC
5 days ago
Remote Data Analyst for Map Quality and Research
A technology company is seeking a freelance Online Data Analyst to enhance digital map content. This part-time opportunity allows remote work, requiring proficiency in Urdu and English, and involves verifying geographical data. Candidates must have familiarity with US cultural...
Freelance
Remote job
Part time
TELUS Digital
Raleigh, NC
3 days ago
AI Tutor — Remote Part-Time Freelance, $55/hr
$55 per hour
A leading AI development company in North Carolina is seeking an AI Tutor in Accounting for a part-time, remote role. This position involves generating challenging AI prompts, defining evaluation criteria, and correcting AI responses in your field of expertise. Applicants...
Freelance
Remote job
Hourly pay
Part time
Mindrift
Raleigh, NC
2 days ago
Remote AI Training & Communications Consultant
$20 per hour
A technology company specializing in AI is seeking individuals to train chatbots remotely. This role involves developing prompts, writing responses, and evaluating AI performance. Ideal for freelance professionals, the position offers flexibility in schedule and project...
Freelance
Remote job
DataAnnotation
Raleigh, NC
3 days ago
Remote Biology AI Tutor & Expert (Freelance, Python)
$55 per hour
A dynamic AI platform is seeking a Freelance Biology Expert with Python to generate advanced AI training prompts and evaluate AI responses. Applicants should possess a degree in Biology and relevant professional experience. This remote role offers flexible hours and competitive...
Freelance
Remote job
Hourly pay
Flexible hours
Mindrift
Raleigh, NC
5 days ago
Freelance Legal Attorney (US Law) - AI Trainer
$60 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...
Freelance
Permanent employment
Temporary work
Part time
10 hours per week
Mindrift
Raleigh, NC
1 day ago
Freelance Tax Accountant - AI Trainer
$73 per hour
...ethically shape the future of AI. What We Do The Mindrift platform... ...AI systems are tested and evaluated? About the Project You will create... ...tasks that push frontier AI agents to their limits. Think... ...part in a part‑time, remote, freelance project that fits around your...
Freelance
Part time
Remote work
Flexible hours
Mind Rift
Raleigh, NC
1 day ago
Freelance Mathematics Consultant - AI Trainer
$55 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform... ...guidelines. Auditing Work: Review and evaluate tasks completed by other experts,... ...challenging, complex guidelines. Our freelance role is fully remote, so you just need a...
Freelance
Part time
Remote work
Mind Rift
Raleigh, NC
1 day ago
Remote AI Data Annotator - Part-Time, Flexible
RWS Group is seeking AI Data Specialists to enhance AI-generated content in English. This freelance position allows you to work from home in North Carolina, offering flexible... ...week). The role involves data collection, evaluation, annotation, and object tagging across...
Freelance
Remote job
Part time
Work from home
10 hours per week
Flexible hours
RWS Group
Raleigh, NC
5 days ago
Electrical Engineer with Python - Freelance AI Trainer
$55 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...
Freelance
Hourly pay
Permanent employment
Temporary work
Part time
10 hours per week
Mindrift
Raleigh, NC
5 days ago
Freelance Annotator (English) - AI Trainer
$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...
Freelance
Hourly pay
Part time
Remote work
Toloka Annotators
Raleigh, NC
3 days ago
AI Solutions Analyst
The AI Solutions Analyst supports the Data & AI organization in developing and deploying artificial intelligence and machine learning solutions... ...detection, and scoring workflows. Participate in model evaluation, validation, monitoring, and lifecycle maintenance....
Work at office
Grifols, S.A
Raleigh, NC
1 day ago
Remote Malware Analyst: Train AI for Cyber Defense
$40 per hour
A technology company is seeking experienced cybersecurity professionals to join their REMOTE team. The role involves evaluating AI-generated security content and solving technical cybersecurity problems. Candidates should have 2+ years in cybersecurity with some coding...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Raleigh, NC
5 days ago
Principal Enterprise Architect - AI & Agents
$118.1k - $328.8k
Job Summary Within IQVIA’s AI & Technology Solutions (ATS) organization, the Architecture... ...: Published and broadly adopted AI and agent reference architectures Increased reuse... ..., RAG, tool‑use, MCP servers, HITL, evaluation frameworks, monitoring, observability, and...
Full time
Part time
Immediate start
Dormont Manufacturing Co
Raleigh, NC
1 day ago
AI Legal Analyst for Chatbots — Flexible Hours
$50 - $60 per hour
A legal consulting firm in North Carolina is seeking a Legal Specialist to evaluate AI models by providing complex legal problems. Candidates must hold a law degree and have 5+ years of experience in various legal fields. This role allows flexibility in project selection...
Hourly pay
For contractors
Flexible hours
DataAnnotation
Raleigh, NC
2 days ago
Freelance Biology Expert with Python - AI Trainer
$55 per hour
Freelance Biology Expert with Python - AI Trainer 4 days ago Be among the first 25 applicants This opportunity is only for candidates currently residing... ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's...
Freelance
Part time
Remote work
Mindrift
Raleigh, NC
5 days ago
Art Director (Independent Contractor) - AI Trainer
$20 per hour
...DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots... ..., detail-oriented small business owners, freelancers, and independent contractors to teach AI... ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy...
Freelance
Hourly pay
Full time
Contract work
Part time
For contractors
Self employment
Remote work
DataAnnotation
Raleigh, NC
2 days ago
Creative Director (Independent Contractor) - AI Trainer
$20 per hour
DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots... ..., detail‑oriented small business owners, freelancers, and independent contractors to join our... ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy...
Freelance
Hourly pay
Full time
Contract work
Part time
For contractors
Self employment
Remote work
DataAnnotation
Raleigh, NC
2 days ago
Remote AI Analytics Analyst | Data Validation & Modeling
$40 per hour
A leading AI development team is seeking experienced quantitative professionals for a flexible remote role involving evaluation of AI-generated work. Ideal candidates have over 2 years of experience in quantitative analysis, strong coding skills, and a background in fields...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Raleigh, NC
4 days ago
Remote Data Analyst - Maps & Research (Entry Level)
...solutions firm is seeking an Online Data Analyst for a fully remote part-time position. Candidates... ...map content through online research and evaluation tasks. This entry-level role offers... ...team making a difference in the world of AI and data solutions. #J-18808-Ljbffr TELUS...
Remote job
Part time
Flexible hours
TELUS Digital AI Data Solutions
Raleigh, NC
5 days ago
Remote Quant Data Analyst & AI Trainer
$60 per hour
...A pioneering AI development organization is seeking quantitative professionals to evaluate AI-generated analyses and conduct statistical work. You will work remotely, selecting projects at your convenience, with competitive pay up to $60/hour. Ideal candidates have at...
Remote work
DataAnnotation
Raleigh, NC
1 day ago
Remote Growth Analyst & AI Trainer — Shape the Future of AI
$40 per hour
A cutting-edge AI company is looking for experienced quantitative professionals to evaluate AI-generated quantitative work. You will analyze statistical models, solve quantitative problems, and help validate AI outputs. Candidates should have 2+ years of experience in...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Raleigh, NC
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!