AI Agent Evaluation Analyst (Freelance)
$60 per hourMindrift
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who We're Looking For We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate. Are you comfortable with ambiguity and complexity? Does an asynchronous, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated? This is a flexible, project-based opportunity well-suited for: Analysts, researchers, or consultants with strong critical thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part-time and non-permanent opportunity About the Project We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit. What You'll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism Identifying inconsistencies, missing assumptions, or unclear decision points Helping define clear expected behaviors (gold standards) for AI agents Annotating cause-effect relationships, reasoning paths, and plausible alternatives Thinking through complex systems and policies as a human would to ensure agents are tested properly Working closely with QA, writers, or developers to suggest refinements or edge case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications Strong attention to detail: can spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: can read, not necessarily write, JSON/YAML Ability to assess scenarios holistically: what's missing, what's unrealistic, what might break? Good communication and clear writing (in English) to document your findings. We Also Value Applicants Who Have Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (e.g., logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI-generated content Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”) Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mindrift
$80 per hour
...ago Be among the first 25 applicants Get AI‑powered advice on this job and more... ...servers and internal tools for running and evaluating agent behavior. You'll implement base methods... ...needs Take part in a flexible, remote, freelance project that fits around your primary professional...FreelancePart timeRemote workFlexible hours$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention to detail to review tasks and define clear standards. This remote, flexible opportunity offers rates...SuggestedRemote jobPart timeFlexible hours- ...national and local geography? This freelance opportunity allows you to... ...the Life of an Online Data Analyst: In this role, you will be working... ...Completing research and evaluation tasks in a web-based environment... ...in the world! TELUS Digital AI Community Our global AI...FreelancePart timeLocal areaWorldwide
$60 per hour
A leading technology firm based in the United States is seeking legal consultants for project-based AI opportunities. The role involves generating prompts, evaluating AI solutions, and improving AI reasoning. Candidates should have a law degree and at least two years of...FreelanceFlexible hours$80 per hour
...-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and... ...analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/...SuggestedPart timeRemote workFlexible hours$60 per hour
Mindrift is offering an exciting opportunity for professionals to engage in evaluating AI-generated auto insurance claims. Candidates will need a degree in related fields and 3+ years of relevant experience. The role requires thorough evaluation and documentation of claims...FreelanceHourly pay10 hours per weekFlexible hours$60 per hour
...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking... ...rates up to $60/hour. This position is ideal for analysts or students looking to contribute meaningfully to...Remote jobFlexible hours- ...solutions company is seeking an Online Data Analyst to work on enhancing the quality of digital maps used worldwide. This part-time freelance role allows you to work from home at your... ...team and contribute to building better AI models within an inclusive environment. #J...FreelanceRemote jobPart timeWork from homeWorldwide
- A global technology company is seeking an Online Data Analyst in the United States to enhance digital map content and quality. This part-time, long-term freelance role includes conducting online research and completing tasks related to maps and data. Candidates should...FreelanceRemote jobPart time
- A technology company is seeking a freelance Online Data Analyst to enhance digital map content. This part-time opportunity allows remote work, requiring proficiency in Urdu and English, and involves verifying geographical data. Candidates must have familiarity with US cultural...FreelanceRemote jobPart time
$55 per hour
A leading AI development company in North Carolina is seeking an AI Tutor in Accounting for a part-time, remote role. This position involves generating challenging AI prompts, defining evaluation criteria, and correcting AI responses in your field of expertise. Applicants...FreelanceRemote jobHourly payPart time$20 per hour
A technology company specializing in AI is seeking individuals to train chatbots remotely. This role involves developing prompts, writing responses, and evaluating AI performance. Ideal for freelance professionals, the position offers flexibility in schedule and project...FreelanceRemote job$55 per hour
A dynamic AI platform is seeking a Freelance Biology Expert with Python to generate advanced AI training prompts and evaluate AI responses. Applicants should possess a degree in Biology and relevant professional experience. This remote role offers flexible hours and competitive...FreelanceRemote jobHourly payFlexible hours$60 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelancePermanent employmentTemporary workPart time10 hours per week$73 per hour
...ethically shape the future of AI. What We Do The Mindrift platform... ...AI systems are tested and evaluated? About the Project You will create... ...tasks that push frontier AI agents to their limits. Think... ...part in a part‑time, remote, freelance project that fits around your...FreelancePart timeRemote workFlexible hours$55 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform... ...guidelines. Auditing Work: Review and evaluate tasks completed by other experts,... ...challenging, complex guidelines. Our freelance role is fully remote, so you just need a...FreelancePart timeRemote work- RWS Group is seeking AI Data Specialists to enhance AI-generated content in English. This freelance position allows you to work from home in North Carolina, offering flexible... ...week). The role involves data collection, evaluation, annotation, and object tagging across...FreelanceRemote jobPart timeWork from home10 hours per weekFlexible hours
$55 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...FreelanceHourly payPart timeRemote work- The AI Solutions Analyst supports the Data & AI organization in developing and deploying artificial intelligence and machine learning solutions... ...detection, and scoring workflows. Participate in model evaluation, validation, monitoring, and lifecycle maintenance....Work at office
$40 per hour
A technology company is seeking experienced cybersecurity professionals to join their REMOTE team. The role involves evaluating AI-generated security content and solving technical cybersecurity problems. Candidates should have 2+ years in cybersecurity with some coding...Remote jobHourly payFlexible hours$118.1k - $328.8k
Job Summary Within IQVIA’s AI & Technology Solutions (ATS) organization, the Architecture... ...: Published and broadly adopted AI and agent reference architectures Increased reuse... ..., RAG, tool‑use, MCP servers, HITL, evaluation frameworks, monitoring, observability, and...Full timePart timeImmediate start$50 - $60 per hour
A legal consulting firm in North Carolina is seeking a Legal Specialist to evaluate AI models by providing complex legal problems. Candidates must hold a law degree and have 5+ years of experience in various legal fields. This role allows flexibility in project selection...Hourly payFor contractorsFlexible hours$55 per hour
Freelance Biology Expert with Python - AI Trainer 4 days ago Be among the first 25 applicants This opportunity is only for candidates currently residing... ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's...FreelancePart timeRemote work$20 per hour
...DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots... ..., detail-oriented small business owners, freelancers, and independent contractors to teach AI... ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy...FreelanceHourly payFull timeContract workPart timeFor contractorsSelf employmentRemote work$20 per hour
DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots... ..., detail‑oriented small business owners, freelancers, and independent contractors to join our... ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy...FreelanceHourly payFull timeContract workPart timeFor contractorsSelf employmentRemote work$40 per hour
A leading AI development team is seeking experienced quantitative professionals for a flexible remote role involving evaluation of AI-generated work. Ideal candidates have over 2 years of experience in quantitative analysis, strong coding skills, and a background in fields...Remote jobHourly payFlexible hours- ...solutions firm is seeking an Online Data Analyst for a fully remote part-time position. Candidates... ...map content through online research and evaluation tasks. This entry-level role offers... ...team making a difference in the world of AI and data solutions. #J-18808-Ljbffr TELUS...Remote jobPart timeFlexible hours
$60 per hour
...A pioneering AI development organization is seeking quantitative professionals to evaluate AI-generated analyses and conduct statistical work. You will work remotely, selecting projects at your convenience, with competitive pay up to $60/hour. Ideal candidates have at...Remote work$40 per hour
A cutting-edge AI company is looking for experienced quantitative professionals to evaluate AI-generated quantitative work. You will analyze statistical models, solve quantitative problems, and help validate AI outputs. Candidates should have 2+ years of experience in...Remote jobHourly payFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!
- agent assistant Raleigh, NC
- work from home chat agent Raleigh, NC
- telemarketer - state farm agent team member Raleigh, NC
- title agent Raleigh, NC
- cruise agent Raleigh, NC
- import export agent Raleigh, NC
- remote chat agent Raleigh, NC
- executive protection agent Raleigh, NC
- commissioning agent Raleigh, NC
- airport agent Raleigh, NC

