AI Agent Evaluation Analyst (Freelance)
$60 per hourMindrift
About Mindrift At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For We’re looking for curious and intellectually proactive contributors, the kind of person who double‑checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated? This is a flexible, project‑based opportunity well‑suited for: Analysts, researchers, or consultants with strong critical‑thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part‑time and non‑permanent opportunity About the Project We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem‑solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit. What You’ll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism Identifying inconsistencies, missing assumptions, or unclear decision points Helping define clear expected behaviors (gold standards) for AI agents Annotating cause‑effect relationships, reasoning paths, and plausible alternatives Thinking through complex systems and policies as a human would to ensure agents are tested properly Working closely with QA, writers, or developers to suggest refinements or edge‑case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: Can read, not necessarily write JSON/YAML Ability to assess scenarios holistically: What’s missing, what’s unrealistic, what might break? Good communication and clear writing (in English) to document your findings. We also value applicants who have: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity with QA or test‑case thinking (edge cases, failure modes, "what could go wrong") Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority level Internship Employment type Part‑time Job function Other Industries IT Services and IT Consulting #J-18808-Ljbffr Mindrift
$80 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity...FreelancePermanent employmentTemporary work- ...Join to apply for the Online Data Analyst - Urdu role at TELUS Digital AI Data Solutions . Are you a detail-... ...national and local geography? This freelance opportunity allows you to work at... ...worldwide. Completing research and evaluation tasks in a web‑based environment...FreelancePart timeLocal areaWorldwide
$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and... ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected AI behaviors...SuggestedPart timeRemote workFlexible hours$75 per hour
...looking for legal consultants to participate in project-based opportunities focused on testing and improving AI systems. Contributors will generate prompts, evaluate AI outputs, and ensure accuracy based on established legal principles. Ideal candidates have a law degree...FreelanceHourly pay10 hours per week$40 per hour
...A leading AI development company is seeking experienced quantitative professionals to evaluate AI-generated analyses and provide impactful feedback. The role is fully remote, allowing flexibility in project selection and scheduling. Candidates with strong backgrounds...SuggestedHourly payRemote work- Mercor is seeking a Bilingual Evaluator for a contract role. You will assess AI-generated responses in Assamese and provide feedback for improvement. This position requires native proficiency in Assamese and strong skills in English writing. The ideal candidate holds a...Contract work
- ...A leading AI data solutions company seeks a detail-oriented Online Data Analyst proficient in Urdu and English for a part-time freelance role. Responsibilities include enhancing digital maps and conducting research to verify data accuracy. This remote opportunity is flexible...FreelancePart timeRemote workWork from homeFlexible hours
- A leading AI consultancy firm is seeking legal consultants for project-based opportunities involving testing and improving AI systems. Candidates should have a degree in law and at least 2 years of legal experience within US jurisdiction. Strong written English is required...FreelancePermanent employmentRemote work10 hours per week
$80 per hour
A leading AI innovation company is seeking experienced Python engineers for a remote part-time freelance project. You will focus on developing Model Context Protocol servers... ...cross-functional teams to enhance agent behavior evaluation. Ideal candidates should possess 4+...FreelancePart timeRemote work$55 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity...FreelancePermanent employmentTemporary workPart time10 hours per week$40 per hour
...A leading cybersecurity firm is seeking experienced cybersecurity professionals to evaluate AI-generated security content and solve technical problems. This position is fully remote and offers flexibility in project selection and scheduling. Candidates should have at least...Hourly payRemote work$40 per hour
...We are looking for experienced cybersecurity professionals to join our team to help train AI models. In this role, you will evaluate AI-generated security content, solve technical cybersecurity problems, and provide feedback to improve how AI systems reason about real...Hourly payFull timePart timeRemote work$55 per hour
A forward-thinking AI company is seeking experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems. This role emphasizes... .... The position offers flexible, part-time freelance work with competitive compensation up to $...FreelanceRemote jobPart timeFlexible hours$50k - $60k
...IT Analyst 1 Hybrid — Overland Park, KS About The Organizations IT at Nextworld operates... ...have, and automate tedious work with AI — all without ripping out the ERP and systems... ...and enhance productivity Research and evaluate new technologies to support...Temporary workLive inWork at officeFlexible hours$55 per hour
An innovative tech company seeks a Freelance Civil Engineering Expert to contribute to AI projects from anywhere. As part of the team, you will generate prompts for AIs, evaluate responses, and collaborate on specialized challenges in engineering. This part-time role offers...FreelanceRemote jobHourly payPart timeFlexible hours$55 per hour
A leading AI innovation firm is looking for experienced Mechanical Engineers with strong Python skills to train and evaluate AI models on complex mechanical engineering problems. The role... ...This flexible, fully remote, part-time freelance position pays up to $55/hour,...FreelanceRemote jobPart timeFlexible hours$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...FreelanceHourly payPart timeRemote work$60 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelancePermanent employmentTemporary workPart time10 hours per week- ...Precision Medicine Data Lead (AI Training) About the Role What if your expertise... ...actionable insights - helping train and evaluate cutting-edge AI models at the frontier of... ...- work when and where it suits you Freelance autonomy with the structure of meaningful...FreelanceHourly payOngoing contractContract workRemote workWorldwideFlexible hours
$20 per hour
...DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots... ..., detail-oriented small business owners, freelancers, and independent contractors to teach AI... ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy...FreelanceHourly payFull timeContract workPart timeFor contractorsSelf employmentRemote work$55 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform... ...Engineers with Python skills to train and evaluate AI models on realistic civil engineering... ...expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems...FreelancePart timeRemote workFlexible hours$40 per hour
...A leading AI development firm is looking for experienced quantitative professionals to join their fully remote team. This role includes evaluating AI-generated quantitative analysis and providing feedback to shape future AI models. Candidates should have at least 2 years...Hourly payRemote workFlexible hours- ...Hospital Health Data Governance Lead (AI Training) About the Role What if your... ...professionals to help train and evaluate cutting-edge AI models - ensuring they understand... ...- work when and where it suits you Freelance autonomy with the structure of meaningful...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$73 per hour
...ethically shape the future of AI. What We Do The... ...systems are tested and evaluated? This is a flexible, project... ...well‐suited for: Analysts, researchers, experienced... ...that push frontier AI agents to their limits. Think... ...in a part‐time, remote, freelance project that fits around...FreelancePermanent employmentPart timeRemote workFlexible hours- Accenture is seeking an Advanced AI Architect in Overland Park, Kansas. You will design and deliver full stack AI architecture for... ...variety of platforms. Responsibilities include leading workshops, evaluating technologies, and implementing AI governance and security. The...
$55 per hour
Freelance Civil Engineering Expert - AI Trainer 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features... ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. Correct the model’s...FreelancePart timeRemote work$60 per hour
...A cutting-edge AI company seeks quantitative professionals to evaluate AI-generated quantitative work, design quantitative problems, and provide impactful feedback. This flexible, fully remote role offers up to $60 USD/hour and is ideal for those with a background in data...Remote workFlexible hours$40 per hour
...A leading AI development company is seeking experienced quantitative professionals to evaluate AI-generated work and design quantitative problems. This role allows for a fully remote flexible schedule while contributing to the development of advanced AI systems. Candidates...Hourly payRemote workFlexible hours$40 per hour
...the DataAnnotation team and contribute to developing cutting-edge AI systems, while enjoying the flexibility of remote work and... ...you'll work closely with state-of-the-art AI models on tasks like evaluating AI-generated quantitative analysis, solving technical problems,...Hourly payFull timeRemote workFlexible hours$69 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!
- special agent Kansas City, MO
- operations agent Kansas City, MO
- agent Kansas City, MO
- airport agent Kansas City, MO
- telemarketer - state farm agent team member Kansas City, MO
- fbi agent Kansas City, MO
- freight broker agent Kansas City, MO
- cruise agent Kansas City, MO
- state farm agent Kansas City, MO
- commissioning agent Kansas City, MO

