AI Agent Evaluation Analyst (Freelance)
$60 per hourMindrift
About Mindrift At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For We’re looking for curious and intellectually proactive contributors, the kind of person who double‑checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated? This is a flexible, project‑based opportunity well‑suited for: Analysts, researchers, or consultants with strong critical‑thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part‑time and non‑permanent opportunity About the Project We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem‑solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit. What You’ll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism Identifying inconsistencies, missing assumptions, or unclear decision points Helping define clear expected behaviors (gold standards) for AI agents Annotating cause‑effect relationships, reasoning paths, and plausible alternatives Thinking through complex systems and policies as a human would to ensure agents are tested properly Working closely with QA, writers, or developers to suggest refinements or edge‑case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: Can read, not necessarily write JSON/YAML Ability to assess scenarios holistically: What’s missing, what’s unrealistic, what might break? Good communication and clear writing (in English) to document your findings. We also value applicants who have: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity with QA or test‑case thinking (edge cases, failure modes, "what could go wrong") Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority level Internship Employment type Part‑time Job function Other Industries IT Services and IT Consulting #J-18808-Ljbffr Mindrift
$80 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform,... ...and internal tools for running and evaluating agent behavior. You’ll implement base methods... ...needs Take part in a flexible, remote, freelance project that fits around your primary professional...FreelanceHourly payPart timeRemote workFlexible hours$80 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelancePermanent employmentTemporary work- Join to apply for the Online Data Analyst - Urdu role at TELUS Digital AI Data Solutions . Are you a detail... ...national and local geography? This freelance opportunity allows you to work at... ...worldwide. Completing research and evaluation tasks in a web‑based environment...FreelancePart timeLocal areaWorldwide
- A leading AI data solutions company seeks a detail-oriented Online Data Analyst proficient in Urdu and English for a part-time freelance role. Responsibilities include enhancing digital maps and conducting research to verify data accuracy. This remote opportunity is flexible...FreelanceRemote jobPart timeWork from homeFlexible hours
$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and... ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected AI behaviors...SuggestedRemote jobPart timeFlexible hours$40 per hour
A leading AI development company is seeking experienced quantitative professionals to evaluate AI-generated analyses and provide impactful feedback. The role is fully remote, allowing flexibility in project selection and scheduling. Candidates with strong backgrounds in...Remote jobHourly pay- A leading AI consultancy firm is seeking legal consultants for project-based opportunities involving testing and improving AI systems. Candidates should have a degree in law and at least 2 years of legal experience within US jurisdiction. Strong written English is required...FreelancePermanent employmentRemote work10 hours per week
$60 per hour
A leading data analytics firm is looking for experienced quantitative professionals to evaluate AI-generated analyses. This fully remote role allows you to contribute to developing cutting-edge AI systems, working flexibly and autonomously. Candidates should have a degree...Remote job- ...Health Informatics Analyst (AI Training) About The Role We're looking for experienced Health Informatics Analysts to help evaluate and improve AI systems being developed for healthcare... ...work on your own schedule, anywhere Freelance perks: autonomy, variety, and global...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
- ...Clinical Business Intelligence Manager (AI Training) About the Role What if... ...efficiency Apply your domain expertise to evaluate and improve how AI models interpret and... ...- work when and where it suits you Freelance autonomy with the structure and purpose of...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$80 per hour
A leading AI innovation company is seeking experienced Python engineers for a remote part-time freelance project. You will focus on developing Model Context Protocol servers... ...cross-functional teams to enhance agent behavior evaluation. Ideal candidates should possess 4+...FreelanceRemote jobPart time$55 per hour
A forward-thinking AI company is seeking experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems. This role emphasizes... .... The position offers flexible, part-time freelance work with competitive compensation up to $...FreelanceRemote jobPart timeFlexible hours$55 per hour
An innovative tech company seeks a Freelance Civil Engineering Expert to contribute to AI projects from anywhere. As part of the team, you will generate prompts for AIs, evaluate responses, and collaborate on specialized challenges in engineering. This part-time role offers...FreelanceRemote jobHourly payPart timeFlexible hours$55 per hour
A leading AI innovation firm is looking for experienced Mechanical Engineers with strong Python skills to train and evaluate AI models on complex mechanical engineering problems. The role... ...This flexible, fully remote, part-time freelance position pays up to $55/hour,...FreelanceRemote jobPart timeFlexible hours$60 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelancePermanent employmentTemporary workPart time10 hours per week$55 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity...FreelancePermanent employmentTemporary workPart time10 hours per week- ...Precision Medicine Data Lead (AI Training) About the Role What if your expertise... ...actionable insights - helping train and evaluate cutting-edge AI models at the frontier of... ...- work when and where it suits you Freelance autonomy with the structure of meaningful...FreelanceHourly payOngoing contractContract workRemote workWorldwideFlexible hours
$55 per hour
Freelance Civil Engineering Expert - AI Trainer 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features... ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. Correct the model’s...FreelancePart timeRemote work- ...Hospital Health Data Governance Lead (AI Training) About the Role What if your... ...professionals to help train and evaluate cutting-edge AI models - ensuring they understand... ...- work when and where it suits you Freelance autonomy with the structure of meaningful...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...FreelancePart timeRemote work$50k - $60k
...IT Analyst 1 Hybrid — Overland Park, KS About The Organizations IT at Nextworld operates... ...have, and automate tedious work with AI — all without ripping out the ERP and systems... ...and enhance productivity Research and evaluate new technologies to support...Temporary workLive inWork at officeFlexible hours$73 per hour
...ethically shape the future of AI. Who We’re Looking For... ...that push frontier AI agents to their limits. Think... ...a part‑time, remote, freelance project that fits... ...fields (Economics Experts, Analysts, researchers, or... ...understanding of how scoring or evaluation works in agent testing...FreelancePart timeRemote work- ...Biotech Health Data Governance Lead (AI Training) About the Role What if your... ...with data annotation, data quality evaluation, or AI training data pipelines Familiarity... ...around your life, not the other way around Freelance autonomy with the substance of genuinely...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$69 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$55 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform... ...Engineers with strong Python skills to train and evaluate AI models on complex, real-world... ...expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems...FreelancePart timeRemote workFlexible hours$55 per hour
...intelligence to ethically shape the future of AI. What We Do The Mindrift platform... ...Engineers with Python skills to train and evaluate AI models on realistic civil engineering... ...expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems...FreelancePart timeRemote workFlexible hours$60 per hour
Freelance Software Developer (Ruby) - AI Trainer This opportunity is only for candidates currently residing in the specified country. Your location may... ...challenge AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. Correct the model’s...FreelancePart timeRemote work$40 per hour
A leading cybersecurity firm is seeking experienced cybersecurity professionals to evaluate AI-generated security content and solve technical problems. This position is fully remote and offers flexibility in project selection and scheduling. Candidates should have at least...Remote jobHourly pay$73 per hour
...ethically shape the future of AI. What We Do The... ...systems are tested and evaluated? This is a flexible, project... ...well‑suited for: Analysts, researchers,... ...that push frontier AI agents to their limits. Think... ...in a part‑time, remote, freelance project that fits around...FreelancePermanent employmentPart timeRemote workFlexible hours- ...Government Services company, is seeking a Data Analyst I to support KPS and our government... ...patterns in textual data Leverage approved AI tools appropriately while adhering to... ...early childhood education policy or program evaluation Our Equal Employment Opportunity Policy...Contract workWork at officeLocal areaRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!
- freight agent no experience Kansas City, MO
- state farm agent Kansas City, MO
- work from home chat agent Kansas City, MO
- special agent Kansas City, MO
- fbi agent Kansas City, MO
- commissioning agent Kansas City, MO
- executive protection agent Kansas City, MO
- cruise agent Kansas City, MO
- telemarketer - state farm agent team member Kansas City, MO
- agent Kansas City, MO

