Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Agent Evaluation Analyst (Freelance)

$60 per hour

Mindrift

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who We're Looking For We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate. Are you comfortable with ambiguity and complexity? Does an asynchronous, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated? This is a flexible, project-based opportunity well-suited for: Analysts, researchers, or consultants with strong critical thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part-time and non-permanent opportunity About the Project We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit. What You'll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism Identifying inconsistencies, missing assumptions, or unclear decision points Helping define clear expected behaviors (gold standards) for AI agents Annotating cause-effect relationships, reasoning paths, and plausible alternatives Thinking through complex systems and policies as a human would to ensure agents are tested properly Working closely with QA, writers, or developers to suggest refinements or edge case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications Strong attention to detail: can spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: can read, not necessarily write, JSON/YAML Ability to assess scenarios holistically: what's missing, what's unrealistic, what might break? Good communication and clear writing (in English) to document your findings. We Also Value Applicants Who Have Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (e.g., logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI-generated content Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”) Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mindrift

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Agent Evaluation Analyst (Freelance) in Raleigh, NC vacancy
  • $80 per hour

     ...ago Be among the first 25 applicants Get AI‑powered advice on this job and more...  ...servers and internal tools for running and evaluating agent behavior. You'll implement base methods...  ...needs Take part in a flexible, remote, freelance project that fits around your primary professional... 
    Freelance
    Part time
    Remote work
    Flexible hours

    Mind Rift

    Raleigh, NC
    1 day ago
  • $55 per hour

    A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention to detail to review tasks and define clear standards. This remote, flexible opportunity offers rates... 
    Suggested
    Remote job
    Part time
    Flexible hours

    Mindrift

    Raleigh, NC
    4 days ago
  •  ...national and local geography? This freelance opportunity allows you to...  ...the Life of an Online Data Analyst: In this role, you will be working...  ...Completing research and evaluation tasks in a web-based environment...  ...in the world! TELUS Digital AI Community Our global AI... 
    Freelance
    Part time
    Local area
    Worldwide

    TELUS Digital

    Raleigh, NC
    11 days ago
  • $60 per hour

    A leading technology firm based in the United States is seeking legal consultants for project-based AI opportunities. The role involves generating prompts, evaluating AI solutions, and improving AI reasoning. Candidates should have a law degree and at least two years of... 
    Freelance
    Flexible hours

    Mindrift

    Raleigh, NC
    5 days ago
  • $80 per hour

     ...-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and...  ...analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/... 
    Suggested
    Part time
    Remote work
    Flexible hours

    Mind Rift

    Raleigh, NC
    1 day ago
  • $60 per hour

    Mindrift is offering an exciting opportunity for professionals to engage in evaluating AI-generated auto insurance claims. Candidates will need a degree in related fields and 3+ years of relevant experience. The role requires thorough evaluation and documentation of claims... 
    Freelance
    Hourly pay
    10 hours per week
    Flexible hours

    Mindrift

    Raleigh, NC
    4 days ago
  • $60 per hour

     ...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking...  ...rates up to $60/hour. This position is ideal for analysts or students looking to contribute meaningfully to... 
    Remote job
    Flexible hours

    Mindrift

    Raleigh, NC
    4 days ago
  •  ...solutions company is seeking an Online Data Analyst to work on enhancing the quality of digital maps used worldwide. This part-time freelance role allows you to work from home at your...  ...team and contribute to building better AI models within an inclusive environment. #J... 
    Freelance
    Remote job
    Part time
    Work from home
    Worldwide

    TELUS Digital AI Data Solutions

    Raleigh, NC
    5 days ago
  • A global technology company is seeking an Online Data Analyst in the United States to enhance digital map content and quality. This part-time, long-term freelance role includes conducting online research and completing tasks related to maps and data. Candidates should... 
    Freelance
    Remote job
    Part time

    TELUS Digital

    Raleigh, NC
    5 days ago
  • A technology company is seeking a freelance Online Data Analyst to enhance digital map content. This part-time opportunity allows remote work, requiring proficiency in Urdu and English, and involves verifying geographical data. Candidates must have familiarity with US cultural... 
    Freelance
    Remote job
    Part time

    TELUS Digital

    Raleigh, NC
    3 days ago
  • $55 per hour

    A leading AI development company in North Carolina is seeking an AI Tutor in Accounting for a part-time, remote role. This position involves generating challenging AI prompts, defining evaluation criteria, and correcting AI responses in your field of expertise. Applicants... 
    Freelance
    Remote job
    Hourly pay
    Part time

    Mindrift

    Raleigh, NC
    2 days ago
  • $20 per hour

    A technology company specializing in AI is seeking individuals to train chatbots remotely. This role involves developing prompts, writing responses, and evaluating AI performance. Ideal for freelance professionals, the position offers flexibility in schedule and project... 
    Freelance
    Remote job

    DataAnnotation

    Raleigh, NC
    3 days ago
  • $55 per hour

    A dynamic AI platform is seeking a Freelance Biology Expert with Python to generate advanced AI training prompts and evaluate AI responses. Applicants should possess a degree in Biology and relevant professional experience. This remote role offers flexible hours and competitive... 
    Freelance
    Remote job
    Hourly pay
    Flexible hours

    Mindrift

    Raleigh, NC
    5 days ago
  • $60 per hour

     ...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This... 
    Freelance
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mindrift

    Raleigh, NC
    1 day ago
  • $73 per hour

     ...ethically shape the future of AI. What We Do The Mindrift platform...  ...AI systems are tested and evaluated? About the Project You will create...  ...tasks that push frontier AI agents to their limits. Think...  ...part in a part‑time, remote, freelance project that fits around your... 
    Freelance
    Part time
    Remote work
    Flexible hours

    Mind Rift

    Raleigh, NC
    1 day ago
  • $55 per hour

     ...intelligence to ethically shape the future of AI. What We Do The Mindrift platform...  ...guidelines. Auditing Work: Review and evaluate tasks completed by other experts,...  ...challenging, complex guidelines. Our freelance role is fully remote, so you just need a... 
    Freelance
    Part time
    Remote work

    Mind Rift

    Raleigh, NC
    1 day ago
  • RWS Group is seeking AI Data Specialists to enhance AI-generated content in English. This freelance position allows you to work from home in North Carolina, offering flexible...  ...week). The role involves data collection, evaluation, annotation, and object tagging across... 
    Freelance
    Remote job
    Part time
    Work from home
    10 hours per week
    Flexible hours

    RWS Group

    Raleigh, NC
    5 days ago
  • $55 per hour

     ...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This... 
    Freelance
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mindrift

    Raleigh, NC
    5 days ago
  • $23 per hour

     ...curious people from around the world with freelance online tasks that train and improve...  ...Annotators connects individuals with Generative AI projects from leading tech innovators....  ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses... 
    Freelance
    Hourly pay
    Part time
    Remote work

    Toloka Annotators

    Raleigh, NC
    3 days ago
  • The AI Solutions Analyst supports the Data & AI organization in developing and deploying artificial intelligence and machine learning solutions...  ...detection, and scoring workflows. Participate in model evaluation, validation, monitoring, and lifecycle maintenance.... 
    Work at office

    Grifols, S.A

    Raleigh, NC
    1 day ago
  • $40 per hour

    A technology company is seeking experienced cybersecurity professionals to join their REMOTE team. The role involves evaluating AI-generated security content and solving technical cybersecurity problems. Candidates should have 2+ years in cybersecurity with some coding... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Raleigh, NC
    5 days ago
  • $118.1k - $328.8k

    Job Summary Within IQVIA’s AI & Technology Solutions (ATS) organization, the Architecture...  ...: Published and broadly adopted AI and agent reference architectures Increased reuse...  ..., RAG, tool‑use, MCP servers, HITL, evaluation frameworks, monitoring, observability, and... 
    Full time
    Part time
    Immediate start

    Dormont Manufacturing Co

    Raleigh, NC
    1 day ago
  • $50 - $60 per hour

    A legal consulting firm in North Carolina is seeking a Legal Specialist to evaluate AI models by providing complex legal problems. Candidates must hold a law degree and have 5+ years of experience in various legal fields. This role allows flexibility in project selection... 
    Hourly pay
    For contractors
    Flexible hours

    DataAnnotation

    Raleigh, NC
    2 days ago
  • $55 per hour

    Freelance Biology Expert with Python - AI Trainer 4 days ago Be among the first 25 applicants This opportunity is only for candidates currently residing...  ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's... 
    Freelance
    Part time
    Remote work

    Mindrift

    Raleigh, NC
    5 days ago
  • $20 per hour

     ...DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots...  ..., detail-oriented small business owners, freelancers, and independent contractors to teach AI...  ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy... 
    Freelance
    Hourly pay
    Full time
    Contract work
    Part time
    For contractors
    Self employment
    Remote work

    DataAnnotation

    Raleigh, NC
    2 days ago
  • $20 per hour

    DataAnnotation is committed to creating quality AI. Join our team to help train AI chatbots...  ..., detail‑oriented small business owners, freelancers, and independent contractors to join our...  ...responses to demonstrate excellence, and evaluate different model outputs based on accuracy... 
    Freelance
    Hourly pay
    Full time
    Contract work
    Part time
    For contractors
    Self employment
    Remote work

    DataAnnotation

    Raleigh, NC
    2 days ago
  • $40 per hour

    A leading AI development team is seeking experienced quantitative professionals for a flexible remote role involving evaluation of AI-generated work. Ideal candidates have over 2 years of experience in quantitative analysis, strong coding skills, and a background in fields... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Raleigh, NC
    4 days ago
  •  ...solutions firm is seeking an Online Data Analyst for a fully remote part-time position. Candidates...  ...map content through online research and evaluation tasks. This entry-level role offers...  ...team making a difference in the world of AI and data solutions. #J-18808-Ljbffr TELUS... 
    Remote job
    Part time
    Flexible hours

    TELUS Digital AI Data Solutions

    Raleigh, NC
    5 days ago
  • $60 per hour

     ...A pioneering AI development organization is seeking quantitative professionals to evaluate AI-generated analyses and conduct statistical work. You will work remotely, selecting projects at your convenience, with competitive pay up to $60/hour. Ideal candidates have at... 
    Remote work

    DataAnnotation

    Raleigh, NC
    1 day ago
  • $40 per hour

    A cutting-edge AI company is looking for experienced quantitative professionals to evaluate AI-generated quantitative work. You will analyze statistical models, solve quantitative problems, and help validate AI outputs. Candidates should have 2+ years of experience in... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Raleigh, NC
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!