AI Agent Evaluation Analyst (Freelance)

$60 per hour

Mindrift

Location & Eligibility This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. About Mindrift At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For We’re looking for curious and intellectually proactive contributors—people who double‑check assumptions and play devil’s advocate. If you thrive in ambiguity, enjoy remote asynchronous work, and want to learn how modern AI systems are tested and evaluated, we want to hear from you. Project Overview We are seeking QA experts for autonomous AI agents in a project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you will balance quality assurance, research, and logical problem‑solving. Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism. Identify inconsistencies, missing assumptions, or unclear decision points. Define clear expected behaviours (gold standards) for AI agents. Annotate cause‑effect relationships, reasoning paths, and plausible alternatives. Think through complex systems and policies as a human would to ensure agents are tested properly. Collaborate with QA, writers, or developers to suggest refinements or edge‑case coverage. Requirements Excellent analytical thinking: ability to reason about complex systems, scenarios, and logical implications. Strong attention to detail: spot contradictions, ambiguities, and vague requirements. Familiarity with structured data formats: read (not necessarily write) JSON/YAML. Ability to assess scenarios holistically: identify what’s missing, unrealistic, or potentially breaking. Good communication and clear writing (in English) to document findings. We also value applicants who have: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design. Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research. Exposure to LLMs, prompt engineering, or AI‑generated content. Familiarity with QA or test‑case thinking (edge cases, failure modes, "what could go wrong"). Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.). Benefits Competitive pay up to $60/hour depending on skills, experience, and project needs. Flexible, remote, freelance project that fits around your primary professional or academic commitments. Advanced AI project experience to enhance your portfolio. Opportunity to influence how future AI models understand and communicate in your field of expertise. #J-18808-Ljbffr Mindrift

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the AI Agent Evaluation Analyst (Freelance) in Austin, TX vacancy

Remote QA Analyst for Autonomous AI Agent Evaluation
$60 per hour
...A leading AI firm in Austin is looking for QA experts to validate... ...AI systems. This remote, freelance role requires strong analytical... .... Candidates will review AI evaluation tasks, identify... ...define expected behaviors for agents. Ideal applicants have experience...
Freelance
Remote work
Mind Rift
Austin, TX
3 days ago
Online Data Analyst Odia
...Join to apply for the Online Data Analyst Odia role at TELUS Digital AI Data Solutions Are you a detail-oriented... ...national and local geography? This freelance opportunity allows you to work at... ...worldwide Completing research and evaluation tasks in a web-based environment...
Freelance
Part time
Local area
Worldwide
TELUS Digital AI Data Solutions
Austin, TX
3 days ago
Freelance US Law Attorney AI Project Evaluator
$60 per hour
...firm is seeking legal consultants with US law experience for part-time, project-based opportunities. You will generate prompts for AI, evaluate solutions, and improve reasoning standards. Ideal candidates have a law degree and 2+ years of legal experience. Strong written...
Freelance
Part time
Mind Rift
Austin, TX
2 days ago
AI Evaluation Platform Lead: Scale Agent Quality
...Manufacturing Co is seeking an experienced Engineering Manager for its Evaluation Platform team within Procore’s Construction Intelligence... ...evaluation frameworks and delivering tools for assessing AI agent quality. The ideal candidate should have 5+ years of management...
Suggested
Dormont Manufacturing Co
Austin, TX
2 days ago
Compensation Analyst - Market Data & Job Evaluation
Central Health is seeking a Compensation Analyst - Core Compensation in Austin, Texas. This role focuses on supporting compensation programs through market analysis and job evaluation, ensuring competitive and equitable pay practices across the organization. The ideal...
Suggested
Central Health
Austin, TX
1 day ago
Freelance Physicist & Python Expert - AI Trainer
$55 per hour
...and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...
Freelance
Permanent employment
Temporary work
Part time
10 hours per week
Mind Rift
Austin, TX
4 days ago
ONA AI Agent Intern (Logistics Focus) - OVIP
$30 per hour
...industry's broadest and deepest suite of AI-powered cloud applications. The following... ...We are seeking a highly motivated AI Agent Intern to join Oracle's Supply Chain Applications... ...scripts. Research & Analysis Evaluate logistics AI use cases (forecasting, resilience...
Hourly pay
Temporary work
Internship
Flexible hours
Oracle
Austin, TX
1 day ago
Remote Freelance Statistics AI Tutor
$73 per hour
A leading AI consultancy is seeking a Quantitative Statistics Expert to work flexibly as a freelance AI Trainer. This remote role requires a Bachelor's degree in Statistics and... ...include generating AI prompts and evaluating model accuracy, allowing you to impact the...
Freelance
Remote job
Mindrift
Austin, TX
3 days ago
Remote Biology AI Tutor with Python
$55 per hour
A leading AI firm is seeking a Freelance Biology Expert with proficiency in Python to contribute to advanced AI projects from the comfort of your home. The role involves generating prompts, evaluating AI responses, and leveraging your expertise in Biology. Candidates should...
Freelance
Remote job
Mindrift
Austin, TX
2 days ago
Freelance Legal Attorney (US Law) - AI Trainer
$60 per hour
...indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This...
Freelance
Permanent employment
Temporary work
Part time
10 hours per week
Mind Rift
Austin, TX
2 days ago
Senior Engineering Manager - AI Agents
...planning challenges. Recognized by industry analysts like Forrester and G2 and backed by top‑... ...revenue performance. About the Role AI agents are becoming central to how CaptivateIQ... ...agent SDK, LLM orchestration layer, and the evaluation and observability infrastructure that...
Work at office
Remote work
Flexible hours
Shift work
3 days per week
Dormont Manufacturing Company
Austin, TX
1 day ago
Remote Pure Mathematics Specialist AI Reasoning Trainer
$35 - $65 per hour
Invisible Agency is seeking a Pure Mathematics Specialist for a Freelance AI Trainer Project. This remote position requires a theoretical mathematics expert to construct and evaluate complex proofs, ensuring rigor and correctness. Ideal candidates are fluent in Lean 4...
Freelance
Remote job
Hourly pay
Invisible Agency
Austin, TX
5 days ago
AI Agent PM — End-to-End HealthTech Deployments
$110k - $160k
Hellopatient is seeking a technical AI Agent Product Manager in Austin, Texas, to lead the delivery of AI agents tailored for healthcare... ...and continuously enhance agent performance through structured evaluation and real-world feedback. The position offers a competitive...
Hellopatient
Austin, TX
2 days ago
Luxury Brand Experience Evaluator
A global leader in customer experience is seeking a Freelance Luxury Brand Evaluator in Austin, TX. In this role, you will assess customer experiences with high-end brands by visiting stores or evaluating online. Enjoy flexible assignments and compensation based on your...
Freelance
Flexible hours
CXG
Austin, TX
1 day ago
Remote LaTeX Specialist for AI Training & Editing
$8 - $30 per hour
Invisible Agency is seeking a LaTeX Specialist for a freelance AI Trainer Project to audit and refine AI-generated text. The ideal candidate... ...LaTeX expressions, rewriting text, and applying consistent evaluation rubrics. The pay ranges from $8 to $30 per hour, depending on...
Freelance
Remote job
Hourly pay
Invisible Agency
Austin, TX
4 days ago
Remote M&A Attorney & AI Transaction Strategist
...& Acquisitions Attorney to support the development of AI tools. This senior-level freelance role requires at least 4 years of hands-on experience in... ...corporate law. The attorney will review AI-generated content, evaluate financial and legal reasoning, and work closely with a...
Freelance
Remote job
Hourly pay
For contractors
Invisible Agency
Austin, TX
4 days ago
Project Perseus | Speech & Voice AI Analyst - Czech Speakers
$26 - $28 per hour
...Data Labeling Analyst Welo Data is looking for detail-oriented and reliable individuals to... ...Labeling Analysts, supporting speech and voice AI systems. This is a high-impact production... ...this role is more execution-focused than evaluation-heavy roles, it still requires strong...
Full time
Work experience placement
Remote work
Visa sponsorship
Welocalize
Austin, TX
4 days ago
Remote PHP AI Trainer — Freelance Coding Specialist
Invisible Agency is looking for a PHP Coding Specialist for a freelance AI Trainer project. In this role, you will apply your PHP expertise to shape the future of AI by evaluating code quality, conversing with AI on engineering tasks, and suggesting improvements. Ideal...
Freelance
Remote job
Hourly pay
Contract work
Invisible Agency
Austin, TX
5 days ago
Remote AI Trainer Civil Eng & Python Pro
$55 per hour
A leading innovative tech firm seeks a Freelance AI Trainer specializing in Civil Engineering and Python. This part-time, fully remote role involves designing and evaluating AI models on civil engineering challenges. Candidates should have a Bachelor’s, Master’s, or PhD...
Freelance
Remote job
Hourly pay
Part time
Mind Rift
Austin, TX
2 days ago
English Language Specialist (US Only) - Freelance AI Trainer Project
$20 per hour
...English Language Specialist (US Only) - Freelance AI Trainer Project World Wide - Remote Are you eager to shape the future of AI? Large-... ...’ll work hands‑on with advanced AI tools, test model outputs, evaluate responses for accuracy and clarity, and provide structured feedback...
Freelance
Hourly pay
Contract work
For contractors
Remote work
Invisible Agency
Austin, TX
1 day ago
AI Trainer - Freelance Annotator (Portuguese)
$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...
Freelance
Part time
Remote work
Toloka Annotators
Austin, TX
3 days ago
Remote Audio AI Trainer - Freelance Project
$11 - $30.65 per hour
Invisible Agency is looking for an Audio Specialist as a Freelance AI Trainer to evaluate advanced audio models. In this remote role, you will create scenarios that simulate real customer service interactions and assess model performance based on various criteria. The...
Freelance
Remote job
Hourly pay
Invisible Agency
Austin, TX
1 day ago
Remote Chemistry AI Trainer — Freelance Expert Network
...Network Join our Chemist Expert Network to connect with leading AI labs and companies seeking your expertise. This is an open application... .... Projects Experts in our network contribute to: Training and evaluating AI models in Chemistry Creating tasks and deliverables based on...
Freelance
Remote job
Contract work
Mercor
Austin, TX
3 days ago
AI Trainer - Freelance Annotator (English)
$18 per hour
...Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is... ...and useful feedback that helps improve model and agent performance Following style and quality standards...
Freelance
Hourly pay
Permanent employment
Temporary work
Part time
10 hours per week
Toloka Annotators
Austin, TX
2 days ago
Project Perseus | Speech & Voice AI Analyst - Spanish Speakers
$26 - $28 per hour
...Data Labeling Analyst Welo Data is looking for detail-oriented and reliable individuals... ...Labeling Analysts, supporting speech and voice AI systems. This is a high-impact... ...this role is more execution-focused than evaluation-heavy roles, it still requires strong judgment...
Full time
Work experience placement
Remote work
Visa sponsorship
Welocalize
Austin, TX
1 day ago
Project Perseus | Speech & Voice AI Analyst - French Speakers
$26 - $28 per hour
...Data Labeling Analyst Welo Data is looking for detail-oriented and reliable individuals... ...Labeling Analysts, supporting speech and voice AI systems. This is a high-impact... ...this role is more execution-focused than evaluation-heavy roles, it still requires strong judgment...
Full time
Work experience placement
Remote work
Visa sponsorship
Welocalize
Austin, TX
1 day ago
Freelance Software Developer (Ruby) - AI Trainer
$60 per hour
...Freelance Software Developer (Ruby) - AI Trainer Location : Remote, full‑time, freelance Overview At Mindrift, we connect AI projects with specialists... ...challenge AI models. Define comprehensive scoring criteria to evaluate AI responses. Correct model outputs based on domain...
Freelance
Full time
Part time
Remote work
Worldwide
Mind Rift
Austin, TX
3 days ago
Freelance AI Trainer - Civil Engineering & Python
$55 per hour
...Freelance AI Trainer - Civil Engineering & Python 1 day ago Be among the first 25 applicants This opportunity is only for candidates... ...seeking experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems. This role...
Freelance
Part time
Remote work
Flexible hours
Mind Rift
Austin, TX
2 days ago
AI Agent Product Manager
$110k - $160k
About the Role Hello Patient is hiring a technical, high-agency AI Agent Product Manager to own the end-to-end delivery of AI agents in... ...ll continuously iterate on agent performance using structured evaluation, testing, and real customer feedback. What You'll Do Own...
Hello Patient
Austin, TX
4 days ago
STEM Specialist - Freelance AI Trainer Project
$8 - $65 per hour
STEM Specialist - AI Trainer (Contract, Remote) Location: United States of America (remote... ...improvements to prompt engineering and evaluation metrics. You will also document failure... ..., and aiding engineers, scientists, and analysts. Responsibilities Converse with the model...
Freelance
Hourly pay
Contract work
Remote work
Invisible Agency
Austin, TX
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Agent Evaluation Analyst (Freelance). Be the first to apply!