Freelance Agent Evaluation Engineer
$80 per hourMind Rift
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources Write comprehensive functional tests that validate actual end-to-end behavior and edge‑cases, not just superficial checks Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) Analyze AI failures to understand what the model struggles with vs. what it masters Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria What We Look For Degree in Computer Science, Software Engineering or related fields 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations) Background in Full‑Stack development, with an equal focus on building React‑based interfaces and robust Back‑end systems Experience writing tests (functional, integration - not just running them) Docker containers (running evaluations locally in containers) CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) English proficiency - B2 How It Works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Payment Paid contributions, with rates up to $80/hour* Fixed project rate or individual rates, depending on the project Some projects include incentive payments Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non‑core project phases. Payment details are shared per project #J-18808-Ljbffr
$80 per hour
...Calling all security researchers, engineers, and penetration testers with a strong... ...servers and internal tools for running and evaluating agent behavior. You’ll implement base... ...needs Take part in a flexible, remote, freelance project that fits around your primary...FreelancePart timeRemote workFlexible hours$80 per hour
...modern AI systems are tested and evaluated? This is a flexible,... ...hunt for QAs for autonomous AI agents for a new project focused on... ...Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity... ...part in a flexible, remote, freelance project that fits around...FreelancePermanent employmentPart timeRemote workFlexible hours- ...Biz Tech Analytics is looking for a part-time, freelance Software Developer to review AI benchmark tasks in Elixir repositories. The role requires strong skills in Mix and OTP, experience with technologies like Docker and GenServer, and a relevant educational background...FreelancePart timeRemote work
$55 per hour
...A pioneering AI consultancy is searching for a Freelance Electrical Engineer to join their team. The role involves creating prompts that test AI abilities, defining how AI should be evaluated, and correcting responses based on specialized knowledge. Ideal candidates should...FreelancePart timeRemote work- ...company is seeking experienced FreeCAD BIM/IFC users to support AI research through flexible, hourly contract work. The role involves evaluating AI-generated content, creating relevant questions, and providing feedback. Ideal candidates have at least 3 years of experience...FreelanceHourly payContract workRemote workFlexible hours
- ...leading AI innovation firm is seeking experienced Civil Engineers with strong Python skills to train and evaluate AI models on realistic engineering problems. This... .... It’s fully remote and offers flexible part-time freelance opportunities with attractive compensation based...FreelancePart timeRemote workFlexible hours
$320k - $405k
...Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...team to help build AI-first products, features, and evaluations. Your mission will be to bridge the gap between model capabilities...Work at officeVisa sponsorshipFlexible hours- ...company is seeking detail-oriented linguists with native Turkish fluency for a remote freelance opportunity. This role will focus on AI-related projects that involve prompt evaluation and multimedia content understanding. Ideal candidates will have a strong command of English...FreelanceRemote work
- ...layout, or embedded hardware design to evaluate AI‑generated content and provide feedback... ...worked in or around roles like: Hardware Engineer or Embedded Systems Engineer PCB... ..., whether through professional work or freelance projects. Experience with LibrePCB is a...FreelanceHourly payFull timeContract workFor contractorsRemote workFlexible hours
- ...TransPerfect is looking for food enthusiasts to evaluate AI-generated responses related to food and dining. This remote role involves verifying the accuracy and clarity of AI outputs about cuisines and restaurant experiences. The ideal candidate enjoys exploring new restaurants...FreelanceFor contractorsRemote work
- ...TransPerfect is looking for game enthusiasts to help train AI by reviewing its sports-related outputs. This remote, freelance position involves evaluating AI performance based on factual accuracy and clarity, shaping how AI interacts in sports discussions worldwide....FreelanceRemote workWorldwide
$100 - $125 per hour
...Crossing Hurdles is looking for freelance Insurance Experts to translate insurance workflows into AI tasks. Responsibilities include evaluating AI outputs, documenting processes, and working on insurance operations. Candidates should have at least 5 years of experience...FreelanceHourly payRemote work10 hours per weekFlexible hours$60 per hour
...you’ll collaborate with Tendem Agents that handle repetitive tasks,... .... Role and Scope This is a freelance role for a Tendem project. As... ...Responsibilities Design and evaluate integrations between core business... ...Degree in Computer Science, Engineering, Information Technology, or...FreelanceHourly payPart timeRemote work- ...You Will Do: Assess Novelty: Assess whether the manuscript is original and adds new knowledge or insights to the field of study. Evaluate Study Design and Methodology: Assess whether the study design and methods are appropriate, comprehensive, and clearly described to...Freelance
- ...Remote Industrial Engineer (Manufacturing) Turing is looking for candidates... ...to projects that help evaluate and enhance AI systems using... ...review and annotation. Perks of freelancing with Turing & offer details... ..., multimodality, and agents; and second, by applying that...FreelanceFull timeFor contractorsRemote work
$150 - $175 per hour
...expert consultants with 30 Senior Software Engineering Experts for part-time, fully-remote... ...leading AI models can't solve as well as evaluating and pressure‑testing models on complex... ...at these labs. Unlike low‑skill freelance work on traditional consulting platforms...FreelancePart timeRemote work$11 - $30.65 per hour
...Invisible Agency is seeking an Agentic Audio Specialist for a freelance project focused on evaluating AI audio models for customer support. Responsibilities include creating realistic role-play evaluation scenarios, developing representative datasets, and assessing model...FreelanceHourly payRemote work$60 per hour
...A leading project-based AI consultancy is seeking legal consultants to evaluate AI systems and improve their reasoning. This non-permanent role requires a law degree and at least 2 years of experience in US law. Candidates must have strong written English skills and a...FreelancePermanent employmentPart timeFlexible hours- ...is seeking detail-oriented linguists with native speaker fluency in Vietnamese. This remote freelance position involves working on AI-related projects such as prompt evaluation, video content understanding, and text review. Ideal candidates will have strong English skills...FreelanceRemote work
- ...specialists - Masters and PhD-level experts - to help train and evaluate cutting-edge AI models on some of the most complex problems in... ...remote and flexible - work when and where it suits you Freelance autonomy with the structure of meaningful, task-based scientific...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$69 per hour
...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based,... ...unique tasks, contributors may: Design original computational engineering problems that simulate real engineering workflows; Create problems...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$55 per hour
A leading technology firm is seeking a Mechanical Engineer with Python expertise to evaluate AI models on complex engineering problems. This freelance role is fully remote and offers up to $55/hour. Candidates must have a degree in Mechanical Engineering and advanced English...FreelanceRemote jobFlexible hours- ...Senior Rust Software Engineer - Distributed Systems (AI Infrastructure) About the Role... ...data pipelines, annotation tooling, and evaluation systems that leading AI labs depend on... ...anywhere, on a schedule that suits you Freelance autonomy paired with the structure of...FreelanceHourly payContract workRemote workFlexible hours
$55 per hour
...globe. The Role We are seeking experienced Mechanical Engineers with strong Python skills to train and evaluate AI models on complex, real-world mechanical... ...your expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems reason about...FreelancePart timeRemote workFlexible hours$55 per hour
...Freelance Electrical Engineer Consultant - AI Trainer 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more... ...related to project guidelines. Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align...FreelancePart timeRemote work$55 per hour
...Electrical Engineer with Python Experience – Freelance AI Trainer 5 days ago Be among the first 25 applicants Get AI-powered advice on this job and more... ...challenge AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's...FreelancePart timeRemote work- ...strategic area leveraging LLMs, agents, and internal APIs to... ...the intersection of product, engineering, and applied AI, building and... ...to design prompt strategies, evaluation frameworks, and guardrails—balancing... .... Part‑time, contract, or freelance roles may not be eligible...FreelanceContract workPart timeLocal area
- ...provider is looking for detail-oriented linguists with native speaker fluency in Indonesian for remote freelance support in AI-related projects focusing on prompt evaluation, video understanding, and text review. The ideal candidates will possess a strong command of...FreelanceRemote work
- ...company is seeking detail-oriented linguists with native fluency in German for a remote freelance opportunity. Responsibilities include analyzing text and multimedia content, prompt evaluation, and text review. Ideal candidates will possess a strong command of English and...FreelanceRemote work
$8 - $65 per hour
...Civil Engineering Specialist – Freelance AI Trainer Project United States of America Are you a civil engineering expert eager to shape the future... ...and suggest improvements to our prompt engineering and evaluation metrics. You will challenge advanced language models on...FreelanceHourly payContract workFor contractorsRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Freelance Agent Evaluation Engineer. Be the first to apply!
- special agent Brooklyn, NY
- transfer agent Brooklyn, NY
- agent Brooklyn, NY
- airport agent Brooklyn, NY
- telemarketer - state farm agent team member Brooklyn, NY
- cruise agent Brooklyn, NY
- state farm agent Brooklyn, NY
- commissioning agent Brooklyn, NY
- work from home chat agent Brooklyn, NY
- remote chat agent Brooklyn, NY

