Freelance Agent Evaluation Engineer
$80 per hourMind Rift
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources Write comprehensive functional tests that validate actual end-to-end behavior and edge‑cases, not just superficial checks Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) Analyze AI failures to understand what the model struggles with vs. what it masters Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria What We Look For Degree in Computer Science, Software Engineering or related fields 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations) Background in Full‑Stack development, with an equal focus on building React‑based interfaces and robust Back‑end systems Experience writing tests (functional, integration - not just running them) Docker containers (running evaluations locally in containers) CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) English proficiency - B2 How It Works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Payment Paid contributions, with rates up to $80/hour* Fixed project rate or individual rates, depending on the project Some projects include incentive payments Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non‑core project phases. Payment details are shared per project #J-18808-Ljbffr
$80 per hour
...For Calling all security researchers, engineers, and penetration testers with a strong... ...servers and internal tools for running and evaluating agent behavior. You'll implement base... ...needs Take part in a flexible, remote, freelance project that fits around your primary...FreelancePart timeRemote workFlexible hours$55 per hour
...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based,... ...involves unique tasks, contributors may: Design rigorous energy engineering problems reflecting professional practice; Evaluate AI...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$80 per hour
...companies, focused on testing, evaluating, and improving AI systems.... ...of experience as a Software Engineer (primarily Python) ~ Deep experience... ...~ Prior experience with agent evaluation platforms and MCP... ..., gcov, kcov. Benefits Freelance project‑based collaboration via...FreelancePermanent employmentTemporary workRemote workFlexible hours$69 per hour
...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based,... ...unique tasks, contributors may: Design original computational engineering problems that simulate real engineering workflows; Create problems...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$55 per hour
...Mechanical Engineer with Python Experience – Freelance AI Trainer 2 days ago Be among the first 25 applicants This opportunity is only for candidates currently... ...Engineers with strong Python skills to train and evaluate AI models on complex, real‑world mechanical...FreelancePart timeRemote workFlexible hours$55 per hour
...Freelance Mechanical Engineering Expert with Python Expertise - AI Trainer 2 days ago Be among the first 25 applicants This opportunity is only... ...AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's...FreelancePart timeRemote work$80 per hour
A leading AI opportunities platform is seeking a remote Senior Python Developer to join project-based collaborations. The role involves creating functional tests for large codebases and managing Docker environments to ensure reproducible builds. Applicants should have ...FreelanceHourly payRemote workFlexible hours$40 per hour
An innovative tech organization is seeking a Contract QA Engineer to contribute to AI development. This role offers the flexibility of... ...design coding problems for AI training, write clear code, and evaluate AI outputs. Proficiency in programming languages such as JavaScript...Remote jobHourly payContract work$55 per hour
...A leading AI opportunity platform in New York seeks experienced energy engineers for project-based AI evaluations. Contributors will design engineering problems, validate AI solutions, and require strong Python and English skills. Ideal candidates have a degree in Energy...Hourly payPart time10 hours per week$60 per hour
A leading AI development company is seeking proficient programmers to contribute to cutting-edge AI systems remotely. In this role, you will tackle diverse coding challenges, work with advanced AI models, and enjoy a flexible schedule. Compensation is competitive at $60...Hourly payRemote workFlexible hours$55 per hour
...company is looking for experienced Civil Engineers with Python skills to train AI models on realistic problems. This part-time freelance role allows for flexible hours and pays up... ...Python proficiency. You will design and evaluate engineering problems, ensuring AI outputs...FreelancePart timeRemote workFlexible hours- ...services. Our contact centers are powered by both on-site and remote agents, leveraging advanced technologies to enhance customer journeys,... ...to maintain regular attendance and punctuality The ability to evaluate, troubleshoot, and follow up on customer issues An aptitude for...Full timeContract workTemporary workCasual workWork at officeLocal areaRemote workMonday to FridayShift workWeekend work
$23 per hour
A leading AI annotation company is seeking freelance annotators to work remotely on exciting projects involving the evaluation and classification of AI-generated content. Candidates must hold a degree or be currently studying, with proficiency in Portuguese and advanced...FreelanceHourly payPart timeRemote workFlexible hours$75 per hour
...English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves Generate...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week- Are you tired of being unsure how much your delivery/driver job will pay? Will the customer tip? We have a great side hustle job for you! Our jobs are preplanned with a flexible schedule, and the faster you get at the job the quicker you are in and out! Flexible when you...Hourly payExtra incomeTemporary workPart timeSecond jobFlexible hoursShift work
- A data-driven AI company is seeking an Urdu language expert for a freelance remote position. You'll evaluate AI-generated content for accuracy and cultural relevance and rewrite responses to ensure high-quality standards. Ideal candidates should possess native level Urdu...FreelanceRemote jobFlexible hours
$60 per hour
...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves While each project...FreelancePermanent employmentTemporary workPart time10 hours per week$45 per hour
...prompts that challenge AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's... ...and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so you just need a laptop, internet connection...FreelancePart timeRemote work$20 per hour
...looking for analytical, detail-oriented small business owners, freelancers, and independent contractors to teach AI chatbots. You will... ..., write high-quality responses to demonstrate excellence, and evaluate different model outputs based on accuracy and style guidelines...FreelanceHourly payFull timeContract workPart timeFor contractorsSelf employmentRemote work- ...TrainAI is seeking AI Data Specialists to work part-time on a freelance basis, focusing on improving AI-generated content in English.... ...involves various data-related tasks, including data collection, evaluation, and annotation, and offers a flexible work schedule that...FreelanceRemote jobPart timeWork from homeFlexible hours
$23 per hour
..., we connect smart, curious people from around the world with freelance online tasks that train and improve artificial intelligence.... ...part in online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses — when projects are available...FreelancePart timeRemote work$15 per hour
...expert with a passion for writing and quality? What You’ll Do Evaluate AI‑generated content for accuracy, fluency, and cultural... ...(flexible) Location: Remote (India-based candidates) Type: Freelance Requirements Native / near-native Urdu proficiency Strong writing...FreelanceRemote jobImmediate startFlexible hours$40 per hour
...join our team to help train AI models. In this role, you will evaluate AI-generated security content, solve technical cybersecurity problems... ...testing, red teaming, incident response, detection engineering, DFIR, malware analysis, threat intelligence, or similar) Some...Hourly payFull timePart timeRemote work- ...oriented individuals with a passion for research to join their freelance team. This role allows you to work from home flexibly while... ...that many rely on. Responsibilities include conducting web-based evaluations for data accuracy and verifying mapping content. Candidates...FreelanceRemote jobLocal areaWork from home
$73 per hour
...modern AI systems are tested and evaluated, this may be the role for... ...tasks that push frontier AI agents to their limits. Think... ...Exposure to LLMs, prompt engineering, or AI‑generated content.... ...part in a part‑time, remote, freelance project that fits around your...FreelancePart timeRemote workFlexible hours$20 per hour
...looking for analytical, detail-oriented small business owners, freelancers, and independent contractors to join our team and teach AI... ..., write high-quality responses to demonstrate excellence, and evaluate different model outputs based on accuracy and style guidelines...FreelanceHourly payFull timeContract workPart timeFor contractorsSelf employmentRemote work$18 per hour
...AI Trainer - Freelance Annotator (Korean) 4 days ago Be among the first 25 applicants At Toloka, we connect smart, curious people... ...invited to online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses – when projects are...FreelancePart timeRemote work$220k - $240k
...OPPORTUNITY We are searching for a highly skilled Customer Success Engineer to play a key role in optimizing our customers' utilization... ...team. Conducting Quarterly Business Reviews with customers, evaluating progress, and identifying areas for further improvement and collaboration...Temporary workWork experience placementWork at officeRemote work$55 per hour
...Freelance AI Trainer - Civil Engineering & Python 1 week agoBe among the first 25 applicants This opportunity is only for candidates currently residing... ...experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems. This...FreelancePart timeRemote workFlexible hours$73 per hour
...learn how modern AI systems are tested and evaluated? Project description You will create... ..., realistic tasks that push frontier AI agents to their limits. Think scattered data, conditional... ...needs Take part in a part‑time, remote, freelance project that fits around your primary...FreelancePermanent employmentPart timeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Freelance Agent Evaluation Engineer. Be the first to apply!
- commissioning agent Florida, NY
- work from home chat agent Florida, NY
- remote chat agent Florida, NY
- airport agent Florida, NY
- agent Florida, NY
- executive protection agent Florida, NY
- import export agent Florida, NY
- state farm agent Florida, NY
- cruise agent Florida, NY
- telemarketer - state farm agent team member Florida, NY

