Freelance Agent Evaluation Engineer
$80 per hourMindrift
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources Write comprehensive functional tests that validate actual end-to-end behavior and edge‑cases, not just superficial checks Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) Analyze AI failures to understand what the model struggles with vs. what it masters Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria What We Look For Degree in Computer Science, Software Engineering or related fields 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations) Background in Full‑Stack development, with an equal focus on building React‑based interfaces and robust Back‑end systems Experience writing tests (functional, integration - not just running them) Docker containers (running evaluations locally in containers) CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) English proficiency - B2 How It Works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Payment Paid contributions, with rates up to $80/hour* Fixed project rate or individual rates, depending on the project Some projects include incentive payments Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non‑core project phases. Payment details are shared per project #J-18808-Ljbffr Mindrift
$80 per hour
...For Calling all security researchers, engineers, and penetration testers with a strong... ...servers and internal tools for running and evaluating agent behavior. You’ll implement base... ...needs Take part in a flexible, remote, freelance project that fits around your primary...FreelanceHourly payPart timeRemote workFlexible hours$60 per hour
...modern AI systems are tested and evaluated? This is a flexible,... ...hunt for QAs for autonomous AI agents for a new project focused on... ...Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity... ...part in a flexible, remote, freelance project that fits around...FreelancePermanent employmentPart timeInternshipRemote workFlexible hours$55 per hour
...innovation firm is looking for experienced Mechanical Engineers with strong Python skills to train and evaluate AI models on complex mechanical engineering... ...mechanics. This flexible, fully remote, part-time freelance position pays up to $55/hour, allowing engineers to...FreelanceRemote jobPart timeFlexible hours$55 per hour
...forward-thinking AI company is seeking experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems.... ...analysis. The position offers flexible, part-time freelance work with competitive compensation up to $55/hour,...FreelanceRemote jobPart timeFlexible hours$55 per hour
Freelance Civil Engineering Expert - AI Trainer 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive... ...that challenge AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. Correct the model’s...FreelancePart timeRemote work$55 per hour
...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based,... ...contributors may: Design graduate- and industry-level automotive engineering problems grounded in real practice; Evaluate AI-generated...FreelancePermanent employmentTemporary workPart time10 hours per week$55 per hour
...globe. The Role We are seeking experienced Mechanical Engineers with strong Python skills to train and evaluate AI models on complex, real-world mechanical... ...your expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems reason about...FreelancePart timeRemote workFlexible hours$69 per hour
...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based,... ...What This Opportunity Involves Design original computational engineering problems that simulate real engineering workflows; Create problems...FreelanceHourly payPermanent employmentTemporary workPart time10 hours per week$55 per hour
...the globe. The Role We are seeking experienced Civil Engineers with Python skills to train and evaluate AI models on realistic civil engineering problems.... ...for your expertise Fully remote, flexible, part-time freelance work Contribute to how future AI systems reason about...FreelancePart timeRemote workFlexible hours$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical... ...familiarity with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected AI...Remote jobPart timeFlexible hours$80 per hour
Mindrift is seeking a Senior Python Developer for a freelance project in the United States, Missouri. The role involves creating functional black box tests and managing Docker environments for effective testing. Candidates should have over 5 years of experience primarily...FreelanceRemote jobHourly pay- A leading AI consultancy firm is seeking legal consultants for project-based opportunities involving testing and improving AI systems. Candidates should have a degree in law and at least 2 years of legal experience within US jurisdiction. Strong written English is required...FreelancePermanent employmentRemote work10 hours per week
$40 per hour
A cybersecurity technology firm is seeking experienced professionals to evaluate AI-generated security content and solve cybersecurity problems. Candidates should have 2+ years in the field, coding experience, and strong writing skills. This position is remote, offering...Remote jobHourly payFlexible hours$80 per hour
...innovation company is seeking experienced Python engineers for a remote part-time freelance project. You will focus on developing Model Context... ...while working with cross-functional teams to enhance agent behavior evaluation. Ideal candidates should possess 4+ years in...FreelanceRemote jobPart time$55 per hour
An innovative tech company seeks a Freelance Civil Engineering Expert to contribute to AI projects from anywhere. As part of the team, you will generate prompts for AIs, evaluate responses, and collaborate on specialized challenges in engineering. This part-time role offers...FreelanceRemote jobHourly payPart timeFlexible hours- Qode is looking for a skilled AI / Agent Engineer to build and operate intelligent systems for their fintech platform. You'll develop workflows, integrate feature management tools, and maintain observability for AI systems. The ideal candidate has at least 8 years of software...
$35 - $75 per hour
...~ Prior experience with data annotation, data quality, or evaluation systems Why Join Us: Competitive pay and flexible... ...cutting-edge digital health and data-driven care models. Freelance perks: autonomy, flexibility, and global collaboration. Potential...FreelanceHourly payContract workRemote workFlexible hours$60 per hour
...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves Generate prompts...FreelancePermanent employmentTemporary workPart time10 hours per week$18 per hour
...opportunities for career progression SUMMARY OVERVIEW The Dock Agent is the information epicenter of the organization. This position... ...employer. All applicants and employees are considered and evaluated for positions at PrimeFlight Aviation Service, Inc. without...Hourly payWork at officeLocal areaFlexible hoursNight shift- ...actionable insights - helping train and evaluate cutting-edge AI models at the frontier... ...functionally with clinical, research, and engineering teams in regulated or highly technical... ...- work when and where it suits you Freelance autonomy with the structure of...FreelanceHourly payOngoing contractContract workRemote workWorldwideFlexible hours
- ...Description Job Description Foresite is looking for a Customer Engineer to be the mastermind behind our clients' security... ...concept (POCs), and demos. Continuous Optimization: Research and evaluate new technologies to optimize security architectures and contribute...Temporary work
- ...bedside. We're looking for Nursing Informatics Specialists to help evaluate and improve AI systems being trained on healthcare and... ...Fully remote and flexible - work when and where it suits you Freelance perks: autonomy, variety, and the chance to collaborate globally...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
- ...experienced health data governance professionals to help train and evaluate cutting-edge AI models - ensuring they understand the... ...Fully remote and flexible - work when and where it suits you Freelance autonomy with the structure of meaningful, expert-driven work...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
- ...re looking for experienced Health Informatics Analysts to help evaluate and improve AI systems being developed for healthcare... ...remote and flexible — work on your own schedule, anywhere Freelance perks: autonomy, variety, and global collaboration Apply your...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$23 per hour
..., we connect smart, curious people from around the world with freelance online tasks that train and improve artificial intelligence.... ...part in online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses — when projects are available...FreelancePart timeRemote work$73 per hour
...tasks that push frontier AI agents to their limits. Think scattered... ...in a part‑time, remote, freelance project that fits around your... ...exposure to LLMs, prompt engineering, or AI‑generated content with... ...understanding of how scoring or evaluation works in agent testing (...FreelancePart timeRemote work- ...Have Prior experience with data annotation, data quality evaluation, or AI training data pipelines Familiarity with frameworks... ...structure your work around your life, not the other way around Freelance autonomy with the substance of genuinely meaningful, high-...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
$17 per hour
...Freight/Warehouse Agent- Kansas City International Airport (MCI) Job Category: Airport Operations Supervisor: Victor Nforgwei Requisition... ...employer. All applicants and employees are considered and evaluated for positions at PrimeFlight Aviation Service, Inc. without...Hourly payFull timePart timeLocal areaFlexible hoursNight shift- ...and organizational efficiency Apply your domain expertise to evaluate and improve how AI models interpret and reason about clinical... ...remote and flexible - work when and where it suits you Freelance autonomy with the structure and purpose of meaningful, high-impact...FreelanceHourly payOngoing contractContract workRemote workFlexible hours
- Work at Home Contact Center Agent (Full-Time & Part-Time) (MO) Join to apply for the Work at Home Contact Center Agent (Full-Time &... ...to maintain regular attendance and punctuality The ability to evaluate, troubleshoot, and follow-up on customer issues An aptitude for...Hourly payFull timeContract workTemporary workPart timeCasual workWork at officeLocal areaRemote workWork from homeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Freelance Agent Evaluation Engineer. Be the first to apply!
- freight agent no experience Kansas City, MO
- state farm agent Kansas City, MO
- work from home chat agent Kansas City, MO
- freight broker agent Kansas City, MO
- special agent Kansas City, MO
- fbi agent Kansas City, MO
- commissioning agent Kansas City, MO
- executive protection agent Kansas City, MO
- cruise agent Kansas City, MO
- telemarketer - state farm agent team member Kansas City, MO


