AI QA Trainer - LLM Evaluation - Freelance Project
Invisible Expert Marketplace
AI QA Trainer – LLM Evaluation Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability. Responsibilities On a typical day you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root‑cause hypotheses, and suggest improvements to prompt engineering, guardrails, and evaluation metrics (e.g., precision/recall, faithfulness, toxicity, and latency SLOs). You’ll also partner on adversarial red‑teaming, automation (Python/SQL), and dashboarding to track quality deltas over time. Qualifications A bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field is ideal; shipped QA for ML/AI systems, safety/red‑team experience, test automation frameworks (e.g., PyTest), and hands‑on work with LLM eval tooling (e.g., OpenAI Evals, RAG evaluators, W&B) signal fit. Skills that stand out include evaluation rubric design, adversarial testing/red‑teaming, regression testing at scale, bias/fairness auditing, grounding verification, prompt and system‑prompt engineering, test automation (Python/SQL), and high‑signal bug reporting. Clear, metacognitive communication—“showing your work”—is essential. Pay & Benefits We offer a pay range of $6‑to‑$65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply. Employment Type Contract Workplace Remote Seniority Level Mid‑Senior Level #J-18808-Ljbffr
- ...A technology consulting firm in Canada seeks an AI QA Trainer for LLM Evaluation. This remote role involves evaluating large-scale language models through testing and improvement processes. The ideal candidate will have experience in QA for ML systems, strong programming...SuggestedHourly payRemote work
$55 per hour
...domain experts with cutting-edge AI projects. We are seeking an Evaluation Scenario Writer - QA for a project focused on... ...evaluation scenarios created for LLM agents. This is a flexible, project... ...Take part in a flexible, remote, freelance project that fits around your commitments...FreelanceProjectPart timeInternshipRemote workFlexible hours$60 per hour
A tech company specializing in AI is seeking legal consultants for project-based work evaluating AI systems. Ideal candidates will have a law degree and at least 2 years of experience in US law. Responsibilities include generating AI prompts, evaluating solutions, and improving...FreelanceProjectWeekly payPart time$55 per hour
Freelance Physics QA (with Python) - AI Trainer 2 days ago Be among the first 25 applicants This opportunity is only... ...connects specialists with AI projects from major tech innovators. Our mission... .... Auditing Work: review and evaluate tasks completed by other experts,...FreelanceProjectPart timeRemote work$80 per hour
...Freelance Software Developer (Ruby) / Quality Assurance (AI Trainer) 4 days ago Be among the first 25 applicants... ...specialists with AI projects from major tech innovators... ...code review Prompt evaluation and complex data... ...others with AI/ML or LLM‑powered testing/coding...FreelanceProjectPart timeRemote work- ...A leading AI evaluation firm in the United Kingdom is seeking an AI QA Trainer to enhance the reliability of large-scale language models. This mid-senior level contractor role involves designing and executing test plans while ensuring factual accuracy in model assessments...Hourly payFor contractorsRemote work
$8 - $65 per hour
...A leading tech consultancy is seeking a Python Coding Specialist for a remote, freelance AI Trainer Project. You will leverage your coding expertise to enhance AI by training large-scale language models. Ideal candidates will have strong Python skills, a background in...FreelanceProjectHourly payContract workRemote work$60 per hour
...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is... ...will have: Degree in law (Bachelor, J.D., LLM, FLLM) within the US context 2+ years of...FreelanceProjectPermanent employmentTemporary workPart time10 hours per week$45 per hour
...ethically shape the future of AI. What We Do The... ...connects specialists with AI projects from major tech... ...comprehensive scoring criteria to evaluate the accuracy of the AI'... ...Law (Bachelor, J.D., LLM, FLLM) or associate... ...guidelines. Our freelance role is fully remote so...FreelanceProjectPart timeRemote work$80 per hour
...A leading AI development company is seeking a Freelance Software Developer (Kotlin) for Quality Assurance to join remote projects. The role involves designing and maintaining automated tests, collaborating with developers, and enhancing application performance on innovative...FreelanceProjectRemote work$30 per hour
...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation... .... What This Opportunity Involves As an AI Trainer - Writer, your work will help train AI...FreelanceProjectHourly payPermanent employmentTemporary workPart time10 hours per week$75 per hour
Mindrift connects specialists with project‑based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project‑based... ...will have: Degree in law (Bachelor, J.D., LLM, FLLM) within the US context 2+ years of...FreelanceProjectHourly payPermanent employmentTemporary workPart time10 hours per week- ...is seeking a Malay Trilingual Language Specialist for a freelance AI Trainer project. This remote opportunity requires fluency in Malay, English... ...additional language, alongside linguistic expertise for evaluating AI outputs. Responsibilities include reviewing AI text, annotating...FreelanceProjectHourly payRemote work
$8 - $65 per hour
...A progressive AI company is seeking a Wolof Language Specialist for a freelance project. You will shape AI-powered tools by evaluating and refining Wolof text outputs. Candidates must have fluency in Wolof and experience in areas like translation or linguistics. This...FreelanceProjectHourly payRemote workWorldwideFlexible hours- ...A leading AI training firm is seeking a Bulgarian Trilingual Language Specialist for a freelance project focused on enhancing AI communication tools. The ideal candidate must be... ...annotating errors, and collaborating on evaluation protocols. This remote contract position...FreelanceProjectHourly payContract workRemote work
$55 per hour
...A leading AI consultancy is seeking experienced Civil Engineers with Python skills to freelance as AI Trainers. This remote role involves designing and evaluating AI solutions geared towards real-world engineering... ...of choosing your projects. Join us to shape the future...FreelanceProjectRemote work- ...A leading AI training organization is seeking a Rust Coding Specialist for a freelance role to assist large-scale language models. You will evaluate and improve the models by conversing on software engineering tasks using Rust, ensuring code quality, and supporting the...FreelanceProjectHourly payRemote work
$80 per hour
...shape the future of AI. What We Do... ...with cutting-edge AI projects from innovative tech... ...tools for running and evaluating agent behavior. You... ...with how LLM agents are prompted... ...– you’ll work with QA and writers Additional... ...flexible, remote, freelance project that fits around...FreelanceProjectPart timeRemote workFlexible hours$8 - $65 per hour
...Overview Join the Umbundu Language Specialist – AI Trainer role at Invisible Expert Marketplace . The position focuses on training large... ...error patterns. Collaborate with the team to refine prompts, evaluation methods, and linguistic guidelines. Qualifications Fluency in...FreelanceProjectHourly payContract workRemote work$8 - $65 per hour
...A leading AI training company is seeking a Philosophy Specialist for a freelance AI Trainer project. In this remote role, the candidate will engage with advanced language models... ...and strong communication skills to evaluate and improve model performance. The pay range...FreelanceProjectHourly payRemote work- ...Updated: 19 May 2026 Freelance | 8–20 hrs/week | Remote (EU/UK) Pricing Manager – AI Trainer (Freelance, 8–20 hrs/week, Remote EU/UK... ..., and competitive positioning Evaluate the realism, relevance, and analytical... ...Why Join 10x Team? Flexible, project‑based freelance work—100%...FreelanceProjectRemote workFlexible hours
$8 - $65 per hour
...States is seeking a Full Stack Engineering Specialist for a freelance AI Trainer Project. This remote role involves reviewing and annotating AI-... ...soundness, and collaborating with the team to improve evaluation methods. The position is contract-based, offering a competitive...FreelanceProjectHourly payContract workRemote work$90 per hour
...English and indicate your level of English proficiency.Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.What this...FreelanceProjectHourly payPermanent employmentTemporary workPart time10 hours per week$23 per hour
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators. Our... ...such as rating AI-generated content, evaluating factual accuracy, or comparing responses...FreelanceProjectPart timeRemote work- ...A leading AI and law enforcement consultancy is looking for a Detective & Police Officer to join a Freelance AI Trainer Project. The role requires strong law enforcement experience to assess... .... You will review case scenarios, evaluate procedural correctness, and provide...FreelanceProjectHourly payContract workRemote work
- CNTXT AI is seeking native American English speakers for a fully remote contractor role in AI data and language projects. Responsibilities include content generation, data annotation, LLM evaluation, and localization QA. Candidates must exhibit excellent editorial judgment...ProjectRemote jobFor contractors
$8 - $65 per hour
...Welsh Language Specialist – AI Trainer World Wide – Remote About the Role We’re looking for a highly skilled Welsh language specialist... ...shape the future of AI. You’ll work with cutting‑edge AI tools, evaluate and refine Welsh text outputs, and provide expert feedback on...FreelanceProjectHourly payContract workFor contractorsRemote work$76 per hour
...English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What...FreelanceProjectHourly payPermanent employmentTemporary work10 hours per week$8 - $65 per hour
...Finnish Language Specialist - Freelance AI Trainer Project Join to apply for the Finnish Language Specialist - Freelance AI Trainer Project role... ..., and suggest improvements to our prompt engineering and evaluation metrics. Responsibilities Interact with AI models to test...FreelanceProjectHourly payContract workRemote work- ...experienced FreeCAD BIM/IFC users to support AI research through flexible, hourly contract work. The role involves evaluating AI-generated content, creating relevant questions... ...and BIM coordination. This position is project-based and fully remote, allowing for flexible...FreelanceProjectHourly payContract workRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI QA Trainer - LLM Evaluation - Freelance Project. Be the first to apply!
- ai trainer New York, NY
- freelance video producer New York, NY
- freelance ux designer New York, NY
- freelance journalist New York, NY
- freelance video editor New York, NY
- moonlighting New York, NY
- freelance project manager New York, NY
- data engineer freelance New York, NY
- freelance account director New York, NY
- freelance copywriter New York, NY

