Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI QA Trainer - LLM Evaluation - Freelance Project

Invisible Expert Marketplace

AI QA Trainer – LLM Evaluation Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability. Responsibilities On a typical day you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root‑cause hypotheses, and suggest improvements to prompt engineering, guardrails, and evaluation metrics (e.g., precision/recall, faithfulness, toxicity, and latency SLOs). You’ll also partner on adversarial red‑teaming, automation (Python/SQL), and dashboarding to track quality deltas over time. Qualifications A bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field is ideal; shipped QA for ML/AI systems, safety/red‑team experience, test automation frameworks (e.g., PyTest), and hands‑on work with LLM eval tooling (e.g., OpenAI Evals, RAG evaluators, W&B) signal fit. Skills that stand out include evaluation rubric design, adversarial testing/red‑teaming, regression testing at scale, bias/fairness auditing, grounding verification, prompt and system‑prompt engineering, test automation (Python/SQL), and high‑signal bug reporting. Clear, metacognitive communication—“showing your work”—is essential. Pay & Benefits We offer a pay range of $6‑to‑$65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply. Employment Type Contract Workplace Remote Seniority Level Mid‑Senior Level #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI QA Trainer - LLM Evaluation - Freelance Project in New York, NY vacancy
  •  ...A technology consulting firm in Canada seeks an AI QA Trainer for LLM Evaluation. This remote role involves evaluating large-scale language models through testing and improvement processes. The ideal candidate will have experience in QA for ML systems, strong programming... 
    Suggested
    Hourly pay
    Remote work

    Invisible Expert Marketplace

    New York, NY
    1 day ago
  • $55 per hour

     ...domain experts with cutting-edge AI projects. We are seeking an Evaluation Scenario Writer - QA for a project focused on...  ...evaluation scenarios created for LLM agents. This is a flexible, project...  ...Take part in a flexible, remote, freelance project that fits around your commitments... 
    Freelance
    Project
    Part time
    Internship
    Remote work
    Flexible hours

    Mindrift

    New York, NY
    1 day ago
  • $60 per hour

    A tech company specializing in AI is seeking legal consultants for project-based work evaluating AI systems. Ideal candidates will have a law degree and at least 2 years of experience in US law. Responsibilities include generating AI prompts, evaluating solutions, and improving... 
    Freelance
    Project
    Weekly pay
    Part time

    Mindrift

    Brooklyn, NY
    1 day ago
  • $55 per hour

    Freelance Physics QA (with Python) - AI Trainer 2 days ago Be among the first 25 applicants This opportunity is only...  ...connects specialists with AI projects from major tech innovators. Our mission...  .... Auditing Work: review and evaluate tasks completed by other experts,... 
    Freelance
    Project
    Part time
    Remote work

    Mindrift

    New York, NY
    4 days ago
  • $80 per hour

     ...Freelance Software Developer (Ruby) / Quality Assurance (AI Trainer) 4 days ago Be among the first 25 applicants...  ...specialists with AI projects from major tech innovators...  ...code review Prompt evaluation and complex data...  ...others with AI/ML or LLM‑powered testing/coding... 
    Freelance
    Project
    Part time
    Remote work

    Mind Rift

    New York, NY
    3 days ago
  •  ...A leading AI evaluation firm in the United Kingdom is seeking an AI QA Trainer to enhance the reliability of large-scale language models. This mid-senior level contractor role involves designing and executing test plans while ensuring factual accuracy in model assessments... 
    Hourly pay
    For contractors
    Remote work

    Invisible Expert Marketplace

    New York, NY
    1 day ago
  • $8 - $65 per hour

     ...A leading tech consultancy is seeking a Python Coding Specialist for a remote, freelance AI Trainer Project. You will leverage your coding expertise to enhance AI by training large-scale language models. Ideal candidates will have strong Python skills, a background in... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $60 per hour

     ...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is...  ...will have: Degree in law (Bachelor, J.D., LLM, FLLM) within the US context 2+ years of... 
    Freelance
    Project
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mind Rift

    Brooklyn, NY
    3 days ago
  • $45 per hour

     ...ethically shape the future of AI. What We Do The...  ...connects specialists with AI projects from major tech...  ...comprehensive scoring criteria to evaluate the accuracy of the AI'...  ...Law (Bachelor, J.D., LLM, FLLM) or associate...  ...guidelines. Our freelance role is fully remote so... 
    Freelance
    Project
    Part time
    Remote work

    Mind Rift

    New York, NY
    3 days ago
  • $80 per hour

     ...A leading AI development company is seeking a Freelance Software Developer (Kotlin) for Quality Assurance to join remote projects. The role involves designing and maintaining automated tests, collaborating with developers, and enhancing application performance on innovative... 
    Freelance
    Project
    Remote work

    Mind Rift

    New York, NY
    8 days ago
  • $30 per hour

     ...proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation...  .... What This Opportunity Involves As an AI Trainer - Writer, your work will help train AI... 
    Freelance
    Project
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mindrift

    New York, NY
    3 days ago
  • $75 per hour

    Mindrift connects specialists with project‑based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project‑based...  ...will have: Degree in law (Bachelor, J.D., LLM, FLLM) within the US context 2+ years of... 
    Freelance
    Project
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mindrift

    New York, NY
    1 day ago
  •  ...is seeking a Malay Trilingual Language Specialist for a freelance AI Trainer project. This remote opportunity requires fluency in Malay, English...  ...additional language, alongside linguistic expertise for evaluating AI outputs. Responsibilities include reviewing AI text, annotating... 
    Freelance
    Project
    Hourly pay
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $8 - $65 per hour

     ...A progressive AI company is seeking a Wolof Language Specialist for a freelance project. You will shape AI-powered tools by evaluating and refining Wolof text outputs. Candidates must have fluency in Wolof and experience in areas like translation or linguistics. This... 
    Freelance
    Project
    Hourly pay
    Remote work
    Worldwide
    Flexible hours

    Invisible Agency

    New York, NY
    1 day ago
  •  ...A leading AI training firm is seeking a Bulgarian Trilingual Language Specialist for a freelance project focused on enhancing AI communication tools. The ideal candidate must be...  ...annotating errors, and collaborating on evaluation protocols. This remote contract position... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $55 per hour

     ...A leading AI consultancy is seeking experienced Civil Engineers with Python skills to freelance as AI Trainers. This remote role involves designing and evaluating AI solutions geared towards real-world engineering...  ...of choosing your projects. Join us to shape the future... 
    Freelance
    Project
    Remote work

    Mind Rift

    New York, NY
    1 day ago
  •  ...A leading AI training organization is seeking a Rust Coding Specialist for a freelance role to assist large-scale language models. You will evaluate and improve the models by conversing on software engineering tasks using Rust, ensuring code quality, and supporting the... 
    Freelance
    Project
    Hourly pay
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $80 per hour

     ...shape the future of AI. What We Do...  ...with cutting-edge AI projects from innovative tech...  ...tools for running and evaluating agent behavior. You...  ...with how LLM agents are prompted...  ...– you’ll work with QA and writers Additional...  ...flexible, remote, freelance project that fits around... 
    Freelance
    Project
    Part time
    Remote work
    Flexible hours

    Mind Rift

    New York, NY
    3 days ago
  • $8 - $65 per hour

     ...Overview Join the Umbundu Language Specialist – AI Trainer role at Invisible Expert Marketplace . The position focuses on training large...  ...error patterns. Collaborate with the team to refine prompts, evaluation methods, and linguistic guidelines. Qualifications Fluency in... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Invisible Expert Marketplace

    New York, NY
    1 day ago
  • $8 - $65 per hour

     ...A leading AI training company is seeking a Philosophy Specialist for a freelance AI Trainer project. In this remote role, the candidate will engage with advanced language models...  ...and strong communication skills to evaluate and improve model performance. The pay range... 
    Freelance
    Project
    Hourly pay
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  •  ...Updated: 19 May 2026 Freelance | 8–20 hrs/week | Remote (EU/UK) Pricing Manager – AI Trainer (Freelance, 8–20 hrs/week, Remote EU/UK...  ..., and competitive positioning Evaluate the realism, relevance, and analytical...  ...Why Join 10x Team? Flexible, project‑based freelance work—100%... 
    Freelance
    Project
    Remote work
    Flexible hours

    10x Team

    New York, NY
    3 days ago
  • $8 - $65 per hour

     ...States is seeking a Full Stack Engineering Specialist for a freelance AI Trainer Project. This remote role involves reviewing and annotating AI-...  ...soundness, and collaborating with the team to improve evaluation methods. The position is contract-based, offering a competitive... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $90 per hour

     ...English and indicate your level of English proficiency.Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.What this... 
    Freelance
    Project
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    10 hours per week

    Mind Rift

    New York, NY
    3 days ago
  • $23 per hour

     ...curious people from around the world with freelance online tasks that train and improve...  ...Annotators connects individuals with Generative AI projects from leading tech innovators. Our...  ...such as rating AI-generated content, evaluating factual accuracy, or comparing responses... 
    Freelance
    Project
    Part time
    Remote work

    Toloka Annotators

    New York, NY
    3 days ago
  •  ...A leading AI and law enforcement consultancy is looking for a Detective & Police Officer to join a Freelance AI Trainer Project. The role requires strong law enforcement experience to assess...  .... You will review case scenarios, evaluate procedural correctness, and provide... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • CNTXT AI is seeking native American English speakers for a fully remote contractor role in AI data and language projects. Responsibilities include content generation, data annotation, LLM evaluation, and localization QA. Candidates must exhibit excellent editorial judgment... 
    Project
    Remote job
    For contractors

    CNTXT AI

    New York, NY
    2 days ago
  • $8 - $65 per hour

     ...Welsh Language Specialist – AI Trainer World Wide – Remote About the Role We’re looking for a highly skilled Welsh language specialist...  ...shape the future of AI. You’ll work with cutting‑edge AI tools, evaluate and refine Welsh text outputs, and provide expert feedback on... 
    Freelance
    Project
    Hourly pay
    Contract work
    For contractors
    Remote work

    Invisible Agency

    New York, NY
    1 day ago
  • $76 per hour

     ...English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What... 
    Freelance
    Project
    Hourly pay
    Permanent employment
    Temporary work
    10 hours per week

    Mind Rift

    New York, NY
    4 days ago
  • $8 - $65 per hour

     ...Finnish Language Specialist - Freelance AI Trainer Project Join to apply for the Finnish Language Specialist - Freelance AI Trainer Project role...  ..., and suggest improvements to our prompt engineering and evaluation metrics. Responsibilities Interact with AI models to test... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work

    Meridial Marketplace, by Invisible

    New York, NY
    1 day ago
  •  ...experienced FreeCAD BIM/IFC users to support AI research through flexible, hourly contract work. The role involves evaluating AI-generated content, creating relevant questions...  ...and BIM coordination. This position is project-based and fully remote, allowing for flexible... 
    Freelance
    Project
    Hourly pay
    Contract work
    Remote work
    Flexible hours

    Handshake

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI QA Trainer - LLM Evaluation - Freelance Project. Be the first to apply!