Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote AI QA Trainer: LLM Evaluation & Reliability

Invisible Expert Marketplace

New York, NY
  • Remote job

A technology consulting firm in Canada seeks an AI QA Trainer for LLM Evaluation. This remote role involves evaluating large-scale language models through testing and improvement processes. The ideal candidate will have experience in QA for ML systems, strong programming skills, and a solid educational background in computer science or a related field. Pay ranges from $6–$65 per hour, depending on experience. #J-18808-Ljbffr Invisible Expert Marketplace

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Remote AI QA Trainer: LLM Evaluation & Reliability in New York, NY vacancy
  • AI QA Trainer - LLM Evaluation Are you an AI QA expert eager to shape the future of AI? Large-scale language...  ...to harden model reasoning and reliability. Responsibilities On a typical day...  ...Employment Type Contract Workplace Remote Seniority Level Mid‑Senior Level #J... 
    Remote work
    Hourly pay
    Contract work
    For contractors
    Freelance

    Invisible Expert Marketplace

    New York, NY
    3 days ago
  • A leading AI evaluation firm in the United Kingdom is seeking an AI QA Trainer to enhance the reliability of large-scale language models. This mid-senior level contractor role involves designing and executing test plans while ensuring factual accuracy in model assessments... 
    Remote job
    Hourly pay
    For contractors

    Invisible Expert Marketplace

    New York, NY
    3 days ago
  • Invisible Agency is looking for an AI QA Trainer for a freelance project. The role involves evaluating language models by detecting hallucinations, assessing factual...  ...with QA for ML/AI systems. The position is remote, with a pay range of $6 to $65 per hour, depending... 
    Remote job
    Hourly pay
    Contract work
    Freelance

    Invisible Agency

    Austin, TX
    4 days ago
  • $80 - $120 per hour

     ...creative and technical talent with leading AI research labs. Headquartered in San...  .... Position: Incident management / reliability / SRE Evaluator Type: Contract Compensation: $80–$120/hour Location: Remote Role Responsibilities Evaluate AI-... 
    Remote work
    Contract work
    Summer work
    Work at office

    Mercor

    Chicago, IL
    3 days ago
  •  ...Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems...  ...builds the context layer for employees and AI agents alike to use in developing...  ...Scientist, LLM Evaluation Location: Remote or Austin, Tx Our value is directly tied... 
    Remote work
    Flexible hours

    Driver AI Inc.

    United States
    3 days ago
  • $40 per hour

     ...seeking experienced Python Developers to join as AI Trainers in Boston. This remote role involves training and evaluating AI models, requiring strong Python skills and attention to detail. Candidates must have reliable internet access and complete a skills verification... 
    Remote job
    Work from home

    Prolific Academic Ltd

    Boston, MA
    1 day ago
  • CNTXT AI is seeking native American English speakers for a fully remote contractor role in AI data and language projects. Responsibilities include content generation, data annotation, LLM evaluation, and localization QA. Candidates must exhibit excellent editorial judgment... 
    Remote job
    For contractors

    CNTXT AI

    New York, NY
    4 days ago
  • $30 per hour

     ...Advanced Arabic Speakers to participate in training and evaluating AI models remotely. This role involves analyzing, editing, and writing tasks...  ...extended periods. Additionally, candidates should have a reliable internet connection and a PayPal account to receive payments... 
    Remote job
    Hourly pay

    Prolific Academic Ltd

    New York, NY
    3 days ago
  •  ...Background** Datatricks AI specializes in building...  ...Large Language Model (LLM) training. Our global network...  ...to technical spatial evaluation. **Roles and...  ...complex instructions. * A reliable high-speed internet connection...  .... **Applicant’s Location** Remote United States... 
    Remote work
    Hourly pay
    Long term contract
    Contract work
    Freelance
    Flexible hours

    Datatricks

    Toccoa, GA
    7 days ago
  • $50 per hour

     ...seeking a Cantonese Audio Generalist Evaluator Expert to contribute to a high-impact audio AI research project with a leading...  ...Assurance Participate in QA and review cycles to ensure tasks...  ...quality bar. Maintain consistency and reliability before datasets are integrated... 
    Remote job
    Hourly pay
    Temporary work
    10 hours per week

    Mercor

    Santa Clarita, CA
    5 days ago
  •  ...Summary This is a fully remote, hourly contractor role supporting AI data and language...  ...AI training datasets. LLM evaluation: reviewing AI-generated...  ...appropriateness. Localization QA: ensuring terminology,...  ..., names, dates) using reliable sources and consistent... 
    Remote work
    Hourly pay
    Temporary work
    For contractors
    Flexible hours

    CNTXT AI

    Brooklyn, NY
    1 day ago
  • $125k - $170k

     ...AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist...  ...improvement of our agentic and ML or LLM-based systems through data-...  ...this is possible without reliable and accurate data. This...  ...Engineer, AI (FULLY REMOTE) Seattle, WA $176,600.00... 
    Remote work
    Full time
    Contract work

    IRONCLAD COMPANY

    Seattle, WA
    4 days ago
  • $166.8k

     ...and the world. The AI and Data Analytics Division...  ...strategies) and evaluation (T&E, robustness) for key...  ...data modalities such as remote sensing imagery, sensor...  ...Hands on experience with LLM/LVM/Foundation Model...  ...determine trustworthiness, reliability, and loyalty to the... 
    Remote work
    For contractors
    Work experience placement
    Work at office
    Local area
    Relocation package
    Flexible hours

    Pacific Northwest National Laboratory

    Seattle, WA
    2 days ago
  • $20 - $50 per hour

    TELUS Digital is looking for a data scientist to ensure data quality through diagnostic testing and evaluation. You will architect frameworks, perform data-wrangling for training sets, and visualize trends in model failures. Required qualifications include a Bachelor’s... 
    Remote job
    Hourly pay
    Flexible hours

    TELUS Digital

    New Bremen, OH
    3 days ago
  • $60 per hour

    A leading AI development firm seeks experienced quantitative professionals to evaluate AI-generated work and design problems for AI systems. The role offers the flexibility of remote work, allowing you to choose your projects and schedule. Ideal candidates have a background... 
    Remote work
    Hourly pay

    DataAnnotation

    Phoenix, AZ
    5 days ago
  • $40 per hour

    A cybersecurity company is seeking experienced professionals to evaluate AI security content and solve technical problems. This role allows you to work remotely and choose your projects with flexibility in scheduling. Ideal candidates should have at least 2 years of hands... 
    Remote job
    Hourly pay

    DataAnnotation

    Louisiana, MO
    3 days ago
  • $60 per hour

    A leading AI development company is seeking experienced quantitative professionals to evaluate and improve AI-generated analyses. This fully remote role allows you to set your own schedule and work from various countries including the US, Canada, and Europe. Qualified candidates... 
    Remote work

    DataAnnotation

    Brooklyn, NY
    5 days ago
  • $60 per hour

    A leading AI development firm is seeking experienced quantitative professionals to assist in advancing AI development. This remote role involves evaluating AI-generated analyses and providing critical feedback to shape future models. Ideal candidates have over 2 years of... 
    Remote work
    Flexible hours

    DataAnnotation

    Nashville, TN
    5 days ago
  •  ...is seeking a qualified candidate for a role focused on improving answer quality across Perplexity's products. You will architect evaluation pipelines, design methods to measure tool impact on answers, and develop visual evaluation solutions. Ideal candidates should possess... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    4 days ago
  • $60 per hour

    An innovative AI development company is looking for quantitative professionals to evaluate AI-generated analyses and solve complex problems. Successful candidates will have...  ...like data science or economics. This fully remote role offers competitive hourly pay up to $60,... 
    Remote work
    Hourly pay

    DataAnnotation

    New York, NY
    5 days ago
  •  ...SIEMENS NX Electro-Mechanical CAD & Design Automation Engineer(Remote) Overath, North Rhine-Westphalia, Germany Actively Hiring 1...  ...Westphalia, Germany Actively Hiring 2 weeks ago Maintenance & Reliability Engineer (m/w/d) Minden, North Rhine-Westphalia, Germany 4 days... 
    Remote job

    Mindrift

    New Bremen, OH
    2 days ago
  • $60 per hour

    A leading AI development firm is currently seeking experienced quantitative professionals to join their team. This fully remote position allows you to evaluate AI-generated quantitative analyses and solve complex problems while enjoying a flexible schedule. Ideal candidates... 
    Remote work
    Flexible hours

    DataAnnotation

    Denver, CO
    5 days ago
  • $20 per hour

     ...company is seeking a Retail Support Associate to help train AI models. This role involves evaluating chatbot performance and content while maintaining high...  ...position is flexible, offering full-time or part-time remote work with hourly pay starting at $20. Applicants must... 
    Remote job
    Hourly pay
    Full time
    Part time
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $70 per hour

     ...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our...  ...Contract Compensation: $70/hour Location: Remote Role Responsibilities Evaluate AI-generated responses to ensure accuracy and depth in... 
    Remote work
    Contract work
    Summer work

    Mercor

    Boston, MA
    3 days ago
  •  ...Title: I Engineer - NLP/LLM Data Specialist...  ...Location: Houston, Texas - Remote Duration: 6 months...  ...for an experienced AI Engineer specializing...  ...Technology Selection: Evaluate and recommend AI...  ...to ensure accuracy and reliability. Implementation... 
    Remote work

    Saviance

    Houston, TX
    4 days ago
  • YO IT Consulting is looking for a Hebrew AI Data Trainer to join their team remotely. The ideal candidate will evaluate AI-generated responses in Hebrew and English, ensuring accuracy and clarity. This role demands a Bachelor’s degree in a relevant field and native or... 
    Remote job

    YO IT Consulting

    New York, NY
    3 days ago
  • Innodata Inc. is seeking a freelance AI Trainer to create and evaluate training data for AI models. The role, performed remotely, requires a Bachelor's degree in a relevant field and strong communication skills. Responsibilities include annotating data, reviewing AI outputs... 
    Remote job
    Hourly pay
    Temporary work
    Freelance

    Innodata Inc.

    New Bremen, OH
    3 days ago
  • YO IT Consulting is looking for a Hebrew AI Data Trainer, a remote contractor role focused on enhancing the quality and accuracy of AI systems. Responsibilities include evaluating AI responses in Hebrew and English, ensuring clarity, correctness, and logical reasoning.... 
    Remote job
    For contractors

    YO IT Consulting

    Chicago, IL
    5 days ago
  • Hebrew AI Data Trainer - Remote Location: Remote This is a fully remote...  ...AI systems. You will evaluate AI‑generated responses...  ...information using reliable sources when required...  ...localisation, editorial QA, linguistic review, or...  ...in AI training data, LLM evaluation, or prompt... 
    Remote job
    Hourly pay
    For contractors

    YO IT Consulting

    Chicago, IL
    18 hours ago
  • $50 - $75 per hour

    A leading AI development company seeks proficient programmers for a fully remote role focused on advancing AI systems. You'll design and solve coding problems, write high-quality code, and evaluate AI-generated content. Candidates should be fluent in English and ideally... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Florida, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI QA Trainer: LLM Evaluation & Reliability. Be the first to apply!