Remote AI QA Trainer: LLM Evaluation & Reliability

Invisible Expert Marketplace

Remote job

A technology consulting firm in Canada seeks an AI QA Trainer for LLM Evaluation. This remote role involves evaluating large-scale language models through testing and improvement processes. The ideal candidate will have experience in QA for ML systems, strong programming skills, and a solid educational background in computer science or a related field. Pay ranges from $6–$65 per hour, depending on experience. #J-18808-Ljbffr Invisible Expert Marketplace

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Remote AI QA Trainer: LLM Evaluation & Reliability in New York, NY vacancy

AI QA Trainer - LLM Evaluation - Freelance Project
AI QA Trainer - LLM Evaluation Are you an AI QA expert eager to shape the future of AI? Large-scale language... ...to harden model reasoning and reliability. Responsibilities On a typical day... ...Employment Type Contract Workplace Remote Seniority Level Mid‑Senior Level #J...
Remote work
Hourly pay
Contract work
For contractors
Freelance
Invisible Expert Marketplace
New York, NY
1 day ago
Remote AI QA Trainer for LLM Evaluation
A leading AI evaluation firm in the United Kingdom is seeking an AI QA Trainer to enhance the reliability of large-scale language models. This mid-senior level contractor role involves designing and executing test plans while ensuring factual accuracy in model assessments...
Remote job
Hourly pay
For contractors
Invisible Expert Marketplace
New York, NY
1 day ago
Senior LLM Evaluation Data Scientist - Remote
Driverai is seeking an Applied Data Scientist with expertise in LLM evaluation to join its innovative team in Austin, TX. This role focuses on building the evaluation function from scratch and requires a strong background in statistics and machine learning. The successful...
Remote job
Driverai
Austin, TX
1 day ago
Applied Data Scientist, LLM Evaluation United States (Remote) View Role
$175k - $275k
Full-Time in Austin, TX Remote (any location) - Senior - Product & Engineering - $175k - $275k Applied Data Scientist, LLM Evaluation Introduction At Driver, we’re building systems that turn... ...the context layer for employees and AI agents alike to use in developing software...
Remote job
Full time
Flexible hours
Driverai
Austin, TX
1 day ago
Finance AI Trainer & Evaluator (Remote)
$50 - $60 per hour
...committed to creating high-quality AI. We are looking for a Sales &... ...enjoying the flexibility of remote work and the freedom to set... ...version of the AI smarter and more reliable. To succeed in this... ...diverse and complex problems and evaluate their outputs Evaluate the...
Remote work
Hourly pay
Full time
Contract work
Part time
Work experience placement
Flexible hours
DataAnnotation
Brooklyn, NY
4 days ago
AI Trainer and Evaluator
...Supporting AI data and language projects, the hourly contractor AI Trainer and Evaluator will work remotely on a flexible basis, focusing on content generation, data annotation,... ...in fact-checking localized content using reliable sources Reliable, self-directed, and able...
Remote work
Hourly pay
For contractors
Flexible hours
Virtual Vocations Inc
United States
2 days ago
FP&A Evaluator & AI Trainer - Finance Research
$140k
...a leading foundational model AI lab. You are a good fit if you... ...positions. $15.00 hourly Remote Part‑time +1 Product Testers... ...0 hourly Remote Part‑time +1 Evaluators in Finance Operations / Audit... ...and slide decks) for accuracy, reliability, and compliance. $100,000.00...
Remote work
Hourly pay
Full time
Part time
For contractors
Work from home
Mercor Inc
Murrieta, CA
1 day ago
Remote AI Data Annotator & LLM Trainer
...Background** Datatricks AI specializes in building... ...Large Language Model (LLM) training. Our global network... ...to technical spatial evaluation. **Roles and... ...complex instructions. * A reliable high-speed internet connection... .... **Applicant’s Location** Remote United States...
Remote work
Hourly pay
Long term contract
Contract work
Freelance
Flexible hours
Datatricks
Toccoa, GA
a month ago
Remote Basque Audio Generalist Evaluator Expert - AI Trainer ($50-$50 per hour)
$50 per hour
...seeking a Basque Audio Generalist Evaluator Expert to contribute to a high-impact audio AI research project with a leading... ...Quality Assurance Participate in QA and review cycles to ensure tasks... ...quality bar. Maintain consistency and reliability before datasets are integrated...
Remote job
Hourly pay
Temporary work
10 hours per week
Mercor
Denton, TX
4 days ago
Dutch AI Trainer & Evaluator (Remote)
$30 per hour
...Prolific is seeking Advanced Dutch Speakers in Chicago, IL to train AI models. You will complete AI tasks and assess AI performance in... ...home with competitive rates and direct payment through PayPal. Pass the evaluation, and you can start within 15 minutes. #J-18808-Ljbffr...
Remote work
Work from home
Prolific
Chicago, IL
2 days ago
Staff Data Scientist, LLM Evaluation & QA Architect
...is seeking a qualified candidate for a role focused on improving answer quality across Perplexity's products. You will architect evaluation pipelines, design methods to measure tool impact on answers, and develop visual evaluation solutions. Ideal candidates should possess...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
2 days ago
English (U.S. Native) AI Trainer & Evaluator (Remote, Hourly Contractor)
...Summary This is a fully remote, hourly contractor role supporting AI data and language... ...AI training datasets. LLM evaluation: reviewing AI-generated... ...appropriateness. Localization QA: ensuring terminology,... ..., names, dates) using reliable sources and consistent...
Remote work
Hourly pay
Temporary work
For contractors
Flexible hours
CNTXT AI
Brooklyn, NY
1 day ago
Remote Physics AI Trainer for Chatbot Evaluation
$40 per hour
DataAnnotation is seeking a Physics AI Training Specialist in Maryland to train AI models by measuring their progress and evaluating logic. This role allows you to work from home on your own schedule, with hourly pay starting at $40+. A strong understanding of physics...
Remote job
Hourly pay
Work from home
DataAnnotation
Annapolis, MD
1 day ago
Remote Mandarin AI Trainer & Model Evaluator
$30 per hour
...A prominent AI data company is seeking Advanced Mandarin Speakers for evaluating AI models and completing training tasks. This remote role offers competitive pay of $30/hr for tasks requiring one hour of focused work. Candidates should be fluent in Mandarin with the ability...
Remote work
Work from home
Flexible hours
Prolific - UK Job Board?
New York, NY
1 day ago
Biology AI QA Trainer - Remote & Flexible Hours
$40 per hour
DataAnnotation is seeking a Biology Instructor to train AI models while working from home. This role requires deep expertise in biology, with responsibilities including evaluating the logic of AI chatbots and assessing the quality of their outputs. You can choose your projects...
Remote job
Hourly pay
Work from home
Flexible hours
DataAnnotation
New York, NY
1 day ago
Mechanical Engineering QA Specialist - AI Trainer
$90 per hour
...technical talent with leading AI research labs.... ...0/hour Location: Remote Role Responsibilities... ...feedback. Apply consistent evaluation standards across a... ...ensure consistency and reliability. Contribute to... ...grading, tutoring, or QA. Application Process...
Remote work
Contract work
Summer work
Mercor
Detroit, MI
29 days ago
AI Evaluation Engineer
$180k
...professionals towards their dream careers AI Evaluation Engineer $180,000 Remote (US-based) Are you passionate... ...shaping how AI is deployed safely, reliably, and at scale? This is a rare... ...and guardrails that make advanced LLM features safe to ship. You’ll collaborate...
Remote work
Full time
DeepRec.ai
Denver, CO
3 days ago
Remote AI Chatbot Trainer & Quality Evaluator
$20 per hour
...technology company is seeking a Customer Support Specialist to train AI models and evaluate chatbot outputs. The ideal candidate will have native-level... ...$20 per hour, with bonuses for high-quality work. This is a REMOTE position open only to applicants in the United States. #J-18...
Remote work
Hourly pay
Flexible hours
DataAnnotation
Providence, RI
3 days ago
Biology AI Trainer & Evaluator (Remote)
$40 per hour
...for a Biology Expert to join our team to train AI models. You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the... ...Benefits This is a full-time or part-time REMOTE position You’ll be able to choose which projects...
Remote work
Hourly pay
Full time
Contract work
Part time
DataAnnotation
Santa Fe, NM
4 days ago
Data Scientist - LLM Evaluation & Survey Design
$141.8k - $258.6k
...multimodal capabilities. You will design and manage data annotation processes, work with ML Engineers, and develop LLM auto-judges for AI model evaluation. The ideal candidate has a BA/Master’s in a relevant field and at least 2 years of experience in survey design and...
Apple Inc.
Cupertino, CA
1 day ago
Biology AI Trainer & Evaluator (Remote)
$40 per hour
A technology company specializing in AI training is looking for a Biology Expert based... ...providing complex biology questions and evaluating their maturity and performance. Candidates... .... This flexible position is offered remotely with project-based hourly pay starting at...
Remote work
Hourly pay
Flexible hours
DataAnnotation
New York, NY
4 days ago
Nurse AI Trainer & Evaluator Remote, Flexible Hours
...Prolific is seeking Registered Nurses for the role of AI Trainer. In this capacity, you will evaluate AI-generated clinical responses for accuracy and appropriateness. Candidates should possess a valid nursing license, clinical experience, and the ability to focus on complex...
Remote work
Work from home
Flexible hours
Prolific
New York, NY
2 days ago
Remote Bio AI Trainer & Evaluator — Flexible Hours
$60 per hour
Prolific is looking for Biology Experts and Life Science Professionals in Las Vegas, NV to evaluate AI-generated science. Successful candidates will work flexibly from home, earning up to $60 per hour for paid tasks. Responsibilities include reviewing scientific data accuracy...
Remote job
Hourly pay
Work from home
Flexible hours
Prolific
Las Vegas, NV
1 day ago
Quantitative AI Trainer & Evaluation Scientist
$60 per hour
A leading AI development firm is currently seeking experienced quantitative professionals to join their team. This fully remote position allows you to evaluate AI-generated quantitative analyses and solve complex problems while enjoying a flexible schedule. Ideal candidates...
Remote work
Flexible hours
DataAnnotation
Denver, CO
3 days ago
Clinical Evaluator - Domain Expert - AI Trainer
$80 - $120 per hour
...Position: Clinical / biomedical / pharma Evaluator Type: Contract Compensation: $80–$120/hour Location: Remote Role Responsibilities Evaluate AI-generated artifacts against domain‑specific quality rubrics. Identify factual, aesthetic, and presentation errors in documents...
Remote work
Contract work
Work at office
Mercor Inc
San Francisco, CA
19 hours ago
Biology AI QA Trainer - Remote & Flexible Hours
DataAnnotation is seeking an experienced Biology Instructor to help train AI models by evaluating their logic and outputs. This role allows you to work independently on your own schedule from anywhere in the United States. Ideal candidates will have a background in Biology...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Denver, CO
1 day ago
Remote Android AI Trainer & Code Evaluator
$60 per hour
...A leading technology platform in the United States seeks proficient programmers for remote work in AI development. You'll design coding problems, write high-quality code, and evaluate AI-generated projects. Preferred qualifications include fluency in English, experience...
Remote work
DataAnnotation
Topeka, KS
3 days ago
Remote Android AI Trainer & Code Evaluator
$60 per hour
...A cutting-edge AI development firm is looking for proficient programmers to work remotely on AI projects with flexible scheduling. As a member of this dynamic coding... ...coding problems, write high-quality code, and evaluate AI-generated outputs. Ideal candidates will be...
Remote work
Flexible hours
DataAnnotation
Springfield, IL
4 days ago
Quantitative AI Trainer & Evaluation Scientist
$60 per hour
A leading AI development firm is seeking experienced quantitative professionals to assist in advancing AI development. This remote role involves evaluating AI-generated analyses and providing critical feedback to shape future models. Ideal candidates have over 2 years of...
Remote work
Flexible hours
DataAnnotation
Nashville, TN
3 days ago
Remote Android AI Trainer & Code Evaluator
...A tech-driven AI development firm is looking for proficient programmers to join their virtual coding team. You'll work on diverse... ...challenges related to AI, including mobile app development and code evaluations. This position offers competitive pay, a flexible schedule, and...
Remote work
Flexible hours
DataAnnotation
Bismarck, ND
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI QA Trainer: LLM Evaluation & Reliability. Be the first to apply!