Remote AI QA Trainer: LLM Evaluation & Reliability
Invisible Expert Marketplace
- Remote job
A technology consulting firm in Canada seeks an AI QA Trainer for LLM Evaluation. This remote role involves evaluating large-scale language models through testing and improvement processes. The ideal candidate will have experience in QA for ML systems, strong programming skills, and a solid educational background in computer science or a related field. Pay ranges from $6–$65 per hour, depending on experience. #J-18808-Ljbffr Invisible Expert Marketplace
- AI QA Trainer - LLM Evaluation Are you an AI QA expert eager to shape the future of AI? Large-scale language... ...to harden model reasoning and reliability. Responsibilities On a typical day... ...Employment Type Contract Workplace Remote Seniority Level Mid‑Senior Level #J...Remote workHourly payContract workFor contractorsFreelance
- A leading AI evaluation firm in the United Kingdom is seeking an AI QA Trainer to enhance the reliability of large-scale language models. This mid-senior level contractor role involves designing and executing test plans while ensuring factual accuracy in model assessments...Remote jobHourly payFor contractors
- Invisible Agency is looking for an AI QA Trainer for a freelance project. The role involves evaluating language models by detecting hallucinations, assessing factual... ...with QA for ML/AI systems. The position is remote, with a pay range of $6 to $65 per hour, depending...Remote jobHourly payContract workFreelance
$80 - $120 per hour
...creative and technical talent with leading AI research labs. Headquartered in San... .... Position: Incident management / reliability / SRE Evaluator Type: Contract Compensation: $80–$120/hour Location: Remote Role Responsibilities Evaluate AI-...Remote workContract workSummer workWork at office- ...Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems... ...builds the context layer for employees and AI agents alike to use in developing... ...Scientist, LLM Evaluation Location: Remote or Austin, Tx Our value is directly tied...Remote workFlexible hours
$40 per hour
...seeking experienced Python Developers to join as AI Trainers in Boston. This remote role involves training and evaluating AI models, requiring strong Python skills and attention to detail. Candidates must have reliable internet access and complete a skills verification...Remote jobWork from home- CNTXT AI is seeking native American English speakers for a fully remote contractor role in AI data and language projects. Responsibilities include content generation, data annotation, LLM evaluation, and localization QA. Candidates must exhibit excellent editorial judgment...Remote jobFor contractors
$30 per hour
...Advanced Arabic Speakers to participate in training and evaluating AI models remotely. This role involves analyzing, editing, and writing tasks... ...extended periods. Additionally, candidates should have a reliable internet connection and a PayPal account to receive payments...Remote jobHourly pay- ...Background** Datatricks AI specializes in building... ...Large Language Model (LLM) training. Our global network... ...to technical spatial evaluation. **Roles and... ...complex instructions. * A reliable high-speed internet connection... .... **Applicant’s Location** Remote United States...Remote workHourly payLong term contractContract workFreelanceFlexible hours
$50 per hour
...seeking a Cantonese Audio Generalist Evaluator Expert to contribute to a high-impact audio AI research project with a leading... ...Assurance Participate in QA and review cycles to ensure tasks... ...quality bar. Maintain consistency and reliability before datasets are integrated...Remote jobHourly payTemporary work10 hours per week- ...Summary This is a fully remote, hourly contractor role supporting AI data and language... ...AI training datasets. LLM evaluation: reviewing AI-generated... ...appropriateness. Localization QA: ensuring terminology,... ..., names, dates) using reliable sources and consistent...Remote workHourly payTemporary workFor contractorsFlexible hours
$125k - $170k
...AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist... ...improvement of our agentic and ML or LLM-based systems through data-... ...this is possible without reliable and accurate data. This... ...Engineer, AI (FULLY REMOTE) Seattle, WA $176,600.00...Remote workFull timeContract work$166.8k
...and the world. The AI and Data Analytics Division... ...strategies) and evaluation (T&E, robustness) for key... ...data modalities such as remote sensing imagery, sensor... ...Hands on experience with LLM/LVM/Foundation Model... ...determine trustworthiness, reliability, and loyalty to the...Remote workFor contractorsWork experience placementWork at officeLocal areaRelocation packageFlexible hours$20 - $50 per hour
TELUS Digital is looking for a data scientist to ensure data quality through diagnostic testing and evaluation. You will architect frameworks, perform data-wrangling for training sets, and visualize trends in model failures. Required qualifications include a Bachelor’s...Remote jobHourly payFlexible hours$60 per hour
A leading AI development firm seeks experienced quantitative professionals to evaluate AI-generated work and design problems for AI systems. The role offers the flexibility of remote work, allowing you to choose your projects and schedule. Ideal candidates have a background...Remote workHourly pay$40 per hour
A cybersecurity company is seeking experienced professionals to evaluate AI security content and solve technical problems. This role allows you to work remotely and choose your projects with flexibility in scheduling. Ideal candidates should have at least 2 years of hands...Remote jobHourly pay$60 per hour
A leading AI development company is seeking experienced quantitative professionals to evaluate and improve AI-generated analyses. This fully remote role allows you to set your own schedule and work from various countries including the US, Canada, and Europe. Qualified candidates...Remote work$60 per hour
A leading AI development firm is seeking experienced quantitative professionals to assist in advancing AI development. This remote role involves evaluating AI-generated analyses and providing critical feedback to shape future models. Ideal candidates have over 2 years of...Remote workFlexible hours- ...is seeking a qualified candidate for a role focused on improving answer quality across Perplexity's products. You will architect evaluation pipelines, design methods to measure tool impact on answers, and develop visual evaluation solutions. Ideal candidates should possess...
$60 per hour
An innovative AI development company is looking for quantitative professionals to evaluate AI-generated analyses and solve complex problems. Successful candidates will have... ...like data science or economics. This fully remote role offers competitive hourly pay up to $60,...Remote workHourly pay- ...SIEMENS NX Electro-Mechanical CAD & Design Automation Engineer(Remote) Overath, North Rhine-Westphalia, Germany Actively Hiring 1... ...Westphalia, Germany Actively Hiring 2 weeks ago Maintenance & Reliability Engineer (m/w/d) Minden, North Rhine-Westphalia, Germany 4 days...Remote job
$60 per hour
A leading AI development firm is currently seeking experienced quantitative professionals to join their team. This fully remote position allows you to evaluate AI-generated quantitative analyses and solve complex problems while enjoying a flexible schedule. Ideal candidates...Remote workFlexible hours$20 per hour
...company is seeking a Retail Support Associate to help train AI models. This role involves evaluating chatbot performance and content while maintaining high... ...position is flexible, offering full-time or part-time remote work with hourly pay starting at $20. Applicants must...Remote jobHourly payFull timePart timeFlexible hours$70 per hour
...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...Contract Compensation: $70/hour Location: Remote Role Responsibilities Evaluate AI-generated responses to ensure accuracy and depth in...Remote workContract workSummer work- ...Title: I Engineer - NLP/LLM Data Specialist... ...Location: Houston, Texas - Remote Duration: 6 months... ...for an experienced AI Engineer specializing... ...Technology Selection: Evaluate and recommend AI... ...to ensure accuracy and reliability. Implementation...Remote work
- YO IT Consulting is looking for a Hebrew AI Data Trainer to join their team remotely. The ideal candidate will evaluate AI-generated responses in Hebrew and English, ensuring accuracy and clarity. This role demands a Bachelor’s degree in a relevant field and native or...Remote job
- Innodata Inc. is seeking a freelance AI Trainer to create and evaluate training data for AI models. The role, performed remotely, requires a Bachelor's degree in a relevant field and strong communication skills. Responsibilities include annotating data, reviewing AI outputs...Remote jobHourly payTemporary workFreelance
- YO IT Consulting is looking for a Hebrew AI Data Trainer, a remote contractor role focused on enhancing the quality and accuracy of AI systems. Responsibilities include evaluating AI responses in Hebrew and English, ensuring clarity, correctness, and logical reasoning....Remote jobFor contractors
- Hebrew AI Data Trainer - Remote Location: Remote This is a fully remote... ...AI systems. You will evaluate AI‑generated responses... ...information using reliable sources when required... ...localisation, editorial QA, linguistic review, or... ...in AI training data, LLM evaluation, or prompt...Remote jobHourly payFor contractors
$50 - $75 per hour
A leading AI development company seeks proficient programmers for a fully remote role focused on advancing AI systems. You'll design and solve coding problems, write high-quality code, and evaluate AI-generated content. Candidates should be fluent in English and ideally...Remote jobHourly payFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote AI QA Trainer: LLM Evaluation & Reliability. Be the first to apply!
- ai trainer New York, NY
- remote education consultant New York, NY
- remote nonprofit New York, NY
- remote financial analyst New York, NY
- remote virtual assistant New York, NY
- junior ux designer remote New York, NY
- remote real estate New York, NY
- remote design intern New York, NY
- remote hr assistant New York, NY
- remote legal internship New York, NY


