Remote AI Agent Evaluation Specialist

$80 per hour

Mindrift

Remote job

A leading tech company is seeking contributors for a flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and ensure clear expected behaviors for AI. Ideal candidates possess excellent analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/hour based on expertise and project needs. This role offers valuable experience in an advanced AI project and fits around your primary commitments. #J-18808-Ljbffr Mindrift

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Remote AI Agent Evaluation Specialist in Raleigh, NC vacancy

QGIS Specialist for AI Research Evaluation
$125 per hour
...Role Overview QGIS Specialists apply QGIS and spatial analysis expertise to evaluate AI-generated GIS outputs, test spatial data workflows, and provide detailed feedback... ...full-time employment position. Work is remote and asynchronous, so you can complete tasks independently...
Remote work
Hourly pay
Full time
Contract work
Part time
Flexible hours
SaidGig
United States
7 days ago
Remote AI Agent Evaluation Specialist
$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and critical... ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected...
Remote work
Part time
Flexible hours
Mind Rift
Kansas City, MO
4 days ago
Remote AI Agent Evaluation Specialist
$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will... ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive...
Remote work
Flexible hours
Mind Rift
Providence, RI
3 days ago
Remote AI Agent Evaluation Specialist
$80 per hour
...-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and... ...analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/...
Remote work
Part time
Flexible hours
Mind Rift
Raleigh, NC
3 days ago
Computer Vision Specialist for AI Model Evaluation
$80 - $110 per hour
...Overview Work on the forefront of generative AI by designing and executing real-world... ...executable tests where applicable, run evaluations against a target model, and analyze failures... ...with client teams. Fully remote work within the United States, with an expected...
Remote work
Hourly pay
Part time
Freelance
SaidGig
United States
12 days ago
Remote: AI Agent Evaluation Scenario Designer
...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background...
Remote job
Part time
Mindrift
Brooklyn, NY
2 days ago
AI Agent Testing & Evaluation Scenario Architect
$80 per hour
A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working flexibly around your commitments. The ideal candidate holds a relevant...
Remote work
Part time
Mindrift
Houston, TX
1 day ago
Remote QA Analyst for Autonomous AI Agent Evaluation
$60 per hour
A leading AI firm in Austin is looking for QA experts to validate... ...and improve AI systems. This remote, freelance role requires... ...detail. Candidates will review AI evaluation tasks, identify inconsistencies... ...define expected behaviors for agents. Ideal applicants have experience...
Remote job
Freelance
Mindrift
Austin, TX
1 day ago
SWE Agent Evaluation Specialist
...SWE Agent Evaluation Specialist is a remote engineering review track for evaluating production code, debugging traces, and developer-facing AI outputs against real-world correctness standards. Reviewers reproduce failures, write the unit test the model should have written...
Remote job
Hourly pay
For contractors
10 hours per week
AuraOne Human Data
Remote
3 days ago
LLM Red Team Specialist for AI Model Evaluation
$60 - $90 per hour
...researchers to convert findings into robust evaluation benchmarks. Key Responsibilities... ...research, research engineering, security, or AI evaluation. Proven ability to identify... ...or task-based gig. Position is fully remote within the United States. Typical engagement...
Remote work
Hourly pay
Full time
Freelance
SaidGig
United States
3 days ago
Expert Codebase Evaluation Specialist - Coding Agent Review
...Expert Codebase Evaluation Specialist - Coding Agent Review is a remote evaluation track for reviewing codebase evaluation evaluation prompts and responses against... ...team can use to retrain. Why this role matters AI data reviewers help turn codebase evaluation...
Remote job
Hourly pay
For contractors
10 hours per week
AuraOne Human Data
Remote
2 days ago
Image Evaluation Specialist for AI Training
$20 - $30 per hour
...Role Overview Evaluate images to help train next generation AI systems by providing high quality, real world assessments that shape how models learn and reason. This remote, contractor role focuses on domain knowledge, visual judgment, and clear written reasoning. No...
Remote job
Hourly pay
For contractors
SaidGig
Indiana
1 day ago
Cybersecurity Specialist for AI Security Evaluation
$70 - $90 per hour
...content for security vulnerabilities to help AI models recognize and classify threats.... ...in English. Work Terms Location: Remote. Engagement type: Hourly contract.... ...a short interview and a questionnaire to evaluate domain expertise. If hired, onboarding...
Remote work
Hourly pay
Contract work
Temporary work
SaidGig
United Kingdom
4 days ago
Mathematics Specialist for AI Content Evaluation
$50 per hour
...Role Overview Math Specialists apply advanced mathematical training to evaluate AI-generated mathematics content, design domain-relevant questions, and give detailed... ...-based contract role that can be performed remotely alongside research, teaching, coursework, or industry...
Remote work
Hourly pay
Full time
Contract work
Part time
For contractors
Flexible hours
SaidGig
United States
2 days ago
Excel Specialist - AI Workflow Evaluator
...Excel Specialist - AI Workflow Evaluator is a remote review track for evaluating AI outputs across excel specialist operations workflows. Reviewers grade workflow correctness, policy adherence, and stakeholder fit; flag operational risk; and document the right next step...
Remote job
Hourly pay
For contractors
Work experience placement
10 hours per week
AuraOne Human Data
Remote
2 days ago
Energy Project Estimation Specialist for AI Evaluation
$85 per hour
...estimation, construction planning, and solar design experience to evaluate AI-generated content and create expert-level training data. In... ..., and ability to work asynchronously and independently with remote research teams. No prior AI experience required. Work Terms...
Remote work
Hourly pay
Contract work
Part time
Flexible hours
SaidGig
United States
5 days ago
Cybersecurity Specialist - AI Systems
$1,750 - $2,150 per month
...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...,750–$2,150 per completed task Location: Remote Role Responsibilities Review and evaluate AI-generated outputs related to threat analysis,...
Remote work
Hourly pay
Full time
Contract work
Summer work
Mercor
Remote
13 hours ago
Arabic Transcription Specialist for AI Training
$10 - $20 per hour
...quality language data that trains and improves AI systems. This contractor role focuses on... ...comfortable working independently in a remote environment. Preferred Qualifications... ...data lab that creates training data and evaluations for frontier AI models. Experts...
Remote work
Hourly pay
For contractors
SaidGig
United States
1 day ago
CAD Tool Specialist for AI Research
$125 per hour
...Role Overview CAD Tool Specialist applies CAD software expertise to AI research projects, providing domain knowledge... ...design workflows. You will work remotely and asynchronously on short-term,... ...for CAD workflows and tooling. Evaluate and score large language model responses...
Remote work
Hourly pay
Full time
Temporary work
Part time
Work experience placement
Flexible hours
SaidGig
United States
5 days ago
Process Improvement Specialist - Evaluator - AI Trainer
$80 - $120 per hour
...elite creative and technical talent with leading AI research labs. Headquartered in San Francisco... .... Position: Process improvement / SOPs Evaluator Type: Contract Compensation: $80–$120/hour Location: Remote Role Responsibilities Evaluate AI-...
Remote work
Contract work
Summer work
Work at office
Mercor
Houston, TX
8 days ago
Software Engineer, Forward Deployed Agent Builder
$152k - $240k
...control spend effortlessly. Brex’s AI-native automation and world-... ...’ll do We're building AI agents to automate and augment... ...four weeks per year of fully remote work! Responsibilities:... ..., and data sources. Define evaluation frameworks, success metrics, and...
Remote work
Full time
Work at office
Work from home
Brex Inc.
New York, NY
13 hours ago
Remote Vietnamese Voice Acting Specialist for AI Training
...is seeking a Vietnamese Voice Acting Specialist for a freelance AI Trainer project. The role is critical... ...emotional expression. This position is remote and designed for individuals with... ...strong voice acting credentials. You will evaluate AI outputs and support the...
Remote work
Hourly pay
Freelance
Meridial
New York, NY
2 days ago
Remote Certified Coding Specialist: Healthcare AI QA
$25 - $30 per hour
...DataAnnotation is seeking a Certified Coding Specialist (CCS) to join their team and help train AI models. This role requires expertise in healthcare to evaluate AI performance and improve model quality. Applicants should have fluency in English and a medical or healthcare...
Remote work
Hourly pay
For contractors
DataAnnotation
Boston, MA
2 days ago
Healthcare Specialist for AI Model Training
$15 - $25 per hour
...Role Overview Apply your clinical knowledge to evaluate and structure healthcare information that helps train next-generation AI systems. This part-time contractor role... ...Ability to multitask and work efficiently in a remote setting, with strong analytical and organizational...
Remote work
Hourly pay
Part time
For contractors
SaidGig
United States
1 day ago
Software Engineer AI Agents
$200k - $320k
...technical depth and a passion for AI-driven product development.... .... This is a full-time remote opportunity, with preference for... ...production-ready LLM pipelines and AI agent systems Develop AI-driven... ...new product initiatives Evaluate and recommend AI architectures...
Remote work
Full time
Visa sponsorship
Virco Talent
Remote
13 hours ago
Chemistry Specialist for AI Model Training
$70 - $90 per hour
...that help train next-generation AI systems for a leading customer... ...and research domain. This remote contractor role focuses on converting... ...integrity. Critically evaluate experimental outcomes and propose... ...information for non-specialist audiences. Contribute domain...
Remote job
Hourly pay
For contractors
SaidGig
Remote
1 day ago
Software Engineer III/Senior, Agent
$202.5k - $247.5k
...sharing localhost or running AI workloads in production. We... ...worth your time. About the Agent Team Our Agent team... ...AWS. Engineers develop by using remote development tools and/or ssh to... ...and actual compensation will be evaluated based on factors including,...
Remote work
Permanent employment
Full time
Work at office
Local area
Immediate start
Home office
Flexible hours
Ngrok
Remote
13 hours ago
Program Evaluation Specialist
$100k - $115k
...is seeking a detail-oriented Program Evaluation Specialist to support program evaluation, implementation... ...-grade platforms and mission-ready AI to federal agencies at commercial speed... ...security clearances, due to the nature of the work. Job Locations US-Remote
Remote work
Contract work
LMI
United States
1 day ago
Cybersecurity Specialist - AI Trainer
$60 - $90 per hour
...creative and technical talent with leading AI research labs. Headquartered in San... ...Compensation: $60–$90/hour Location: Remote Commitment: 15–40 hours/week... ...specialized cybersecurity topics. Evaluate and annotate model responses for technical...
Remote work
Full time
Contract work
Summer work
Immediate start
Mercor
Remote
13 hours ago
Remote Medical Doctor for AI Training & Evaluation
$80 - $150 per hour
...Prolific is seeking Medical Doctors to join as Domain Expert participants in training AI models. Responsibilities include evaluating AI responses for accuracy and writing feedback for improvement. A competitive pay of $80-$150 per hour based on skills is offered. Candidates...
Remote work
Hourly pay
Work from home
Prolific
Denver, CO
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI Agent Evaluation Specialist. Be the first to apply!