Remote AI Agent Evaluation Specialist
$80 per hourMindrift
- Remote job
A leading tech company is seeking contributors for a flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and ensure clear expected behaviors for AI. Ideal candidates possess excellent analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/hour based on expertise and project needs. This role offers valuable experience in an advanced AI project and fits around your primary commitments. #J-18808-Ljbffr Mindrift
$50 - $70 per hour
...management, agentic workflow evaluation, and structured technical QA.... ...supports current and upcoming remote consulting opportunities focused... ...migration journey evaluation, storage agent testing, transfer planning... ...Ability to evaluate AI-generated technical recommendations...Remote jobHourly payWeekly payJob sharingContract workPart timeFor contractorsFlexible hours$60 per hour
Prolific Academic Ltd is looking for Biology Experts and Life Science Professionals to join their Expert Network. This remote role involves evaluating AI-generated science, fact-checking technical claims, and ensuring ethical alignment in AI responses. Candidates should...Remote jobHourly payWork from home$60 per hour
Computer Sciences - Graduates - AI Training About Prolific Prolific is not just another player in the... ...re looking for We’re looking for Computer Science Specialists to join our Expert Network to help train and evaluate cutting‑edge AI models. If you have a background in...Remote jobWork from homeFlexible hours- YO IT Consulting is seeking a Visual Evaluation Specialist to apply expertise in evaluating visual content remotely. The role involves assessing quality, providing feedback for AI training, and working independently to manage evaluation tasks effectively. Candidates should...Remote jobFor contractors
- Alignerr is seeking a remote Chemistry Specialist with a Master's or PhD to design, solve, and evaluate complex chemistry problems that train AI models. You’ll play a crucial role in shaping AI’s understanding of science while working flexibly and autonomously. This contract...Remote jobContract work
$60 per hour
Prolific Academic Ltd is seeking Computer Science Specialists to join their Expert Network for AI training and evaluation. Candidates should ideally hold a BSc in... ...papers, and ensuring scientific integrity. This remote position offers competitive pay rates, flexibility...Remote jobHourly pay- YO IT Consulting is looking for a Visual Evaluation Specialist to evaluate and annotate visual content, enhancing AI training through high-quality input. Ideal candidates... .... This is a contract position allowing for remote work, requiring a commitment of at least 15 hours...Remote jobContract work
- .... Join Our Team Agentic AI Engineering Intern Engineering... ...Projects Data Center Projects Remote Controls, Security, Network, Compute... ..., Enterprise Systems & Agent Integrations Operational Excellence... ..., Agent Workflow Systems and Evaluation Operational Excellence...Remote workInternshipNight shift
- YO IT Consulting is seeking a Visual Evaluation Specialist to evaluate and annotate visual content for next-generation AI training. This remote position requires a commitment of at least 15 hours per week, leveraging your expertise in visual evaluation to enhance AI performance...Remote job
$80 per hour
A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working flexibly around your commitments. The ideal candidate holds a relevant...Remote workPart time- YO IT Consulting is looking for a Visual Evaluation Specialist to evaluate and annotate visual content as part of AI training initiatives. This remote role allows you to contribute your domain knowledge and expertise without requiring prior AI experience. Your responsibilities...Remote job
$60 per hour
...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking... .... Successful candidates can work flexibly and remotely, earning rates up to $60/hour. This position is...Remote jobFlexible hours- ...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background...Remote jobPart time
$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will... ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive...Remote jobFlexible hours$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and critical... ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected...Remote jobPart timeFlexible hours$60 per hour
...ethically shape the future of AI. What We Do The Mindrift platform... ...thrive in ambiguity, enjoy remote asynchronous work, and want to... ...modern AI systems are tested and evaluated, we want to hear from you.... ...QA experts for autonomous AI agents in a project focused on validating...Remote workFreelanceFlexible hours$60 per hour
...Mindrift AI Coding Agent Evaluation Specialist Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this...Remote workPermanent employmentTemporary work- A technology company based in Washington is looking for an Analyst to train AI models. The role involves providing complex mathematics problems to AI chatbots and evaluating their outputs for correctness. Candidates should possess expert mathematical reasoning skills with...Remote jobHourly payContract workFlexible hours
$80 per hour
A tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures... ...allows contributors to work remotely while engaging in a complex AI project... ...detail, facilitating AI testing and evaluation without needing a coding...Remote jobFlexible hours$60 per hour
A leading AI firm in Austin is looking for QA experts to validate... ...and improve AI systems. This remote, freelance role requires... ...detail. Candidates will review AI evaluation tasks, identify inconsistencies... ...define expected behaviors for agents. Ideal applicants have experience...Remote jobFreelance$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention... ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/...Remote jobPart timeFlexible hours- ...Alignerr, we partner with the world’s leading AI research teams and labs to build and... ...conversion is perfect. Technical Auditing: Evaluate AI-generated simulations and proofs for... ...Why Join Us Competitive pay and flexible remote work. Collaborate with a team working on...Remote workContract workFreelanceFlexible hours
$80 per hour
A leading AI consultancy in the United States is seeking Quality Assurance professionals to validate and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical consistency through detailed reviews. Ideal candidates possess...Remote jobHourly payFlexible hours$55 per hour
A leading AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible, remote project is ideal for those with excellent analytical... ...should be adept at evaluating scenarios and documenting findings...Remote jobFlexible hours$80 per hour
...ethically shape the future of AI. What We Do The Mindrift platform connects specialists with AI projects from major... ...realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases... ...Take part in a flexible, remote, freelance project that fits...Remote workPart timeFreelanceFlexible hours$217.57k - $271k
...the intersection of engineering, applied AI, testing and developer experience. You... ...define and lead the discipline of testing AI agents, evaluating LLM behavior, and ensuring the... ...roles — such as field-based sales or other remote-by-design positions — may have different...Remote workFull timeTemporary workWork at officeLocal areaFlexible hours$135.68k - $203.53k
...than 100 miles from the office for the remote option.) Job Summary We are seeking seasoned... ...strategy for our next-generation AI initiatives. This role is designed for a... ...serving as the principal architect for our Agent Evaluation strategies within Google ADK environments...Remote workWork experience placementWork at officeWorldwide$80 per hour
...domain experts with cutting‑edge AI projects from innovative tech clients... ...and maintaining MCP‑compatible evaluation servers. Implementing logic to check agent actions against scenario definitions... ...needs. Take part in a flexible, remote, freelance project that fits around...Remote workPart timeFreelanceFlexible hours- A leading AI company is seeking a Biology Specialist to help fine-tune large language models. Ideal candidates will be pursuing or hold a Ph.D. in Biology... ...problems and collaborating on AI projects in a fully remote setting. Applicants with excellent communication and analytical...Remote job
$125 per hour
QGIS specialists leverage their expertise in geographic information systems to enhance AI research through flexible, project-based work. Utilizing... ...mapping, you will evaluate AI-generated content and provide... ...Part-time, flexible hours. Remote work environment. Ongoing...Remote jobPart timeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote AI Agent Evaluation Specialist. Be the first to apply!


