AI Agent Evaluation Specialist Remote & Flexible
$60 per hourMind Rift
A tech innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking, attention to detail, and the ability to assess complex scenarios. Successful candidates can work flexibly and remotely, earning rates up to $60/hour. This position is ideal for analysts or students looking to contribute meaningfully to advanced AI projects while gaining valuable experience. #J-18808-Ljbffr
$80 per hour
...tech company is seeking contributors for a flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks... ...oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation...Remote workFlexible hoursPart time$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates... ...with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive pay up to $80/...Remote workFlexible hours$60 per hour
...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and critical... ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected...Remote workFlexible hoursPart time$80 per hour
...company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures. This flexible, project-based opportunity... ...allows contributors to work remotely while engaging in a... ...facilitating AI testing and evaluation without needing a coding...Remote workFlexible hours$80 per hour
...A leading AI consultancy in the United States is seeking Quality Assurance... ...and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical... ...attention to detail. This project offers flexible hours and pay rates up to $80/hour...Remote workFlexible hoursHourly pay$60 per hour
...ethically shape the future of AI. What We Do The... ...thrive in ambiguity, enjoy remote asynchronous work, and... ...systems are tested and evaluated, we want to hear from... ...experts for autonomous AI agents in a project focused on... ..., and project needs. Flexible, remote, freelance...Remote workFlexible hoursFreelance$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention... ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/hour...Remote jobFlexible hoursPart time- ...Alignerr, we partner with the world’s leading AI research teams and labs to build and... ...is perfect. Technical Auditing: Evaluate AI-generated simulations and proofs for... .../SciPy) Why Join Us Competitive pay and flexible remote work. Collaborate with a team working on...Remote workFlexible hoursContract workFreelance
$80 per hour
...domain experts with cutting-edge AI projects from innovative tech... ...and maintaining MCP-compatible evaluation servers.Implementing logic to check agent actions against scenario definitions... ...project needs.Take part in a flexible, remote, freelance project that fits around...Remote workFlexible hoursFreelance$80 per hour
...ethically shape the future of AI. Our platform connects specialists with AI projects from major... ...realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that... ...project needs Take part in a flexible, remote, freelance project that fits...Remote workFlexible hoursPart timeFreelance$80 per hour
...A leading AI consultancy is seeking individuals for a flexible, remote project focused on QA for autonomous AI agents. Ideal candidates should possess strong analytical thinking, attention to detail, and clear communication skills. You'll review tasks, identify inconsistencies...Remote workFlexible hours$80 per hour
...company is seeking a part-time QA contributor for an AI project to validate autonomous agents. Candidates must have strong analytical thinking, attention... ...and defining expected behaviors for AI agents. This flexible remote opportunity allows you to work on your own schedule...Remote workFlexible hoursPart time$50 - $70 per hour
...management, agentic workflow evaluation, and structured... ...supports current and upcoming remote consulting... ...journey evaluation, storage agent testing, transfer planning... ...Ability to evaluate AI-generated technical recommendations... ...review Work on flexible assignments aligned...Remote jobFlexible hoursHourly payWeekly payJob sharingContract workPart timeFor contractors$60 - $80 per hour
...Authorizer, Checker, and Clerk for a remote, part-time position. The role requires... ...credit operations content for AI training and evaluation. Candidates must have professional experience... ...This is a long-term opportunity with flexible hours and competitive compensation...Remote workFlexible hoursHourly payPart time- ...for skilled ParaView users to support AI research through flexible, hourly contract work. This ongoing... ...scientific visualization and CFD to evaluate AI-generated content. Ideal candidates... ...communication skills. The role is fully remote and open to those residing in the...Remote workFlexible hoursHourly payContract workFor contractors
- ...enterprises who are building AI systems to power... ...semantic search, RAG, and agents. We believe that our... ...training algorithms, and evaluation methods to ensure... ...we also embrace being remote-friendly! For this role... ...improvement ~ Remote-flexible, offices in Toronto, New...Remote workFlexible hoursFull timeWork at office
$145k - $185k
...Francisco, CA, but built as a remote-first company that... ...a new era. Advances in AI are transforming how... ...prepare for engagements, evaluate evidence, and deliver... ...collaboration, and AI agents to help firms operate more... ...meaningful ownership Flexible PTO 401k Wellness benefits...Remote workFlexible hoursImmediate startWork from home- ...enterprises who are building AI systems to power magical experiences... ..., semantic search, RAG, and agents. We believe that our work is... ...managementPrototype and evaluate new technologies, then help decide... ...and workspace improvement?? Remote-flexible, offices in Toronto, New York...Remote workFlexible hoursFull timeWork at office
$186.1k - $292.81k
...Figma, and Stripe with the flexibility and insights needed to drive... ...performance. About the Role AI agents are becoming central to how... ...orchestration layer, and the evaluation and observability infrastructure... ...‑functional partners in a remote‑first environment. High...Remote workFlexible hoursShift work$55 per hour
...edge technology firm is seeking analysts and researchers for a part-time opportunity focused on evaluating AI systems. This remote role pays up to $55/hour, allowing flexibility to fit around other commitments. Ideal candidates possess strong analytical skills and a firm...Remote workPart time- ...looking for skilled BlenderBIM (Bonsai) users to support AI research through flexible, hourly contract work. This ongoing opportunity allows you... ...experience with native IFC editing and OpenBIM workflows to evaluate AI-generated content. Ideal candidates will have at least...Remote workFlexible hoursHourly payContract work
- ...Coconut is looking for an Agent Evals Specialist to evaluate AI-generated outputs, ensuring accuracy and quality in performance. This role requires strong... .... This is an opportunity to work with top-tier remote talent and help shape AI performance. #J-18808-Ljbffr...Remote work
- ...oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based... ...some experience with software testing. Flexibility is key as you engage with innovative AI...Remote workPart time
$43.48 - $55.58 per hour
...classification are responsible for the evaluation and risk assessments of... ...coordination. Work a flexible schedule, which may include evenings... ...for MENTAL HEALTH EVALUATION SPECIALIST After you have... ...more information. Telework/Remote Options: Some departments...Remote workFlexible hoursHourly payPermanent employmentFull timePart timeWork at officeLocal areaWork from homeWork visaShift workNight shiftDay shiftAfternoon shift$60 per hour
...Science Professionals to join their Expert Network. This remote role involves evaluating AI-generated science, fact-checking technical claims, and... ...competitive pay rates up to $60 per hour and the flexibility to work from home. #J-18808-Ljbffr Prolific Academic LtdRemote jobHourly payWork from home$77k - $95k
...) This role combines rigorous evaluation with real community impact. The Senior Evaluation Specialist will support a multi-year clinical... ...traditional (M-F, 8 am-5 pm), flexible (for example, M-F 10 am-7pm),... ...both hybrid and fully remote roles, depending on position responsibilities...Remote workFlexible hoursFull timeContract workTemporary workWork at officeLocal area$72.25k - $89.25k
...Job As a Systems Specialist at Omnidian, you will be... ...offboarding for a global remote staff. Act as a key... ...Security, Frameworks & AI-Assisted Operations... ...documentation. Who You Are Flexible & Resilient: You... ...problem-solver who can evaluate complex scenarios and...Remote workFlexible hoursWork at officeImmediate start3 days per week- ...We're creating an operating system of AI agents that learn, remember, and act across every... ...seamlessly across platforms Design evaluation systems that make our agents... ...you'll get Competitive salary, equity, flexible remote work Premium AI tools (ChatGPT, Claude...Remote workFlexible hoursLive in
- ...ParaView users to support AI research through flexible, hourly contract work. This... ...large dataset analysis to evaluate AI-generated content and... ...Applied Mathematician HPC Specialist or Computational Scientist... ...employee role) Location: Fully remote; work from anywhere with a...Remote workFlexible hoursHourly payFull timeContract workFor contractorsFreelance
- ...for skilled Qucs-S users to support AI research through flexible, hourly contract work. This is not a... ...design, or analog circuit analysis to evaluate AI-generated content and provide feedback... ...‑time employee role) Location: Fully remote; work from anywhere with a reliable...Remote workFlexible hoursHourly payFull timeContract workFor contractorsFreelance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Agent Evaluation Specialist Remote & Flexible. Be the first to apply!

