Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote AI Agent Evaluation Specialist

$80 per hour

Mindrift

Raleigh, NC
  • Remote job

A leading tech company is seeking contributors for a flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and ensure clear expected behaviors for AI. Ideal candidates possess excellent analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/hour based on expertise and project needs. This role offers valuable experience in an advanced AI project and fits around your primary commitments. #J-18808-Ljbffr Mindrift

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Remote AI Agent Evaluation Specialist in Raleigh, NC vacancy
  • $50 - $70 per hour

     ...management, agentic workflow evaluation, and structured technical QA....  ...supports current and upcoming remote consulting opportunities focused...  ...migration journey evaluation, storage agent testing, transfer planning...  ...Ability to evaluate AI-generated technical recommendations... 
    Remote job
    Hourly pay
    Weekly pay
    Job sharing
    Contract work
    Part time
    For contractors
    Flexible hours

    24-MAG

    New York, NY
    3 days ago
  • $60 per hour

    Prolific Academic Ltd is looking for Biology Experts and Life Science Professionals to join their Expert Network. This remote role involves evaluating AI-generated science, fact-checking technical claims, and ensuring ethical alignment in AI responses. Candidates should... 
    Remote job
    Hourly pay
    Work from home

    Prolific Academic Ltd

    Cambridge, MA
    2 days ago
  • $60 per hour

    Computer Sciences - Graduates - AI Training About Prolific Prolific is not just another player in the...  ...re looking for We’re looking for Computer Science Specialists to join our Expert Network to help train and evaluate cutting‑edge AI models. If you have a background in... 
    Remote job
    Work from home
    Flexible hours

    Prolific

    Raleigh, NC
    18 hours ago
  • YO IT Consulting is seeking a Visual Evaluation Specialist to apply expertise in evaluating visual content remotely. The role involves assessing quality, providing feedback for AI training, and working independently to manage evaluation tasks effectively. Candidates should... 
    Remote job
    For contractors

    YO IT Consulting

    Boston, MA
    3 days ago
  • Alignerr is seeking a remote Chemistry Specialist with a Master's or PhD to design, solve, and evaluate complex chemistry problems that train AI models. You’ll play a crucial role in shaping AI’s understanding of science while working flexibly and autonomously. This contract... 
    Remote job
    Contract work

    Alignerr

    New York, NY
    1 day ago
  • $60 per hour

    Prolific Academic Ltd is seeking Computer Science Specialists to join their Expert Network for AI training and evaluation. Candidates should ideally hold a BSc in...  ...papers, and ensuring scientific integrity. This remote position offers competitive pay rates, flexibility... 
    Remote job
    Hourly pay

    Prolific Academic Ltd

    Chicago, IL
    1 day ago
  • YO IT Consulting is looking for a Visual Evaluation Specialist to evaluate and annotate visual content, enhancing AI training through high-quality input. Ideal candidates...  .... This is a contract position allowing for remote work, requiring a commitment of at least 15 hours... 
    Remote job
    Contract work

    YO IT Consulting

    New York, NY
    2 days ago
  •  .... Join Our Team Agentic AI Engineering Intern Engineering...  ...Projects Data Center Projects Remote Controls, Security, Network, Compute...  ..., Enterprise Systems & Agent Integrations Operational Excellence...  ..., Agent Workflow Systems and Evaluation Operational Excellence... 
    Remote work
    Internship
    Night shift

    SB Energy

    Temple, TX
    19 hours ago
  • YO IT Consulting is seeking a Visual Evaluation Specialist to evaluate and annotate visual content for next-generation AI training. This remote position requires a commitment of at least 15 hours per week, leveraging your expertise in visual evaluation to enhance AI performance... 
    Remote job

    YO IT Consulting

    Phoenix, AZ
    2 days ago
  • $80 per hour

    A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working flexibly around your commitments. The ideal candidate holds a relevant... 
    Remote work
    Part time

    Mindrift

    Houston, TX
    2 days ago
  • YO IT Consulting is looking for a Visual Evaluation Specialist to evaluate and annotate visual content as part of AI training initiatives. This remote role allows you to contribute your domain knowledge and expertise without requiring prior AI experience. Your responsibilities... 
    Remote job

    YO IT Consulting

    Chicago, IL
    3 days ago
  • $60 per hour

     ...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking...  .... Successful candidates can work flexibly and remotely, earning rates up to $60/hour. This position is... 
    Remote job
    Flexible hours

    Mindrift

    Raleigh, NC
    2 days ago
  •  ...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background... 
    Remote job
    Part time

    Mindrift

    Brooklyn, NY
    3 days ago
  • $80 per hour

    A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will...  ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive... 
    Remote job
    Flexible hours

    Mindrift

    Providence, RI
    2 days ago
  • $60 per hour

     ...contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and critical...  ...with structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define expected... 
    Remote job
    Part time
    Flexible hours

    Mindrift

    Kansas City, MO
    2 days ago
  • $60 per hour

     ...ethically shape the future of AI. What We Do The Mindrift platform...  ...thrive in ambiguity, enjoy remote asynchronous work, and want to...  ...modern AI systems are tested and evaluated, we want to hear from you....  ...QA experts for autonomous AI agents in a project focused on validating... 
    Remote work
    Freelance
    Flexible hours

    Mindrift

    Austin, TX
    2 days ago
  • $60 per hour

     ...Mindrift AI Coding Agent Evaluation Specialist Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this... 
    Remote work
    Permanent employment
    Temporary work

    Mind Rift

    United States
    4 days ago
  • A technology company based in Washington is looking for an Analyst to train AI models. The role involves providing complex mathematics problems to AI chatbots and evaluating their outputs for correctness. Candidates should possess expert mathematical reasoning skills with... 
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    DataAnnotation

    Washington DC
    2 days ago
  • $80 per hour

    A tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures...  ...allows contributors to work remotely while engaging in a complex AI project...  ...detail, facilitating AI testing and evaluation without needing a coding... 
    Remote job
    Flexible hours

    Mindrift

    Houston, TX
    8 hours ago
  • $60 per hour

    A leading AI firm in Austin is looking for QA experts to validate...  ...and improve AI systems. This remote, freelance role requires...  ...detail. Candidates will review AI evaluation tasks, identify inconsistencies...  ...define expected behaviors for agents. Ideal applicants have experience... 
    Remote job
    Freelance

    Mindrift

    Austin, TX
    2 days ago
  • $55 per hour

    A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention...  ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/... 
    Remote job
    Part time
    Flexible hours

    Mindrift

    Raleigh, NC
    2 days ago
  •  ...Alignerr, we partner with the world’s leading AI research teams and labs to build and...  ...conversion is perfect. Technical Auditing: Evaluate AI-generated simulations and proofs for...  ...Why Join Us Competitive pay and flexible remote work. Collaborate with a team working on... 
    Remote work
    Contract work
    Freelance
    Flexible hours

    Alignerr

    Atlanta, GA
    4 days ago
  • $80 per hour

    A leading AI consultancy in the United States is seeking Quality Assurance professionals to validate and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical consistency through detailed reviews. Ideal candidates possess... 
    Remote job
    Hourly pay
    Flexible hours

    Mindrift

    Providence, RI
    4 days ago
  • $55 per hour

    A leading AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible, remote project is ideal for those with excellent analytical...  ...should be adept at evaluating scenarios and documenting findings... 
    Remote job
    Flexible hours

    Mindrift

    Dallas, TX
    1 day ago
  • $80 per hour

     ...ethically shape the future of AI. What We Do The Mindrift platform connects specialists with AI projects from major...  ...realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases...  ...Take part in a flexible, remote, freelance project that fits... 
    Remote work
    Part time
    Freelance
    Flexible hours

    Mindrift

    New York, NY
    2 days ago
  • $217.57k - $271k

     ...the intersection of engineering, applied AI, testing and developer experience. You...  ...define and lead the discipline of testing AI agents, evaluating LLM behavior, and ensuring the...  ...roles — such as field-based sales or other remote-by-design positions — may have different... 
    Remote work
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    ID.me

    Mountain View, CA
    3 days ago
  • $135.68k - $203.53k

     ...than 100 miles from the office for the remote option.) Job Summary We are seeking seasoned...  ...strategy for our next-generation AI initiatives. This role is designed for a...  ...serving as the principal architect for our Agent Evaluation strategies within Google ADK environments... 
    Remote work
    Work experience placement
    Work at office
    Worldwide

    慨正橡扯

    Mount Laurel, NJ
    2 days ago
  • $80 per hour

     ...domain experts with cutting‑edge AI projects from innovative tech clients...  ...and maintaining MCP‑compatible evaluation servers. Implementing logic to check agent actions against scenario definitions...  ...needs. Take part in a flexible, remote, freelance project that fits around... 
    Remote work
    Part time
    Freelance
    Flexible hours

    Mindrift

    Houston, TX
    4 days ago
  • A leading AI company is seeking a Biology Specialist to help fine-tune large language models. Ideal candidates will be pursuing or hold a Ph.D. in Biology...  ...problems and collaborating on AI projects in a fully remote setting. Applicants with excellent communication and analytical... 
    Remote job

    Turing

    Boston, MA
    2 days ago
  • $125 per hour

    QGIS specialists leverage their expertise in geographic information systems to enhance AI research through flexible, project-based work. Utilizing...  ...mapping, you will evaluate AI-generated content and provide...  ...Part-time, flexible hours. Remote work environment. Ongoing... 
    Remote job
    Part time
    Flexible hours

    SaidGig

    Remote
    10 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI Agent Evaluation Specialist. Be the first to apply!