Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote QA Analyst for Autonomous AI Agent Evaluation

$60 per hour

Mindrift

Austin, TX
  • Remote job

A leading AI firm in Austin is looking for QA experts to validate and improve AI systems. This remote, freelance role requires strong analytical skills and attention to detail. Candidates will review AI evaluation tasks, identify inconsistencies, and define expected behaviors for agents. Ideal applicants have experience in policy evaluation or logic puzzles and good communication skills to document findings. Competitive pay of up to $60/hour based on experience and project needs. A great opportunity to influence future AI models! #J-18808-Ljbffr Mindrift

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Remote QA Analyst for Autonomous AI Agent Evaluation in Austin, TX vacancy
  • $80 per hour

    A leading AI consultancy in the United States is seeking Quality Assurance professionals to validate and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical consistency through detailed reviews. Ideal candidates possess... 
    Remote job
    Hourly pay
    Flexible hours

    Mindrift

    Providence, RI
    6 days ago
  • $55 per hour

    A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention...  ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/... 
    Remote job
    Part time
    Flexible hours

    Mindrift

    Raleigh, NC
    4 days ago
  • $80 per hour

    A tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures...  ...allows contributors to work remotely while engaging in a complex AI project...  ...detail, facilitating AI testing and evaluation without needing a coding... 
    Remote job
    Flexible hours

    Mind Rift

    Houston, TX
    2 days ago
  • $80 per hour

    A forward-thinking AI company is seeking Quality Analysts for autonomous AI agents. This project-based opportunity is ideal for...  ...include reviewing evaluation tasks, identifying inconsistencies...  ...up to $80/hour, this flexible remote position allows candidates to influence... 
    Remote job
    Flexible hours

    Mindrift

    Phoenix, AZ
    4 days ago
  • $80 per hour

     ...forward-thinking tech company is seeking QAs for autonomous AI agents to validate complex task structures and improve evaluation frameworks. The role requires excellent...  ...$80/hour, this position allows for flexible, remote work while contributing to advanced AI projects... 
    Remote job
    Flexible hours

    Mindrift

    New York, NY
    5 days ago
  • $80 per hour

    A technology company is seeking a part-time QA contributor for an AI project to validate autonomous agents. Candidates must have strong analytical thinking, attention...  ...expected behaviors for AI agents. This flexible remote opportunity allows you to work on your own... 
    Remote job
    Part time
    Flexible hours

    Mindrift

    Phoenix, AZ
    4 days ago
  • $60 per hour

     ...Missouri is seeking contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and...  ...structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define... 
    Remote work
    Part time
    Flexible hours

    Mind Rift

    Kansas City, MO
    2 days ago
  • $80 per hour

    A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will work...  ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers... 
    Remote work
    Flexible hours

    Mind Rift

    Providence, RI
    1 day ago
  • $80 per hour

     ...flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic,...  ...and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $8... 
    Remote work
    Part time
    Flexible hours

    Mind Rift

    Raleigh, NC
    1 day ago
  • $60 per hour

     ...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires...  ...can work flexibly and remotely, earning rates up to $60/hour. This position is ideal for analysts or students looking to contribute... 
    Remote job
    Flexible hours

    Mindrift

    Raleigh, NC
    4 days ago
  • $60 per hour

     ...ethically shape the future of AI. What We Do The Mindrift...  ...in ambiguity, enjoy remote asynchronous work, and want...  ...AI systems are tested and evaluated, we want to hear from you....  ...Project Overview We are seeking QA experts for autonomous AI agents in a project focused on... 
    Remote work
    Freelance
    Flexible hours

    Mindrift

    Austin, TX
    4 days ago
  • $55 per hour

    A leading AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible, remote project is ideal for those with excellent analytical...  ...should be adept at evaluating scenarios and documenting findings... 
    Remote job
    Flexible hours

    Mindrift

    Dallas, TX
    3 days ago
  • BMC Software, Inc. is looking for a PhD-level research intern to contribute to the BMC AI Foundation. This role involves designing evaluations for AI agents, coordinating with technical teams, and producing concrete research artifacts within 12 weeks. The ideal candidate... 
    Remote job
    Internship

    BMC Software, Inc.

    New York, NY
    5 days ago
  • $217.57k - $271k

     ...field-based sales or other remote-by-design positions — may have...  ...the thoughtful use of AI tools in our daily work and...  ...the discipline of testing AI agents, evaluating LLM behavior, and ensuring the...  ...prompt-based features to fully autonomous multi-step workflows. *... 
    Remote work
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    ID.me

    Mountain View, CA
    4 days ago
  •  ...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background... 
    Remote job
    Part time

    Mindrift

    Brooklyn, NY
    5 days ago
  • $80 per hour

    A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working...  ...degree and has experience in QA or data analysis, with a strong command... 
    Remote work
    Part time

    Mindrift

    Houston, TX
    4 days ago
  • ModMed in Boca Raton, Florida, is hiring a Machine Learning Engineer to develop AI agents for patient engagement. This role involves designing autonomous systems and optimizing conversational AI for better healthcare outcomes. Ideal candidates will have a strong background... 
    Remote job

    Dormont Manufacturing Co

    Boca Raton, FL
    2 days ago
  • $160k - $190k

    Member of Technical Staff, Agent Workflow Systems and Evaluation The Member of Technical...  ...measures, observes, and scales AI-enabled workflows. This...  ...success metrics. Mentor FDEs, analysts, engineers, and...  ...Area, CA; Denver, CO; or remote option available. The position... 
    Remote work
    Work at office
    Flexible hours

    SB Energy

    California, MO
    6 days ago
  • $400 per month

    Mercor is seeking a Frontier Code Agent to support its partnership with a leading AI research lab. The role involves evaluating and improving AI coding models through technical assessments and handling realistic machine learning workflows. Successful candidates will have... 
    Remote job

    Mercor Inc

    La Mirada, CA
    6 days ago
  • $400 per month

    Mercor is collaborating with a leading AI research lab for a project supporting Frontier Code Agents. Contributors will evaluate AI coding models through technical assessments that focus on practical machine learning engineering workflows. The position requires 2+ years... 
    Remote job

    Mercor

    Corona, CA
    5 days ago
  • $400 per month

    Mercor is looking for contributors to evaluate and improve frontier AI coding models for a research project. In this role, you will perform complex machine learning engineering tasks and evaluate model performance, focusing on realistic workflows. Applicants should have... 
    Remote job

    Mercor

    Alhambra, CA
    6 days ago
  • $400 per month

     ...seeking contributors for a project with a leading AI research lab focused on frontier coding models. The role involves using frontier AI coding agents to tackle complex machine learning tasks, where you will evaluate model outputs and assess performance. With a time commitment... 
    Remote job

    Mercor

    Friendswood, TX
    3 days ago
  • $400 per unit

    Mercor is partnering with a leading AI research lab on a Frontier Code Agents project. The role involves using AI coding agents for evaluating complex ML tasks, identifying bugs, and comparing model outputs. Applicants should have at least 2 years of experience in machine... 
    Remote job

    Mercor

    Santa Clarita, CA
    6 days ago
  • $400 per month

     ...learning engineering role in partnership with a leading AI research lab. This role involves evaluating frontier AI coding models and completing technical...  ...experience in ML engineering and familiarity with AI coding agents such as Cursor and Codex. #J-18808-Ljbffr Mercor
    Remote job

    Mercor

    San Leandro, CA
    4 days ago
  • $400 per month

    Mercor is seeking contributors for an exciting role in partnership with a leading AI research lab. You will work on the Frontier Code Agents project, focusing on evaluating AI coding models through technical assessments. The tasks involve realistic machine learning engineering... 
    Remote job

    Mercor Inc

    Moreno Valley, CA
    6 days ago
  • $400 per month

    Mercor is collaborating with a leading AI research lab on the Frontier Code Agents project. Contributors will work on evaluating and improving frontier AI coding models via technical assessments. The tasks involve using AI coding agents to address machine learning engineering... 
    Remote job

    Mercor Inc

    San Diego, CA
    6 days ago
  • $400 per month

    Mercor is collaborating with a leading AI research lab to recruit machine learning engineers for a Frontier Code Agents project. You will work on evaluating and improving AI coding models through technical assessments, focusing on real engineering workflows. The role requires... 
    Remote job

    Mercor

    New Rochelle, NY
    6 days ago
  • $400 per month

    Mercor is seeking machine learning engineers to join a project with a leading AI research lab. You will evaluate and complete complex engineering tasks using frontier AI coding agents. This sprint-based project offers $400 for each accepted task, which typically takes 2... 
    Remote job

    Mercor Inc

    Gresham, OR
    4 days ago
  • $400 per month

    Mercor is hiring for a project involving AI research with a leading lab to support Frontier Code Agents. The role requires substantial machine learning engineering expertise, focusing on evaluating AI coding models through structured assessments. If you have experience... 
    Remote job

    Mercor

    Methuen, MA
    4 days ago
  • $400 per month

    Mercor is collaborating with a leading AI research lab on a project focused on evaluating frontier AI coding models. In this role, you'll use advanced AI coding agents to carry out machine learning engineering tasks, review model-generated implementations, and identify... 
    Remote job

    Mercor

    Saint Clair Shores, MI
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote QA Analyst for Autonomous AI Agent Evaluation. Be the first to apply!