Remote QA Analyst for Autonomous AI Agent Evaluation

$60 per hour

Mindrift

Remote job

A leading AI firm in Austin is looking for QA experts to validate and improve AI systems. This remote, freelance role requires strong analytical skills and attention to detail. Candidates will review AI evaluation tasks, identify inconsistencies, and define expected behaviors for agents. Ideal applicants have experience in policy evaluation or logic puzzles and good communication skills to document findings. Competitive pay of up to $60/hour based on experience and project needs. A great opportunity to influence future AI models! #J-18808-Ljbffr Mindrift

Apply

Vacancy posted 18 hours ago

Similar jobs that could be interesting for youBased on the Remote QA Analyst for Autonomous AI Agent Evaluation in Austin, TX vacancy

Autonomous AI QA & Evaluation Analyst (Remote)
$80 per hour
...A forward-thinking AI company is seeking Quality Analysts for autonomous AI agents. This project-based opportunity is ideal for... ...include reviewing evaluation tasks, identifying inconsistencies... ...up to $80/hour, this flexible remote position allows candidates to influence...
Remote work
Flexible hours
Mind Rift
Phoenix, AZ
3 days ago
Autonomous AI QA & Evaluation Analyst (Remote)
$80 per hour
...forward-thinking tech company is seeking QAs for autonomous AI agents to validate complex task structures and improve evaluation frameworks. The role requires excellent... ...$80/hour, this position allows for flexible, remote work while contributing to advanced AI projects...
Remote job
Flexible hours
Mindrift
New York, NY
2 days ago
Remote AI Agent Evaluation Specialist
$60 per hour
...Missouri is seeking contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and... ...structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define...
Remote work
Part time
Flexible hours
Mind Rift
Kansas City, MO
4 days ago
Remote AI Agent Evaluation Specialist
$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will work... ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers...
Remote job
Flexible hours
Mindrift
Providence, RI
1 day ago
Remote AI Agent Evaluation Specialist
$80 per hour
...flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic,... ...and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $8...
Remote job
Part time
Flexible hours
Mindrift
Raleigh, NC
1 day ago
Principal AI Engineer (Autonomous Agent)
$179k - $199k
...human-first and accelerated by AI to create meaningful and... ...Office expectations** For Remote Roles : If this role is remote... ...Summary The AI Engineer – Autonomous Agent will work closely with the Product... .... We use this information to evaluate your candidacy for the posted...
Remote job
Full time
Work at office
Flexible hours
Pointclickcare
Remote
1 day ago
Remote: AI Agent Evaluation Scenario Designer
...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background...
Remote work
Part time
Mind Rift
Brooklyn, NY
4 days ago
AI Agent Testing & Evaluation Scenario Architect
$80 per hour
A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working... ...degree and has experience in QA or data analysis, with a strong command...
Remote work
Part time
Mindrift
Houston, TX
1 day ago
Evaluation Scenario Writer - AI Agent Testing Specialist
$80 per hour
...ethically shape the future of AI. What We Do The Mindrift... ...realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases... ...fields. Background in QA, software testing, data analysis... ...Take part in a flexible, remote, freelance project that...
Remote work
Part time
Freelance
Flexible hours
Mindrift
New York, NY
1 day ago
MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hour
Get AI‑powered advice on this job and more exclusive features.... ...internal tools for running and evaluating agent behavior. You’ll implement base... ...skills – you’ll work with QA and writers We also value applicants... ...Take part in a flexible, remote, freelance project that fits...
Remote work
Freelance
Flexible hours
Mind Rift
Dallas, TX
3 days ago
AI Agent Interaction Specialist - Dialogue & Conversation Evaluation
...AI Agent Interaction Specialist - Dialogue & Conversation Evaluation is a remote evaluation track for reviewing ai agent interaction evaluation prompts and responses against AuraOne's quality rubric. Reviewers compare paired outputs, label edge cases, and write the kind...
Remote job
Hourly pay
For contractors
10 hours per week
AuraOne Human Data
Remote
1 day ago
Software Engineer, Agent
$232k - $348k
...company. We free people and agents to ship what's next.... ...Next.js, v0, and AI SDK, we create products... ...the role is listed as remote. For location-specific... ...agentic systems that autonomously investigate, diagnose,... ...prompt engineering, model evaluation, and retrieval-...
Remote work
Full time
Work at office
Work from home
Worldwide
Monday to Friday
Flexible hours
Shift work
Vercel Corp
San Francisco, CA
9 hours ago
AI Agents Solutions Architect - Finance
...Kraken app. As a fully remote company, we have Krakenites... ...and builder of the AI-native finance operating... ...Finance operations - Evaluate how financial work currently... ..., and MCP or similar agent coordination layers.... ...and the failure modes of autonomous AI workflows in high-...
Remote work
Local area
Kraken
Poland, NY
4 days ago
AI Agent Engineer
$177.1k
...innovative and product-driven AI Agent Engineer to design, build, and operationalize autonomous AI agents. In this role, you will... .... Building automated evaluation harnesses and observability pipelines... ...centered around our offices and remote work environments. The work style...
Remote work
Work at office
Zoom
Seattle, WA
8 hours ago
Senior QA Analyst, AI & Machine Learning Systems
Description Keeper is hiring a talented Senior QA Analyst to join our AI & Threat Analytics team. This is a 100% remote position from select locations, with an... ...systems, develop Python-based automation and evaluation tools, and validate model outputs, guardrails,...
Remote work
Temporary work
Keeper Security, Inc.
Cameron Park, CA
1 day ago
Remote Treasury AI QA Analyst
$50 - $60 per hour
...looking for an experienced Treasury Analyst to assist in training AI models, focusing on financial principles and decision-making. You will evaluate AI chatbot responses on topics such as... ...role offers the flexibility to work remotely with an hourly pay rate ranging between...
Remote work
Hourly pay
DataAnnotation
Nevada, IA
4 days ago
Remote Treasury AI QA Analyst
$50 - $60 per hour
DataAnnotation is seeking a Treasury Analyst to train AI models focused on finance. In this role, you will evaluate the accuracy of AI chatbot responses regarding macro trends and corporate finance. You will work independently, providing feedback on AI reasoning and helping...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Phoenix, AZ
1 day ago
Remote Treasury AI QA Analyst
$50 - $60 per hour
DataAnnotation is seeking a Treasury Analyst to assist in training AI models by evaluating their logic and enhancing their financial decision-making capabilities. This role enables experienced finance professionals to influence AI’s understanding of financial principles...
Remote job
Hourly pay
DataAnnotation
New York, NY
1 day ago
Remote Treasury AI QA Analyst
$50 - $60 per hour
DataAnnotation is looking for a Treasury Analyst to help train AI models in Connecticut. This role involves reviewing and improving AI Assistant responses related to finance, evaluating their logic, and providing structured feedback. Ideal candidates are finance professionals...
Remote job
Hourly pay
DataAnnotation
Hartford, CT
1 day ago
Product Manager, AI Agents
...Squid AI helps enterprises modernize in place and without disruption... ..., turnkey, private AI agents based on their own data and an... ...communication skills (crucial for our remote-friendly team). Can... ...platform enables companies to deploy autonomous agents that can understand,...
Remote work
Squid Cloud, Inc.
New York, NY
4 days ago
Software Engineer, Forward Deployed Agent Builder
$152k - $240k
...control spend effortlessly. Brex’s AI-native automation and world-... ...’ll do We're building AI agents to automate and augment... ...four weeks per year of fully remote work! Responsibilities:... ..., and data sources. Define evaluation frameworks, success metrics, and...
Remote work
Full time
Work at office
Work from home
Brex Inc.
New York, NY
1 day ago
Software Engineer AI Agents
$200k - $320k
...technical depth and a passion for AI-driven product development.... .... This is a full-time remote opportunity, with preference for... ...production-ready LLM pipelines and AI agent systems Develop AI-driven... ...new product initiatives Evaluate and recommend AI architectures...
Remote work
Full time
Visa sponsorship
Virco Talent
Remote
1 day ago
Software engineer, AI Agent
$100k - $250k
...software engineer who specializes in building AI Agents. What are examples of projects you’... ...talk with users, come up with new ideas autonomously, and ship! What additional skills do... .... Is Hercules in-office or remote? Hercules founding team works in-office...
Remote work
Full time
Work at office
Hercules
San Francisco, CA
1 day ago
Software Engineer III/Senior, Agent
$202.5k - $247.5k
...sharing localhost or running AI workloads in production. We... ...worth your time. About the Agent Team Our Agent team... ...AWS. Engineers develop by using remote development tools and/or ssh to... ...and actual compensation will be evaluated based on factors including,...
Remote work
Permanent employment
Full time
Work at office
Local area
Immediate start
Home office
Flexible hours
Ngrok
Remote
1 day ago
Software Engineer - AI Agents
...Agent Engineer We're seeking an Agent Engineer to design and build agentic features in... ...passionate about building agent systems and making AI easy for developers to adopt. The ideal... ...and other high-value features Evaluate and integrate open-source models to power...
Remote work
Worldwide
Flexible hours
FriendliAI Corp
United States
3 days ago
Staff Engineer AI Agents
...pioneering the future of agentic AI in property management. We build AI agents that act as property... ...leasing agent, a fully autonomous AI that engages... ...monitoring, and testing. ~ Remote friendly but needs to fully... ..., model training and evaluation outside of LLM contexts....
Remote work
Zuma
Mundelein, IL
17 days ago
Remote Medical Doctor for AI Training & Evaluation
$80 - $150 per hour
...Prolific is seeking Medical Doctors to join as Domain Expert participants in training AI models. Responsibilities include evaluating AI responses for accuracy and writing feedback for improvement. A competitive pay of $80-$150 per hour based on skills is offered. Candidates...
Remote work
Hourly pay
Work from home
Prolific
Denver, CO
3 days ago
Software Engineer, AI Agents
$200k - $320k
...innovative companies. Our team is 100% remote and we work with teams across the United... ...to help them hire. Software Engineer, AI Agents Location - San Francisco Bay Area... ...lines from concept through production Evaluate emerging AI models, frameworks, and methodologies...
Remote work
Visa sponsorship
Recruiting from Scratch
United States
5 days ago
LLM Red-Teamer for AI Model Evaluation
$40 - $65 per hour
...impact project focused on the evaluation and enhancement of frontier... ...expertise to train next-generation AI systems, shaping how models... ...Demonstrated ability to work autonomously, interpreting and executing... ...a contractor position with remote work flexibility. Experts are...
Remote job
Hourly pay
For contractors
Immediate start
SaidGig
Frontier County, NE
15 days ago
Software Quality Assurance Analyst
Get AI-powered advice on this job and more exclusive... ...Quality Assurance Analyst for a consulting position... ...Indianapolis, Indiana. Remote work is not an option.... ...will be responsible for evaluating and testing the functionality... ...4+ years of QA experience Seniority...
Remote work
Full time
Contract work
Relocation
Squadware Inc
Indianapolis, IN
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote QA Analyst for Autonomous AI Agent Evaluation. Be the first to apply!