Remote QA Analyst for Autonomous AI Agent Evaluation
$60 per hourMindrift
- Remote job
A leading AI firm in Austin is looking for QA experts to validate and improve AI systems. This remote, freelance role requires strong analytical skills and attention to detail. Candidates will review AI evaluation tasks, identify inconsistencies, and define expected behaviors for agents. Ideal applicants have experience in policy evaluation or logic puzzles and good communication skills to document findings. Competitive pay of up to $60/hour based on experience and project needs. A great opportunity to influence future AI models! #J-18808-Ljbffr Mindrift
$80 per hour
A leading AI consultancy in the United States is seeking Quality Assurance professionals to validate and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical consistency through detailed reviews. Ideal candidates possess...Remote jobHourly payFlexible hours$80 per hour
A tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures... ...allows contributors to work remotely while engaging in a complex AI project... ...detail, facilitating AI testing and evaluation without needing a coding...Remote jobFlexible hours$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention... ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/...Remote jobPart timeFlexible hours$80 per hour
A forward-thinking AI company is seeking Quality Analysts for autonomous AI agents. This project-based opportunity is ideal for... ...include reviewing evaluation tasks, identifying inconsistencies... ...up to $80/hour, this flexible remote position allows candidates to influence...Remote jobFlexible hours$80 per hour
...forward-thinking tech company is seeking QAs for autonomous AI agents to validate complex task structures and improve evaluation frameworks. The role requires excellent... ...$80/hour, this position allows for flexible, remote work while contributing to advanced AI projects...Remote jobFlexible hours$80 per hour
A technology company is seeking a part-time QA contributor for an AI project to validate autonomous agents. Candidates must have strong analytical thinking, attention... ...expected behaviors for AI agents. This flexible remote opportunity allows you to work on your own...Remote jobPart timeFlexible hours$80 per hour
...flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic,... ...and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $8...Remote workPart timeFlexible hours$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will work... ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers...Remote workFlexible hours$60 per hour
...Missouri is seeking contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and... ...structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define...Remote workPart timeFlexible hours$60 per hour
...innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires... ...can work flexibly and remotely, earning rates up to $60/hour. This position is ideal for analysts or students looking to contribute...Remote jobFlexible hours$60 per hour
...ethically shape the future of AI. What We Do The Mindrift... ...in ambiguity, enjoy remote asynchronous work, and want... ...AI systems are tested and evaluated, we want to hear from you.... ...Project Overview We are seeking QA experts for autonomous AI agents in a project focused on...Remote workFreelanceFlexible hours$55 per hour
A leading AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible, remote project is ideal for those with excellent analytical... ...should be adept at evaluating scenarios and documenting findings...Remote jobFlexible hours- BMC Software, Inc. is looking for a PhD-level research intern to contribute to the BMC AI Foundation. This role involves designing evaluations for AI agents, coordinating with technical teams, and producing concrete research artifacts within 12 weeks. The ideal candidate...Remote jobInternship
$80 per hour
A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working... ...degree and has experience in QA or data analysis, with a strong command...Remote workPart time- ...technology company is looking for a detail-oriented individual to design structured evaluation scenarios for AI agents. This entry-level part-time role allows you to contribute remotely, creating test cases for LLM-based agents. The ideal candidate will have a background...Remote jobPart time
$217.57k - $271k
...field-based sales or other remote-by-design positions — may have... ...the thoughtful use of AI tools in our daily work and... ...the discipline of testing AI agents, evaluating LLM behavior, and ensuring the... ...prompt-based features to fully autonomous multi-step workflows. *...Remote workFull timeTemporary workWork at officeLocal areaFlexible hours- ModMed in Boca Raton, Florida, is hiring a Machine Learning Engineer to develop AI agents for patient engagement. This role involves designing autonomous systems and optimizing conversational AI for better healthcare outcomes. Ideal candidates will have a strong background...Remote job
$400 per month
Mercor is collaborating with a leading AI research lab on the Frontier Code Agents project. Contributors will work on evaluating and improving frontier AI coding models via technical assessments. The tasks involve using AI coding agents to address machine learning engineering...Remote job$400 per month
Mercor is looking for contributors to work on a Frontier Code Agents project, evaluating and improving frontier AI coding models. This role focuses on complex machine learning engineering tasks and model evaluations through structured assessments. The project involves...Remote job$400 per unit
Mercor is partnering with an AI research lab for a project assessing frontier AI coding models. The role involves evaluating machine learning engineering workflows through structured... ...learning, familiarity with AI coding agents, and a track record in deploying ML systems...Remote job$160k - $190k
Member of Technical Staff, Agent Workflow Systems and Evaluation The Member of Technical... ...measures, observes, and scales AI-enabled workflows. This... ...success metrics. Mentor FDEs, analysts, engineers, and... ...Area, CA; Denver, CO; or remote option available. The position...Remote workWork at officeFlexible hours- Mercor is seeking contributors to support a Frontier Code Agents project, collaborating with a leading AI research lab. This role involves evaluating and improving AI coding models, emphasizing realistic machine learning engineering and structured assessments. Tasks include...Remote job
$400 per month
Mercor is seeking machine learning engineers to join a project with a leading AI research lab. You will evaluate and complete complex engineering tasks using frontier AI coding agents. This sprint-based project offers $400 for each accepted task, which typically takes 2...Remote job$400 per month
...learning engineering role in partnership with a leading AI research lab. This role involves evaluating frontier AI coding models and completing technical... ...experience in ML engineering and familiarity with AI coding agents such as Cursor and Codex. #J-18808-Ljbffr MercorRemote job$400 per month
Mercor is seeking contributors for an exciting role in partnership with a leading AI research lab. You will work on the Frontier Code Agents project, focusing on evaluating AI coding models through technical assessments. The tasks involve realistic machine learning engineering...Remote job$400 per month
Mercor is looking for contributors to evaluate and improve frontier AI coding models for a research project. In this role, you will perform complex machine learning engineering tasks and evaluate model performance, focusing on realistic workflows. Applicants should have...Remote job$400 per month
...seeking contributors for a project with a leading AI research lab focused on frontier coding models. The role involves using frontier AI coding agents to tackle complex machine learning tasks, where you will evaluate model outputs and assess performance. With a time commitment...Remote job$400 per month
Mercor is collaborating with a leading AI research lab to recruit machine learning engineers for a Frontier Code Agents project. You will work on evaluating and improving AI coding models through technical assessments, focusing on real engineering workflows. The role requires...Remote job$400 per unit
Mercor is partnering with a leading AI research lab on a Frontier Code Agents project. The role involves using AI coding agents for evaluating complex ML tasks, identifying bugs, and comparing model outputs. Applicants should have at least 2 years of experience in machine...Remote job$400 per month
Mercor is seeking a Frontier Code Agent to support its partnership with a leading AI research lab. The role involves evaluating and improving AI coding models through technical assessments and handling realistic machine learning workflows. Successful candidates will have...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote QA Analyst for Autonomous AI Agent Evaluation. Be the first to apply!
- quality assurance analyst Austin, TX
- qa analyst Austin, TX
- agent assistant Austin, TX
- work from home chat agent Austin, TX
- telemarketer - state farm agent team member Austin, TX
- cruise agent Austin, TX
- import export agent Austin, TX
- remote chat agent Austin, TX
- executive protection agent Austin, TX
- commissioning agent Austin, TX

