Remote AI Agent QA Analyst - Flexible, Impactful Evaluation
$80 per hourMind Rift
- Remote job
A tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures. This flexible, project-based opportunity allows contributors to work remotely while engaging in a complex AI project. Ideal candidates will possess strong analytical skills and attention to detail, facilitating AI testing and evaluation without needing a coding background. The role offers competitive pay up to $80/hour based on experience and skills, enriching your portfolio while influencing the future of AI. #J-18808-Ljbffr Mind Rift
$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention... ...tasks and define clear standards. This remote, flexible opportunity offers rates up to $55/hour...Remote jobFlexible hoursPart time$80 per hour
A leading AI consultancy in the United States is seeking Quality Assurance... ...and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical... ...attention to detail. This project offers flexible hours and pay rates up to $80/hour...Remote jobFlexible hoursHourly pay$80 per hour
A technology company is seeking a part-time QA contributor for an AI project to validate autonomous agents. Candidates must have strong analytical thinking,... ...defining expected behaviors for AI agents. This flexible remote opportunity allows you to work on your own schedule...Remote jobFlexible hoursPart time$60 per hour
...A leading AI firm in Austin is looking for QA experts to validate and improve AI systems. This remote, freelance role requires strong analytical... ...Candidates will review AI evaluation tasks, identify inconsistencies... ...expected behaviors for agents. Ideal applicants have...Remote workFreelance$80 per hour
...thinking tech company is seeking QAs for autonomous AI agents to validate complex task structures and improve evaluation frameworks. The role requires excellent... ...Offering up to $80/hour, this position allows for flexible, remote work while contributing to advanced AI...Remote workFlexible hours$80 per hour
A forward-thinking AI company is seeking Quality Analysts for autonomous AI agents. This project-based opportunity is ideal... ...include reviewing evaluation tasks, identifying inconsistencies... ...With rates up to $80/hour, this flexible remote position allows candidates to...Remote jobFlexible hours$80 per hour
A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates... ...with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive pay up to $80/...Remote workFlexible hours$80 per hour
...tech company is seeking contributors for a flexible part-time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks... ...oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation...Remote workFlexible hoursPart time$60 per hour
...Missouri is seeking contributors for a part-time QA project focused on autonomous AI agents. This flexible remote opportunity requires strong analytical and... ...structured data formats. Candidates will review evaluation tasks, identify inconsistencies, and help define...Remote workFlexible hoursPart time$60 per hour
...is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires... ...Successful candidates can work flexibly and remotely, earning rates up to $60/hour. This position is ideal for analysts or students looking to contribute...Remote jobFlexible hours$60 per hour
...shape the future of AI. What We Do The Mindrift... ...in ambiguity, enjoy remote asynchronous work,... ...are tested and evaluated, we want to hear from... ...Overview We are seeking QA experts for autonomous AI agents in a project focused... ...and project needs. Flexible, remote, freelance...Remote workFlexible hoursFreelance$400 per month
...contributors for a project with a leading AI research lab focused on evaluating frontier AI coding models. The role involves using frontier coding agents to complete machine learning tasks,... ...client-driven sprints requiring flexible time commitment. #J-18808-Ljbffr Mercor...Remote jobFlexible hours$400 per month
Mercor is hiring for a role in AI engineering, partnering with a leading research lab on a Frontier Code Agents project. Contributors will evaluate and improve AI coding models through structured... ...is $400 per accepted task, with flexible time commitments based on client...Remote jobFlexible hours$160k - $190k
...of Technical Staff, Agent Workflow Systems and Evaluation The Member of Technical... ..., and scales AI-enabled workflows. This... ...metrics. Mentor FDEs, analysts, engineers, and... ...CA; Denver, CO; or remote option available. The... ...Disability Coverage Flexible Spending Accounts (FSA...Remote workFlexible hoursWork at office$400 per month
...backend engineers to participate in an AI research project. You will use frontier AI coding agents to tackle complex engineering tasks and evaluate AI-generated code. Ideal candidates... ...each task paying $400. This role offers flexible sprint-based work commitments in Culver...Remote jobFlexible hours$55 per hour
A leading AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible, remote project is ideal for those with excellent analytical... ...should be adept at evaluating scenarios and documenting findings...Remote jobFlexible hours$400 per unit
...backend engineers for the Frontier Code Agents project. In this role, you will leverage frontier AI coding agents to perform complex engineering tasks and evaluate model-generated code for quality... ...performance. The position allows flexible sprint-based work, with...Remote jobFlexible hours$400 per month
Mercor is seeking contributors for a project with a leading AI research lab. Work focuses on evaluating frontier AI coding agents by completing and reviewing engineering tasks within a flexible sprint-based schedule. Compensation is $400 per accepted task, typically taking...Remote jobFlexible hours$60 per hour
...shape the future of AI. About The Role We'... ...realistic and structured evaluation scenarios for LLM-based agents. You'll create test... .... Background in QA, software testing,... ...freelance role is fully remote so you just need a... ...Take part in a flexible, remote, freelance project...Remote workFlexible hoursPart timeFreelance$80 per hour
...domain experts with cutting‑edge AI projects from innovative... ...tools for running and evaluating agent behavior. You’ll implement base... ...communication skills – you’ll work with QA and writers We also value... ...needs Take part in a flexible, remote, freelance project that fits...Remote workFlexible hoursFreelance$400 per month
...seeking experienced systems engineers to evaluate and support frontier AI coding models through structured assessments. You will use AI coding agents for complex tasks, review system... ...completed in 2-3 hours. This role offers flexible engagement based on client demand and...Remote jobFlexible hours- A leading AI research accelerator is looking for a detail-oriented Business Analyst to support evaluation and annotation workflows. The role involves... ...compensation, flexible working hours, and the... ...opportunity to contribute to impactful research. Remote work is available with...Remote jobFlexible hours
- ...Agent Engineer We're seeking an Agent Engineer to... ...agent systems and making AI easy for developers to... ...into reliable, high-impact features. Key Responsibilities... ...high-value features Evaluate and integrate open-... ...Benefits Flexible working hours Daily...Remote workFlexible hoursWorldwide
$232k - $348k
...We free people and agents to ship what’s... ...Next.js, v0, and AI SDK, we create products... ...role is listed as remote. For location-... ...engineering, model evaluation, and retrieval-augmented... ...on a small, high-impact team. Excellent... ...network and skills. Flexible Time Off. We will...Remote workFlexible hoursFull timeWork at officeWork from homeWorldwideMonday to FridayShift work$202.5k - $247.5k
...Engineer III/Senior, Agent ngrok is an all-... ...or running AI workloads in production... ...networking, integrations, remote management) that... ...will be evaluated based on factors including... ...experience, potential impact, and scope of role)... ...you. Actually flexible time off. We say "open...Remote workFlexible hoursPermanent employmentFull timeWork at officeLocal areaImmediate startHome office- ...loyalty solutions, is seeking an AI Engineer to develop and support an internal agent platform. The role involves... ...reliability through evaluations and guardrails. With a flexible work environment and a focus... ...ready to make a significant impact in the field. #J-18808-...Remote jobFlexible hours
- About Level AI Level AI is on a mission to... ...(SLMs) to power AI Agents across the entire... ...conversation analytics for QA, coaching, and... ...Implement evaluation (evals) frameworks... ...performance‑based upside Flexible vacation policy Health... ...distributed, high‑impact team Opportunity...Remote jobFlexible hours
$108k - $170k
...About Us Observe.AI is the AI Agents platform for customer... ...telephony setup, and evaluation frameworks. Client... ...dependents ~ Flexible Paid Time Off: Our unlimited... ...: Support for remote and hybrid work connectivity... ...ambitious, make an impact wherever you go, and...Remote workFlexible hoursFull timeWork at officeLocal area$2,000 per month
Elastic, the Search AI Company, enables... ...execution for the Elastic Agent Builder. As... .... In this high‑impact role, you will be... ...benchmarking and evaluations of agent capabilities... ...in a fast‑paced, remote‑first environment.... ...your calendar with flexible locations and schedules...Remote workFlexible hoursLocal areaShift work$133k - $149k
About Us Observe.AI is the AI Agents platform for customer experience... ...smooth, high‑impact, and set up for long‑... ...delivery templates, evaluation frameworks, UAT processes... ...eligible dependents Flexible Paid Time Off: Our... ...Stipend: Support for remote and hybrid work connectivity...Remote workFlexible hoursFull timeWork at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote AI Agent QA Analyst - Flexible, Impactful Evaluation. Be the first to apply!
- agent assistant Houston, TX
- work from home chat agent Houston, TX
- telemarketer - state farm agent team member Houston, TX
- title agent Houston, TX
- cruise agent Houston, TX
- import export agent Houston, TX
- remote chat agent Houston, TX
- executive protection agent Houston, TX
- commissioning agent Houston, TX
- airport agent Houston, TX

