Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour

$60 - $90 per hour

24-MAG

Remote job

Job Description

We are sharing a specialised part-time consulting opportunity for AI, machine learning, data science, data engineering, software engineering, and STEM professionals experienced in technical task design, programming, statistical methods, ML modeling, computational reasoning, agentic workflows, and structured evaluation.

This role supports current and upcoming remote consulting opportunities focused on AI/ML task design, agentic technical evaluation, data science and software workflow review, ground truth solution development, technical feedback, rubric creation, and high-quality project execution. Selected professionals will help design challenging tasks, evaluate AI agent outputs, and improve the rigor of technical evaluation materials across AI, data, and STEM domains.

Key Responsibilities

Professionals in this role may contribute to:

Agentic AI & Technical Task Design

Design challenging agentic tasks rooted in real-world machine learning, data science, data engineering, software, and technical workflows
Write accurate, well-documented solutions that serve as ground truth for evaluation
Surface technical nuances, edge cases, and reasoning gaps that distinguish expert-level work from surface-level responses
Create tasks that require strong programming, analytical, statistical, or computational judgment

AI Output Evaluation & Technical Feedback

Evaluate AI agent outputs against reference solutions for correctness, efficiency, reasoning quality, and technical rigor
Review outputs involving programming, data analysis, ML modeling, statistical methods, software reasoning, or computational methods
Identify flawed logic, incomplete solutions, inefficient approaches, weak assumptions, or unsupported technical conclusions
Provide detailed written feedback that clearly explains technical issues and improvement areas

Evaluation Frameworks & Review Consistency

Develop and refine evaluation frameworks and rubrics for assessing agentic behavior on AI and data science tasks
Apply structured review standards across technical domains and task types
Collaborate with other subject-matter experts to support consistency and accuracy
Maintain high standards for clarity, reproducibility, technical correctness, and written explanation

Ideal Profile

Strong candidates may have:

3+ years of research, academic, or industry experience in machine learning, data science, software engineering, computer science, statistics, engineering, mathematics, physics, chemistry, biology, materials science, or another STEM field
Demonstrated technical expertise in at least one of the following: programming, data analysis, ML modeling, statistical methods, or computational methods
Ability to design and evaluate complex technical tasks with strong subject-matter judgment
Prior experience with data annotation, labeling, evaluation, or human feedback collection as a strong plus
Experience with LLMs, AI systems, or agentic workflows as a plus
Familiarity with agentic frameworks as a plus
Strong written communication skills and ability to explain technical decisions clearly
Ability to commit approximately 40 hours per week during weekdays depending on engagement scope

Educational Background

Academic or professional backgrounds in machine learning, data science, computer science, software engineering, statistics, mathematics, engineering, physics, chemistry, biology, materials science, or related STEM fields may be highly relevant
Research, industry, or applied technical experience in programming, modeling, data analysis, computational methods, or technical evaluation may be especially valuable
Equivalent professional experience may be considered depending on project needs

Nice to Have

Experience with Python, R, SQL, data pipelines, ML workflows, software development, notebooks, model evaluation, or data engineering tools
Experience developing benchmark tasks, evaluation frameworks, rubrics, or technical review guidelines
Familiarity with AI agent behavior, tool use, multi-step reasoning, or agentic task execution
Experience reviewing AI-generated technical outputs or human-written technical solutions
Comfort working across multiple technical domains and evaluating complex reasoning quality

Why This Opportunity

Apply AI/ML, data science, software, and STEM expertise to structured remote consulting work
Contribute to high-quality technical task design, agentic evaluation, ground truth solution development, and rubric creation
Work on assignments aligned with your machine learning, data science, software engineering, or STEM background
Use your technical judgment to improve the rigor and clarity of AI and data evaluation materials
Remote structure with competitive hourly compensation

Contract Details

Independent contractor role
Fully remote with weekday availability expected
Eligible professionals should be based in the United States depending on project needs
Expected commitment of approximately 40 hours per week during weekdays depending on engagement scope
Competitive rates between $60–$90 per hour depending on expertise and project scope
Weekly payments via Stripe or Wise
Projects may be extended, shortened, or adjusted depending on scope and performance
Work will not involve access to confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy:

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour in New York, NY vacancy

Remote | STEM PhD Research Evaluation Consultant -- $60-$90/hour
$60 - $90 per hour
...specialised part-time consulting opportunity for... ...reasoning, technical task design,... ...solving, solution evaluation, and structured... ...and upcoming remote consulting... ...solutions produced by AI systems or... ...approximately 40 hours per week during... ...rates between $60–$90 per hour depending...
Remote job
Hourly pay
Weekly pay
Job sharing
Contract work
Part time
For contractors
Weekday work
24-MAG
New York, NY
2 days ago
Remote Bio AI Trainer & Evaluator — Flexible Hours
$60 per hour
...looking for Biology Experts and Life Science Professionals in Las Vegas, NV to evaluate AI-generated science. Successful candidates will work flexibly from home, earning up to $60 per hour for paid tasks. Responsibilities include reviewing scientific data accuracy, validating...
Remote job
Hourly pay
Work from home
Flexible hours
Prolific
Las Vegas, NV
3 days ago
Remote | Odia-English AI Safety Red Team Evaluator -- $20-$30/hour
$20 - $30 per hour
...specialised part-time consulting opportunity for... ...experienced in AI safety evaluation, red team testing... ...and upcoming remote consulting opportunities... ..., and socio-technical risk patterns at... ...with adversarial ML concepts, jailbreak... ...with competitive hourly compensation Contract...
Remote job
Hourly pay
Weekly pay
Job sharing
Contract work
Part time
For contractors
Flexible hours
24-MAG
New York, NY
3 days ago
Remote FP&A Manager - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...
Remote work
Hourly pay
Contract work
For contractors
Work experience placement
Data Annotation
South Milwaukee, WI
more than 2 months ago
Remote M&A Associate - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...
Remote work
Hourly pay
Contract work
For contractors
Work experience placement
Data Annotation
New Haven, IN
more than 2 months ago
Remote Corporate Development Associate - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...committed to creating high-quality AI. Join our team to help... ...the flexibility of remote work and the freedom to set... ...schedule Projects are paid hourly starting at USD $50-$60 per hour, with bonuses on... ...diverse and complex problems and evaluate their outputs Evaluate...
Remote work
Hourly pay
Contract work
Work experience placement
Flexible hours
Data Annotation
Greensboro, NC
a month ago
Remote Equity Research Analyst - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...
Remote work
Hourly pay
Contract work
For contractors
Work experience placement
Data Annotation
Spanish Fort, AL
6 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
...for experienced quantitative professionals to evaluate AI-generated analysis and shape future AI systems. This fully remote position allows you to work from anywhere in... ...flexible schedule and competitive pay of up to $60 per hour. Ideal candidates will have 2+ years of...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Oklahoma City, OK
3 days ago
Remote ML Research Scientist & AI Evaluator
$60 per hour
A leading AI development company in the United States seeks experienced quantitative professionals to evaluate AI-generated analyses and contribute to the development... ...of AI systems. You'll work remotely, enjoy flexible hours, and can earn up to $60 USD/hour. Candidates should...
Remote job
Flexible hours
DataAnnotation
Louisiana, MO
1 day ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development firm is seeking experienced quantitative professionals... ...cutting-edge AI systems. This fully remote role allows you to evaluate AI-generated analyses and solve... ...methodologies. Enjoy competitive pay of up to $60/hour while impacting the future of AI...
Remote job
DataAnnotation
Brooklyn, NY
4 days ago
Remote ML Research Scientist & AI Evaluator
$60 per hour
A leading AI development team is seeking experienced quantitative professionals to evaluate AI-generated quantitative work and design training problems. Work fully remote with a flexible schedule and competitive pay up to $60/hour. Ideal candidates should have 2+ years...
Remote job
Flexible hours
DataAnnotation
Providence, RI
3 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development firm is seeking experienced quantitative professionals to evaluate AI-generated analytical work and solve complex... ...feedback and insights. This is a remote position, allowing... ...Payment is competitive, up to $60/hour, enhancing AI systems' capabilities...
Remote job
DataAnnotation
Kansas City, MO
1 day ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development firm is seeking quantitative professionals to evaluate and shape AI-generated analytics. Candidates should... ..., and fluency in English. This remote position allows for flexible scheduling and pays up to $60 USD/hour, providing an impactful opportunity...
Remote job
Flexible hours
DataAnnotation
Raleigh, NC
3 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development company is seeking quantitative professionals to aid in evaluating and shaping advanced AI systems. This fully remote opportunity allows you to work from various locations... ..., with competitive pay up to $60 per hour. Ideal candidates will have a background...
Remote job
Hourly pay
DataAnnotation
Hartford, CT
3 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development company seeks experienced quantitative professionals to evaluate AI-generated work and contribute to developing... ...fields while enjoying a fully remote work setup with a flexible... ...a competitive pay of up to $60 USD/hour. #J-18808-Ljbffr DataAnnotation
Remote job
Flexible hours
DataAnnotation
Boston, MA
1 day ago
Remote AI Training Code Engineer — Flexible Hours, $60/hr
$60 per hour
A forward-thinking AI development company seeks proficient programmers... ...You will write quality code, evaluate AI-generated outputs, and... .... Employees enjoy a fully remote work setup with flexible... ...and competitive pay of up to $60 USD/hour, allowing for meaningful contributions...
Remote job
Flexible hours
DataAnnotation
Oklahoma City, OK
1 day ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development company is seeking quantitative professionals to evaluate AI-generated outputs and solve complex problems. This fully remote role allows flexibility and offers competitive pay of up to $60 USD/hour. Ideal candidates should have at least 2 years...
Remote job
Flexible hours
DataAnnotation
New York, NY
1 day ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI development firm is looking for experienced quantitative professionals to evaluate AI-generated analyses and provide feedback to enhance AI systems. This remote role offers a flexible schedule and competitive pay up to $60 USD/hour, making it suitable for...
Remote job
Flexible hours
DataAnnotation
New York, NY
3 days ago
Remote AI Finance Accountant — Flexible Hours, $50-$60/hr
$50 - $60 per hour
A technology firm specializing in AI is seeking an Accountant to evaluate the performance of AI chatbots. Responsibilities... ...in financial analysis. The position is remote, with a flexible schedule and hourly pay starting at $50-$60 USD, plus bonuses for high-quality work...
Remote job
Hourly pay
Flexible hours
DataAnnotation
Kansas City, MO
2 days ago
Finance AI Trainer & Evaluator — Remote, Flexible Hours
$50 - $60 per hour
...technology company specializing in AI is looking for a Chief... ...This flexible role allows for remote work, accommodating both full... ...mandatory. Responsibilities include evaluating AI chatbot outputs and... ...Compensation starts at $50-$60 per hour, with project bonuses available...
Remote job
Hourly pay
Full time
Part time
Flexible hours
DataAnnotation
Florida, NY
4 days ago
Senior Consultant - AI Training & Evaluation (MBB & Top-Tier Firms)
$60 per hour
...Management Consulting Domain Consultant Toloka AI supports frontier... ...environments, tasks, and evaluation frameworks... ..., and applied ML systems,... ...uncertainty No deep technical background is... ...This is a remote, project-based,... ...can earn up to $60 per hour equivalent, depending...
Remote work
Hourly pay
Mind Rift
United States
3 days ago
AI Design Evaluator (Remote · Flexible Hours)
$50 per hour
...seeks Product Designers and UX Specialists to help train AI models using your expertise. You'll evaluate AI-generated designs and ensure usability and... ...experience. This position offers competitive pay (up to $50/hour) and requires proficiency in design tools like Figma and...
Remote job
Work from home
Flexible hours
Prolific
Sacramento, CA
4 days ago
Remote Finance AI Quality Evaluator - Flexible Hours
$50 - $60 per hour
...located in the United States, seeks a Financial Advisor to evaluate AI Assistant models for finance. This role offers... ...model performance. This independent contract position pays hourly rates starting from $50-$60 with the potential for bonuses based on quality and volume...
Remote job
Hourly pay
Full time
Contract work
Part time
Flexible hours
DataAnnotation
Oklahoma City, OK
2 days ago
Remote AI Training Code Engineer — Flexible Hours, $60/hr
$60 per hour
A cutting-edge AI development company is looking for proficient... ...to AI systems. This fully remote role allows you to choose your... ...with competitive pay up to $60 USD/hour. Ideal candidates will have fluency... ..., writing quality code, and evaluating AI-generated code. A bachelor...
Remote job
Flexible hours
DataAnnotation
Hartford, CT
4 days ago
Remote | Machine Learning Systems Evaluation Engineer -- Up to $90/hour
$90 per hour
...sharing a specialised remote consulting opportunity for... ..., production ML judgment, and the ability to evaluate complex machine learning and AI engineering... ...across realistic technical scenarios. This... ...in focused 12–24 hour sprint windows... ...may reach up to $90/hour, depending...
Remote job
Job sharing
Contract work
For contractors
Flexible hours
24-MAG
New York, NY
1 day ago
Remote HR Experts - AI Trainer ($60-$80 per hour)
$60 - $80 per hour
...seeking senior HR and People Operations professionals to build evaluation tasks for AI systems operating in Fortune 500 people contexts. The... ...HR enablement authorship is a plus. Compensation Note Hourly Pay: $60 to $80 per hour, set by Mercor based on demonstrated...
Remote job
Hourly pay
Mercor
New Bedford, MA
4 days ago
Remote Financial AI Consultant - Flexible Hours, $50-$60/hr
$50 - $60 per hour
A technology company focused on AI is seeking a Financial Consultant to contribute to the training of AI models. This flexible role allows... ..., analysis, and modeling. The position offers hourly pay starting at $50-$60, plus bonuses for high-quality work and is available...
Remote job
Hourly pay
Full time
Part time
Flexible hours
DataAnnotation
New York, NY
3 days ago
Remote | People Operations & HRIS Specialist - $60-$90/hour
$60 - $90 per hour
...About the job Remote | People Operations & HRIS Specialist - $60-$90/hour We are sharing a specialised part-time consulting opportunity for professionals... ...operations scenarios, evaluate policy and documentation... ...opportunities across technical, evaluation, and project...
Remote work
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Flexible hours
24-MAG LLC
United States
1 day ago
Remote Sales & Marketing Experts - AI Trainer ($65-$90 per hour)
$65 - $90 per hour
...seeking senior sales and marketing professionals to build evaluation tasks for AI systems operating in Fortune 500 go-to-market contexts.... ...‑content authorship is a plus. Compensation Note Hourly Pay: $65 to $90 per hour, set by Mercor based on demonstrated expertise...
Remote job
Hourly pay
Mercor
Columbia, MO
4 days ago
Remote Research Scientist: AI/ML Evaluation & Prompting
$150 per hour
...TryApplyNow is seeking a Research Scientist for a fully remote position. The role involves evaluating AI-generated technical content, writing domain-specific prompts, and... ...anywhere in eligible locations with no fixed schedule or minimum hours required. #J-18808-Ljbffr
Remote work
TryApplyNow
California, MO
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour. Be the first to apply!