Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour
$60 - $90 per hour24-MAG
- Remote job
Job Description
Job Description
We are sharing a specialised part-time consulting opportunity for AI, machine learning, data science, data engineering, software engineering, and STEM professionals experienced in technical task design, programming, statistical methods, ML modeling, computational reasoning, agentic workflows, and structured evaluation.
This role supports current and upcoming remote consulting opportunities focused on AI/ML task design, agentic technical evaluation, data science and software workflow review, ground truth solution development, technical feedback, rubric creation, and high-quality project execution. Selected professionals will help design challenging tasks, evaluate AI agent outputs, and improve the rigor of technical evaluation materials across AI, data, and STEM domains.
Key Responsibilities
Professionals in this role may contribute to:
Agentic AI & Technical Task Design
- Design challenging agentic tasks rooted in real-world machine learning, data science, data engineering, software, and technical workflows
- Write accurate, well-documented solutions that serve as ground truth for evaluation
- Surface technical nuances, edge cases, and reasoning gaps that distinguish expert-level work from surface-level responses
- Create tasks that require strong programming, analytical, statistical, or computational judgment
AI Output Evaluation & Technical Feedback
- Evaluate AI agent outputs against reference solutions for correctness, efficiency, reasoning quality, and technical rigor
- Review outputs involving programming, data analysis, ML modeling, statistical methods, software reasoning, or computational methods
- Identify flawed logic, incomplete solutions, inefficient approaches, weak assumptions, or unsupported technical conclusions
- Provide detailed written feedback that clearly explains technical issues and improvement areas
Evaluation Frameworks & Review Consistency
- Develop and refine evaluation frameworks and rubrics for assessing agentic behavior on AI and data science tasks
- Apply structured review standards across technical domains and task types
- Collaborate with other subject-matter experts to support consistency and accuracy
- Maintain high standards for clarity, reproducibility, technical correctness, and written explanation
Ideal Profile
Strong candidates may have:
- 3+ years of research, academic, or industry experience in machine learning, data science, software engineering, computer science, statistics, engineering, mathematics, physics, chemistry, biology, materials science, or another STEM field
- Demonstrated technical expertise in at least one of the following: programming, data analysis, ML modeling, statistical methods, or computational methods
- Ability to design and evaluate complex technical tasks with strong subject-matter judgment
- Prior experience with data annotation, labeling, evaluation, or human feedback collection as a strong plus
- Experience with LLMs, AI systems, or agentic workflows as a plus
- Familiarity with agentic frameworks as a plus
- Strong written communication skills and ability to explain technical decisions clearly
- Ability to commit approximately 40 hours per week during weekdays depending on engagement scope
Educational Background
- Academic or professional backgrounds in machine learning, data science, computer science, software engineering, statistics, mathematics, engineering, physics, chemistry, biology, materials science, or related STEM fields may be highly relevant
- Research, industry, or applied technical experience in programming, modeling, data analysis, computational methods, or technical evaluation may be especially valuable
- Equivalent professional experience may be considered depending on project needs
Nice to Have
- Experience with Python, R, SQL, data pipelines, ML workflows, software development, notebooks, model evaluation, or data engineering tools
- Experience developing benchmark tasks, evaluation frameworks, rubrics, or technical review guidelines
- Familiarity with AI agent behavior, tool use, multi-step reasoning, or agentic task execution
- Experience reviewing AI-generated technical outputs or human-written technical solutions
- Comfort working across multiple technical domains and evaluating complex reasoning quality
Why This Opportunity
- Apply AI/ML, data science, software, and STEM expertise to structured remote consulting work
- Contribute to high-quality technical task design, agentic evaluation, ground truth solution development, and rubric creation
- Work on assignments aligned with your machine learning, data science, software engineering, or STEM background
- Use your technical judgment to improve the rigor and clarity of AI and data evaluation materials
- Remote structure with competitive hourly compensation
Contract Details
- Independent contractor role
- Fully remote with weekday availability expected
- Eligible professionals should be based in the United States depending on project needs
- Expected commitment of approximately 40 hours per week during weekdays depending on engagement scope
- Competitive rates between $60–$90 per hour depending on expertise and project scope
- Weekly payments via Stripe or Wise
- Projects may be extended, shortened, or adjusted depending on scope and performance
- Work will not involve access to confidential or proprietary information from any employer, client, or institution
About the Platform
This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.
By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy:
$60 - $90 per hour
...specialised part-time consulting opportunity for... ...reasoning, technical task design,... ...solving, solution evaluation, and structured... ...and upcoming remote consulting... ...solutions produced by AI systems or... ...approximately 40 hours per week during... ...rates between $60–$90 per hour depending...Remote jobHourly payWeekly payJob sharingContract workPart timeFor contractorsWeekday work$60 per hour
...looking for Biology Experts and Life Science Professionals in Las Vegas, NV to evaluate AI-generated science. Successful candidates will work flexibly from home, earning up to $60 per hour for paid tasks. Responsibilities include reviewing scientific data accuracy, validating...Remote jobHourly payWork from homeFlexible hours$20 - $30 per hour
...specialised part-time consulting opportunity for... ...experienced in AI safety evaluation, red team testing... ...and upcoming remote consulting opportunities... ..., and socio-technical risk patterns at... ...with adversarial ML concepts, jailbreak... ...with competitive hourly compensation Contract...Remote jobHourly payWeekly payJob sharingContract workPart timeFor contractorsFlexible hours$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...Remote workHourly payContract workFor contractorsWork experience placement$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...Remote workHourly payContract workFor contractorsWork experience placement$50 - $60 per hour
...committed to creating high-quality AI. Join our team to help... ...the flexibility of remote work and the freedom to set... ...schedule Projects are paid hourly starting at USD $50-$60 per hour, with bonuses on... ...diverse and complex problems and evaluate their outputs Evaluate...Remote workHourly payContract workWork experience placementFlexible hours$50 - $60 per hour
...committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your... ...own home Projects are paid hourly starting at USD $50-$60 per hour, with bonus rates... ...diverse and complex problems and evaluate their outputs Evaluate the quality...Remote workHourly payContract workFor contractorsWork experience placement$60 per hour
...for experienced quantitative professionals to evaluate AI-generated analysis and shape future AI systems. This fully remote position allows you to work from anywhere in... ...flexible schedule and competitive pay of up to $60 per hour. Ideal candidates will have 2+ years of...Remote jobHourly payFlexible hours$60 per hour
A leading AI development company in the United States seeks experienced quantitative professionals to evaluate AI-generated analyses and contribute to the development... ...of AI systems. You'll work remotely, enjoy flexible hours, and can earn up to $60 USD/hour. Candidates should...Remote jobFlexible hours$60 per hour
A leading AI development firm is seeking experienced quantitative professionals... ...cutting-edge AI systems. This fully remote role allows you to evaluate AI-generated analyses and solve... ...methodologies. Enjoy competitive pay of up to $60/hour while impacting the future of AI...Remote job$60 per hour
A leading AI development team is seeking experienced quantitative professionals to evaluate AI-generated quantitative work and design training problems. Work fully remote with a flexible schedule and competitive pay up to $60/hour. Ideal candidates should have 2+ years...Remote jobFlexible hours$60 per hour
A leading AI development firm is seeking experienced quantitative professionals to evaluate AI-generated analytical work and solve complex... ...feedback and insights. This is a remote position, allowing... ...Payment is competitive, up to $60/hour, enhancing AI systems' capabilities...Remote job$60 per hour
A leading AI development firm is seeking quantitative professionals to evaluate and shape AI-generated analytics. Candidates should... ..., and fluency in English. This remote position allows for flexible scheduling and pays up to $60 USD/hour, providing an impactful opportunity...Remote jobFlexible hours$60 per hour
A leading AI development company is seeking quantitative professionals to aid in evaluating and shaping advanced AI systems. This fully remote opportunity allows you to work from various locations... ..., with competitive pay up to $60 per hour. Ideal candidates will have a background...Remote jobHourly pay$60 per hour
A leading AI development company seeks experienced quantitative professionals to evaluate AI-generated work and contribute to developing... ...fields while enjoying a fully remote work setup with a flexible... ...a competitive pay of up to $60 USD/hour. #J-18808-Ljbffr DataAnnotationRemote jobFlexible hours$60 per hour
A forward-thinking AI development company seeks proficient programmers... ...You will write quality code, evaluate AI-generated outputs, and... .... Employees enjoy a fully remote work setup with flexible... ...and competitive pay of up to $60 USD/hour, allowing for meaningful contributions...Remote jobFlexible hours$60 per hour
A leading AI development company is seeking quantitative professionals to evaluate AI-generated outputs and solve complex problems. This fully remote role allows flexibility and offers competitive pay of up to $60 USD/hour. Ideal candidates should have at least 2 years...Remote jobFlexible hours$60 per hour
A leading AI development firm is looking for experienced quantitative professionals to evaluate AI-generated analyses and provide feedback to enhance AI systems. This remote role offers a flexible schedule and competitive pay up to $60 USD/hour, making it suitable for...Remote jobFlexible hours$50 - $60 per hour
A technology firm specializing in AI is seeking an Accountant to evaluate the performance of AI chatbots. Responsibilities... ...in financial analysis. The position is remote, with a flexible schedule and hourly pay starting at $50-$60 USD, plus bonuses for high-quality work...Remote jobHourly payFlexible hours$50 - $60 per hour
...technology company specializing in AI is looking for a Chief... ...This flexible role allows for remote work, accommodating both full... ...mandatory. Responsibilities include evaluating AI chatbot outputs and... ...Compensation starts at $50-$60 per hour, with project bonuses available...Remote jobHourly payFull timePart timeFlexible hours$60 per hour
...Management Consulting Domain Consultant Toloka AI supports frontier... ...environments, tasks, and evaluation frameworks... ..., and applied ML systems,... ...uncertainty No deep technical background is... ...This is a remote, project-based,... ...can earn up to $60 per hour equivalent, depending...Remote workHourly pay$50 per hour
...seeks Product Designers and UX Specialists to help train AI models using your expertise. You'll evaluate AI-generated designs and ensure usability and... ...experience. This position offers competitive pay (up to $50/hour) and requires proficiency in design tools like Figma and...Remote jobWork from homeFlexible hours$50 - $60 per hour
...located in the United States, seeks a Financial Advisor to evaluate AI Assistant models for finance. This role offers... ...model performance. This independent contract position pays hourly rates starting from $50-$60 with the potential for bonuses based on quality and volume...Remote jobHourly payFull timeContract workPart timeFlexible hours$60 per hour
A cutting-edge AI development company is looking for proficient... ...to AI systems. This fully remote role allows you to choose your... ...with competitive pay up to $60 USD/hour. Ideal candidates will have fluency... ..., writing quality code, and evaluating AI-generated code. A bachelor...Remote jobFlexible hours$90 per hour
...sharing a specialised remote consulting opportunity for... ..., production ML judgment, and the ability to evaluate complex machine learning and AI engineering... ...across realistic technical scenarios. This... ...in focused 12–24 hour sprint windows... ...may reach up to $90/hour, depending...Remote jobJob sharingContract workFor contractorsFlexible hours$60 - $80 per hour
...seeking senior HR and People Operations professionals to build evaluation tasks for AI systems operating in Fortune 500 people contexts. The... ...HR enablement authorship is a plus. Compensation Note Hourly Pay: $60 to $80 per hour, set by Mercor based on demonstrated...Remote jobHourly pay$50 - $60 per hour
A technology company focused on AI is seeking a Financial Consultant to contribute to the training of AI models. This flexible role allows... ..., analysis, and modeling. The position offers hourly pay starting at $50-$60, plus bonuses for high-quality work and is available...Remote jobHourly payFull timePart timeFlexible hours$60 - $90 per hour
...About the job Remote | People Operations & HRIS Specialist - $60-$90/hour We are sharing a specialised part-time consulting opportunity for professionals... ...operations scenarios, evaluate policy and documentation... ...opportunities across technical, evaluation, and project...Remote workHourly payWeekly payContract workPart timeFor contractorsFlexible hours$65 - $90 per hour
...seeking senior sales and marketing professionals to build evaluation tasks for AI systems operating in Fortune 500 go-to-market contexts.... ...‑content authorship is a plus. Compensation Note Hourly Pay: $65 to $90 per hour, set by Mercor based on demonstrated expertise...Remote jobHourly pay$150 per hour
...TryApplyNow is seeking a Research Scientist for a fully remote position. The role involves evaluating AI-generated technical content, writing domain-specific prompts, and... ...anywhere in eligible locations with no fixed schedule or minimum hours required. #J-18808-LjbffrRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour. Be the first to apply!
- senior technical advisor New York, NY
- oracle apps technical consultant New York, NY
- technical solutions consultant New York, NY
- senior technical consultant New York, NY
- oracle erp cloud technical consultant New York, NY
- technical advisor New York, NY
- technical consultant New York, NY
- remote quality assurance New York, NY
- remote wordpress developer New York, NY
- remote accounts payable New York, NY



