Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hour

$90 per hour

24-MAG LLC

About the job Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hour

We are sharing a specialised remote consulting opportunity for experienced machine learning engineers with strong coding agent experience, production ML judgment, and the ability to evaluate complex machine learning and AI engineering implementations across realistic technical scenarios.

This role supports current and upcoming remote consulting opportunities focused on machine learning system evaluation, coding-agent-assisted technical workflows, ML implementation review, inference system assessment, MLOps evaluation, and LLM application analysis. Selected professionals may use tools such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or comparable coding agents to complete, review, and evaluate technical tasks involving model training, deployment infrastructure, inference workflows, AI-powered products, and production machine learning systems.

Key Responsibilities

Professionals in this role may contribute to:

Machine Learning Implementation Review

Use modern coding agents to complete and evaluate complex machine learning and AI engineering tasks
Review generated implementations involving model training, inference systems, MLOps workflows, LLM applications, and AI-powered product features
Assess technical outputs for correctness, quality, maintainability, performance, reliability, and production-readiness
Apply professional machine learning engineering judgment to realistic technical scenarios

MLOps, Deployment & Inference Evaluation

Evaluate ML system workflows involving model deployment, inference infrastructure, monitoring, testing, and production integration
Review implementation choices related to scalability, latency, data flow, model serving, reliability, and system maintainability
Identify bugs, edge cases, performance issues, failure modes, and weak assumptions in ML engineering outputs
Provide structured feedback on MLOps design, deployment patterns, and production ML system quality

Coding Agent Output Assessment

Compare outputs from multiple coding agents and assess their strengths, weaknesses, accuracy, and practical usefulness
Identify where generated solutions succeed, where they fail, and where additional ML engineering judgment is required
Evaluate whether generated machine learning implementations reflect real-world engineering standards
Document technical review findings clearly for project teams and quality evaluation workflows

Technical Documentation & Feedback

Produce clear, structured evaluations of machine learning engineering tasks and generated outputs
Explain reasoning around model training, inference systems, deployment infrastructure, LLM applications, performance, and architectural trade-offs
Support technical assessment workflows by documenting accepted work, improvement areas, and practical engineering conclusions
Help ensure outputs reflect production-scale machine learning engineering expectations

Ideal Profile

Strong candidates may have:

2+ years of professional machine learning engineering experience
Hands-on experience building production ML systems, model deployment infrastructure, LLM applications, or AI-powered products
Regular use of AI coding agents such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or comparable tools
Ability to evaluate generated machine learning implementations and identify technical trade-offs, bugs, edge cases, and performance issues
Experience deploying ML systems to production is strongly preferred
Strong understanding of model training, inference workflows, MLOps, data pipelines, evaluation methods, deployment patterns, and system reliability
Clear written communication skills and comfort documenting technical reasoning in a remote, project-based environment

Educational Background

A degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, Software Engineering, Computer Engineering, Statistics, Mathematics, or a related technical field is helpful
Equivalent professional experience in machine learning engineering, applied AI, MLOps, LLM applications, or production ML systems is also highly relevant

Nice to Have

Experience with Python, PyTorch, TensorFlow, scikit-learn, Hugging Face, LangChain, LlamaIndex, MLflow, Ray, or comparable ML tools
Familiarity with model serving, feature pipelines, vector databases, embeddings, retrieval systems, LLM application architecture, or evaluation frameworks
Experience with cloud platforms, Docker, Kubernetes, CI/CD pipelines, observability tooling, or production deployment workflows
Background in technical code review, ML architecture review, model performance evaluation, or large-scale AI product engineering
Strong comfort working in sprint-based project environments with focused technical assessment windows

Why This Opportunity

Remote consulting work aligned with machine learning engineering, coding agent, and technical evaluation expertise
Opportunity to evaluate realistic ML engineering workflows involving model training, inference systems, MLOps, LLM applications, and production AI systems
Suitable for engineers who enjoy technical assessment, tool-assisted coding workflows, ML implementation review, and practical system-level problem-solving
Sprint-based project work that can align with focused availability and remote schedules

Contract Details

Independent contractor engagement
Fully remote and flexible scheduling
Sprint-based, project-based availability
Some project work may run in focused 12-24 hour sprint windows depending on project requirements
Compensation may reach up to $90/hour, depending on project scope, experience, and accepted work structure
Some projects may use accepted-task compensation depending on the specific workflow
Payments are made weekly via Stripe or Wise based on services rendered
Projects may be extended, shortened, adjusted, or concluded based on project needs and performance
Candidates requiring H1-B or STEM OPT sponsorship support are not eligible at this time
Work must not involve sharing confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy:

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hour in United States vacancy

Remote | Systems Architecture Evaluation Engineer - Up to $90/hour
$90 per hour
...About the job Remote | Systems Architecture Evaluation Engineer - Up to $90/hour We are sharing a specialised remote consulting opportunity for experienced systems engineers with strong coding agent experience, practical systems-level engineering judgment, and...
Remote work
Contract work
For contractors
Flexible hours
24-MAG LLC
United States
1 day ago
Remote | Fraud & Risk Systems Evaluation Engineer — Up to $90/hour
$90 per hour
...sharing a specialised remote consulting... ...and trust & safety engineering professionals with... ...across fraud detection systems, risk scoring... ...risk engineering evaluation, coding-agent-assisted... ...in focused 12–24 hour sprint windows depending... ...may reach up to $90/hour, depending...
Remote job
Contract work
For contractors
Flexible hours
24-MAG
New York, NY
5 days ago
Remote | MLOps & ML Systems Engineer - $60-$100/hour
$60 - $100 per hour
...About the job Remote | MLOps & ML Systems Engineer - $60-$100/hour We are sharing a specialised part-time... ...programming, and structured technical evaluation. This role supports current... ...feedback across advanced machine learning engineering workflows. Key Responsibilities...
Remote work
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Flexible hours
Weekday work
24-MAG LLC
United States
2 days ago
Remote | Data Science Evaluation Consultant — $60-$90/hour
$60 - $90 per hour
...modeling, applied machine learning, data analysis... ...task design, evaluation frameworks,... ...and upcoming remote consulting opportunities... ...with AI systems, LLMs, or... ...mathematics, economics, engineering, machine... ...40 hours per week during... ...between $60–$90 per hour depending...
Remote job
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Weekday work
24-MAG
New York, NY
6 days ago
Remote | Supply Chain & Manufacturing Evaluation Consultant - $90-$120/hour
$90 - $120 per hour
...About the job Remote | Supply Chain & Manufacturing Evaluation Consultant - $90-$120/hour We are sharing a specialised part-... ...chain management, industrial engineering, manufacturing engineering, logistics... ...CPIM, CSCP, CLTD, ERP/MRP systems, SAP, Oracle, NetSuite,...
Remote work
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Flexible hours
Weekday work
24-MAG LLC
United States
1 day ago
Remote | Cloud Infrastructure Evaluation Engineer - Up to $90/hour
$90 per hour
...About the job Remote | Cloud Infrastructure Evaluation Engineer - Up to $90/hour We are sharing a specialised part-time consulting opportunity for experienced... ...infrastructure engineering, Kubernetes, CI/CD systems, observability, automation tooling, and AI...
Remote work
Contract work
Part time
For contractors
Flexible hours
24-MAG LLC
United States
2 days ago
Systems Engineer Level 5 (AI Systems & Trust Frameworks)
$175k - $205k
...Systems Engineer Level 5 (AI Systems & Trust Frameworks)... ...Posted Date 3 hours ago(6/4/2026 12:03 PM... ...development, integration, evaluation, and... ...Artificial Intelligence (AI), Machine Learning (ML), autonomous systems... ...Flexible work schedules and remote work options....
Remote work
Full time
Flexible hours
Shift work
Wood Consulting
Maryland
2 days ago
Remote CA-based SLP for AAC Evaluations - Flexible Hours
...Pathologist for the 2026-2027 school year. This fully remote position requires California residency, full state licensure, and experience with AAC evaluations. You'll provide flexible teletherapy during school hours, allowing therapists to manage their schedule effectively...
Remote work
Work from home
Flexible hours
TalkPath Live
Fontana, CA
3 days ago
Senior Design System Engineer II
$159.2k - $238.8k
...your recruiter to learn more. Base pay range... ...a Senior Design System Engineer II to join our amazing... ...part of a growing remote-first company with... ...expected to close 90 days from the... ...Serving Systems, Machine Learning Platform... ....00-$100,000.00 6 hours ago United States...
Remote work
Full time
Internship
Local area
Immediate start
Articulate
New York, NY
2 days ago
Remote | Software, AI, IT & Data Evaluation Consultant - $70-$110/hour
$70 - $110 per hour
...About the job Remote | Software, AI, IT & Data Evaluation Consultant - $70-$110/hour We are sharing a specialised... ...spreadsheets, slide decks, system summaries, data analyses,... ...professional experience in software engineering, AI, machine learning, IT systems, data analysis...
Remote work
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Work at office
Flexible hours
24-MAG LLC
United States
1 day ago
Remote Systems Engineer (Coding Agent Experience) - AI Trainer ($85-$85 per hour)
$85 per hour
...Code Agents project. Contributors help evaluate and improve frontier AI coding models through... .... The work focuses on realistic systems engineering workflows and model evaluation. Spots are... ...based project that runs in 12-24 hour stretches based on client requirement....
Remote job
Hourly pay
Mercor
Kent, WA
1 day ago
Senior or Staff ML Systems Engineer, LLMs
$200k - $240k
...world for all. The AI Engineering Team is chartered... ...LLMs) and agentic systems. Our mission is to... ...involved in evaluating and integrating cutting... ...impact. Learn about TRM Speed in... ...issues in minutes to hours, not weeks. We create... ...entire team—onsite and remote—has full...
Remote work
Worldwide
TRM Labs
San Francisco, CA
3 days ago
AI Model Evaluation Specialist - Remote & Flexible Hours
...Specialists to join their Expert Network, helping to train and evaluate cutting-edge AI models. Candidates will take a quick skills assessment... ...and reviewing research papers. The role offers competitive pay rates and flexible work hours from home. #J-18808-Ljbffr...
Remote work
Work from home
Flexible hours
Prolific
Nacogdoches, TX
2 days ago
Remote | Open Source Code Evaluation Engineer — Up to $90/hour
$90 per hour
...open-source software engineers with strong public contribution... ...current and upcoming remote consulting... ...software engineering evaluation, coding task design, test... ...that accurately evaluate system behaviour across diverse... ...may reach up to $90/hour, depending on project...
Remote job
Contract work
Part time
For contractors
Flexible hours
24-MAG
New York, NY
5 days ago
Sr. Manager, Cloud Center of Excellence & Systems Engineering
$200k - $270k
...Excellence (CCoE) to drive engineering efficiency and... ...program and a Systems Engineering team... ...observability playbooks • Evaluate emerging tools and... ...• Run office hours, workshops, and... ...on post-mortem learnings; translate findings... ...Success Looks Like • 90 days: Cloud...
Remote work
Work at office
Arlo Technologies, Inc.
United States
2 days ago
Remote AI Systems Engineer: Frontier Code Agent Evaluator
$400 per month
...role involves using AI coding agents to evaluate complex engineering tasks and assess model-generated... ...have at least 2 years of professional systems engineering experience and proficiency... ...per accepted task, with tasks typically taking 2-3 hours. #J-18808-Ljbffr Mercor
Remote job
Mercor
Mesquite, TX
2 days ago
Systems Engineer, Platform Requirements and Verification
$140k - $190k
...how to describe and evaluate them efficiently and... ...teams at Waabi including Systems Engineering, Hardware and... ...identify and apply lessons learned and ensure a detailed... ...Experience in robotics or machine learning... ...Vacation. Flexible hours and Work from Home support...
Remote work
Full time
Work at office
Work from home
Flexible hours
Waabi
United States
1 day ago
Remote AI Systems Engineer: Frontier Code Agent Evaluator
$400 per month
...is looking for Frontier Code Agents to evaluate and improve coding models based on structured... ...Ideal candidates will have 2+ years of systems engineering experience and knowledge of AI coding... ...task, with typical tasks requiring 2-3 hours of work. #J-18808-Ljbffr Mercor
Remote job
Mercor
Hanover Park, IL
2 days ago
Remote | STEM PhD Research Evaluation Consultant — $60-$90/hour
$60 - $90 per hour
...solving, solution evaluation, and structured feedback... ...and upcoming remote consulting... ...mathematical, computational, engineering, or analytical... ...solutions produced by AI systems or other... ...approximately 40 hours per week during weekdays... ...between $60–$90 per hour depending...
Remote job
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Weekday work
24-MAG
New York, NY
6 days ago
Remote AI Systems Engineer: Frontier Code Agent Evaluator
$400 per month
...project. This role involves evaluating and improving frontier AI coding... ...using AI coding agents for engineering tasks, reviewing generated code... ...will have 2+ years of systems engineering experience and familiarity... ...with projects in sprints of 12-24 hours. #J-18808-Ljbffr Mercor
Remote job
Mercor
Kent, WA
2 days ago
Remote Bio AI Trainer & Evaluator — Flexible Hours
$60 per hour
...looking for Biology Experts and Life Science Professionals in Las Vegas, NV to evaluate AI-generated science. Successful candidates will work flexibly from home, earning up to $60 per hour for paid tasks. Responsibilities include reviewing scientific data accuracy, validating...
Remote job
Hourly pay
Work from home
Flexible hours
Prolific
Las Vegas, NV
2 days ago
Remote AI Systems Engineer: Frontier Code Agent Evaluator
$400 per month
...collaboration with an AI research lab. Applicants will evaluate AI coding models through structured assessments, focusing on systems engineering workflows. Candidates should have at least... ...00 per accepted task, typically taking 2-3 hours. If you possess relevant experience in...
Remote job
Mercor
Littleton, CO
2 days ago
Machine Learning Ops Engineer | Remote | $90 -$140/hr
$90 - $140 per hour
...advancing next-generation AI systems through large-scale ML... ..., and framework-level engineering. The work involves... ...Full-time Compensation: $90-$140/hour Location: United States (Remote) Role Responsibilities... ...structured technical solutions Evaluate ML systems outputs and...
Remote job
Full time
Weekday work
Call For Referral
New York, NY
2 days ago
AI Design Evaluator (Remote · Flexible Hours)
$50 per hour
...Specialists to help train AI models using your expertise. You'll evaluate AI-generated designs and ensure usability and accessibility... ...professional experience. This position offers competitive pay (up to $50/hour) and requires proficiency in design tools like Figma and Adobe...
Remote job
Work from home
Flexible hours
Prolific
Sacramento, CA
3 days ago
Systems Engineer, Mid
$61.9k - $141k
...Number: R0239753 Systems Engineer, Mid The Opportunity... ..., test and evaluation of system performance... ...working at least 20 hours a week on a regular... ...encourage you to learn more about our total... ...will close within 90 days from the... ...Date. Work Model Remote: If this position...
Remote work
Full time
Contract work
Part time
Work at office
Local area
Phase2 Technology
Dahlgren, VA
4 days ago
Remote Legal Expert for AI Training | Flexible Hours
$100 per hour
...use their legal expertise to evaluate and improve these models. Researchers pay up to $100 per hour, depending on skills and experience... ...independently on ad‑hoc, remote assignments that fit around... ...between human expertise and machine learning. Key Technologies Prolific Platform...
Remote work
Hourly pay
Work from home
Flexible hours
Prolific
Louisville, KY
4 days ago
AI/ML Systems Engineer
$132.9k - $207.75k
...Artificial Intelligence/Machine Learning Systems Engineer to join our team of professionals... ...Include: Evaluation of advanced AI/ML concepts... ...a drug-free workplace. Remote/hybrid status is subject to... ...employees working at least 20 hours a week on a regular basis...
Remote work
Full time
Contract work
Part time
Work experience placement
Work at office
Local area
Noblis
Chantilly, Loudoun County, VA
1 day ago
Remote Network Engineer - Data for Autonomous Systems annotation - AI Trainer ($50-$70 per hour)
$50 - $70 per hour
...3 / Tier 3 network support engineer interested in data science and... ...integrated networking systems and using the data they generate... ...how raw infra data becomes machine learning input Want to contribute to... ...are expected to work 30-40 hours/week. #J-18808-Ljbffr Mercor
Remote job
Hourly pay
Immediate start
Mercor
Allen, TX
1 day ago
Combat Data Systems Engineer
$62k - $141k
...Combat Data Systems Engineer The Opportunity:... ...be vital as you evaluate and modernize F-3... ...Knowledge of AI and machine learning approaches to multi... ...at least 20 hours a week on a regular... ...will close within 90 days from the Posting... ...in person or remotely. If this position...
Remote work
Full time
Contract work
Part time
Local area
Booz Allen Hamilton
Arlington, VA
more than 2 months ago
Senior Systems Engineer - Partial Telework
$170k - $260k
...clearance. Are you a Senior Systems Engineer who is ready for a new... ...phase, then dropped off on a remote contract and never seen or... ...Cloud Technologies, Analytics, Machine Learning, AI, and DevOps... ...Partial Telework (up to 16 hours per week) Salary Range: The...
Remote work
16 hours
Full time
Contract work
Work from home
Relocation package
GliaCell Technologies LLC
Annapolis Junction, MD
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hour. Be the first to apply!