Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour

$90 per hour

24-MAG

New York, NY
  • Remote job

We are sharing a specialised remote consulting opportunity for experienced machine learning engineers with strong coding agent experience, production ML judgment, and the ability to evaluate complex machine learning and AI engineering implementations across realistic technical scenarios.

This role supports current and upcoming remote consulting opportunities focused on machine learning system evaluation, coding-agent-assisted technical workflows, ML implementation review, inference system assessment, MLOps evaluation, and LLM application analysis. Selected professionals may use tools such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or comparable coding agents to complete, review, and evaluate technical tasks involving model training, deployment infrastructure, inference workflows, AI-powered products, and production machine learning systems.

Key Responsibilities

Professionals in this role may contribute to:

Machine Learning Implementation Review

  • Use modern coding agents to complete and evaluate complex machine learning and AI engineering tasks
  • Review generated implementations involving model training, inference systems, MLOps workflows, LLM applications, and AI-powered product features
  • Assess technical outputs for correctness, quality, maintainability, performance, reliability, and production-readiness
  • Apply professional machine learning engineering judgment to realistic technical scenarios

MLOps, Deployment & Inference Evaluation

  • Evaluate ML system workflows involving model deployment, inference infrastructure, monitoring, testing, and production integration
  • Review implementation choices related to scalability, latency, data flow, model serving, reliability, and system maintainability
  • Identify bugs, edge cases, performance issues, failure modes, and weak assumptions in ML engineering outputs
  • Provide structured feedback on MLOps design, deployment patterns, and production ML system quality

Coding Agent Output Assessment

  • Compare outputs from multiple coding agents and assess their strengths, weaknesses, accuracy, and practical usefulness
  • Identify where generated solutions succeed, where they fail, and where additional ML engineering judgment is required
  • Evaluate whether generated machine learning implementations reflect real-world engineering standards
  • Document technical review findings clearly for project teams and quality evaluation workflows

Technical Documentation & Feedback

  • Produce clear, structured evaluations of machine learning engineering tasks and generated outputs
  • Explain reasoning around model training, inference systems, deployment infrastructure, LLM applications, performance, and architectural trade-offs
  • Support technical assessment workflows by documenting accepted work, improvement areas, and practical engineering conclusions
  • Help ensure outputs reflect production-scale machine learning engineering expectations

Ideal Profile

Strong candidates may have:

  • 2+ years of professional machine learning engineering experience
  • Hands-on experience building production ML systems, model deployment infrastructure, LLM applications, or AI-powered products
  • Regular use of AI coding agents such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or comparable tools
  • Ability to evaluate generated machine learning implementations and identify technical trade-offs, bugs, edge cases, and performance issues
  • Experience deploying ML systems to production is strongly preferred
  • Strong understanding of model training, inference workflows, MLOps, data pipelines, evaluation methods, deployment patterns, and system reliability
  • Clear written communication skills and comfort documenting technical reasoning in a remote, project-based environment

Educational Background

  • A degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, Software Engineering, Computer Engineering, Statistics, Mathematics, or a related technical field is helpful
  • Equivalent professional experience in machine learning engineering, applied AI, MLOps, LLM applications, or production ML systems is also highly relevant

Nice to Have

  • Experience with Python, PyTorch, TensorFlow, scikit-learn, Hugging Face, LangChain, LlamaIndex, MLflow, Ray, or comparable ML tools
  • Familiarity with model serving, feature pipelines, vector databases, embeddings, retrieval systems, LLM application architecture, or evaluation frameworks
  • Experience with cloud platforms, Docker, Kubernetes, CI/CD pipelines, observability tooling, or production deployment workflows
  • Background in technical code review, ML architecture review, model performance evaluation, or large-scale AI product engineering
  • Strong comfort working in sprint-based project environments with focused technical assessment windows

Why This Opportunity

  • Remote consulting work aligned with machine learning engineering, coding agent, and technical evaluation expertise
  • Opportunity to evaluate realistic ML engineering workflows involving model training, inference systems, MLOps, LLM applications, and production AI systems
  • Suitable for engineers who enjoy technical assessment, tool-assisted coding workflows, ML implementation review, and practical system-level problem-solving
  • Sprint-based project work that can align with focused availability and remote schedules

Contract Details

  • Independent contractor engagement
  • Fully remote and flexible scheduling
  • Sprint-based, project-based availability
  • Some project work may run in focused 12–24 hour sprint windows depending on project requirements
  • Compensation may reach up to $90/hour, depending on project scope, experience, and accepted work structure
  • Some projects may use accepted-task compensation depending on the specific workflow
  • Payments are made weekly via Stripe or Wise based on services rendered
  • Projects may be extended, shortened, adjusted, or concluded based on project needs and performance
  • Candidates requiring H1-B or STEM OPT sponsorship support are not eligible at this time
  • Work must not involve sharing confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: .

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour in New York, NY vacancy
  • $90 per hour

     ...sharing a specialised remote consulting opportunity for experienced systems engineers with strong coding agent...  ...judgment, and the ability to evaluate complex code,...  ...may run in focused 12–24 hour sprint windows depending...  ...Compensation may reach up to $90/hour, depending on... 
    Remote job
    Contract work
    For contractors
    Flexible hours

    24-MAG

    New York, NY
    2 days ago
  • $90 per hour

     ...sharing a specialised remote consulting...  ...and trust & safety engineering professionals with...  ...across fraud detection systems, risk scoring...  ...risk engineering evaluation, coding-agent-assisted...  ...in focused 12–24 hour sprint windows depending...  ...may reach up to $90/hour, depending... 
    Remote job
    Contract work
    For contractors
    Flexible hours

    24-MAG

    New York, NY
    2 days ago
  • $60 - $90 per hour

     ...modeling, applied machine learning, data analysis...  ...task design, evaluation frameworks,...  ...and upcoming remote consulting opportunities...  ...with AI systems, LLMs, or...  ...mathematics, economics, engineering, machine...  ...40 hours per week during...  ...between $60–$90 per hour depending... 
    Remote job
    Hourly pay
    Weekly pay
    Contract work
    Part time
    For contractors
    Weekday work

    24-MAG

    New York, NY
    3 days ago
  • $175k - $205k

     ...Systems Engineer Level 5 (AI Systems & Trust Frameworks)...  ...Posted Date 3 hours ago(6/4/2026 12:03 PM...  ...development, integration, evaluation, and...  ...Artificial Intelligence (AI), Machine Learning (ML), autonomous systems...  ...Flexible work schedules and remote work options.... 
    Remote work
    Full time
    Flexible hours
    Shift work

    Wood Consulting

    Maryland
    4 days ago
  •  ...seeking experienced software engineers to contribute to improving advanced AI systems through human feedback....  ...shaping how AI models evaluate performance, detect...  ...Fully remote and flexible work schedule...  ...engagement with no guaranteed hours Work on tasks based... 
    Remote work
    Freelance
    Flexible hours

    Lifted, an Upwork Company™

    United States
    2 days ago
  •  ...Pathologist for the 2026-2027 school year. This fully remote position requires California residency, full state licensure, and experience with AAC evaluations. You'll provide flexible teletherapy during school hours, allowing therapists to manage their schedule effectively... 
    Remote job
    Work from home
    Flexible hours

    TalkPath Live

    Fontana, CA
    23 hours ago
  • $159.2k - $238.8k

     ...your recruiter to learn more. Base pay range...  ...a Senior Design System Engineer II to join our amazing...  ...part of a growing remote-first company with...  ...expected to close 90 days from the...  ...Serving Systems, Machine Learning Platform...  ....00-$100,000.00 6 hours ago United States... 
    Remote work
    Full time
    Internship
    Local area
    Immediate start

    Articulate

    New York, NY
    4 days ago
  •  ...this flyer. Learn more about this...  ...Missile (FBM) Systems, representing the...  ...Security Engineering (SSE), including...  ...authority for evaluating security assessments...  ...integration of Machine Learning (ML) models...  ...may include remote or isolated...  ...least 60 semester hours of courses in... 
    Remote work
    Full time
    Part time
    For contractors
    For subcontractor
    Interim role
    Work at office
    Immediate start
    Relocation
    Relocation package

    Strategic Systems Programs

    Littleton, CO
    2 days ago
  • $60 per hour

     ...looking for Biology Experts and Life Science Professionals in Las Vegas, NV to evaluate AI-generated science. Successful candidates will work flexibly from home, earning up to $60 per hour for paid tasks. Responsibilities include reviewing scientific data accuracy, validating... 
    Remote job
    Hourly pay
    Work from home
    Flexible hours

    Prolific

    Las Vegas, NV
    4 days ago
  • $200k - $270k

     ...Excellence (CCoE) to drive engineering efficiency and...  ...program and a Systems Engineering team...  ...observability playbooks • Evaluate emerging tools and...  ...• Run office hours, workshops, and...  ...on post-mortem learnings; translate findings...  ...Success Looks Like • 90 days: Cloud... 
    Remote work
    Work at office

    Arlo Technologies, Inc.

    United States
    4 days ago
  • $50 per hour

     ...Specialists to help train AI models using your expertise. You'll evaluate AI-generated designs and ensure usability and accessibility...  ...professional experience. This position offers competitive pay (up to $50/hour) and requires proficiency in design tools like Figma and Adobe... 
    Remote job
    Work from home
    Flexible hours

    Prolific

    Sacramento, CA
    23 hours ago
  • $140k - $190k

     ...how to describe and evaluate them efficiently and...  ...teams at Waabi including Systems Engineering, Hardware and...  ...identify and apply lessons learned and ensure a detailed...  ...Experience in robotics or machine learning...  ...Vacation. Flexible hours and Work from Home support... 
    Remote work
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    United States
    3 days ago
  • $90 per hour

    Building Code & Permitting Specialist Contract / Remote position with compensation $65-$90/hour. Responsibilities Review and label construction documents...  ...with local, state, and national building codes, evaluating AI‑generated outputs for correctness in permit requirements... 
    Remote work
    Hourly pay
    Contract work
    Local area

    Mercor

    New York, NY
    4 days ago
  • $132.9k - $207.75k

     ...Artificial Intelligence/Machine Learning Systems Engineer to join our team of professionals...  ...Include: Evaluation of advanced AI/ML concepts...  ...a drug-free workplace. Remote/hybrid status is subject to...  ...employees working at least 20 hours a week on a regular basis... 
    Remote work
    Full time
    Contract work
    Part time
    Work experience placement
    Work at office
    Local area

    Noblis

    Chantilly, Loudoun County, VA
    3 days ago
  • $60 - $100 per hour

     ...for MLOps and ML systems professionals experienced...  ...technical evaluation. This role...  ...current and upcoming remote consulting...  ...framework-level engineering, model training workflow...  ...across advanced machine learning engineering...  ...potentially up to 40 hours per week depending... 
    Remote job
    Hourly pay
    Weekly pay
    Contract work
    Part time
    For contractors
    Flexible hours
    Weekday work

    24-MAG

    New York, NY
    4 days ago
  • $170k - $260k

     ...clearance. Are you a Senior Systems Engineer who is ready for a new...  ...phase, then dropped off on a remote contract and never seen or...  ...Cloud Technologies, Analytics, Machine Learning, AI, and DevOps...  ...Partial Telework (up to 16 hours per week) Salary Range: The... 
    Remote work
    16 hours
    Full time
    Contract work
    Work from home
    Relocation package

    GliaCell Technologies LLC

    Annapolis Junction, MD
    4 days ago
  • $170k - $260k

     ...Are you a Senior HPC Systems Engineer who is ready for a new challenge...  ...phase, then dropped off on a remote contract and never seen or...  ...Cloud Technologies, Analytics, Machine Learning, AI, and DevOps...  ...Paid Time Off ~ Up to 40 hours of PTO Carryover ~11 Federal... 
    Remote work
    Full time
    Contract work
    Work from home
    Relocation package

    GliaCell Technologies LLC

    Annapolis, MD
    4 days ago
  •  ...leading technology company is seeking a Senior Distributed Systems Engineer to design and implement distributed system components...  ...systems and proficiency in Java or Scala. The company offers remote work, flexible hours, and excellent compensation. #J-18808-Ljbffr BairesDev
    Remote job
    Flexible hours

    BairesDev

    Staten Island, NY
    2 days ago
  • Stars Arena is seeking a Systems Engineer to develop pioneering machine learning infrastructure that enhances the efficiency of experiments on local...  ...package, the company offers flexible working hours, opportunities for remote work, and the chance to learn from world-... 
    Remote job
    Local area
    Flexible hours

    Stars Arena

    San Francisco, CA
    23 hours ago
  • $90 per hour

     ...Position: Physics PhD Expert Type: Contract Compensation: $70-$90/hour Location: Remote Commitment: 4-6 tasks/week Role Responsibilities...  ...physics problems to enhance AI model training and innovation. Evaluate the quality and rigor of AI-generated physics content to... 
    Remote work
    Hourly pay
    Contract work
    Summer work

    Mercor

    San Francisco, CA
    23 hours ago
  • $16.5 per hour

     ...Speech Ai Evaluation Specialist We are looking for a speech AI...  ...Work Schedule: Part-time - 10+ hours per week. Flexible - work...  ...professional seeking flexible remote work? Are you interested in...  ...more of the following areas: machine learning tasks, data collection and preprocessing... 
    Remote work
    Hourly pay
    Extra income
    Part time
    Freelance
    Immediate start
    Work from home
    10 hours per week
    Flexible hours

    RWS

    United States
    4 days ago
  •  ...ground, soldier, and maritime systems in 50+ locations world-wide....  ...intelligence (AI), machine learning (ML), and augmented reality...  ...immediate opening for a systems engineer to support SDA's enterprise...  ...the development of test and evaluation plans, system verification plans... 
    Remote work
    Immediate start

    QinetiQ US

    Chantilly, Loudoun County, VA
    14 days ago
  • Higharc in the United States is seeking a Software Engineer to join their growing Config team. The role involves building...  ..., and API design. Working at Higharc means joining a remote-first company offering flexible hours, competitive salaries, and comprehensive health... 
    Remote job
    Flexible hours

    Higharc

    New York, NY
    3 days ago
  • $61.9k - $141k

     ...FMS C2 Systems Engineer The Opportunity: Are you...  ...be vital as you evaluate and modernize C4I...  ...working at least 20 hours a week on a regular...  ...encourage you to learn more about our total...  ...will close within 90 days from the...  ...during meetings. Remote: If this position... 
    Remote work
    Full time
    Contract work
    Part time
    Work at office
    Local area

    BOOZ, ALLEN & HAMILTON, INC.

    Nacogdoches, TX
    4 days ago
  • $100 per hour

     ...use their legal expertise to evaluate and improve these models. Researchers pay up to $100 per hour, depending on skills and experience...  ...independently on ad‑hoc, remote assignments that fit around...  ...between human expertise and machine learning. Key Technologies Prolific... 
    Remote job
    Hourly pay
    Work from home
    Flexible hours

    Prolific

    Louisville, KY
    2 days ago
  • $200k - $280k

     ....*** Are you a Principal Systems Engineer who is ready for a new challenge...  ..., then dropped off on a remote contract and never seen or...  ...Technologies, Analytics, Machine Learning, AI, and DevOps Containerization...  ...The work offers flexible hours. Responsibilities:... 
    Remote work
    Full time
    Contract work
    Work from home
    Relocation package
    Flexible hours

    GliaCell Technologies LLC

    Annapolis, MD
    3 days ago
  • $200k

     ...7 Department: Engineering Opening Date:...  ...and distribution system design, planning,...  ...Monday - Friday. Remote work is not available...  ...development, and evaluation of assigned Engineering...  ..., during daylight hours and at night....  ...pay ninety percent (90%) and the employee... 
    Remote work
    Full time
    Part time
    For contractors
    Work at office
    Monday to Friday
    Night shift

    Grays Harbor PUD

    Aberdeen, WA
    2 days ago
  • $62k - $141k

     ...0232462 Overview Systems Engineer, Mid Are you looking...  ...be vital as you evaluate, modernize and...  ...working at least 20 hours a week on a regular...  ...encourage you to learn more about our total...  ...will close within 90 days from the...  ...happens in person or remotely. If this position... 
    Remote work
    Full time
    Contract work
    Part time
    Local area

    Phase2 Technology

    San Diego, CA
    4 days ago
  • Overview Systems Engineer Asc, F-35 Level 1 - Location...  ...metrics to evaluate component and system...  ...You From onsite to remote, we offer flexible...  ...future and security. Learn more about Lockheed...  ...from standard 40 hours over a five day work...  ...will close in 90 days; applicants... 
    Remote work
    Full time
    Part time
    Work at office
    Relocation
    Flexible hours
    Shift work
    3 days per week

    Lockheed Martin

    Fort Worth, TX
    23 hours ago
  • $61.9k - $141k

     ...Number: R0239753 Systems Engineer, Mid The Opportunity...  ..., test and evaluation of system performance...  ...working at least 20 hours a week on a...  ...encourage you to learn more about our total...  ...will close within 90 days from the Posting...  ...Date. Work Model Remote: If this position... 
    Remote work
    Full time
    Contract work
    Part time
    Work at office
    Local area

    Phase2 Technology

    Dahlgren, VA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour. Be the first to apply!