Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote ML Engineer: Frontier Code Agent Evaluator

$400 per month

Mercor Inc

Ontario, CA
  • Remote job

Mercor is partnering with a leading AI research lab for a project focused on evaluating frontier AI coding models. Contributors will help improve machine learning workflows through structured assessments. The role requires machine learning engineering expertise and familiarity with frontier AI coding agents like Cursor and Claude. Compensation is $400 per accepted task, typically taking 2-3 hours. Ideal candidates will have 2+ years of experience and a strong background in ML systems. #J-18808-Ljbffr Mercor Inc

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Remote ML Engineer: Frontier Code Agent Evaluator in Ontario, CA vacancy
  • $400 per unit

     ...partnering with a leading AI research lab on a Frontier Code Agents project. The role involves using AI coding agents for evaluating complex ML tasks, identifying bugs, and comparing...  ...years of experience in machine learning engineering and familiarity with deploying ML... 
    Remote job

    Mercor

    Santa Clarita, CA
    1 day ago
  • $400 per month

    Obsidian is looking for contributors to support a Frontier Code Agents project with a leading AI research lab. The role involves using AI coding agents to evaluate and improve machine learning and AI engineering tasks. Ideal candidates should have 2+ years of machine learning... 
    Remote job

    Obsidian

    New York, NY
    1 day ago
  • $400 per month

     ...is seeking skilled professionals to support its Frontier Code Agents project in Chicago. The role involves evaluating and improving frontier AI coding models through...  ..., tackling complex machine learning and AI engineering tasks. Responsibilities include reviewing AI-generated... 
    Suggested

    Obsidian

    Chicago, IL
    3 days ago
  • $400 per unit

    Mercor is partnering with a leading AI research lab on a project involving Frontier Code Agents. The role focuses on evaluating AI coding models and performing complex engineering tasks using advanced coding agents. Applicants should have 2+ years of systems engineering... 
    Remote job

    Mercor

    Fontana, CA
    1 day ago
  • $400 per month

     ...is looking for contributors to support a Frontier Code Agents project in partnership with a leading AI research lab. You’ll evaluate and improve frontier AI coding models through...  ...least 2 years of experience in systems engineering and familiarity with AI coding agents.... 
    Remote job

    Mercor

    Pearland, TX
    3 days ago
  • $85 per hour

     ...leading AI research lab to support a Frontier Code Agents project. Contributors help evaluate and improve frontier AI coding...  ...on realistic machine learning engineering workflows and model evaluation. Spots...  ...judgment to realistic ML engineering scenarios. Time Commitment... 
    Remote job
    Hourly pay

    Mercor

    Weymouth, MA
    2 days ago
  • $400 per month

    Mercor is looking for contributors to evaluate and improve AI coding models through technical assessments. This role involves completing complex engineering tasks using frontier AI coding agents, with tasks typically taking 2-3 hours. Applicants should have at least 2... 
    Remote job

    Mercor

    Monterey Park, CA
    1 day ago
  • $400 per month

    Mercor is seeking experienced backend engineers in New Jersey to join a project evaluating frontier AI coding models. You'll use AI coding agents for complex tasks and evaluate model-generated code for quality and performance. This role requires over 2 years of backend... 
    Remote job

    Mercor

    Passaic, NJ
    1 day ago
  • $400 per unit

    Mercor is seeking contributors for a project evaluating frontier AI coding models. The role involves using AI coding agents for security engineering tasks, reviewing code for vulnerabilities, and providing technical assessments. Applicants should have over 2 years of experience... 
    Remote job

    Mercor

    Hawthorne, CA
    1 day ago
  • $400 per month

    Mercor is seeking contributors to evaluate and enhance frontier AI coding models through structured assessments...  .... The focus is on realistic mobile engineering workflows, where tasks can be...  ...experience with Swift and AI coding agents, apply to join this cutting-edge project... 

    Mercor

    Azusa, CA
    3 days ago
  • $400 per month

    Mercor is seeking contributors for a project that evaluates frontier AI coding models. Ideal candidates will have over 2 years of professional iOS engineering experience and be adept at conducting structured technical assessments. The role focuses on realistic mobile engineering... 

    Mercor

    Passaic, NJ
    1 day ago
  •  ...looking for contributors to support AI coding models in a dynamic project with a leading...  ...have a strong background in security engineering, specifically in application, cloud, and...  ...infrastructure security. Your role includes evaluating AI-generated code for vulnerabilities... 
    Remote job

    Mercor

    San Gabriel, CA
    1 day ago
  • $85 per hour

     ..., and Jack Dorsey . Position: ML Engineer (Coding Agent Experience) Type: Contract Compensation: $85/hour Location: Remote Role Responsibilities Use frontier AI coding agents to complete and evaluate complex machine learning and AI engineering... 
    Remote work
    Contract work
    Summer work

    Mercor

    New York, NY
    10 days ago
  • $250k - $350k

     ...Ambience delivers real-time coding-aware documentation...  ...The Role: As a Staff ML Engineer on the Frontier AI team at Ambience,...  ...scribing, voice agents, long-context chart understanding...  ..., benchmarks, or evaluation frameworks....  ...3% of base salary A remote-friendly culture (with... 
    Remote work
    Work at office
    Immediate start
    Flexible hours
    3 days per week

    Ambience Healthcare, Inc.

    San Francisco, CA
    2 days ago
  • Turing is searching for a Software Engineering evaluator based in Seattle, United States. You will work on AI model training by curating code examples and evaluating AI-generated code, ensuring efficiency and reliability. This role requires strong full-stack application... 
    Remote job
    For contractors
    10 hours per week
    Flexible hours

    Turing

    Seattle, WA
    3 days ago
  • $400 per month

    TryApplyNow is seeking a mid-level Backend Engineer specializing in AI coding agents. This fully remote role requires at least 2 years of backend engineering experience...  .... The ideal candidate will work independently to evaluate and implement software solutions while... 
    Remote job
    Flexible hours

    TryApplyNow

    San Francisco, CA
    17 hours ago
  • $180k - $210k

     ...Description What You’ll Do Own the graph-ML roadmap end-to-end—turn research into production...  ...technical artifacts as graphs (e.g., code/IR or telemetry) and learn over them for analysis...  ...solve problems no one has cracked before. Remote & Flexible - Work from anywhere with a... 
    Remote work
    Permanent employment
    Full time
    Temporary work
    Work experience placement
    Flexible hours

    Integrated Research

    Denver, CO
    1 day ago
  • $85 per hour

     ...Jack Dorsey . Position: Security Engineer (Coding Agent Experience) Type: Contract Compensation: $85/hour Location: Remote Role Responsibilities Use frontier AI coding agents to complete and evaluate complex security engineering tasks .... 
    Remote work
    Contract work
    Summer work

    Mercor

    San Francisco, CA
    10 days ago
  • UnitedHealth Group seeks a knowledgeable AI Engineering Manager to define and oversee the AI strategy, focused on enhancing software development processes. Candidates should have extensive experience in technology initiatives and a strong grasp of AI applications in business... 
    Remote job

    UnitedHealth Group

    Schaumburg, IL
    2 days ago
  •  ...Arizona, seeks a Machine Learning Engineer / Data Scientist to design algorithms for agent harness. You'll build evaluation benchmarks and conduct...  ...engineering background and hands-on ML experience, candidates will...  ...to work both on-site and remotely. #J-18808-Ljbffr Intel
    Remote work

    Intel

    Phoenix, AZ
    4 days ago
  • $400 per month

     ...seeking contributors for a project with a leading AI research lab focused on evaluating frontier AI coding models. You'll complete and assess complex data engineering tasks using AI coding agents. The role requires at least 2 years of experience in data engineering and... 
    Remote job

    Obsidian

    San Francisco, CA
    1 day ago
  • $400 per month

    Mercor is seeking data engineers for a project supporting AI coding models. Contributors will utilize frontier AI tools to assess and improve data engineering implementations...  ...building ETL pipelines and using AI coding agents. Compensation is $400 for each accepted task... 
    Remote job

    Mercor

    West Covina, CA
    3 days ago
  • $400 per month

    Mercor is seeking contributors for a data engineering project in collaboration with a top AI...  ...lab. The role involves utilizing frontier AI coding models to execute and assess complex tasks. You will be responsible for evaluating implementations, identifying issues, and... 
    Remote job

    Mercor

    La Mesa, CA
    1 day ago
  •  ...Motors is seeking a Senior ML Infra Engineer to design and implement scalable...  ...learning training and evaluation. This role is pivotal for accelerating...  ...distributed systems, solid coding skills in Python or C++, and...  ...This position can be fully remote or hybrid. #J-18808-Ljbffr... 
    Remote work

    Israelvcforum

    Sunnyvale, CA
    17 hours ago
  • $60 per hour

     ...is seeking experienced quantitative professionals to evaluate and improve AI models. This fully remote role allows you to work from anywhere in the US and offers...  ...should have 2+ years in a quantitative field, coding skills, and fluency in English. Responsibilities include... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    1 day ago
  • $183k - $246k

     ...growth as we scale AI agents across the jobsite...  ...0+ employees (50+ engineers), we’re scaling...  ...and improve evaluation and experimentation...  ...production-grade code and raise the bar...  ...shipping production ML/AI systems. ~ Proven...  ...work options (remote and hybrid) Free... 
    Remote work
    Work at office
    Flexible hours

    Trunk Tools, Inc.

    New York, NY
    6 days ago
  • $152k - $228k

     ...Description Job Description Senior ML Engineer About Invoca Invoca is...  ..., and care deeply about code quality, reliability, and...  ...artifacts — including model evaluation, versioning, and automated deployment...  ...Location This is a remote-first role. We are currently... 
    Remote work
    Currently hiring
    Flexible hours

    Invoca

    New York, NY
    8 days ago
  •  ...looking for experienced professionals to evaluate AI-generated security content and solve technical...  ...cybersecurity problems. This role is remote and offers flexible scheduling, with...  ...hands-on experience in cybersecurity, some coding skills, and strong analytical and writing... 
    Remote job
    Hourly pay
    Full time
    Part time
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  • $40 per hour

     ...provider seeks experienced professionals to evaluate AI-generated content related to security...  ...include 2+ years in cybersecurity, coding skills, and strong analytical capabilities...  ...a bachelor's degree. This is a flexible remote position that allows for project and schedule... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Washington DC
    4 days ago
  • $40 per hour

     ...experienced cybersecurity professionals to evaluate AI-generated security content and design technical problems for AI training. This remote position allows you to work on a flexible...  ...practical cybersecurity experience, some coding skills, and strong analytical abilities.... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Phoenix, AZ
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote ML Engineer: Frontier Code Agent Evaluator. Be the first to apply!