Software Engineer (Python/C++) - Code Evaluation | Remote
Crossing Hurdles
- Remote job
Commitment: Part-time availability with flexibility for increased hours Role Responsibilities Review and refine AI-generated prompts, responses, and code Validate algorithms and software concepts for technical accuracy Provide structured, actionable feedback on solution quality and clarity Tag and organize content by topic, difficulty, or language Support benchmarking efforts to evaluate and compare model capabilities Requirements Strong experience in software engineering, technical research, or educational content development Strong experience with debugging, testing, and validating code Ability to assess technical accuracy and communicate improvements clearly High attention to detail and comfort with technical documentation #J-18808-Ljbffr Crossing Hurdles
$400 per month
...with a leading AI research lab for a project supporting Frontier Code Agents. Contributors will evaluate AI coding models through technical assessments that focus on practical machine learning engineering workflows. The position requires 2+ years of experience in machine...Remote job$400 per month
...leading AI research lab focused on frontier coding models. The role involves using frontier... ...machine learning tasks, where you will evaluate model outputs and assess performance.... ...should have 2+ years in machine learning engineering and experience with AI coding agents....Remote job$400 per month
Mercor is seeking machine learning engineers to join a project with a leading AI research lab. You will evaluate and complete complex engineering tasks using frontier AI coding agents. This sprint-based project offers $400 for each accepted task, which typically takes 2...Remote job$400 per unit
Mercor is partnering with an AI research lab for a project assessing frontier AI coding models. The role involves evaluating machine learning engineering workflows through structured tasks with compensation tied to accepted work. Ideal candidates have 2+ years of experience...Remote job$400 per month
...collaborating with a leading AI research lab on a project focused on evaluating frontier AI coding models. In this role, you'll use advanced AI coding agents to carry out machine learning engineering tasks, review model-generated implementations, and identify various issues...Remote job- ...proficient programmers to join their virtual coding team. You'll work on diverse programming... ...mobile app development and code evaluations. This position offers competitive pay, a... ...in languages like Kotlin, JavaScript, or Python. A bachelor’s degree is also preferred....Remote workFlexible hours
$60 per hour
...to contribute to cutting-edge AI systems. Enjoy the flexibility of remote work and choose projects that align with your schedule. Responsibilities include designing coding problems for AI, evaluating AI-generated code, and writing high-quality code. Ideal candidates have...Remote job$60 per hour
...programmers to contribute to cutting-edge AI systems while enjoying fully remote work and a flexible schedule. Responsibilities include solving coding problems, writing high-quality code, and evaluating AI-generated code. Ideal candidates should have a bachelor’s degree...Remote jobHourly payFlexible hours$400 per month
...lab to support a project focusing on AI coding capabilities. The role involves utilizing... ...AI coding agents to tackle complex engineering tasks, with a compensation of $400 per accepted... ...flexibility and requires the ability to evaluate designs from multiple AI models...Remote job$400 per month
...for contributors to participate in a Frontier Code Agents project partnering with a leading AI research lab. You will evaluate and improve AI coding models through structured assessments, focusing on realistic engineering scenarios. This project operates in short sprint...Remote job$400 per month
Mercor is seeking contributors to evaluate and improve frontier AI coding models through a structured project. Ideal candidates will have 2+ years in systems engineering and experience with AI coding agents. The project is sprint-based, running in 12-24 hour stretches,...Remote job- ...pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality,... ...Project Overview What Does a Typical Day Look Like? Evaluate and refine AI-generated code across backend and frontend contexts to ensure that...Remote workFor contractorsFlexible hours
$40 per hour
...validating datasets for evaluating large language models (LLMs) in realistic software engineering scenarios. You will... ...Set up and configure code repositories, including... ...Strong experience with C++. Proficiency with Git... ...to work in a fully remote environment. Opportunity...Remote workFor contractors$60 per hour
...cutting-edge AI systems. Enjoy the flexibility of remote work and set your own schedule while taking on challenging... ...projects. Key responsibilities include designing coding problems for AI, providing feedback, and evaluating AI code. Candidates should be fluent in English and...Remote job$220k
...focuses on advancing the evaluation and development of cutting-edge coding agents. You will... ...intersection of AI research, software engineering, and model evaluation,... ...with expertise in Python, C++, or comparable programming... ...Terms Full-time, remote position. Compensation...Remote jobFull time$400 per month
...Mercor is seeking contributors for a unique role involving frontier AI coding agents to enhance security engineering tasks. This position focuses on evaluating AI-generated code and identifying vulnerabilities within security workflows. The role requires 2+ years in security...Remote work$85 per hour
ChatGPT Jobs is looking for an ML Engineer (Coding Agent Experience) in Chicago, IL. The position focuses on using frontier AI coding agents for complex machine learning tasks. Candidates should have at least 2 years of experience and be familiar with various AI tools....Remote jobHourly pay$400 per month
...is looking for contributors to join a project with a leading AI research lab, evaluating frontier AI coding models. You will be tasked with using AI coding agents to tackle security engineering tasks, reviewing code for potential vulnerabilities and flaws. The role is...Remote work$400 per month
...Mercor is collaborating with a leading AI research lab for a Frontier Code Agents project in Glendale, California. The role involves evaluating AI coding models through security engineering tasks. Contributors will assess vulnerabilities in model-generated code and apply...Remote work$400 per month
...seeking contributors for a project with a leading AI research lab focused on evaluating and improving AI coding models. The role involves using frontier AI coding agents to tackle security engineering tasks. Candidates should have at least 2 years of security engineering...Remote workFlexible hours$80 per hour
WorksHub is seeking experienced developers to design and validate complex coding tasks for AI research. This fully remote position allows you to own tasks from creation to evaluation, ensuring they challenge AI capabilities. You’ll earn $80 per accepted task and $90 for...Remote job$400 per month
...seeking contributors for a project partnering with a leading AI research lab. The role involves evaluating frontier AI coding models and completing complex security engineering tasks using AI coding agents. Applicants with 2+ years of experience in security engineering...Remote job$220k
...Staff specializing in Coding Research to lead innovative evaluation frameworks for coding... ...strong background in software engineering and expertise in programming... ...languages such as Python and C++. The ideal candidate will... .... This is a full-time remote position offering a...Remote jobFull time$400 per month
Mercor is collaborating with a leading AI research lab for a project focused on frontier AI coding agents. This role engages in evaluating security engineering workflows through technical assessments, with a commitment of 12-24 hours based on client needs. Applicants need...Remote job$400 per month
Mercor is seeking backend engineers to join a project evaluating frontier AI coding models. You will apply your expertise to complete and assess engineering tasks using cutting-edge AI tools. Applicants should have significant backend engineering experience and familiarity...Remote job$400 per month
...collaborating with a leading AI research lab for the Frontier Code Agents project. This role involves evaluating and improving AI coding models through structured assessments centered around security engineering workflows. The position is compensated at $400 per accepted...Remote job- ...advance AI systems with the flexibility of remote work. Responsibilities include designing coding problems, evaluating AI-generated code, and writing clear code snippets... ...one programming language such as Java, Python, or C++. This role allows for flexible scheduling and...Remote jobFlexible hours
$30 per hour
...seeking a Cloud Platform Engineer to join their team. The position is remote and focuses on training and evaluating AI chatbots. The ideal candidate... ...like JavaScript, Python, or C++, and possess strong analytical... ...Responsibilities include providing coding challenges to AI chatbots...Remote jobHourly payFlexible hours- ...Alignerr is seeking a Senior Python Infrastructure Engineer to work remotely on critical AI model development tasks. You will design, build, and optimize data pipelines, annotation tools, and evaluation systems essential for next-generation AI models. This contract role...Remote workContract workFlexible hours
$60 per hour
...programmers to contribute to cutting-edge AI systems remotely. In this role, you will tackle diverse coding challenges, work with advanced AI models, and enjoy... ...experience in languages like Kotlin, JavaScript, or Python. A bachelor’s degree is preferred. Join now to impact...Remote jobHourly payFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer (Python/C++) - Code Evaluation | Remote. Be the first to apply!
- full stack / python developer (remote) New York, NY
- python developer data analytics New York, NY
- senior python developer New York, NY
- python developer internship New York, NY
- python engineer New York, NY
- python developer New York, NY
- c++-python developer New York, NY
- junior python developer New York, NY
- backend python developer New York, NY
- python programmer New York, NY


