Software Engineer - AI Evaluation

Virtual Vocations Inc

Evaluating and benchmarking AI-generated code, the full-time Software Engineer - AI Evaluation will assess correctness, efficiency, and maintainability of coding solutions while working remotely on a contractor basis. Key responsibilities Review and evaluate AI-generated code for correctness and adherence to requirements Analyze software engineering tasks and validate proposed solutions against expected outcomes Create and maintain evaluation datasets, benchmarks, and grading rubrics for coding tasks Required qualifications Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field 3+ years of professional software engineering experience Strong proficiency in two or more programming languages such as Python, Java, or C/C++ Experience performing code reviews and evaluating code quality in large-scale codebases Familiarity with version control systems and modern software development workflows

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Software Engineer - AI Evaluation in United States vacancy

Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible)
$175k - $245k
...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For over 20 years, Smartsheet has helped people and teams achieve–well, anything. From seamless work management to smart, scalable solutions, we’ve always worked with flow. We’re building...
Suggested
Full time
Temporary work
Local area
Immediate start
Remote work
Smartsheet
New York, NY
3 days ago
Contract Senior Software Engineer - AI Code Review & Evaluation
$50 - $150 per hour
A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and...
Suggested
Hourly pay
Contract work
Flexible hours
Turing
San Francisco, CA
2 days ago
Software Engineer, Agent Evaluation and Quality
...© 2025 Repovive, Inc. All rights reserved. Back to Jobs Apply Now Compensation Not listed Posted April 25, 2026 Required Skills AI evaluation data pipelines agent instrumentation Requirements Mid/Senior Visa Sponsorship Not mentioned Relocation Not mentioned About the Role...
Suggested
Relocation
Visa sponsorship
Repovive, Inc.
New York, NY
4 days ago
Senior Software Engineer - Siri Agentic Evaluation Platform
Senior Software Engineer - Siri Agentic Evaluation Platform Cambridge, England, United Kingdom • Machine Learning and AI Siri is the world’s most popular intelligent assistant. Available on iPhone, iPad, Mac, Apple Watch, HomePod, AppleTV and Vision Pro, hundreds of millions...
Suggested
Apple Inc.
Cambridge, MA
3 days ago
Software Engineer, Search Evaluation Platforms
$147k - $211k
Software Engineer, Search Evaluation Platforms corporate_fare Google place Mountain View, CA, USA Apply Qualifications Bachelor’s degree or equivalent... ...that meets standards, applying artificial intelligence (AI) tools in the development flow, and ensuring your work is...
Suggested
Temporary work
Google Inc.
Mountain View, CA
4 days ago
(Senior) Software Engineer - Evaluation
$120k - $250k
...2016 in Silicon Valley, Pony.ai has quickly become a global leader... ..., and multi-dimensional evaluation. Design and implement high-performance... ...and optimize downstream engineering workflows for Large Language... ...skills in C/C++, Python, and software design Strong foundation in...
Temporary work
pony.ai
Denver, CO
10 hours ago
Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles
$148k - $356.5k
Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles page is loaded Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles... ...to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest...
Full time
Remote work
NVIDIA Corporation
Raleigh, NC
3 days ago
Senior Software Engineer, Autonomy Evaluation
$144.7k - $221.4k
...introspect autonomous driving software performance at interfaces... ...autonomy developers and systems engineers. Design and implement analysis... ...autonomy stack, including evaluation of perception, prediction, and... ...systems. Experience leveraging AI‑assisted development and...
Local area
Remote work
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
1 day ago
Software Engineer (Model Evaluation & Benchmarking)
Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems that ensure multimodal AI behaves reliably, consistently, and predictably as it moves from research into production. This position...
SpreeAI
San Francisco, CA
10 hours ago
iOS Software Engineer, AIML Evaluation
$147.4k - $272.1k
Cupertino, California, United States Software and Services At Apple, great... ...to the next revolution in AI and Machine Learning. We're a dynamic team seeking engineers who are enthusiastic about influencing... ...signals, and advance the evaluation pipeline to the next level. Continuously...
Relocation
Apple
Cupertino, CA
10 hours ago
Remote Software Engineer — AI Coding Evaluator (Contractor)
Turing is searching for a Software Engineering evaluator based in Seattle, United States. You will work on AI model training by curating code examples and evaluating AI-generated code, ensuring efficiency and reliability. This role requires strong full-stack application...
Remote job
For contractors
10 hours per week
Flexible hours
Turing
Seattle, WA
2 days ago
Remote AI Evaluation Software Engineer - Freelance
Feedinkoo is looking for experienced software engineers to contribute their expertise to AI evaluation and improvement projects. This role involves applying software engineering skills to assess AI systems with no prior AI experience required, as training will be provided...
Remote job
Freelance
Feedinkoo
New York, NY
10 hours ago
Remote Web Platform Engineer - AI Model Evaluator
...SupportFinity™ is seeking a Web Platform Engineer to train AI models in South Dakota. This role entails coding challenges for AI chatbots, evaluating their outputs for quality, and requires proficiency in programming languages like Python or JavaScript. As a flexible...
Hourly pay
Remote work
Flexible hours
SupportFinity
Sioux Falls, SD
3 days ago
Software Engineer, Agent Evaluation and Quality Engineering · · San Francisco; New York Apply →
...inventive research, design, and engineering. Our organization is very... ...shipping code. About the Role As a Software Engineer on the Agent Quality... ...’ll build the measurement, evaluation, and feedback-loop... ...Designing and building best-in‑class AI evaluation system: curated...
Anysphere
San Francisco, CA
2 days ago
Remote Python Backend Engineer for AI Evaluation Tools
An innovative AI company based in the US is seeking a Mid-Senior level developer. The role involves developing and maintaining evaluation servers, implementing logic for agent actions, and creating testing tools. Candidates should have 4+ years of Python experience, solid...
Part time
Remote work
Mind Rift
Houston, TX
5 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
A cybersecurity technology company is seeking experienced professionals to evaluate AI-generated security content and solve technical cybersecurity problems. This role offers a flexible schedule, with options for full-time or part-time remote work. Candidates should have...
Full time
Part time
Remote work
Flexible hours
DataAnnotation
Louisiana, MO
4 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
A cybersecurity consultancy is seeking experienced individuals to enhance AI capabilities by evaluating cybersecurity content and solving security-related challenges. You will play a pivotal role in validating AI outputs and providing critical feedback to advance cybersecurity...
Remote work
Flexible hours
DataAnnotation
Phoenix, AZ
10 hours ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A leading AI security firm is seeking experienced cybersecurity professionals in the United States to evaluate AI-generated security content and solve technical problems. The role offers flexible scheduling, the opportunity to work on varied projects, and hourly pay starting...
Hourly pay
Full time
Part time
Remote work
Flexible hours
DataAnnotation
Kansas City, MO
1 day ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity technology company is seeking experienced professionals to evaluate AI-generated content and solve technical problems. In this remote role, candidates will work with AI systems to enhance their reasoning about real-world threats. Required qualifications...
Hourly pay
Remote work
DataAnnotation
Lansing, MI
4 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity company is seeking experienced cybersecurity professionals to join their team. You will evaluate AI-generated security content, solve technical problems, and provide critical feedback to enhance AI systems. This role is remote, flexible, and offers hourly...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Oregon, WI
4 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity solutions company is seeking experienced professionals to evaluate AI-generated security content and solve technical security problems. Candidates should have over 2 years of hands-on experience in cybersecurity and coding skills, with strong writing and...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Saint Paul, MN
4 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity-focused company is seeking experienced professionals to evaluate AI-generated security content and improve AI systems. Responsibilities include assessing threats and providing technical feedback, ideal for those with 2+ years in cybersecurity roles like...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Springfield, IL
1 day ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A leader in AI training for cybersecurity is seeking experienced cybersecurity professionals to evaluate AI-generated content and solve technical problems. This role offers full-time or part-time remote work with the flexibility to choose projects and work hours. Candidates...
Hourly pay
Full time
Part time
Remote work
DataAnnotation
Madison, WI
10 hours ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
...cybersecurity professionals to join our team to help train AI models. In this role, you will evaluate AI-generated security content, solve technical... ...penetration testing, red teaming, incident response, detection engineering, DFIR, malware analysis, threat intelligence, or...
Hourly pay
Full time
Part time
Remote work
DataAnnotation
New York, NY
10 hours ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity-focused technology firm is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. This remote position offers flexible hours and the ability to choose projects, paying $40+ per hour. Candidates should have...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Virginia, MN
10 hours ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity firm is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. This role is flexible, allowing you to choose projects and work on your own schedule. Candidates should have over 2 years of hands-on cybersecurity...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Honolulu, HI
1 day ago
AI Evaluation Engineer
$130k - $220k
...opportunity summary by the Joinrs AI : The selection process... ...insights company. They help engineers, enterprises, investors, media... ...is best described as an AI Evaluation Engineer / Technical Generalist. It is not a traditional software engineering role and not a pure...
Full time
Worldwide
Aurora Jobs ApS
San Francisco, CA
8 hours ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A cybersecurity firm in the United States is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. You'll work directly with AI models to enhance their accuracy and improve cybersecurity tools. Ideal candidates have 2+...
Hourly pay
Remote work
DataAnnotation
Charleston, WV
4 days ago
AI Security Engineer: Train & Evaluate Cyber AI Models
$40 per hour
A leading tech firm is seeking experienced cybersecurity professionals to evaluate AI-generated content and solve technical problems. In this remote role, you will work to enhance AI systems, requiring 2+ years of hands-on experience in cybersecurity and some coding knowledge...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Iowa, LA
4 days ago
Remote AI Web App Developer | Train & Evaluate Chatbots
$40 per hour
A technology solutions company is seeking a Web Application Developer to improve AI models by evaluating coding outputs and performance. Candidates should be proficient in Python or JavaScript and have experience with algorithms and debugging. This remote position allows...
Hourly pay
Remote work
Flexible hours
DataAnnotation
Providence, RI
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - AI Evaluation. Be the first to apply!