Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer - AI Evaluation

Virtual Vocations Inc

Evaluating and benchmarking AI-generated code, the full-time Software Engineer - AI Evaluation will assess correctness, efficiency, and maintainability of coding solutions while working remotely on a contractor basis. Key responsibilities Review and evaluate AI-generated code for correctness and adherence to requirements Analyze software engineering tasks and validate proposed solutions against expected outcomes Create and maintain evaluation datasets, benchmarks, and grading rubrics for coding tasks Required qualifications Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field 3+ years of professional software engineering experience Strong proficiency in two or more programming languages such as Python, Java, or C/C++ Experience performing code reviews and evaluating code quality in large-scale codebases Familiarity with version control systems and modern software development workflows

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Software Engineer - AI Evaluation in United States vacancy
  • $175k - $245k

     ...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For over 20 years, Smartsheet has helped people and teams achieve–well, anything. From seamless work management to smart, scalable solutions, we’ve always worked with flow. We’re building... 
    Suggested
    Full time
    Temporary work
    Local area
    Immediate start
    Remote work

    Smartsheet

    New York, NY
    3 days ago
  • $50 - $150 per hour

    A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and... 
    Suggested
    Hourly pay
    Contract work
    Flexible hours

    Turing

    San Francisco, CA
    2 days ago
  •  ...© 2025 Repovive, Inc. All rights reserved. Back to Jobs Apply Now Compensation Not listed Posted April 25, 2026 Required Skills AI evaluation data pipelines agent instrumentation Requirements Mid/Senior Visa Sponsorship Not mentioned Relocation Not mentioned About the Role... 
    Suggested
    Relocation
    Visa sponsorship

    Repovive, Inc.

    New York, NY
    4 days ago
  • Senior Software Engineer - Siri Agentic Evaluation Platform Cambridge, England, United Kingdom • Machine Learning and AI Siri is the world’s most popular intelligent assistant. Available on iPhone, iPad, Mac, Apple Watch, HomePod, AppleTV and Vision Pro, hundreds of millions... 
    Suggested

    Apple Inc.

    Cambridge, MA
    3 days ago
  • $147k - $211k

    Software Engineer, Search Evaluation Platforms corporate_fare Google place Mountain View, CA, USA Apply Qualifications Bachelor’s degree or equivalent...  ...that meets standards, applying artificial intelligence (AI) tools in the development flow, and ensuring your work is... 
    Suggested
    Temporary work

    Google Inc.

    Mountain View, CA
    4 days ago
  • $120k - $250k

     ...2016 in Silicon Valley, Pony.ai has quickly become a global leader...  ..., and multi-dimensional evaluation. Design and implement high-performance...  ...and optimize downstream engineering workflows for Large Language...  ...skills in C/C++, Python, and software design Strong foundation in... 
    Temporary work

    pony.ai

    Denver, CO
    10 hours ago
  • $148k - $356.5k

    Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles page is loaded Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles...  ...to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest... 
    Full time
    Remote work

    NVIDIA Corporation

    Raleigh, NC
    3 days ago
  • $144.7k - $221.4k

     ...introspect autonomous driving software performance at interfaces...  ...autonomy developers and systems engineers. Design and implement analysis...  ...autonomy stack, including evaluation of perception, prediction, and...  ...systems. Experience leveraging AI‑assisted development and... 
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems that ensure multimodal AI behaves reliably, consistently, and predictably as it moves from research into production. This position... 

    SpreeAI

    San Francisco, CA
    10 hours ago
  • $147.4k - $272.1k

    Cupertino, California, United States Software and Services At Apple, great...  ...to the next revolution in AI and Machine Learning. We're a dynamic team seeking engineers who are enthusiastic about influencing...  ...signals, and advance the evaluation pipeline to the next level. Continuously... 
    Relocation

    Apple

    Cupertino, CA
    10 hours ago
  • Turing is searching for a Software Engineering evaluator based in Seattle, United States. You will work on AI model training by curating code examples and evaluating AI-generated code, ensuring efficiency and reliability. This role requires strong full-stack application... 
    Remote job
    For contractors
    10 hours per week
    Flexible hours

    Turing

    Seattle, WA
    2 days ago
  • Feedinkoo is looking for experienced software engineers to contribute their expertise to AI evaluation and improvement projects. This role involves applying software engineering skills to assess AI systems with no prior AI experience required, as training will be provided... 
    Remote job
    Freelance

    Feedinkoo

    New York, NY
    10 hours ago
  •  ...SupportFinity™ is seeking a Web Platform Engineer to train AI models in South Dakota. This role entails coding challenges for AI chatbots, evaluating their outputs for quality, and requires proficiency in programming languages like Python or JavaScript. As a flexible... 
    Hourly pay
    Remote work
    Flexible hours

    SupportFinity

    Sioux Falls, SD
    3 days ago
  •  ...inventive research, design, and engineering. Our organization is very...  ...shipping code. About the Role As a Software Engineer on the Agent Quality...  ...’ll build the measurement, evaluation, and feedback-loop...  ...Designing and building best-in‑class AI evaluation system: curated... 

    Anysphere

    San Francisco, CA
    2 days ago
  • An innovative AI company based in the US is seeking a Mid-Senior level developer. The role involves developing and maintaining evaluation servers, implementing logic for agent actions, and creating testing tools. Candidates should have 4+ years of Python experience, solid... 
    Part time
    Remote work

    Mind Rift

    Houston, TX
    5 days ago
  • A cybersecurity technology company is seeking experienced professionals to evaluate AI-generated security content and solve technical cybersecurity problems. This role offers a flexible schedule, with options for full-time or part-time remote work. Candidates should have... 
    Full time
    Part time
    Remote work
    Flexible hours

    DataAnnotation

    Louisiana, MO
    4 days ago
  • A cybersecurity consultancy is seeking experienced individuals to enhance AI capabilities by evaluating cybersecurity content and solving security-related challenges. You will play a pivotal role in validating AI outputs and providing critical feedback to advance cybersecurity... 
    Remote work
    Flexible hours

    DataAnnotation

    Phoenix, AZ
    10 hours ago
  • $40 per hour

    A leading AI security firm is seeking experienced cybersecurity professionals in the United States to evaluate AI-generated security content and solve technical problems. The role offers flexible scheduling, the opportunity to work on varied projects, and hourly pay starting... 
    Hourly pay
    Full time
    Part time
    Remote work
    Flexible hours

    DataAnnotation

    Kansas City, MO
    1 day ago
  • $40 per hour

    A cybersecurity technology company is seeking experienced professionals to evaluate AI-generated content and solve technical problems. In this remote role, candidates will work with AI systems to enhance their reasoning about real-world threats. Required qualifications... 
    Hourly pay
    Remote work

    DataAnnotation

    Lansing, MI
    4 days ago
  • $40 per hour

    A cybersecurity company is seeking experienced cybersecurity professionals to join their team. You will evaluate AI-generated security content, solve technical problems, and provide critical feedback to enhance AI systems. This role is remote, flexible, and offers hourly... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Oregon, WI
    4 days ago
  • $40 per hour

    A cybersecurity solutions company is seeking experienced professionals to evaluate AI-generated security content and solve technical security problems. Candidates should have over 2 years of hands-on experience in cybersecurity and coding skills, with strong writing and... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Saint Paul, MN
    4 days ago
  • $40 per hour

    A cybersecurity-focused company is seeking experienced professionals to evaluate AI-generated security content and improve AI systems. Responsibilities include assessing threats and providing technical feedback, ideal for those with 2+ years in cybersecurity roles like... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Springfield, IL
    1 day ago
  • $40 per hour

    A leader in AI training for cybersecurity is seeking experienced cybersecurity professionals to evaluate AI-generated content and solve technical problems. This role offers full-time or part-time remote work with the flexibility to choose projects and work hours. Candidates... 
    Hourly pay
    Full time
    Part time
    Remote work

    DataAnnotation

    Madison, WI
    10 hours ago
  • $40 per hour

     ...cybersecurity professionals to join our team to help train AI models. In this role, you will evaluate AI-generated security content, solve technical...  ...penetration testing, red teaming, incident response, detection engineering, DFIR, malware analysis, threat intelligence, or... 
    Hourly pay
    Full time
    Part time
    Remote work

    DataAnnotation

    New York, NY
    10 hours ago
  • $40 per hour

    A cybersecurity-focused technology firm is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. This remote position offers flexible hours and the ability to choose projects, paying $40+ per hour. Candidates should have... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Virginia, MN
    10 hours ago
  • $40 per hour

    A cybersecurity firm is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. This role is flexible, allowing you to choose projects and work on your own schedule. Candidates should have over 2 years of hands-on cybersecurity... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Honolulu, HI
    1 day ago
  • $130k - $220k

     ...opportunity summary by the Joinrs AI : The selection process...  ...insights company. They help engineers, enterprises, investors, media...  ...is best described as an AI Evaluation Engineer / Technical Generalist. It is not a traditional software engineering role and not a pure... 
    Full time
    Worldwide

    Aurora Jobs ApS

    San Francisco, CA
    8 hours ago
  • $40 per hour

    A cybersecurity firm in the United States is seeking experienced professionals to evaluate AI-generated security content and solve technical problems. You'll work directly with AI models to enhance their accuracy and improve cybersecurity tools. Ideal candidates have 2+... 
    Hourly pay
    Remote work

    DataAnnotation

    Charleston, WV
    4 days ago
  • $40 per hour

    A leading tech firm is seeking experienced cybersecurity professionals to evaluate AI-generated content and solve technical problems. In this remote role, you will work to enhance AI systems, requiring 2+ years of hands-on experience in cybersecurity and some coding knowledge... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Iowa, LA
    4 days ago
  • $40 per hour

    A technology solutions company is seeking a Web Application Developer to improve AI models by evaluating coding outputs and performance. Candidates should be proficient in Python or JavaScript and have experience with algorithms and debugging. This remote position allows... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Providence, RI
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - AI Evaluation. Be the first to apply!