Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Evaluation Engineer: Code & Test Automation

$80 per hour

Mindrift

A tech consulting firm is seeking experienced freelance contributors to develop coding test cases for AI systems, allowing for flexible part-time work. Ideal candidates should have a degree in computer science or related fields, over 5 years of experience in software development mainly in Python, and a strong background in full-stack development. Compensation may reach up to $80/hour based on project-specific criteria, fostering a stimulating work environment with the freedom to manage task schedules flexibly. #J-18808-Ljbffr Mindrift

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Evaluation Engineer: Code & Test Automation in New York, NY vacancy
  • TWG Global AI in New York is seeking a Senior or Staff AI Software Engineer in Test to develop test automation frameworks for AI products. The ideal candidate will have significant experience in Python and Java, along with a strong software engineering background focused... 
    Suggested

    TWG Global AI

    New York, NY
    2 days ago
  • $80 per hour

    A leading AI project company in the United States is seeking experienced software...  ...in Computer Science or Software Engineering, at least 5 years of experience in...  ...Responsibilities include reviewing coding tasks, writing functional tests, and analyzing AI model failures. The... 
    Suggested
    Contract work
    Flexible hours

    Mindrift

    Brooklyn, NY
    4 days ago
  • $60 per hour

    A leading AI development company seeks proficient programmers to contribute to cutting-edge AI systems while...  ...flexible schedule. Responsibilities include solving coding problems, writing high-quality code, and evaluating AI-generated code. Ideal candidates should have a... 
    Suggested
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  • $60 per hour

    A prominent AI technology company seeks proficient programmers to join a remote coding team that drives AI development. Responsibilities include designing coding problems, writing high-quality code, and evaluating AI-generated code. The ideal candidate is fluent in English... 
    Suggested
    Remote job

    DataAnnotation

    New York, NY
    1 day ago
  •  ...Responsibilities Review and refine AI-generated prompts, responses, and code Validate algorithms and...  ...benchmarking efforts to evaluate and compare model...  ...experience in software engineering, technical research, or...  ...with debugging, testing, and validating code Ability... 
    Suggested
    Remote job
    Part time

    Crossing Hurdles

    New York, NY
    4 days ago
  • $60 per hour

     ...seeking proficient programmers to contribute to cutting-edge AI development. This fully remote role allows candidates...  ...choose projects. Responsibilities include designing coding problems for AI systems and evaluating AI-generated code. Candidates should be fluent in... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  •  ...looking for  Senior Python and Test Engineer. New York City Location....  ...package. Senior Python & Test Automation Engineer In this role you...  ..., modular design, and clean code practices. Experience...  ...UAT (Planning, Execution and Evaluation) Participate in planning... 
    Work at office
    Visa sponsorship
    2 days per week
    3 days per week

    Brains Workgroup

    New York, NY
    16 days ago
  • $150k - $250k

     ...About Distyl AI Distyl is an applied...  ...AI systems using Evaluation-Driven Development...  ...AI Evaluation Engineers focus on designing...  ...production Python code, build evaluation...  ...and maintain golden test cases and regression...  ...graders, aligning automated judgments with... 
    3 days per week

    Distyl AI

    New York, NY
    1 day ago
  • $180k - $200k

     ...Responsibilities Senior AI Engineer Location: New...  ...LLMs: Build evaluation framework for open-source...  ...deployment systems to enable automated diagnostics, runbook...  ...designing, architecting, testing, and launching...  ...SageMaker), and infra-as-code (Terraform/CloudFormation... 
    Full time

    eClercx

    New York, NY
    3 days ago
  •  ...execution, leveraging AI, data, and...  ...Generative AI and AI Automation to optimize productivity...  ...essence, the AI Engineer ensures that the...  ...: Write low code (ex/python) to connect...  ...meeting penetration testing requirements)....  ...stack. For example, evaluating whether to use a framework... 
    Remote work

    Hiretruss

    New York, NY
    4 days ago
  • Description Software Engineering Principal AI Engineer This is a hands...  ..., or marketing automation operations role. You...  ...are writing production code and shipping working...  ...practices: automated testing, CI/CD, infrastructure...  ...context design, and evaluation frameworks for reliable... 
    Immediate start

    B Capital

    New York, NY
    1 day ago
  • $130k - $250k

     ...Machine Learning Engineer Goldman Sachs is...  ...through large scale automation, best-in-class...  ...classical ML with agentic AI, we deliver...  ...Productionize LLMs: Build evaluation framework for open...  ..., architecting, testing, and launching...  ...), and infra-as-code (Terraform/CloudFormation... 
    Full time
    Temporary work
    Part time
    Worldwide

    Goldman Sachs

    Jersey City, NJ
    3 days ago
  • $72k - $184.44k

     ...of stakeholders. They evaluate compliance with regulations...  ...technology, and automation tools to leverage data...  ...guidance), the Firm's code of conduct, and independence...  ...evaluation and assessment of AI solutions, securing...  ...in AI/ML model and testing and validation, risk management... 
    H1b

    PricewaterhouseCoopers

    New York, NY
    15 days ago
  • $119k - $299.93k

     ...of stakeholders. They evaluate compliance with...  ...wrangling technology, and automation tools to leverage data...  ...), the Firm's code of conduct, and independence...  ...of enterprise AI/ML solutions, setting...  ...professional AI/ML development, engineering, or testing experience What... 
    H1b

    PricewaterhouseCoopers

    New York, NY
    1 day ago
  • $150k - $200k

    AI Deployment Engineer AI & Automation Practice Stable Rock is hiring an AI Deployment Engineer to help build...  ...comfortable writing production code, deploying AI into complex client...  ...production systems. Build monitoring, testing, evaluation, and operational support processes... 
    Immediate start
    Flexible hours

    EmergencyMD

    New York, NY
    13 hours ago
  • $99k - $252.45k

     ...variety of stakeholders. They evaluate compliance with...  ...wrangling technology, and automation tools to leverage data and...  ...audit guidance), the Firm's code of conduct, and independence...  ...As part of the AI Engineering team, you will design, test, and deploy innovative AI... 
    H1b

    PricewaterhouseCoopers

    New York, NY
    13 hours ago
  • $40 per hour

    A technology firm focusing on AI is seeking experienced cybersecurity professionals...  ...for a remote role. You will evaluate AI-generated content, solve...  ..., including skills in penetration testing and incident response, as well as some coding experience. Strong writing and analytical... 
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    1 day ago
  • $156.4k - $301k

     ...project, performing coding, debugging, testing and troubleshooting...  ...across Development & Engineering and Architecture teams...  ...management and automation tools such as GitHub...  ...systems that combine AI models with external...  ...Ability to design robust evaluation frameworks for AI... 
    Summer holiday
    Local area
    Flexible hours

    EY

    Hoboken, NJ
    7 days ago
  •  ...degree in Computer Science, Engineering, or related discipline. ~...  ...years of experience in UI automation with a strong focus on UFT (Unified Functional Testing) in enterprise environments...  ...~ Experience using Agentic AI tools to support coding and automation tasks. ~ Excellent... 
    Work from home
    Flexible hours

    Sparc Technology Services Inc

    New York, NY
    6 days ago
  •  ...solutions provider is looking for Software Engineering, Data Science, and Systems Design Experts proficient in C programming to evaluate responses generated by large language...  ...completeness of LLM outputs, as well as conducting code testing and validation. Candidates should... 
    Remote job
    Flexible hours

    Crossing Hurdles

    New York, NY
    4 days ago
  • $127 per hour

     ...and Power Platform AI capabilities...  ...cases to advanced, code‑first AI solution...  ...Design, build, test, deploy, and manage...  ...workflows using Power Automate, structured knowledge...  ...capabilities and evaluate their potential impact...  ...of LLMs, prompt engineering, RAG, and vector... 
    Work at office
    Local area
    Remote work
    1 day per week

    MUFG

    Jersey City, NJ
    9 days ago
  • Sigma Software LLC is seeking a Principal AI-Augmented Test Automation Engineer to lead and scale AI-native E2E test automation offerings. This role involves shaping engagement models and establishing scalable testing processes in AI-driven quality engineering. The successful... 
    Remote job

    Sigma Software LLC

    New York, NY
    2 days ago
  • $100k - $110k

    Litify is seeking an AI QA Engineer to join the Product & Engineering team located anywhere in the United States. This role focuses on managing AI-generated test outputs, improving quality assurance, and driving transformation in an agile environment. Applicants should... 

    Litify

    New York, NY
    2 days ago
  •  ...Title : Lead Software Engineer in Test Team : Global...  ...scalable, and reliable automated testing solutions across...  .... You will also drive AI-augmented development...  ...test authoring velocity, code reviews, and defect...  ...root-cause analysis. • Evaluate and integrate functional... 
    Contract work
    Local area
    Remote work
    Shift work

    Paramount

    New York, NY
    5 hours ago
  •  ...Description Senior Software Engineer - AI II Recruiter: Symbiotic...  ...workflows through automation and intelligence. About the...  ...and compliance through strong testing frameworks, monitoring solutions...  ...proof-of-concepts (POCs) to evaluate emerging technologies and... 
    Work at office
    Remote work
    Flexible hours

    Symbiotic Corp.

    New York, NY
    3 days ago
  • $170k - $200k

     ...Metropolis is seeking a Senior AI Engineer to join our Applied AI...  ...shipping AI-powered tools and automation pipelines that replace manual...  ...maintain quality over time Evaluate and integrate emerging AI tooling...  ...write clean, maintainable code and are comfortable building... 
    Temporary work
    Work at office
    Local area
    Immediate start

    Metropolis Corp

    New York, NY
    13 hours ago
  • $60 per hour

    A leading AI development team is seeking proficient programmers to contribute to cutting-edge AI systems. Enjoy...  ...projects. Key responsibilities include designing coding problems for AI, providing feedback, and evaluating AI code. Candidates should be fluent in English and... 
    Remote job

    DataAnnotation

    New York, NY
    2 days ago
  • VP AI Innovation & Solutions Engineering location_on New York, NY, United States...  ...transferring the application code, cloud integration...  ...standards. Drive automation of deployment, testing, and monitoring processes...  ...practices, continuously evaluating and recommending new... 
    Full time
    Temporary work
    Work at office

    Goldman Sachs Bank AG

    New York, NY
    3 days ago
  • $60 per hour

     ...proficient programmers to contribute remotely to AI systems development. Ideal candidates should have...  ...JavaScript and Python. Responsibilities include designing coding problems for AI training, writing clear code, and evaluating AI-generated scripts. This role offers... 
    Remote job
    Hourly pay

    DataAnnotation

    Brooklyn, NY
    2 days ago
  • $60 per hour

    A cutting-edge AI development company is looking for proficient programmers to contribute to AI systems...  ...-time jobs. Responsibilities include designing coding problems for training AI systems, writing code snippets, and evaluating AI-generated code. Candidates should have... 
    Remote job
    Full time

    DataAnnotation

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Evaluation Engineer: Code & Test Automation. Be the first to apply!