AI Evaluation Engineer: Code & Test Automation
$80 per hourMindrift
A tech consulting firm is seeking experienced freelance contributors to develop coding test cases for AI systems, allowing for flexible part-time work. Ideal candidates should have a degree in computer science or related fields, over 5 years of experience in software development mainly in Python, and a strong background in full-stack development. Compensation may reach up to $80/hour based on project-specific criteria, fostering a stimulating work environment with the freedom to manage task schedules flexibly. #J-18808-Ljbffr Mindrift
- TWG Global AI in New York is seeking a Senior or Staff AI Software Engineer in Test to develop test automation frameworks for AI products. The ideal candidate will have significant experience in Python and Java, along with a strong software engineering background focused...Suggested
$80 per hour
A leading AI project company in the United States is seeking experienced software... ...in Computer Science or Software Engineering, at least 5 years of experience in... ...Responsibilities include reviewing coding tasks, writing functional tests, and analyzing AI model failures. The...SuggestedContract workFlexible hours$60 per hour
A leading AI development company seeks proficient programmers to contribute to cutting-edge AI systems while... ...flexible schedule. Responsibilities include solving coding problems, writing high-quality code, and evaluating AI-generated code. Ideal candidates should have a...SuggestedRemote jobHourly payFlexible hours$60 per hour
A prominent AI technology company seeks proficient programmers to join a remote coding team that drives AI development. Responsibilities include designing coding problems, writing high-quality code, and evaluating AI-generated code. The ideal candidate is fluent in English...SuggestedRemote job- ...Responsibilities Review and refine AI-generated prompts, responses, and code Validate algorithms and... ...benchmarking efforts to evaluate and compare model... ...experience in software engineering, technical research, or... ...with debugging, testing, and validating code Ability...SuggestedRemote jobPart time
$60 per hour
...seeking proficient programmers to contribute to cutting-edge AI development. This fully remote role allows candidates... ...choose projects. Responsibilities include designing coding problems for AI systems and evaluating AI-generated code. Candidates should be fluent in...Remote jobHourly payFlexible hours- ...looking for Senior Python and Test Engineer. New York City Location.... ...package. Senior Python & Test Automation Engineer In this role you... ..., modular design, and clean code practices. Experience... ...UAT (Planning, Execution and Evaluation) Participate in planning...Work at officeVisa sponsorship2 days per week3 days per week
$150k - $250k
...About Distyl AI Distyl is an applied... ...AI systems using Evaluation-Driven Development... ...AI Evaluation Engineers focus on designing... ...production Python code, build evaluation... ...and maintain golden test cases and regression... ...graders, aligning automated judgments with...3 days per week$180k - $200k
...Responsibilities Senior AI Engineer Location: New... ...LLMs: Build evaluation framework for open-source... ...deployment systems to enable automated diagnostics, runbook... ...designing, architecting, testing, and launching... ...SageMaker), and infra-as-code (Terraform/CloudFormation...Full time- ...execution, leveraging AI, data, and... ...Generative AI and AI Automation to optimize productivity... ...essence, the AI Engineer ensures that the... ...: Write low code (ex/python) to connect... ...meeting penetration testing requirements).... ...stack. For example, evaluating whether to use a framework...Remote work
- Description Software Engineering Principal AI Engineer This is a hands... ..., or marketing automation operations role. You... ...are writing production code and shipping working... ...practices: automated testing, CI/CD, infrastructure... ...context design, and evaluation frameworks for reliable...Immediate start
$130k - $250k
...Machine Learning Engineer Goldman Sachs is... ...through large scale automation, best-in-class... ...classical ML with agentic AI, we deliver... ...Productionize LLMs: Build evaluation framework for open... ..., architecting, testing, and launching... ...), and infra-as-code (Terraform/CloudFormation...Full timeTemporary workPart timeWorldwide$72k - $184.44k
...of stakeholders. They evaluate compliance with regulations... ...technology, and automation tools to leverage data... ...guidance), the Firm's code of conduct, and independence... ...evaluation and assessment of AI solutions, securing... ...in AI/ML model and testing and validation, risk management...H1b$119k - $299.93k
...of stakeholders. They evaluate compliance with... ...wrangling technology, and automation tools to leverage data... ...), the Firm's code of conduct, and independence... ...of enterprise AI/ML solutions, setting... ...professional AI/ML development, engineering, or testing experience What...H1b$150k - $200k
AI Deployment Engineer AI & Automation Practice Stable Rock is hiring an AI Deployment Engineer to help build... ...comfortable writing production code, deploying AI into complex client... ...production systems. Build monitoring, testing, evaluation, and operational support processes...Immediate startFlexible hours$99k - $252.45k
...variety of stakeholders. They evaluate compliance with... ...wrangling technology, and automation tools to leverage data and... ...audit guidance), the Firm's code of conduct, and independence... ...As part of the AI Engineering team, you will design, test, and deploy innovative AI...H1b$40 per hour
A technology firm focusing on AI is seeking experienced cybersecurity professionals... ...for a remote role. You will evaluate AI-generated content, solve... ..., including skills in penetration testing and incident response, as well as some coding experience. Strong writing and analytical...Remote jobHourly pay$156.4k - $301k
...project, performing coding, debugging, testing and troubleshooting... ...across Development & Engineering and Architecture teams... ...management and automation tools such as GitHub... ...systems that combine AI models with external... ...Ability to design robust evaluation frameworks for AI...Summer holidayLocal areaFlexible hours- ...degree in Computer Science, Engineering, or related discipline. ~... ...years of experience in UI automation with a strong focus on UFT (Unified Functional Testing) in enterprise environments... ...~ Experience using Agentic AI tools to support coding and automation tasks. ~ Excellent...Work from homeFlexible hours
- ...solutions provider is looking for Software Engineering, Data Science, and Systems Design Experts proficient in C programming to evaluate responses generated by large language... ...completeness of LLM outputs, as well as conducting code testing and validation. Candidates should...Remote jobFlexible hours
$127 per hour
...and Power Platform AI capabilities... ...cases to advanced, code‑first AI solution... ...Design, build, test, deploy, and manage... ...workflows using Power Automate, structured knowledge... ...capabilities and evaluate their potential impact... ...of LLMs, prompt engineering, RAG, and vector...Work at officeLocal areaRemote work1 day per week- Sigma Software LLC is seeking a Principal AI-Augmented Test Automation Engineer to lead and scale AI-native E2E test automation offerings. This role involves shaping engagement models and establishing scalable testing processes in AI-driven quality engineering. The successful...Remote job
$100k - $110k
Litify is seeking an AI QA Engineer to join the Product & Engineering team located anywhere in the United States. This role focuses on managing AI-generated test outputs, improving quality assurance, and driving transformation in an agile environment. Applicants should...- ...Title : Lead Software Engineer in Test Team : Global... ...scalable, and reliable automated testing solutions across... .... You will also drive AI-augmented development... ...test authoring velocity, code reviews, and defect... ...root-cause analysis. • Evaluate and integrate functional...Contract workLocal areaRemote workShift work
- ...Description Senior Software Engineer - AI II Recruiter: Symbiotic... ...workflows through automation and intelligence. About the... ...and compliance through strong testing frameworks, monitoring solutions... ...proof-of-concepts (POCs) to evaluate emerging technologies and...Work at officeRemote workFlexible hours
$170k - $200k
...Metropolis is seeking a Senior AI Engineer to join our Applied AI... ...shipping AI-powered tools and automation pipelines that replace manual... ...maintain quality over time Evaluate and integrate emerging AI tooling... ...write clean, maintainable code and are comfortable building...Temporary workWork at officeLocal areaImmediate start$60 per hour
A leading AI development team is seeking proficient programmers to contribute to cutting-edge AI systems. Enjoy... ...projects. Key responsibilities include designing coding problems for AI, providing feedback, and evaluating AI code. Candidates should be fluent in English and...Remote job- VP AI Innovation & Solutions Engineering location_on New York, NY, United States... ...transferring the application code, cloud integration... ...standards. Drive automation of deployment, testing, and monitoring processes... ...practices, continuously evaluating and recommending new...Full timeTemporary workWork at office
$60 per hour
...proficient programmers to contribute remotely to AI systems development. Ideal candidates should have... ...JavaScript and Python. Responsibilities include designing coding problems for AI training, writing clear code, and evaluating AI-generated scripts. This role offers...Remote jobHourly pay$60 per hour
A cutting-edge AI development company is looking for proficient programmers to contribute to AI systems... ...-time jobs. Responsibilities include designing coding problems for training AI systems, writing code snippets, and evaluating AI-generated code. Candidates should have...Remote jobFull time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Evaluation Engineer: Code & Test Automation. Be the first to apply!
- senior ai engineer New York, NY
- ai ml engineer New York, NY
- ai engineer remote New York, NY
- ai engineer New York, NY
- ai prompt engineer New York, NY
- ai developer New York, NY
- ai research engineer New York, NY
- machine learning ai engineer New York, NY
- senior software test automation engineer New York, NY
- qa automation engineer New York, NY



