Computer Vision Specialist for AI Model Evaluation
$80 - $110 per hourSaidGig
Join a leading AI lab''s cutting-edge GenAI team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced AI models. Overview
A leading AI lab is building and evaluating frontier models and needs experienced computer vision practitioners to act as ground-truth experts. You will author challenging, real-world vision tasks, generate reference solutions, and evaluate model outputs to surface reasoning and capability gaps in a target model.
The work centers on designing robust computer vision problems (e.g., detection, segmentation, recognition, vision-language and multimodal reasoning, or generation), building them out with executable tests where applicable, and then analyzing how models and agents behave against them. All applicants are expected to have working proficiency in Python.
This is a part-time W-2 employment position with Cincinnatus LLC, with the opportunity to be placed at a leading AI lab as part of their extended workforce. This role is fully remote within the United States, at approximately 20 hours per week.
Key Responsibilities- Task design and development: Design challenging, real-world computer vision problems drawn from your area of expertise (e.g., object detection, segmentation, recognition, multimodal/vision-language reasoning, or image/video generation) that target specific capability gaps in a frontier AI model.
- Spec and golden-solution generation: Integrate the problems into an agentic development environment, preparing all necessary components using Python.
- Evaluation and analysis: Evaluate the target model''s performance on your tasks.
- Headroom identification: Identify tasks where the target model fails, and classify the nature of the failure.
- Collaborate with other experts: Work alongside fellow subject-matter experts to keep evaluations consistent and accurate.
- Deep, hands-on experience in computer vision, from applied industry work, research, or a graduate/PhD background in the field.
- Working proficiency in Python, applied in research, industry, or open-source work (not theoretical familiarity).
- Strong command of modern computer vision methods, including deep learning architectures, evaluation, and standard tooling (e.g., PyTorch/TensorFlow).
- Ability to engage reliably for approximately 20 hours per week.
- Past experience in AI training, model evaluation, or data annotation is preferred.
- Strong written communication and the ability to work independently and manage your own time.
This is a part-time W-2 employment position, fully remote within the United States, with an expected commitment of approximately 20 hours per week.
CompensationThe hourly compensation for this role ranges from $80 to $110.
EligibilityApplicants must be legally eligible to work in the United States.
- ...Washington is looking for an Analyst to train AI models. The role involves providing complex mathematics problems to AI chatbots and evaluating their outputs for correctness.... ...with a focus on applied mathematics or computer science. This is a flexible, independent...ComputerRemote jobHourly payContract workFlexible hours
$224k - $356.5k
NVIDIA has been transforming computer graphics, PC gaming, and accelerated... ...the unlimited potential of AI to define the next era of... ...never been done before takes vision, innovation, and the world’s... ...Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a...Computer- Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems... ...on evaluating generative and vision-based models through... ...pipelines Qualifications Degree in Computer Science, AI, Engineering, or...Computer
$197.3k - $225.1k
...Lead AI Engineer (Vision model customization, VLM) Overview At Capital One, we are creating responsible... ...similarity search, guardrails, model evaluation, experimentation, governance, and... ...: Bachelor's degree in Computer Science, AI, Electrical Engineering,...ComputerFull timePart timeLocal area$60 per hour
...contribute to developing cutting-edge AI systems, while enjoying the... ...advance AI development. AI models are increasingly capable of... ...the-art AI models on tasks like evaluating AI-generated quantitative... ...field is preferred (Statistics, Computer Science, Mathematics, Engineering...ComputerHourly payFull timeRemote workFlexible hours$100 per hour
...Geoscientists leverage their expertise to support AI research by evaluating AI-generated content and providing critical feedback on geology concepts... ...internet connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous, with no minimum...ComputerRemote jobHourly payFull timeContract workFor contractorsFlexible hours$197.3k - $225.1k
Lead AI Engineer (Vision Model Customization, VML) Capital One is a leader in applying machine learning... ...similarity search, guardrails, model evaluation, experimentation, governance, and... ...Qualifications Bachelor’s degree in Computer Science, AI, Electrical Engineering,...ComputerLocal area- ...leverage their clinical expertise to support AI research through flexible, project-based... ...real-world experience in neurology to evaluate AI-generated content, providing valuable... ...connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous,...ComputerRemote jobFull timeContract workFor contractorsFlexible hours
$75 per hour
...testing, and maintaining equipment to support AI research through flexible, hourly... ...to utilize your real-world experience to evaluate AI-generated content and provide valuable... ...connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous,...ComputerRemote jobHourly payFull timeContract workFor contractorsFlexible hours$150 per hour
Prolific is looking for PhD experts in STEM fields to train and evaluate AI models. Successful candidates will work on complex problems while... ...role requires a strong understanding of subjects such as Computer Science, Statistics, and Mechanical Engineering. Join Prolific...Computer- Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning...ComputerFull time
- ...in the United States is seeking a Data Scientist to train AI models and evaluate their outputs. In this role, you will ensure quality and performance... ...understanding of data analysis and skills in statistics or computer science. This position offers competitive hourly pay and...ComputerRemote jobHourly payFlexible hours
$60 per hour
Prolific is looking for Computer Science Specialists in Washington, DC to join their Expert Network. Successful candidates will work on evaluating AI models, ranking AI-generated responses, and reviewing technical research. Responsibilities include fact-checking research...Computer$40 per hour
A leading AI solutions company is seeking a Data Scientist to train AI models and enhance their quality through evaluation and problem-solving. This flexible and remote role allows you to... ...deep understanding of statistics and computer science. Only applicants located in the...ComputerRemote jobHourly payFlexible hours$301.75k - $355k
...vertically integrated AI infrastructure company... ...time. The demand for AI compute is boundless, and power... ...Senior Director for the Model LifeCycle team will undertake... ...: versioning, lineage, evaluation, and reproducible fine‑... ...health, dental & vision insurance ~ Employer...ComputerTemporary work- ...remote Kotlin Engineer to review AI-generated responses and create... ...AI performance, and ensuring model accuracy. The ideal candidate has a Bachelor's degree in Computer Science, 2+ years of Kotlin... ...network and requires critical evaluation of technical concepts. #J-188...ComputerRemote job
$141.8k - $258.6k
AI Experience Researcher, Product Evaluation, Vision Products Group Sunnyvale, California, United States Machine Learning... ...— to recognize patterns in model behaviors and outputs, and to develop... ...degree in Cognitive Psychology, Human-Computer Interaction (HCI), User...ComputerRelocation- A leading AI company is seeking a Biology Specialist to help fine-tune large language models. Ideal candidates will be pursuing or hold a Ph.D. in Biology or a related field and possess strong research skills. The role involves solving complex biological problems and collaborating...Remote job
$60 per hour
Prolific is looking for Computer Science Specialists to join our Expert Network to help train AI models. Candidates with a BSc in Computer Science will evaluate responses to technical prompts and review scientific papers. The role offers competitive pay rates up to $60...ComputerRemote jobWork from homeFlexible hours$197.3k - $225.1k
...Overview Lead AI Engineer (Vision model customization, VML) At Capital One, we are creating responsible... ...similarity search, guardrails, model evaluation, experimentation, governance, and... ...: Bachelor's degree in Computer Science, AI, Electrical Engineering,...ComputerFull timePart timeLocal area$150k
...Institute of Foundation Models We are a dedicated... ...the next generation of AI builders, and drive transformative... ...for high-performance computing in deep learning,... ...Research Scientist in the Vision Language Model (VLM)... ...and post-training, and evaluation benchmarks. The role combines...Computer$115k - $157k
# Senior SAS/Python Model Validation and Modernization... ...stakeholders to evaluate model performance, conduct... ...* Bachelor’s degree in Computer Science, Data Science,... ...technologies, particularly AI* Strategic mindset with... ...medical, dental, and vision plans.* You can join one...ComputerFull timeWork at officeRemote work$93.6k - $220.4k
...Safety (T&S) Responsible AI Policy team's mission... ...the development of GenAI models and applications are... ...drive end-to-end policy to evaluate workflows for your... ...experience. Degree in Computer Science, Human-Computer... ...to medical, dental, and vision insurance, a 401(k) savings...ComputerTemporary workLocal area- Medical Professionals can apply their expertise to evaluate AI models and enhance their understanding of healthcare tasks and terminology. This role involves assessing content relevant to your field and providing clear, structured feedback to improve AI performance. No...Hourly payTemporary workPart timeRemote workFlexible hours
$40 per hour
...D Mathematician to work remotely. This role involves training AI models and measuring their progress by providing complex mathematical... ...mathematical reasoning and familiarity with applied mathematics or computer science. The position offers flexible scheduling and...ComputerRemote jobHourly payFlexible hours$272k - $431.25k
...seeking a Senior Research Manager to lead world‑model evaluation and benchmarking across NVIDIA’s Physical AI model portfolio. The role will build a team and... ...Strong research background in machine learning, computer vision, multimodal AI, robotics, world models, representation...Computer$300k - $320k
...role: We are seeking a Technical Program Manager to lead our AI model evaluation initiatives across multiple workstreams. This role will be... ...equity donation matching. Comprehensive health, dental, and vision insurance for you and all your dependents. 401(k) plan with...Work at officeHome officeVisa sponsorshipRelocation package- ...Chicago-headquartered medical‑AI company applying artificial intelligence... ...cancer detection, with a vision of a future where cancer no... ...lives. Our foundational model, ABCD (AI Biomarker Cancer Detection... ...Steward governs how ABCD is evaluated, validated, and released — owning...
- ...Research Engineer - Language Model Pre-Training , you\'ll... ..., processing, and evaluation Architecture and methodology... ...a scientific subject (Computer Science, EE/EECS, Math,... ...do and love discussing AI Benefits and Perks:... ...medical, dental, vision, and FSA plans Competitive...ComputerWork at officeRelocation package
$75 per hour
Cartographers and photogrammetrists can apply their expertise to evaluate AI models and enhance their understanding of geographical data. In... ...charts. Utilize precision stereoplotting apparatus and computer graphics for delineating topographic and cultural features....ComputerRemote jobFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Computer Vision Specialist for AI Model Evaluation. Be the first to apply!
- esports specialist United States
- verification specialist work from home United States
- delivery assurance specialist United States
- demo specialist United States
- instructional technology specialist United States
- hospitality specialist United States
- fixed income specialist United States
- threading specialist United States
- smart home specialist United States
- coffee specialist United States



