Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Computer Vision Specialist for AI Model Evaluation

$80 - $110 per hour

SaidGig

Join a leading AI lab''s cutting-edge GenAI team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced AI models. Overview

A leading AI lab is building and evaluating frontier models and needs experienced computer vision practitioners to act as ground-truth experts. You will author challenging, real-world vision tasks, generate reference solutions, and evaluate model outputs to surface reasoning and capability gaps in a target model.

The work centers on designing robust computer vision problems (e.g., detection, segmentation, recognition, vision-language and multimodal reasoning, or generation), building them out with executable tests where applicable, and then analyzing how models and agents behave against them. All applicants are expected to have working proficiency in Python.

This is a part-time W-2 employment position with Cincinnatus LLC, with the opportunity to be placed at a leading AI lab as part of their extended workforce. This role is fully remote within the United States, at approximately 20 hours per week.

Key Responsibilities
  • Task design and development: Design challenging, real-world computer vision problems drawn from your area of expertise (e.g., object detection, segmentation, recognition, multimodal/vision-language reasoning, or image/video generation) that target specific capability gaps in a frontier AI model.
  • Spec and golden-solution generation: Integrate the problems into an agentic development environment, preparing all necessary components using Python.
  • Evaluation and analysis: Evaluate the target model''s performance on your tasks.
  • Headroom identification: Identify tasks where the target model fails, and classify the nature of the failure.
  • Collaborate with other experts: Work alongside fellow subject-matter experts to keep evaluations consistent and accurate.
Core Qualifications
  • Deep, hands-on experience in computer vision, from applied industry work, research, or a graduate/PhD background in the field.
  • Working proficiency in Python, applied in research, industry, or open-source work (not theoretical familiarity).
  • Strong command of modern computer vision methods, including deep learning architectures, evaluation, and standard tooling (e.g., PyTorch/TensorFlow).
  • Ability to engage reliably for approximately 20 hours per week.
  • Past experience in AI training, model evaluation, or data annotation is preferred.
  • Strong written communication and the ability to work independently and manage your own time.
Work Terms

This is a part-time W-2 employment position, fully remote within the United States, with an expected commitment of approximately 20 hours per week.

Compensation

The hourly compensation for this role ranges from $80 to $110.

Eligibility

Applicants must be legally eligible to work in the United States.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Computer Vision Specialist for AI Model Evaluation in United States vacancy
  •  ...Washington is looking for an Analyst to train AI models. The role involves providing complex mathematics problems to AI chatbots and evaluating their outputs for correctness....  ...with a focus on applied mathematics or computer science. This is a flexible, independent... 
    Computer
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    DataAnnotation

    Washington DC
    3 days ago
  • $224k - $356.5k

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated...  ...the unlimited potential of AI to define the next era of...  ...never been done before takes vision, innovation, and the world’s...  ...Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a... 
    Computer

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems...  ...on evaluating generative and vision-based models through...  ...pipelines Qualifications Degree in Computer Science, AI, Engineering, or... 
    Computer

    SpreeAI

    San Francisco, CA
    5 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (Vision model customization, VLM) Overview At Capital One, we are creating responsible...  ...similarity search, guardrails, model evaluation, experimentation, governance, and...  ...: Bachelor's degree in Computer Science, AI, Electrical Engineering,... 
    Computer
    Full time
    Part time
    Local area

    Capital One

    Cambridge, MA
    3 days ago
  • $60 per hour

     ...contribute to developing cutting-edge AI systems, while enjoying the...  ...advance AI development. AI models are increasingly capable of...  ...the-art AI models on tasks like evaluating AI-generated quantitative...  ...field is preferred (Statistics, Computer Science, Mathematics, Engineering... 
    Computer
    Hourly pay
    Full time
    Remote work
    Flexible hours

    DataAnnotation

    Sioux Falls, SD
    5 days ago
  • $100 per hour

     ...Geoscientists leverage their expertise to support AI research by evaluating AI-generated content and providing critical feedback on geology concepts...  ...internet connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous, with no minimum... 
    Computer
    Remote job
    Hourly pay
    Full time
    Contract work
    For contractors
    Flexible hours

    SaidGig

    United States
    16 days ago
  • $197.3k - $225.1k

    Lead AI Engineer (Vision Model Customization, VML) Capital One is a leader in applying machine learning...  ...similarity search, guardrails, model evaluation, experimentation, governance, and...  ...Qualifications Bachelor’s degree in Computer Science, AI, Electrical Engineering,... 
    Computer
    Local area

    Capital One

    New York, NY
    4 days ago
  •  ...leverage their clinical expertise to support AI research through flexible, project-based...  ...real-world experience in neurology to evaluate AI-generated content, providing valuable...  ...connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous,... 
    Computer
    Remote job
    Full time
    Contract work
    For contractors
    Flexible hours

    SaidGig

    United States
    16 days ago
  • $75 per hour

     ...testing, and maintaining equipment to support AI research through flexible, hourly...  ...to utilize your real-world experience to evaluate AI-generated content and provide valuable...  ...connection and access to a desktop or laptop computer. Schedule: Flexible and asynchronous,... 
    Computer
    Remote job
    Hourly pay
    Full time
    Contract work
    For contractors
    Flexible hours

    SaidGig

    United States
    16 days ago
  • $150 per hour

    Prolific is looking for PhD experts in STEM fields to train and evaluate AI models. Successful candidates will work on complex problems while...  ...role requires a strong understanding of subjects such as Computer Science, Statistics, and Mechanical Engineering. Join Prolific... 
    Computer

    Prolific

    New Bremen, OH
    3 days ago
  • Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning... 
    Computer
    Full time

    Refresh AI

    San Francisco, CA
    3 days ago
  •  ...in the United States is seeking a Data Scientist to train AI models and evaluate their outputs. In this role, you will ensure quality and performance...  ...understanding of data analysis and skills in statistics or computer science. This position offers competitive hourly pay and... 
    Computer
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    1 day ago
  • $60 per hour

    Prolific is looking for Computer Science Specialists in Washington, DC to join their Expert Network. Successful candidates will work on evaluating AI models, ranking AI-generated responses, and reviewing technical research. Responsibilities include fact-checking research... 
    Computer

    Prolific

    Washington DC
    3 days ago
  • $40 per hour

    A leading AI solutions company is seeking a Data Scientist to train AI models and enhance their quality through evaluation and problem-solving. This flexible and remote role allows you to...  ...deep understanding of statistics and computer science. Only applicants located in the... 
    Computer
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $301.75k - $355k

     ...vertically integrated AI infrastructure company...  ...time. The demand for AI compute is boundless, and power...  ...Senior Director for the Model LifeCycle team will undertake...  ...: versioning, lineage, evaluation, and reproducible fine‑...  ...health, dental & vision insurance ~ Employer... 
    Computer
    Temporary work

    Crusoe Energy Systems LLC

    San Francisco, CA
    2 days ago
  •  ...remote Kotlin Engineer to review AI-generated responses and create...  ...AI performance, and ensuring model accuracy. The ideal candidate has a Bachelor's degree in Computer Science, 2+ years of Kotlin...  ...network and requires critical evaluation of technical concepts. #J-188... 
    Computer
    Remote job

    SME Careers

    New York, NY
    3 days ago
  • $141.8k - $258.6k

    AI Experience Researcher, Product Evaluation, Vision Products Group Sunnyvale, California, United States Machine Learning...  ...— to recognize patterns in model behaviors and outputs, and to develop...  ...degree in Cognitive Psychology, Human-Computer Interaction (HCI), User... 
    Computer
    Relocation

    Apple Inc.

    Sunnyvale, CA
    4 days ago
  • A leading AI company is seeking a Biology Specialist to help fine-tune large language models. Ideal candidates will be pursuing or hold a Ph.D. in Biology or a related field and possess strong research skills. The role involves solving complex biological problems and collaborating... 
    Remote job

    Turing

    Boston, MA
    18 days ago
  • $60 per hour

    Prolific is looking for Computer Science Specialists to join our Expert Network to help train AI models. Candidates with a BSc in Computer Science will evaluate responses to technical prompts and review scientific papers. The role offers competitive pay rates up to $60... 
    Computer
    Remote job
    Work from home
    Flexible hours

    Prolific

    Tucson, AZ
    5 days ago
  • $197.3k - $225.1k

     ...Overview Lead AI Engineer (Vision model customization, VML) At Capital One, we are creating responsible...  ...similarity search, guardrails, model evaluation, experimentation, governance, and...  ...: Bachelor's degree in Computer Science, AI, Electrical Engineering,... 
    Computer
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    22 days ago
  • $150k

     ...Institute of Foundation Models We are a dedicated...  ...the next generation of AI builders, and drive transformative...  ...for high-performance computing in deep learning,...  ...Research Scientist in the Vision Language Model (VLM)...  ...and post-training, and evaluation benchmarks. The role combines... 
    Computer

    Institute of Foundation Models

    Sunnyvale, CA
    5 days ago
  • $115k - $157k

    # Senior SAS/Python Model Validation and Modernization...  ...stakeholders to evaluate model performance, conduct...  ...* Bachelor’s degree in Computer Science, Data Science,...  ...technologies, particularly AI* Strategic mindset with...  ...medical, dental, and vision plans.* You can join one... 
    Computer
    Full time
    Work at office
    Remote work

    TryApplyNow

    Mc Lean, VA
    3 days ago
  • $93.6k - $220.4k

     ...Safety (T&S) Responsible AI Policy team's mission...  ...the development of GenAI models and applications are...  ...drive end-to-end policy to evaluate workflows for your...  ...experience. Degree in Computer Science, Human-Computer...  ...to medical, dental, and vision insurance, a 401(k) savings... 
    Computer
    Temporary work
    Local area

    TikTok

    San Francisco, CA
    4 days ago
  • Medical Professionals can apply their expertise to evaluate AI models and enhance their understanding of healthcare tasks and terminology. This role involves assessing content relevant to your field and providing clear, structured feedback to improve AI performance. No... 
    Hourly pay
    Temporary work
    Part time
    Remote work
    Flexible hours

    SaidGig

    United States
    15 hours ago
  • $40 per hour

     ...D Mathematician to work remotely. This role involves training AI models and measuring their progress by providing complex mathematical...  ...mathematical reasoning and familiarity with applied mathematics or computer science. The position offers flexible scheduling and... 
    Computer
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Oklahoma City, OK
    1 day ago
  • $272k - $431.25k

     ...seeking a Senior Research Manager to lead world‑model evaluation and benchmarking across NVIDIA’s Physical AI model portfolio. The role will build a team and...  ...Strong research background in machine learning, computer vision, multimodal AI, robotics, world models, representation... 
    Computer

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $300k - $320k

     ...role: We are seeking a Technical Program Manager to lead our AI model evaluation initiatives across multiple workstreams. This role will be...  ...equity donation matching. Comprehensive health, dental, and vision insurance for you and all your dependents. 401(k) plan with... 
    Work at office
    Home office
    Visa sponsorship
    Relocation package

    Anthropic

    Seattle, WA
    3 days ago
  •  ...Chicago-headquartered medical‑AI company applying artificial intelligence...  ...cancer detection, with a vision of a future where cancer no...  ...lives. Our foundational model, ABCD (AI Biomarker Cancer Detection...  ...Steward governs how ABCD is evaluated, validated, and released — owning... 

    accentedge

    Chicago, IL
    1 day ago
  •  ...Research Engineer - Language Model Pre-Training , you\'ll...  ..., processing, and evaluation Architecture and methodology...  ...a scientific subject (Computer Science, EE/EECS, Math,...  ...do and love discussing AI Benefits and Perks:...  ...medical, dental, vision, and FSA plans Competitive... 
    Computer
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    3 days ago
  • $75 per hour

    Cartographers and photogrammetrists can apply their expertise to evaluate AI models and enhance their understanding of geographical data. In...  ...charts. Utilize precision stereoplotting apparatus and computer graphics for delineating topographic and cultural features.... 
    Computer
    Remote job
    Flexible hours

    SaidGig

    United States
    16 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Computer Vision Specialist for AI Model Evaluation. Be the first to apply!