Machine Learning and NLP Practitioner for AI Model Evaluation

$80 - $110 per hour

SaidGig

Join a cutting-edge GenAI team at a leading AI lab, where your expertise will be pivotal in developing advanced AI models. This role focuses on designing and evaluating machine learning and natural language processing tasks that will help identify and address capability gaps in frontier AI models. Key Responsibilities

Task design and development: Create challenging, real-world ML and NLP problems from your area of expertise, targeting specific capability gaps in a frontier AI model.
Spec and golden-solution generation: Prepare all necessary components for the problems in an agentic development environment using Python.
Evaluation and analysis: Assess the target model''s performance on your tasks.
Headroom identification: Identify and classify tasks where the target model fails.
Collaborate with other experts: Work with fellow subject-matter experts to ensure consistent and accurate evaluations.

Core Qualifications

Deep, hands-on experience in machine learning and/or natural language processing, gained through applied industry work, research, or a graduate/PhD background.
Working proficiency in Python, applied in research, industry, or open-source projects.
Strong command of modern ML/NLP methods, including model training and evaluation, transformers, large language models, and standard tooling.
Availability to engage for approximately 20 hours per week.
Preferred experience in AI training, model evaluation, or data annotation.
Strong written communication skills and the ability to work independently while managing your own time.

Work Terms

This is a part-time W-2 employment position with Cincinnatus LLC, offering the opportunity to work remotely within the United States.

Compensation

Hourly compensation ranges from $80 to $110.

Eligibility

This role is open to candidates located in the United States.

Apply

Vacancy posted 21 days ago

Similar jobs that could be interesting for youBased on the Machine Learning and NLP Practitioner for AI Model Evaluation in United States vacancy

Computational Biologist for AI Model Evaluation
$20 - $60 per hour
...biology to train next-generation AI systems. Your contributions... ...directly influence how these models learn, reason, and perform by... ...biology domain expertise to evaluate, annotate, and benchmark AI systems... ...biology, bioinformatics, and machine learning within the...
Suggested
Remote job
Hourly pay
Contract work
SaidGig
United States
2 days ago
Senior Solutions Engineer, AI Data & Model Evaluation Solutions
...Engineer to help shape and scale our AI Data Solutions, working with leading AI labs, frontier model developers, and enterprise AI teams on complex data, evaluation, and model development... ...Requirements ~Experience with AI, machine learning, data services, model...
Suggested
Full time
For contractors
L10n People Ltd
Remote
1 day ago
Computational Engineer for AI Model Evaluation
$20 - $60 per hour
...apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world... ...engineering to inform advanced AI benchmarking and evaluation processes. Analyze and provide feedback on AI models...
Suggested
Hourly pay
Contract work
Remote work
SaidGig
United States
a month ago
Aviation Professional for AI Model Evaluation
$100 per hour
...Aviation professionals can apply their expertise to support AI research projects by evaluating AI model outputs and providing structured feedback. This role allows you to leverage your extensive experience in aviation to enhance AI understanding of workplace tasks and...
Suggested
Remote work
Flexible hours
SaidGig
United States
4 days ago
Medical Professional for AI Model Evaluation
...Medical professionals can apply their expertise to contribute to AI research projects that enhance the understanding of workplace tasks and language in their field. This role involves evaluating AI model outputs, assessing content related to your profession, and providing...
Suggested
Remote work
Flexible hours
SaidGig
United States
4 days ago
Land Administrator for AI Model Evaluation
$80 per hour
Land administrators leverage their expertise in land management and accounting to support AI research through flexible, hourly contract work. This role involves evaluating AI-generated content and providing insights that enhance AI understanding of upstream land administration...
Hourly pay
Contract work
Flexible hours
SaidGig
United States
21 days ago
Lawyer for AI Model Training and Evaluation
$60 - $150 per hour
...Join a network of legal experts that connects you with leading AI labs and companies seeking your specialized knowledge. This... ...background and interests. Key Responsibilities Train and evaluate AI models in the field of Law. Create tasks and deliverables based on...
Hourly pay
Contract work
Remote work
SaidGig
United States
6 days ago
Trade Surveillance Expert for AI Model Evaluation
$100 per hour
...Expert, you will play a pivotal role in the development and evaluation of advanced AI systems designed for capital markets compliance and market... ...generating high-quality training data for next-generation AI models. This position is ideal for professionals experienced with...
Remote job
SaidGig
United States
a month ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
...A leading AI development firm is seeking experienced quantitative professionals to contribute to the advancement of cutting-edge AI systems. This fully remote role allows you to evaluate AI-generated analyses and solve quantitative problems, enhancing AI capabilities...
Remote work
DataAnnotation
Brooklyn, NY
2 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
A leading AI research firm is seeking experienced quantitative professionals to evaluate AI-generated analysis and solve technical problems. This fully remote position offers flexible scheduling and competitive pay of up to $60 per hour. Ideal candidates have 2+ years...
Hourly pay
Remote work
Flexible hours
DataAnnotation
New York, NY
1 day ago
AI Systems, Model Optimization
...Member of Technical Staff, AI Systems, Model Optimization, you will develop... ...performance models to evaluate compute, memory, and energy... ...specifications and codifying learnings for tapeouts. Qualifications... ...quantitative field such as AI/Machine Learning, Computer Science,...
Full time
Work at office
Unconventional, Inc.
Remote
2 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
...A leading AI development company seeks experienced quantitative professionals to evaluate AI-generated work and contribute to developing advanced AI systems. You'll leverage your background in data science or related fields while enjoying a fully remote work setup with...
Remote work
Flexible hours
DataAnnotation
Boston, MA
2 days ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
...A leading AI development firm is seeking quantitative professionals to evaluate and enhance AI-generated analyses. This fully remote role allows a flexible schedule, with competitive pay up to $60 USD/hour. Candidates should have 2+ years of experience in a quantitative...
Remote work
Flexible hours
DataAnnotation
Madison, WI
1 day ago
Remote ML Scientist - AI Model Trainer & Evaluator
$60 per hour
...A leading AI development company is seeking quantitative professionals to aid in evaluating and shaping advanced AI systems. This fully remote opportunity allows you to work from various locations and manage your own schedule, with competitive pay up to $60 per hour. Ideal...
Hourly pay
Remote work
DataAnnotation
Hartford, CT
1 day ago
Procurement Clerk for AI Model Evaluation
$75 per hour
Procurement professionals can apply their expertise to contribute to AI research projects that enhance model understanding of workplace tasks and language. This role involves evaluating AI outputs related to procurement processes, providing structured feedback, and developing...
Remote work
Flexible hours
SaidGig
United States
8 days ago
Hindi Musician for AI Model Evaluation
$9.63 - $17.33 per hour
...As a Music Audio Expert, you will play a crucial role in evaluating generative musical AI models in collaboration with a leading AI lab. This position offers the opportunity to assess model outputs across various music categories while utilizing your bilingual skills in...
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
3 days ago
AI Model Evaluation Associate
$17 per hour
...AI Evaluation Specialists contribute to the advancement of Large Language Models (LLMs) by testing and providing feedback in collaboration with leading AI labs. This role... ...projects. Engage with frontier AI labs to learn about their model training processes while earning...
Temporary work
Part time
Remote work
SaidGig
United States
4 days ago
Records Manager for AI Model Evaluation
$75 per hour
...Records Managers, including Archivists, Information Managers, Collections Managers, and Librarians, play a crucial role in evaluating AI models within their fields. This position allows you to utilize your professional expertise to assess AI-generated content, providing...
Part time
Remote work
Flexible hours
SaidGig
United States
21 days ago
Cartographer for AI Model Evaluation
$75 per hour
Cartographers and photogrammetrists can apply their expertise to evaluate AI models in their field through a flexible, remote engagement. This role involves assessing AI-generated content, providing structured feedback, and helping to enhance the model''s understanding...
Immediate start
Remote work
Flexible hours
SaidGig
United States
6 days ago
Turkish Musician for AI Model Evaluation
$12 - $21.6 per hour
...As a Music Audio Expert, you will play a crucial role in evaluating generative musical AI models in collaboration with a leading AI lab. This position allows you to leverage your musical expertise and bilingual skills to assess AI outputs across various music categories...
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
17 days ago
Telecommunications Expert for AI Model Training
$20 - $75 per hour
...training of next-generation AI systems. Your insights will play... ...role in shaping how these models learn and perform by providing high... ...Responsibilities: Analyze and evaluate telecommunications systems... ...Qualifications: Familiarity with machine learning, natural language...
Remote job
Hourly pay
Contract work
SaidGig
United States
a month ago
Italian Musician for AI Model Evaluation
$19.9 - $58.46 per hour
...As a Music Audio Expert, you will play a crucial role in evaluating generative musical AI models in collaboration with a leading AI lab. This position allows you to leverage your musical expertise to assess model outputs across various music categories while utilizing...
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
16 days ago
Japanese Musician for AI Model Evaluation
$46 per hour
...As a Music Audio Expert, you will play a crucial role in evaluating generative musical AI models in collaboration with a leading AI lab. This position offers the opportunity to assess and score AI-generated music outputs while utilizing your bilingual skills in Japanese...
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
16 days ago
Musician for AI Model Evaluation
$9.63 - $17 per hour
...Join a dynamic team as a Music Audio Expert, where you will play a crucial role in evaluating generative musical AI models in collaboration with a leading AI lab. This position offers the opportunity to leverage your musical expertise while assessing AI outputs across...
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
16 days ago
Psychologist for Students | Virtual
$55 per hour
...the capabilities of Large Language Models (LLMs) through innovative AI research projects. This role offers... ...Develop domain-specific prompts and evaluate LLM responses. Engage in research... ...AI across various disciplines while learning new skills. Work remotely and asynchronously...
Remote work
Flexible hours
SaidGig
United States
4 days ago
Bilingual Odia Language Evaluator for AI Model Training
$15 - $20 per hour
...you will play a crucial part in enhancing AI-generated responses in Odia by conducting... ...tools. Generate high-quality human evaluation data by identifying response strengths, areas... ...and completeness of responses. Ensure model responses align with expected...
SaidGig
United States
8 days ago
Finance Professional for AI Model Evaluation
$75 per hour
...professionals leverage their expertise in financial analysis, modeling, and advisory to support AI research through flexible, hourly contract work. This... ..., banking, consulting, or investment sectors. You will evaluate AI-generated content and provide critical feedback,...
Hourly pay
Full time
Contract work
Part time
Remote work
Flexible hours
SaidGig
United States
8 days ago
Bilingual Musician for AI Model Evaluation
$39 per hour
...In this role, you will leverage your musical expertise to evaluate generative musical AI models in collaboration with a leading AI lab. Your assessments will focus on various categories of music, utilizing your bilingual skills to provide insights into model outputs....
Hourly pay
Part time
Immediate start
10 hours per week
SaidGig
United States
16 days ago
Software Engineer for AI Model Evaluation
$220k
...This role focuses on advancing the evaluation and development of cutting-edge... ...operate at the intersection of AI research, software engineering, and model evaluation, designing the... ...experience in software engineering, machine learning, AI research, evaluation, or related...
Full time
Remote work
SaidGig
United States
2 days ago
Energy Auditor for AI Model Evaluation
$80 per hour
Energy auditing professionals leverage their expertise in building energy modeling and performance benchmarking to contribute to AI research initiatives. In this role, you will evaluate AI-generated content and provide critical feedback, enhancing the AI''s understanding...
Hourly pay
Contract work
Part time
Remote work
Flexible hours
SaidGig
United States
8 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning and NLP Practitioner for AI Model Evaluation. Be the first to apply!