Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour

$20 - $30 per hour

24-MAG

Remote job

We are sharing a specialised part-time consulting opportunity for Urdu-English bilingual professionals experienced in AI safety evaluation, red team testing, adversarial review, vulnerability classification, and structured feedback on sensitive text-based AI outputs.

This role supports current and upcoming remote consulting opportunities focused on AI safety evaluation, bilingual red team testing, conversational model assessment, misuse-risk review, vulnerability annotation, and high-quality project execution. Selected professionals will test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts across English and Urdu contexts.

Key Responsibilities

Professionals in this role may contribute to:

Bilingual AI Safety & Red Team Testing

Review English and Urdu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
Stress-test conversational AI models and agents using structured adversarial scenarios
Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality

Vulnerability Classification & Risk Review

Annotate failures, classify vulnerabilities, and flag recurring safety patterns
Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
Generate high-quality human evaluation data through careful review and structured judgment

Reproducible Documentation & Evaluation Artifacts

Produce clear reports, datasets, test cases, and written summaries that support model improvement
Document findings reproducibly so results can be reviewed, compared, and acted upon
Explain risks clearly for both technical and non-technical audiences
Maintain accuracy, consistency, and strong attention to detail across submitted evaluations

Ideal Profile

Strong candidates may have:

Native-level fluency in both English and Urdu
Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
Ability to think adversarially while staying structured, careful, and methodical
Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
Strong written communication skills and ability to explain safety findings clearly
Comfort reviewing text-based content involving sensitive topics under clear guidelines
Adaptability across project types, safety categories, and evaluation workflows

Educational Background

Formal degree requirements may vary based on project needs
Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable

Nice to Have

Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts

Why This Opportunity

Apply Urdu-English bilingual expertise to structured AI safety and red team evaluation work
Contribute to stronger, safer, and more reliable AI systems through careful adversarial testing
Work on flexible assignments aligned with language skills, safety judgment, and structured analysis
Build experience in human data-driven AI safety evaluation and bilingual risk review
Remote structure with competitive hourly compensation

Contract Details

Independent contractor role
Fully remote with flexible scheduling
Eligible professionals may be based in approved project locations depending on project needs
Native-level English and Urdu fluency are required for project work
Work is text-based and may involve sensitive topics such as bias, misinformation, harassment, or harmful-behavior risks
Topic areas will be communicated before exposure to content, and participation in higher-sensitivity projects may depend on candidate comfort and project fit
Part-time commitment depending on project availability
Competitive rates between $20–$30 per hour depending on expertise and project scope
Weekly payments via Stripe or Wise
Projects may be extended, shortened, or adjusted depending on scope and performance
Work will not involve access to confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: .

Vacancy posted 9 days ago

Similar jobs that could be interesting for youBased on the Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour in New York, NY vacancy

AI Safety Expert - Red Team
$20 - $22 per hour
...talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Urdu Type: Contract... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI... ...Process (Takes 20–30 mins to complete)...
Remote work
English language skills
Urdu language skills
Contract work
Summer work
Mercor
San Francisco, CA
14 days ago
Remote | Urdu-English AI Response Evaluator — $10-$20/hour
$10 - $20 per hour
...consulting opportunity for Urdu-English bilingual... ...experienced in language evaluation, LLM response... ...and upcoming remote consulting opportunities... ...on Urdu-language AI response... ...with competitive hourly compensation... ...rates between $10–$20 per hour depending...
Remote job
English language skills
Urdu language skills
Hourly pay
Weekly pay
Contract work
Part time
For contractors
Flexible hours
24-MAG
New York, NY
9 days ago
Remote AI Safety Red Team Specialist (English & Marathi)
$20 - $22 per hour
Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities... ...in AI models through red teaming and generating... ...offers compensation around $20-$22 per hour. Interested applicants can apply...
Remote job
English language skills
Hourly pay
Mercor
San Francisco, CA
5 days ago
AI Safety Expert - Red Team
$20 - $22 per hour
...talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Telugu Type:... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI models... ...Application Process (Takes 20–30 mins to complete)...
Remote work
English language skills
Hourly pay
Weekly pay
Contract work
For contractors
Summer work
Flexible hours
Mercor
San Francisco, CA
14 days ago
Remote Italian AI Safety Evaluator & Red-Team
...seeking an Italian Data Trainer for a remote, hourly-paid contractor role. The position focuses on improving the safety and reasoning quality of AI systems by evaluating AI-generated content and... ...proficiency and a minimum C1 level in English, along with experience in Trust...
Remote job
English language skills
Hourly pay
For contractors
YO IT Consulting
Newark, NJ
4 days ago
Remote Italian AI Safety Evaluator & Red-Team
...Italian Data Trainer for a remote, hourly-paid contractor role aimed at improving AI systems' safety, reliability, and... ...Responsibilities include evaluating AI-generated content, conducting red-teaming, and ensuring... ...Italian proficiency, C1 English proficiency, and experience...
Remote job
Hourly pay
For contractors
YO IT Consulting
San Francisco, CA
3 days ago
Remote AI Safety Red Team Specialist - English & Telugu
Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours with an hourly compensation of $20-$22, paid weekly. Ideal candidates will have red teaming experience...
Remote job
English language skills
Hourly pay
Weekly pay
Flexible hours
Mercor
San Francisco, CA
5 days ago
Remote AI Safety Red Team Expert (English & Punjabi)
$20 - $22 per hour
Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent... ...labs. The role involves red teaming AI models, generating... ...projects. The position is a remote contract role with compensation ranging from $20 to $22 per hour. Interested candidates should...
Remote job
English language skills
Hourly pay
Contract work
Mercor
San Francisco, CA
4 days ago
AI Quality Evaluator (Remote) Flexible Hours & $20+/hr
$20 per hour
...Receptionist to assist in training AI models. This role involves evaluating the logic of AI chatbots and... ...Applicants should be fluent in English, detail-oriented, and possess strong... ...with projects and an hourly pay starting at $20+ USD, with bonuses for high-quality...
Remote work
English language skills
Hourly pay
Flexible hours
DataAnnotation
New York, NY
2 days ago
AI Safety Expert - Red Team
$20 - $22 per hour
...talent with leading AI research labs.... ...Position: AI Safety Experts — English & Bengali Type... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI... ...Process (Takes 20–30 mins to complete)...
Remote job
English language skills
Contract work
Summer work
Mercor
New York, NY
5 days ago
Remote Italian AI Safety Evaluator & Red-Team
...seeking an Italian Data Trainer for a remote position focused on improving AI systems. You will review AI-generated content, ensuring accuracy and safety, and will apply linguistic and cultural judgment across Italian and English. Ideal candidates will have a relevant...
Remote job
English language skills
YO IT Consulting
San Marino, CA
1 day ago
Remote Italian AI Safety Evaluator & Red-Team
YO IT Consulting is looking for a remote Italian Data Trainer to enhance AI systems' safety and reasoning quality. Responsibilities include evaluating AI-generated content, conducting safety... ...in Italian along with C1 level English. This role demands experience in Trust...
Remote job
English language skills
Flexible hours
YO IT Consulting
New York, NY
5 days ago
Remote AI Safety Experts — English & Tamil - AI Trainer ($20-$22 per hour)
$20 - $22 per hour
Location: Remote Fluent Language Skills Required: English & Tamil. Native fluency in English... ...believe the safest AI is the one that’s... ...are assembling a red team for this project -... ...AI systems. Evaluation coverage expands: more... ...customers trust the safety of their AI because...
Remote job
English language skills
Hourly pay
Mercor
Fort Worth, TX
2 days ago
Language Model Evaluator - Fully Remote | Upto $20/hr Part-time
$20 per hour
...talent with leading AI research labs.... ...: Generalist - English & Urdu Type:... ...Compensation: $15–$20/hour Location: Remote Role... ...high-quality human evaluation data by identifying... ...(Takes 20–30 mins to complete... ...com PS: Our team reviews applications...
Remote job
English language skills
Urdu language skills
Contract work
Part time
Summer work
Mercor
San Francisco, CA
24 days ago
AI Safety Red Team Specialist
Mercor seeks a red team member to enhance AI safety. You will probe conversational AI models for vulnerabilities... ...teaming experience and be fluent in English and Tamil. You'll contribute... ...trustworthy. This role is entirely remote, providing a unique opportunity to influence...
Remote work
English language skills
Mercor
Fort Worth, TX
2 days ago
AI Security Evaluator & Red Team Operator (Remote)
$40 per hour
...cybersecurity professionals to evaluate AI-generated security content... ...This role allows you to work remotely and on your own schedule with... ...projects starting at $40+ per hour. The ideal candidate will have... ...cybersecurity, be fluent in English, and possess strong analytical...
Remote job
English language skills
Hourly pay
DataAnnotation
California, MO
5 days ago
Remote AI Safety Red Team Specialist (Bilingual EN/PA)
Mercor is seeking an AI Safety Expert who is proficient in both English and Punjabi to join their remote team. The role involves red teaming conversational AI models, generating high-quality datasets, and effectively communicating risks. The ideal candidate has prior experience...
Remote job
English language skills
Contract work
Mercor
New York, NY
5 days ago
AI Security Evaluator & Red Team Operator (Remote)
$40 per hour
...professionals to join our team to help train AI models. In this role, you will evaluate AI-generated... ...time or part-time REMOTE position You’ll... ...Projects are paid hourly starting at $40+ USD... ...testing, red teaming, incident... ...required Fluency in English (native or bilingual...
Remote job
Hourly pay
Full time
Part time
DataAnnotation
Denver, CO
4 days ago
AI Biology Evaluator (Remote, Flexible Hours)
$40 per hour
...seeking a Biology Teacher to join its team remotely to train AI models by evaluating their logic and solving problems... .... Candidates should be fluent in English and possess a strong understanding... ...offers flexible projects with hourly payment starting at $40. #J-18808-...
Remote work
English language skills
Hourly pay
Contract work
Flexible hours
DataAnnotation
Iowa, LA
8 hours ago
AI Biology Evaluator (Remote, Flexible Hours)
$40 per hour
...Biology Teacher to join its team and train AI models. This is a remote position where you can... ...complex biology questions and evaluating their logic. Ideal... ...be fluently bilingual in English and possess a strong background... ...competitive at $40+ per hour, with bonuses for high-...
Remote work
English language skills
Hourly pay
Flexible hours
DataAnnotation
El Paso, TX
8 hours ago
Remote AI Red Team Analyst — Adversarial Testing & Safety
$26 per hour
...consulting firm is seeking a remote contractor to engage in AI red teaming activities. The ideal candidate will work on evaluating conversational AI systems... .... Fluency in English and Spanish is required,... ...cybersecurity. This is a flexible hourly commitment of 10 to 40 hours...
Remote job
Hourly pay
For contractors
10 hours per week
Flexible hours
Crossing Hurdles
New York, NY
4 days ago
AI Red Team Analyst (LLM Safety & Adversarial Testing) | $26/hr Remote
$26 per hour
Type: Hourly contract Compensation: $26/hour Location: Remote Commitment: 10-40 hours/week... ...Red-team conversational AI systems using jailbreaks... ...adversarial test cases. Evaluate AI outputs... ...alignment with defined safety guidelines.... ...fluency in both English and Spanish. Prior...
Remote job
Hourly pay
Contract work
Crossing Hurdles
New York, NY
4 days ago
Remote Generalist - English & Urdu - AI Trainer ($15-$20 per hour)
$15 - $20 per hour
...Bilingual Competency interview in Urdu to be considered for this role... ...Urdu (native fluency) and English (strong proficiency) Why this... ...matters Your job is to assess Urdu AI-generated responses and... ...Generate high‑quality human evaluation data by identifying response strengths...
Remote job
English language skills
Urdu language skills
Hourly pay
Contract work
Mercor
Redwood City, CA
4 days ago
Remote Adversarial AI Testing Specialist LLM Red Team
...technology company is seeking a remote AI Red Teamer to assess conversational... ...vulnerabilities through red teaming exercises. Candidates should be fluent in both English and Brazilian Portuguese, with... ...technical stakeholders. Competitive hourly compensation is offered. #J-188...
Remote work
English language skills
Hourly pay
Crossing Hurdles
New York, NY
3 days ago
AI Quality Evaluator — Remote Part-Time Contract
$30 - $150 per hour
...Mercor Generalist to evaluate AI-generated outputs... ...requires strong English fluency, critical... ...This position is remote, with an expected... ...approximately 10-20 hours per week and an hourly... ...hourly pay range of $30 to $150. Ideal... ...Join a collaborative team in the IT...
Remote job
English language skills
Hourly pay
Contract work
Part time
10 hours per week
Crossing Hurdles
New York, NY
4 days ago
Italian AI Safety Evaluator - Remote
...Consulting is seeking a Remote Italian Data Trainer to improve the safety and reliability of AI systems. The role involves evaluating AI-generated content,... ...Italian proficiency, and C1 English proficiency. Experience in Trust & Safety and red-teaming is necessary. The job...
Remote job
English language skills
For contractors
YO IT Consulting
Boston, MA
3 days ago
Remote Italian AI Safety Evaluator & Red-Team
...is seeking an Italian Data Trainer for a remote position focused on evaluating AI-generated content. The role involves ensuring safety and quality in AI systems through detailed... ...bachelor's degree, native Italian and C1 English proficiency, and experience in Trust & Safety...
Remote job
For contractors
YO IT Consulting
Miami, FL
1 day ago
Remote AI Security Red Team Specialist (EN/AR)
$32.25 per hour
A tech security company is seeking an AI Security Specialist fluent in English and Arabic to work remotely. Responsibilities include red teaming conversational AI models and identifying... ...us for a flexible role requiring 10-40 hours a week, with compensation set at $32.25...
Remote job
English language skills
Flexible hours
Crossing Hurdles
New York, NY
4 days ago
Remote AI Safety Red Team Engineer - Part Time
$65 per hour
A leading AI consulting firm is seeking an AI Tutor... ...freelance role involves evaluating AI models, creating test... ...degree, possess advanced English skills, and have... ...security. With flexible hours, this position allows you to work remotely on challenging AI projects...
Remote job
English language skills
Part time
Freelance
Flexible hours
Mindrift
San Antonio, TX
5 days ago
Remote AI Red-Teaming Specialist (English/Portuguese)
$28.74 per hour
...technology firm is seeking an AI Red-Teamer for adversarial testing in a remote capacity. Candidates must be fluent in both English and Brazilian Portuguese,... ...experience in AI red teaming and cybersecurity.... ...offers pay at $28.74 per hour and requires strong communication...
Remote job
English language skills
Hourly pay
Contract work
Crossing Hurdles
New York, NY
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour. Be the first to apply!