Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour
$20 - $30 per hour24-MAG
- Remote job
We are sharing a specialised part-time consulting opportunity for Urdu-English bilingual professionals experienced in AI safety evaluation, red team testing, adversarial review, vulnerability classification, and structured feedback on sensitive text-based AI outputs.
This role supports current and upcoming remote consulting opportunities focused on AI safety evaluation, bilingual red team testing, conversational model assessment, misuse-risk review, vulnerability annotation, and high-quality project execution. Selected professionals will test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts across English and Urdu contexts.
Key Responsibilities
Professionals in this role may contribute to:
Bilingual AI Safety & Red Team Testing
- Review English and Urdu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
- Stress-test conversational AI models and agents using structured adversarial scenarios
- Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
- Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality
Vulnerability Classification & Risk Review
- Annotate failures, classify vulnerabilities, and flag recurring safety patterns
- Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
- Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
- Generate high-quality human evaluation data through careful review and structured judgment
Reproducible Documentation & Evaluation Artifacts
- Produce clear reports, datasets, test cases, and written summaries that support model improvement
- Document findings reproducibly so results can be reviewed, compared, and acted upon
- Explain risks clearly for both technical and non-technical audiences
- Maintain accuracy, consistency, and strong attention to detail across submitted evaluations
Ideal Profile
Strong candidates may have:
- Native-level fluency in both English and Urdu
- Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
- Ability to think adversarially while staying structured, careful, and methodical
- Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
- Strong written communication skills and ability to explain safety findings clearly
- Comfort reviewing text-based content involving sensitive topics under clear guidelines
- Adaptability across project types, safety categories, and evaluation workflows
Educational Background
- Formal degree requirements may vary based on project needs
- Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
- Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable
Nice to Have
- Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
- Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
- Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
- Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
- Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts
Why This Opportunity
- Apply Urdu-English bilingual expertise to structured AI safety and red team evaluation work
- Contribute to stronger, safer, and more reliable AI systems through careful adversarial testing
- Work on flexible assignments aligned with language skills, safety judgment, and structured analysis
- Build experience in human data-driven AI safety evaluation and bilingual risk review
- Remote structure with competitive hourly compensation
Contract Details
- Independent contractor role
- Fully remote with flexible scheduling
- Eligible professionals may be based in approved project locations depending on project needs
- Native-level English and Urdu fluency are required for project work
- Work is text-based and may involve sensitive topics such as bias, misinformation, harassment, or harmful-behavior risks
- Topic areas will be communicated before exposure to content, and participation in higher-sensitivity projects may depend on candidate comfort and project fit
- Part-time commitment depending on project availability
- Competitive rates between $20–$30 per hour depending on expertise and project scope
- Weekly payments via Stripe or Wise
- Projects may be extended, shortened, or adjusted depending on scope and performance
- Work will not involve access to confidential or proprietary information from any employer, client, or institution
About the Platform
This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.
By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: .
$20 - $22 per hour
...talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Urdu Type: Contract... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI... ...Process (Takes 20–30 mins to complete)...Remote workEnglish language skillsUrdu language skillsContract workSummer work$10 - $20 per hour
...consulting opportunity for Urdu-English bilingual... ...experienced in language evaluation, LLM response... ...and upcoming remote consulting opportunities... ...on Urdu-language AI response... ...with competitive hourly compensation... ...rates between $10–$20 per hour depending...Remote jobEnglish language skillsUrdu language skillsHourly payWeekly payContract workPart timeFor contractorsFlexible hours$20 - $22 per hour
Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities... ...in AI models through red teaming and generating... ...offers compensation around $20-$22 per hour. Interested applicants can apply...Remote jobEnglish language skillsHourly pay$20 - $22 per hour
...talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Telugu Type:... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI models... ...Application Process (Takes 20–30 mins to complete)...Remote workEnglish language skillsHourly payWeekly payContract workFor contractorsSummer workFlexible hours- ...seeking an Italian Data Trainer for a remote, hourly-paid contractor role. The position focuses on improving the safety and reasoning quality of AI systems by evaluating AI-generated content and... ...proficiency and a minimum C1 level in English, along with experience in Trust...Remote jobEnglish language skillsHourly payFor contractors
- ...Italian Data Trainer for a remote, hourly-paid contractor role aimed at improving AI systems' safety, reliability, and... ...Responsibilities include evaluating AI-generated content, conducting red-teaming, and ensuring... ...Italian proficiency, C1 English proficiency, and experience...Remote jobHourly payFor contractors
- Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours with an hourly compensation of $20-$22, paid weekly. Ideal candidates will have red teaming experience...Remote jobEnglish language skillsHourly payWeekly payFlexible hours
$20 - $22 per hour
Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent... ...labs. The role involves red teaming AI models, generating... ...projects. The position is a remote contract role with compensation ranging from $20 to $22 per hour. Interested candidates should...Remote jobEnglish language skillsHourly payContract work$20 per hour
...Receptionist to assist in training AI models. This role involves evaluating the logic of AI chatbots and... ...Applicants should be fluent in English, detail-oriented, and possess strong... ...with projects and an hourly pay starting at $20+ USD, with bonuses for high-quality...Remote workEnglish language skillsHourly payFlexible hours$20 - $22 per hour
...talent with leading AI research labs.... ...Position: AI Safety Experts — English & Bengali Type... ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI... ...Process (Takes 20–30 mins to complete)...Remote jobEnglish language skillsContract workSummer work- ...seeking an Italian Data Trainer for a remote position focused on improving AI systems. You will review AI-generated content, ensuring accuracy and safety, and will apply linguistic and cultural judgment across Italian and English. Ideal candidates will have a relevant...Remote jobEnglish language skills
- YO IT Consulting is looking for a remote Italian Data Trainer to enhance AI systems' safety and reasoning quality. Responsibilities include evaluating AI-generated content, conducting safety... ...in Italian along with C1 level English. This role demands experience in Trust...Remote jobEnglish language skillsFlexible hours
$20 - $22 per hour
Location: Remote Fluent Language Skills Required: English & Tamil. Native fluency in English... ...believe the safest AI is the one that’s... ...are assembling a red team for this project -... ...AI systems. Evaluation coverage expands: more... ...customers trust the safety of their AI because...Remote jobEnglish language skillsHourly pay$20 per hour
...talent with leading AI research labs.... ...: Generalist - English & Urdu Type:... ...Compensation: $15–$20/hour Location: Remote Role... ...high-quality human evaluation data by identifying... ...(Takes 20–30 mins to complete... ...com PS: Our team reviews applications...Remote jobEnglish language skillsUrdu language skillsContract workPart timeSummer work- Mercor seeks a red team member to enhance AI safety. You will probe conversational AI models for vulnerabilities... ...teaming experience and be fluent in English and Tamil. You'll contribute... ...trustworthy. This role is entirely remote, providing a unique opportunity to influence...Remote workEnglish language skills
$40 per hour
...cybersecurity professionals to evaluate AI-generated security content... ...This role allows you to work remotely and on your own schedule with... ...projects starting at $40+ per hour. The ideal candidate will have... ...cybersecurity, be fluent in English, and possess strong analytical...Remote jobEnglish language skillsHourly pay- Mercor is seeking an AI Safety Expert who is proficient in both English and Punjabi to join their remote team. The role involves red teaming conversational AI models, generating high-quality datasets, and effectively communicating risks. The ideal candidate has prior experience...Remote jobEnglish language skillsContract work
$40 per hour
...professionals to join our team to help train AI models. In this role, you will evaluate AI-generated... ...time or part-time REMOTE position You’ll... ...Projects are paid hourly starting at $40+ USD... ...testing, red teaming, incident... ...required Fluency in English (native or bilingual...Remote jobHourly payFull timePart time$40 per hour
...seeking a Biology Teacher to join its team remotely to train AI models by evaluating their logic and solving problems... .... Candidates should be fluent in English and possess a strong understanding... ...offers flexible projects with hourly payment starting at $40. #J-18808-...Remote workEnglish language skillsHourly payContract workFlexible hours$40 per hour
...Biology Teacher to join its team and train AI models. This is a remote position where you can... ...complex biology questions and evaluating their logic. Ideal... ...be fluently bilingual in English and possess a strong background... ...competitive at $40+ per hour, with bonuses for high-...Remote workEnglish language skillsHourly payFlexible hours$26 per hour
...consulting firm is seeking a remote contractor to engage in AI red teaming activities. The ideal candidate will work on evaluating conversational AI systems... .... Fluency in English and Spanish is required,... ...cybersecurity. This is a flexible hourly commitment of 10 to 40 hours...Remote jobHourly payFor contractors10 hours per weekFlexible hours$26 per hour
Type: Hourly contract Compensation: $26/hour Location: Remote Commitment: 10-40 hours/week... ...Red-team conversational AI systems using jailbreaks... ...adversarial test cases. Evaluate AI outputs... ...alignment with defined safety guidelines.... ...fluency in both English and Spanish. Prior...Remote jobHourly payContract work$15 - $20 per hour
...Bilingual Competency interview in Urdu to be considered for this role... ...Urdu (native fluency) and English (strong proficiency) Why this... ...matters Your job is to assess Urdu AI-generated responses and... ...Generate high‑quality human evaluation data by identifying response strengths...Remote jobEnglish language skillsUrdu language skillsHourly payContract work- ...technology company is seeking a remote AI Red Teamer to assess conversational... ...vulnerabilities through red teaming exercises. Candidates should be fluent in both English and Brazilian Portuguese, with... ...technical stakeholders. Competitive hourly compensation is offered. #J-188...Remote workEnglish language skillsHourly pay
$30 - $150 per hour
...Mercor Generalist to evaluate AI-generated outputs... ...requires strong English fluency, critical... ...This position is remote, with an expected... ...approximately 10-20 hours per week and an hourly... ...hourly pay range of $30 to $150. Ideal... ...Join a collaborative team in the IT...Remote jobEnglish language skillsHourly payContract workPart time10 hours per week- ...Consulting is seeking a Remote Italian Data Trainer to improve the safety and reliability of AI systems. The role involves evaluating AI-generated content,... ...Italian proficiency, and C1 English proficiency. Experience in Trust & Safety and red-teaming is necessary. The job...Remote jobEnglish language skillsFor contractors
- ...is seeking an Italian Data Trainer for a remote position focused on evaluating AI-generated content. The role involves ensuring safety and quality in AI systems through detailed... ...bachelor's degree, native Italian and C1 English proficiency, and experience in Trust & Safety...Remote jobFor contractors
$32.25 per hour
A tech security company is seeking an AI Security Specialist fluent in English and Arabic to work remotely. Responsibilities include red teaming conversational AI models and identifying... ...us for a flexible role requiring 10-40 hours a week, with compensation set at $32.25...Remote jobEnglish language skillsFlexible hours$65 per hour
A leading AI consulting firm is seeking an AI Tutor... ...freelance role involves evaluating AI models, creating test... ...degree, possess advanced English skills, and have... ...security. With flexible hours, this position allows you to work remotely on challenging AI projects...Remote jobEnglish language skillsPart timeFreelanceFlexible hours$28.74 per hour
...technology firm is seeking an AI Red-Teamer for adversarial testing in a remote capacity. Candidates must be fluent in both English and Brazilian Portuguese,... ...experience in AI red teaming and cybersecurity.... ...offers pay at $28.74 per hour and requires strong communication...Remote jobEnglish language skillsHourly payContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour. Be the first to apply!
- quality evaluator New York, NY
- work from home social media evaluator New York, NY
- social media evaluator New York, NY
- evaluator New York, NY
- clinical evaluator New York, NY
- program evaluator New York, NY
- nurse evaluator New York, NY
- work from home web search evaluator New York, NY
- program coordinator remote New York, NY
- procurement specialist remote New York, NY



