Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour

$20 - $30 per hour

24-MAG

New York, NY
  • Remote job

We are sharing a specialised part-time consulting opportunity for Urdu-English bilingual professionals experienced in AI safety evaluation, red team testing, adversarial review, vulnerability classification, and structured feedback on sensitive text-based AI outputs.

This role supports current and upcoming remote consulting opportunities focused on AI safety evaluation, bilingual red team testing, conversational model assessment, misuse-risk review, vulnerability annotation, and high-quality project execution. Selected professionals will test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts across English and Urdu contexts.

Key Responsibilities

Professionals in this role may contribute to:

Bilingual AI Safety & Red Team Testing

  • Review English and Urdu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
  • Stress-test conversational AI models and agents using structured adversarial scenarios
  • Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
  • Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality

Vulnerability Classification & Risk Review

  • Annotate failures, classify vulnerabilities, and flag recurring safety patterns
  • Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
  • Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
  • Generate high-quality human evaluation data through careful review and structured judgment

Reproducible Documentation & Evaluation Artifacts

  • Produce clear reports, datasets, test cases, and written summaries that support model improvement
  • Document findings reproducibly so results can be reviewed, compared, and acted upon
  • Explain risks clearly for both technical and non-technical audiences
  • Maintain accuracy, consistency, and strong attention to detail across submitted evaluations

Ideal Profile

Strong candidates may have:

  • Native-level fluency in both English and Urdu
  • Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
  • Ability to think adversarially while staying structured, careful, and methodical
  • Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
  • Strong written communication skills and ability to explain safety findings clearly
  • Comfort reviewing text-based content involving sensitive topics under clear guidelines
  • Adaptability across project types, safety categories, and evaluation workflows

Educational Background

  • Formal degree requirements may vary based on project needs
  • Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
  • Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable

Nice to Have

  • Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
  • Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
  • Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
  • Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
  • Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts

Why This Opportunity

  • Apply Urdu-English bilingual expertise to structured AI safety and red team evaluation work
  • Contribute to stronger, safer, and more reliable AI systems through careful adversarial testing
  • Work on flexible assignments aligned with language skills, safety judgment, and structured analysis
  • Build experience in human data-driven AI safety evaluation and bilingual risk review
  • Remote structure with competitive hourly compensation

Contract Details

  • Independent contractor role
  • Fully remote with flexible scheduling
  • Eligible professionals may be based in approved project locations depending on project needs
  • Native-level English and Urdu fluency are required for project work
  • Work is text-based and may involve sensitive topics such as bias, misinformation, harassment, or harmful-behavior risks
  • Topic areas will be communicated before exposure to content, and participation in higher-sensitivity projects may depend on candidate comfort and project fit
  • Part-time commitment depending on project availability
  • Competitive rates between $20–$30 per hour depending on expertise and project scope
  • Weekly payments via Stripe or Wise
  • Projects may be extended, shortened, or adjusted depending on scope and performance
  • Work will not involve access to confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: .

Vacancy posted 9 days ago
Similar jobs that could be interesting for youBased on the Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour in New York, NY vacancy
  • $20 - $22 per hour

     ...talent with leading AI research labs. Headquartered...  .... Position: AI Safety Experts — English & Urdu Type: Contract...  ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI...  ...Process (Takes 20–30 mins to complete)... 
    Remote work
    English language skills
    Urdu language skills
    Contract work
    Summer work

    Mercor

    San Francisco, CA
    14 days ago
  • $10 - $20 per hour

     ...consulting opportunity for Urdu-English bilingual...  ...experienced in language evaluation, LLM response...  ...and upcoming remote consulting opportunities...  ...on Urdu-language AI response...  ...with competitive hourly compensation...  ...rates between $10–$20 per hour depending... 
    Remote job
    English language skills
    Urdu language skills
    Hourly pay
    Weekly pay
    Contract work
    Part time
    For contractors
    Flexible hours

    24-MAG

    New York, NY
    9 days ago
  • $20 - $22 per hour

    Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities...  ...in AI models through red teaming and generating...  ...offers compensation around $20-$22 per hour. Interested applicants can apply... 
    Remote job
    English language skills
    Hourly pay

    Mercor

    San Francisco, CA
    5 days ago
  • $20 - $22 per hour

     ...talent with leading AI research labs. Headquartered...  .... Position: AI Safety Experts — English & Telugu Type:...  ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI models...  ...Application Process (Takes 20–30 mins to complete)... 
    Remote work
    English language skills
    Hourly pay
    Weekly pay
    Contract work
    For contractors
    Summer work
    Flexible hours

    Mercor

    San Francisco, CA
    14 days ago
  •  ...seeking an Italian Data Trainer for a remote, hourly-paid contractor role. The position focuses on improving the safety and reasoning quality of AI systems by evaluating AI-generated content and...  ...proficiency and a minimum C1 level in English, along with experience in Trust... 
    Remote job
    English language skills
    Hourly pay
    For contractors

    YO IT Consulting

    Newark, NJ
    4 days ago
  •  ...Italian Data Trainer for a remote, hourly-paid contractor role aimed at improving AI systems' safety, reliability, and...  ...Responsibilities include evaluating AI-generated content, conducting red-teaming, and ensuring...  ...Italian proficiency, C1 English proficiency, and experience... 
    Remote job
    Hourly pay
    For contractors

    YO IT Consulting

    San Francisco, CA
    3 days ago
  • Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours with an hourly compensation of $20-$22, paid weekly. Ideal candidates will have red teaming experience... 
    Remote job
    English language skills
    Hourly pay
    Weekly pay
    Flexible hours

    Mercor

    San Francisco, CA
    5 days ago
  • $20 - $22 per hour

    Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent...  ...labs. The role involves red teaming AI models, generating...  ...projects. The position is a remote contract role with compensation ranging from $20 to $22 per hour. Interested candidates should... 
    Remote job
    English language skills
    Hourly pay
    Contract work

    Mercor

    San Francisco, CA
    4 days ago
  • $20 per hour

     ...Receptionist to assist in training AI models. This role involves evaluating the logic of AI chatbots and...  ...Applicants should be fluent in English, detail-oriented, and possess strong...  ...with projects and an hourly pay starting at $20+ USD, with bonuses for high-quality... 
    Remote work
    English language skills
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $20 - $22 per hour

     ...talent with leading AI research labs....  ...Position: AI Safety Experts — English & Bengali Type...  ...Compensation: $20–$22/hour Location: Remote Role Responsibilities Red team conversational AI...  ...Process (Takes 20–30 mins to complete)... 
    Remote job
    English language skills
    Contract work
    Summer work

    Mercor

    New York, NY
    5 days ago
  •  ...seeking an Italian Data Trainer for a remote position focused on improving AI systems. You will review AI-generated content, ensuring accuracy and safety, and will apply linguistic and cultural judgment across Italian and English. Ideal candidates will have a relevant... 
    Remote job
    English language skills

    YO IT Consulting

    San Marino, CA
    1 day ago
  • YO IT Consulting is looking for a remote Italian Data Trainer to enhance AI systems' safety and reasoning quality. Responsibilities include evaluating AI-generated content, conducting safety...  ...in Italian along with C1 level English. This role demands experience in Trust... 
    Remote job
    English language skills
    Flexible hours

    YO IT Consulting

    New York, NY
    5 days ago
  • $20 - $22 per hour

    Location: Remote Fluent Language Skills Required: English & Tamil. Native fluency in English...  ...believe the safest AI is the one that’s...  ...are assembling a red team for this project -...  ...AI systems. Evaluation coverage expands: more...  ...customers trust the safety of their AI because... 
    Remote job
    English language skills
    Hourly pay

    Mercor

    Fort Worth, TX
    2 days ago
  • $20 per hour

     ...talent with leading AI research labs....  ...: Generalist - English & Urdu Type:...  ...Compensation: $15–$20/hour Location: Remote Role...  ...high-quality human evaluation data by identifying...  ...(Takes 20–30 mins to complete...  ...com PS: Our team reviews applications... 
    Remote job
    English language skills
    Urdu language skills
    Contract work
    Part time
    Summer work

    Mercor

    San Francisco, CA
    24 days ago
  • Mercor seeks a red team member to enhance AI safety. You will probe conversational AI models for vulnerabilities...  ...teaming experience and be fluent in English and Tamil. You'll contribute...  ...trustworthy. This role is entirely remote, providing a unique opportunity to influence... 
    Remote work
    English language skills

    Mercor

    Fort Worth, TX
    2 days ago
  • $40 per hour

     ...cybersecurity professionals to evaluate AI-generated security content...  ...This role allows you to work remotely and on your own schedule with...  ...projects starting at $40+ per hour. The ideal candidate will have...  ...cybersecurity, be fluent in English, and possess strong analytical... 
    Remote job
    English language skills
    Hourly pay

    DataAnnotation

    California, MO
    5 days ago
  • Mercor is seeking an AI Safety Expert who is proficient in both English and Punjabi to join their remote team. The role involves red teaming conversational AI models, generating high-quality datasets, and effectively communicating risks. The ideal candidate has prior experience... 
    Remote job
    English language skills
    Contract work

    Mercor

    New York, NY
    5 days ago
  • $40 per hour

     ...professionals to join our team to help train AI models. In this role, you will evaluate AI-generated...  ...time or part-time REMOTE position You’ll...  ...Projects are paid hourly starting at $40+ USD...  ...testing, red teaming, incident...  ...required Fluency in English (native or bilingual... 
    Remote job
    Hourly pay
    Full time
    Part time

    DataAnnotation

    Denver, CO
    4 days ago
  • $40 per hour

     ...seeking a Biology Teacher to join its team remotely to train AI models by evaluating their logic and solving problems...  .... Candidates should be fluent in English and possess a strong understanding...  ...offers flexible projects with hourly payment starting at $40. #J-18808-... 
    Remote work
    English language skills
    Hourly pay
    Contract work
    Flexible hours

    DataAnnotation

    Iowa, LA
    8 hours ago
  • $40 per hour

     ...Biology Teacher to join its team and train AI models. This is a remote position where you can...  ...complex biology questions and evaluating their logic. Ideal...  ...be fluently bilingual in English and possess a strong background...  ...competitive at $40+ per hour, with bonuses for high-... 
    Remote work
    English language skills
    Hourly pay
    Flexible hours

    DataAnnotation

    El Paso, TX
    8 hours ago
  • $26 per hour

     ...consulting firm is seeking a remote contractor to engage in AI red teaming activities. The ideal candidate will work on evaluating conversational AI systems...  .... Fluency in English and Spanish is required,...  ...cybersecurity. This is a flexible hourly commitment of 10 to 40 hours... 
    Remote job
    Hourly pay
    For contractors
    10 hours per week
    Flexible hours

    Crossing Hurdles

    New York, NY
    4 days ago
  • $26 per hour

    Type: Hourly contract Compensation: $26/hour Location: Remote Commitment: 10-40 hours/week...  ...Red-team conversational AI systems using jailbreaks...  ...adversarial test cases. Evaluate AI outputs...  ...alignment with defined safety guidelines....  ...fluency in both English and Spanish. Prior... 
    Remote job
    Hourly pay
    Contract work

    Crossing Hurdles

    New York, NY
    4 days ago
  • $15 - $20 per hour

     ...Bilingual Competency interview in Urdu to be considered for this role...  ...Urdu (native fluency) and English (strong proficiency) Why this...  ...matters Your job is to assess Urdu AI-generated responses and...  ...Generate high‑quality human evaluation data by identifying response strengths... 
    Remote job
    English language skills
    Urdu language skills
    Hourly pay
    Contract work

    Mercor

    Redwood City, CA
    4 days ago
  •  ...technology company is seeking a remote AI Red Teamer to assess conversational...  ...vulnerabilities through red teaming exercises. Candidates should be fluent in both English and Brazilian Portuguese, with...  ...technical stakeholders. Competitive hourly compensation is offered. #J-188... 
    Remote work
    English language skills
    Hourly pay

    Crossing Hurdles

    New York, NY
    3 days ago
  • $30 - $150 per hour

     ...Mercor Generalist to evaluate AI-generated outputs...  ...requires strong English fluency, critical...  ...This position is remote, with an expected...  ...approximately 10-20 hours per week and an hourly...  ...hourly pay range of $30 to $150. Ideal...  ...Join a collaborative team in the IT... 
    Remote job
    English language skills
    Hourly pay
    Contract work
    Part time
    10 hours per week

    Crossing Hurdles

    New York, NY
    4 days ago
  •  ...Consulting is seeking a Remote Italian Data Trainer to improve the safety and reliability of AI systems. The role involves evaluating AI-generated content,...  ...Italian proficiency, and C1 English proficiency. Experience in Trust & Safety and red-teaming is necessary. The job... 
    Remote job
    English language skills
    For contractors

    YO IT Consulting

    Boston, MA
    3 days ago
  •  ...is seeking an Italian Data Trainer for a remote position focused on evaluating AI-generated content. The role involves ensuring safety and quality in AI systems through detailed...  ...bachelor's degree, native Italian and C1 English proficiency, and experience in Trust & Safety... 
    Remote job
    For contractors

    YO IT Consulting

    Miami, FL
    1 day ago
  • $32.25 per hour

    A tech security company is seeking an AI Security Specialist fluent in English and Arabic to work remotely. Responsibilities include red teaming conversational AI models and identifying...  ...us for a flexible role requiring 10-40 hours a week, with compensation set at $32.25... 
    Remote job
    English language skills
    Flexible hours

    Crossing Hurdles

    New York, NY
    4 days ago
  • $65 per hour

    A leading AI consulting firm is seeking an AI Tutor...  ...freelance role involves evaluating AI models, creating test...  ...degree, possess advanced English skills, and have...  ...security. With flexible hours, this position allows you to work remotely on challenging AI projects... 
    Remote job
    English language skills
    Part time
    Freelance
    Flexible hours

    Mindrift

    San Antonio, TX
    5 days ago
  • $28.74 per hour

     ...technology firm is seeking an AI Red-Teamer for adversarial testing in a remote capacity. Candidates must be fluent in both English and Brazilian Portuguese,...  ...experience in AI red teaming and cybersecurity....  ...offers pay at $28.74 per hour and requires strong communication... 
    Remote job
    English language skills
    Hourly pay
    Contract work

    Crossing Hurdles

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote | Urdu-English AI Safety Red Team Evaluator — $20-$30/hour. Be the first to apply!