AI Red Teamer, LLM Generalist
Handshake
About the Role As an AI Red Teamer, you will stress-test large language models by intentionally trying to break them. Rather than checking whether an answer is correct, you will design creative, adversarial prompts that expose vulnerabilities: unsafe content, bias, broken guardrails, hallucinations, prompt injection weaknesses, and unexpected behaviors. Your work directly supports AI safety and model robustness for leading research labs. This is a generalist red teaming role. You will probe models across the full spectrum of risk categories, including content safety, CBRN (chemical, biological, radiological, nuclear), cybersecurity, persuasion and influence operations, child safety, self-harm, over-companionship, and regulatory compliance. Red teaming may span text, image, voice, and agentic model capabilities depending on project needs. This role requires creativity, curiosity, and an ability to think like an adversary while operating with strong ethical judgment. Craft creative prompts and multi-turn scenarios to stress-test AI guardrails across diverse risk categories Discover ways around safety filters, restrictions, and defenses using jailbreak, evasion, and prompt injection techniques Explore edge cases to provoke disallowed, harmful, or incorrect outputs Evaluate and score model responses against structured harm taxonomies and severity rubrics Document experiments clearly, including what you tried, why you tried it, and what it revealed Review and refine adversarial prompts generated by other team members Contribute to harm taxonomy development, calibration exercises, and inter-rater reliability work Collaborate with engineers, data scientists, and researchers to share findings and strengthen defenses Work with potentially disturbing content on a regular basis (see Content Warning below) Stay current on jailbreaks, attack methods, and evolving model behaviors Desired Capabilities Strong hands-on experience using multiple LLMs (ChatGPT, Claude, Gemini, open-source models, etc.) Intuition for crafting adversarial prompts; familiarity with jailbreak or evasion techniques is a strong plus Creative, adversarial problem-solving skills Clear and thoughtful written communication Strong ethical judgment and the ability to separate adversarial thinking from personal values Self‑directed, collaborative, and comfortable in feedback‑heavy environments Curiosity, persistence, and comfort with frequent failure in experimentation Extra Credit Familiarity with Python or other scripting languages Experience working with LLM APIs or evaluation tooling Comfort with structured data annotation and rubric‑based scoring Prior work in trust and safety, content moderation, QA, or security research Subject matter expertise in any high‑risk domain (cybersecurity, chemistry, biology, medicine, law, finance, etc.) You Will Thrive Here If You treat every model response as a hypothesis to challenge You can switch between creative free‑association and rigorous documentation in the same session You go deep into unusual interests (fandoms, niche internet cultures, gaming exploits, Wikipedia rabbit holes, etc.) You come from a creative background: writing, visual art, improv, puzzle design, or similar You are energized by finding the thing nobody else thought to try You are genuinely passionate about AI and follow the space closely Content Warning This role involves regular and deliberate exposure to harmful content. You will encounter and intentionally generate content involving violence, self‑harm, hate speech, sexually explicit material, child safety scenarios, and other categories of harmful output as part of structured adversarial testing. Candidates must be able to engage with this material professionally and sustainably. Support resources are available. #J-18808-Ljbffr
- ...In 2025, we started Handshake AI and built the fastest-growing... ...About the Role As an AI Red Teamer, you will stress-test large language... ...research labs. This is a generalist red teaming role. You will... ...Experience working with LLM APIs or evaluation tooling...Suggested
- Handshake is looking for an AI Red Teamer in Seattle, USA, to stress-test large language models by designing adversarial prompts that expose vulnerabilities. This role requires creativity and the ability to think like an adversary while maintaining strong ethical judgment...Suggested
- ...institutions. In 2025, we started Handshake AI and built the fastest-growing AI data... ...largest scale. About the Role As a CBRNE Red Teamer, you will evaluate whether AI models appropriately... ...with Python or scripting languages, LLM APIs, or evaluation tooling Published research...SuggestedImmediate start
- Handshake is hiring a CBRNE Red Teamer in Seattle to evaluate how AI models handle queries related to hazardous threats. The role requires crafting adversarial scenarios and analyzing model outputs to identify safety gaps. The ideal candidate will have a graduate-level...Suggested
$184k - $287.5k
...requisition id: JR2016042Join our team at NVIDIA and help bring AI solutions to our largest customers. We are seeking an expert Solutions... ...performance aspects related to tasks like large scale LLM training and inference.* Conducting regular technical customer meetings...Suggested- ...building the future of trustworthy AI. Grounded in behavioral science... ...work in areas like RL gyms, red teaming, and benchmarking, we... ...experts, annotators, reviewers, red teamers, contractors, and quality... ...sumé. Experience in AI safety, LLM evaluation, or trust & safety operations...Contract workFor contractors
$230k - $280k
...CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world's... ...disclosure, agentic pentesting, AI red teaming, and code security, HackerOne delivers... ...memory systems, RAG, long-horizon tasks, and LLM-based models-into the HackerOne platform, applying...ApprenticeshipWork at officeLocal areaRemote workFlexible hoursShift work1 day per week- ...AI Lead San Francisco / Seattle , PST time. (On-site) Primary focus for someone with good audio/video/chat interface experience... ...Develop scalable model pipelines and reusable AI components GenAI & LLM (If applicable) Build solutions on Azure OpenAI, OpenAI,...Contract work
- A leading technology company in Seattle seeks a Senior Software Engineer to join their AI Networking team. This role involves building ML tools for optimizing AI workloads across data centers, focusing on large-scale deep learning. Candidates should have a PhD or equivalent...
$166.8k
...real-world?threats to our nation and the world. The AI and Data Analytics Division, part of NSD, combines... ...hyperspectral imagery. Hands on experience with LLM/LVM/Foundation Model and Frontier AI evaluation, red teaming, uncertainty analysis, or safety control implementation...For contractorsWork experience placementWork at officeLocal areaRemote workRelocation packageFlexible hours$123.5k - $185.3k
...thrive. Role Overview F5 is expanding its AI Center of Excellence and is hiring a Specialist... ...deep expertise in AI, Data Science, and LLM behavior to support our AI Runtime Security... ...outcomes from proofs of concept (POCs), red-teaming exercises, and runtime guardrail evaluations...Local area$190k - $230k
...CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world's... ...disclosure, agentic pentesting, AI red teaming, and code security, HackerOne delivers... ..., Memory systems, RAG, Long horizon tasks, LLM-based models into the HackerOne platform, applying...ApprenticeshipWork at officeLocal areaRemote workFlexible hoursShift work1 day per week$77 - $98 per hour
...data stack (dbt, snowflake/Big Query, version control, CI/CD pipelines) Experience and familiarity in DataOps/MLOps frameworks, AI/LLM tooling such as OpenAI ,Langchain, Hugging face Transformers Strong communication and stakeholder engagement skills Experience...Hourly payFull timeContract workTemporary workWork experience placementImmediate startWorldwideFlexible hours$185k - $220k
...Principal Data Scientist job at Curative AI. Bellevue, WA. About Curative AI, Inc. Curative AI, Inc. is an ambitious innovative... ...Natural Language Processing (NLP) and/or Large Language Models (LLM) highly desired ~ Excellent understanding of statistical methods...Full timeH1b- ...AI / Emerging Tech Security Analyst (AI Training) About the Role What if your security... ...week What You'll Do Analyze AI and LLM security scenarios to understand how models... ...-on experience with penetration testing, red teaming, or vulnerability research...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
$30 - $60 per hour
Portland Seed Fund is seeking part-time Red Teaming Experts in Seattle to support AI safety evaluation campaigns. Candidates will design and simulate AI conversations, identifying risks and evaluating performance. This role requires strong analytical skills and creative...Part time- ...Senior Machine Learning Engineer (AI Training) About the Role What if your deep expertise in machine learning could directly... ...Author complex, high-fidelity reasoning traces that show how an LLM should plan, use tools, and make decisions to solve sophisticated...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
- ...power of data to drive decisions, create exceptional customer experiences, fight fraud, and power AI innovation. We are a small, collaborative group that values generalists over specialists, simple solutions over clever ones, and delivering results over talking. You...Work at office
$141.9k - $190.3k
...learning areas with primary focus on specialization in generative AI applications, including generative mixed media, language models,... ...generative video, generative image, generative audio, chatbots, LLM applications, and mixed agentic workflows. Work will additionally...$201.3k - $302.2k
...Overview Seattle, Washington, United States Machine Learning and AI Apple is where individual imaginations gather together, committing... .... We leverage cutting-edge technologies like Visual Search and LLM to empower user-facing features like Visual Look-up and Visual Intelligence...Relocation- ...bidding optimization recommendations and insights oDevelop LLM based solutions using prompt engineering embeddings and basic RAG... ...scalable explainable and production ready following Responsible AI and governance standards Business Collaboration...Contract work
- YO IT Consulting is looking for a Red-Teaming Quality Assurance Lead to ensure quality and consistency across AI red-teaming projects. This remote position involves evaluating AI-generated evaluations and providing feedback to maintain quality standards. The ideal candidate...Remote job
- KPMG is looking for a Senior Associate, AI Engineer to develop GenAI applications and integrate solutions within their Advisory Services. The role requires at least three years of experience in AI/ML, along with proficiency in Python and familiarity with cloud platforms...
$160k - $200k
Madrona Venture Labs is seeking experienced engineers passionate about AI to design and build LLM powered systems in Seattle, Washington. The role offers a competitive salary between $160K and $200K and is hybrid. Responsibilities include developing production AI systems...Flexible hours$300k - $320k
...a Technical Program Manager to lead our AI model evaluation initiatives across multiple... ...who are comfortable acting as adaptable generalists who add value fast. We excel at... ...including designing test suites, coordinating red team exercises, and analyzing results Create...Work at officeHome officeVisa sponsorshipRelocation package- ...chronic conditions a highly personalized experience enhanced by AI and powered by mobile apps, connected devices, and a team of health... ...Partner directly with our Applied AI function to ensure every LLM answer is anchored in your data models and canonical metrics. Evaluate...Full timeTemporary workPart timeWork at officeRemote workRelocationMonday to ThursdayFlexible hours
- ...Seattle, Washington, United States Machine Learning and AI Do you get excited by driving product impact via measurement and evaluation... ...methods as needed, including prompt engineering and building LLM as judges. Conduct analysis that includes data collection and quality...Work experience placement
$166k - $203k
...CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world's... ...disclosure, agentic pentesting, AI red teaming, and code security, HackerOne delivers... ...building and integrating AI capabilities such as LLM-powered workflows, RAG pipelines, or...ApprenticeshipWork at officeLocal areaRemote workFlexible hoursShift work1 day per week$172.5k - $260.1k
...Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together.... ..., forecasting, recommendation) Design AI agents that combine LLM reasoning, tool usage, and ML decisioning Implement reusable...$139.5k - $258.1k
Seattle, Washington, United States Machine Learning and AI The Apple Knowledge Quality Team is building the next-generation of machine... ...Experience in building evaluation solution powered by LLM is a plus Experience in designing and developing large-scale data...Relocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Red Teamer, LLM Generalist. Be the first to apply!


