AI Red Teamer, CBRNE
Handshake
About Handshake Handshake was founded on a simple belief that everyone deserves a path to a great career, regardless of where they went to school or who they know. Today, we power 25 million job seekers, 1 million+ employers, and 1,600 educational institutions. In 2025, we started Handshake AI and built the fastest-growing AI data business in history. We work directly with frontier AI lab researchers to create evaluations, publish benchmarks, and push the boundary of data. We’ve grown from $0 to ~$1B run rate and pay ~$60M to over 30K individuals every month. Why join Handshake now: Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel Partner hand-in-hand with world-class AI labs, Fortune 500 partners and the world’s top educational institutions Work together with engineers, scientists, operators, and more from Palantir, Meta, Scale AI, and former YC founders Build a massive, fast-growing business with billions in revenue About Handshake AI Human data is the core infrastructure to AI advancement. Frontier AI labs currently improve model capabilities with various data-intensive post-training techniques. We believe that data spend for AI training will increase by 3-5x in the next few years and continue for much longer as models take on new domains. Handshake AI supports all of the frontier AI labs, working on their most complex data at the largest scale. About the Role As a CBRNE Red Teamer, you will evaluate whether AI models appropriately handle queries related to chemical, biological, radiological, nuclear, and explosive threats. Your job is to probe models for dangerous knowledge gaps in their safety guardrails, testing whether they can be manipulated into providing meaningful uplift toward the creation, acquisition, or deployment of weapons or hazardous materials. This work sits at the intersection of AI safety and national security. You will use your domain expertise to craft realistic, technically grounded adversarial scenarios that stress-test model defenses far beyond what a non-expert could attempt. The goal is not to generate harmful content, but to find and document the places where models fail to refuse, hedge, or redirect appropriately so that labs can fix them before those failures reach the real world. This role requires deep subject matter expertise in at least one CBRNE domain, strong ethical judgment, and the ability to think like a sophisticated threat actor while operating within a structured evaluation framework. Day-to-Day Responsibilities Design technically grounded adversarial prompts that test whether models provide meaningful uplift toward CBRNE threats Evaluate model outputs for technical accuracy, assessing whether responses contain genuinely dangerous information versus superficial or publicly available knowledge Probe dual-use knowledge boundaries, testing how models handle queries that blend legitimate scientific, medical, or industrial use cases with potential weapons applications Test multi-step and multi-turn attack chains that simulate how a motivated actor might extract dangerous information incrementally Score model responses against structured harm taxonomies and severity rubrics calibrated to real-world risk Document findings with clear technical reasoning, including what a response gets right, what it gets wrong, and why the failure matters Identify and articulate the difference between information that is freely available in open literature and information that constitutes genuine uplift beyond baseline Contribute to the development and refinement of CBRNE-specific evaluation frameworks and threat models Collaborate with other red teamers, AI researchers, and policy teams to translate findings into actionable model improvements Stay current on evolving model capabilities, jailbreak techniques, and relevant developments in your domain Desired Capabilities Core Graduate-level education or equivalent professional experience in a relevant CBRNE field (chemistry, biochemistry, microbiology, virology, nuclear physics, radiochemistry, materials science, munitions/ordnance, chemical engineering, or closely related disciplines) Ability to evaluate the technical accuracy and real-world consequence of model outputs in your domain Understanding of dual-use research concerns and the distinction between open-source knowledge and operationally significant uplift Strong hands-on experience using multiple LLMs (ChatGPT, Claude, Gemini, open-source models, etc.) Creative, adversarial problem-solving skills Clear and precise written communication, including the ability to explain technical risk to non-specialist audiences Strong ethical judgment and the ability to separate adversarial thinking from personal values Self-directed, collaborative, and comfortable in feedback-heavy environments Nice to Have Active or prior security clearance (Secret, Top Secret, or SCI) Experience in threat assessment, WMD analysis, intelligence analysis, or arms control verification Background in biosafety/biosecurity, chemical safety, nuclear nonproliferation, or explosive ordnance disposal Familiarity with relevant regulatory frameworks (CWC, BWC, IAEA safeguards, ATF regulations, Export Administration Regulations) Experience in red teaming, penetration testing, or structured adversarial evaluation in any context Familiarity with Python or scripting languages, LLM APIs, or evaluation tooling Published research or professional presentations in a relevant CBRNE domain Prior work in trust and safety, content moderation, or AI evaluation You Will Thrive Here If You have spent years building deep expertise in a CBRNE-relevant field and want to apply that knowledge to AI safety You can look at a model response about synthesis routes, enrichment processes, or dispersal mechanisms and immediately assess whether it crosses the line from textbook to actionable You think in attack trees and threat models, not just individual prompts You are comfortable working at the boundary between helpful scientific information and genuinely dangerous knowledge You care about getting this right because you understand what the consequences of getting it wrong look like Content Warning This role involves regular and deliberate engagement with sensitive CBRNE-related content. You will craft and evaluate scenarios involving weapons of mass destruction, toxic industrial chemicals, biological agents, radiological and nuclear materials, and explosive devices. All work is conducted within a structured evaluation framework with strict ethical guidelines and operational security protocols. Candidates must be able to engage with this material professionally and sustainably. #J-18808-Ljbffr Handshake
- Handshake is hiring a CBRNE Red Teamer in Seattle to evaluate how AI models handle queries related to hazardous threats. The role requires crafting adversarial scenarios and analyzing model outputs to identify safety gaps. The ideal candidate will have a graduate-level...Suggested
- ...About the Role As an AI Red Teamer, you will stress-test large language models by intentionally trying to break them. Rather than checking whether an answer is correct, you will design creative, adversarial prompts that expose vulnerabilities: unsafe content, bias, broken...Suggested
- Handshake is looking for an AI Red Teamer in Seattle, USA, to stress-test large language models by designing adversarial prompts that expose vulnerabilities. This role requires creativity and the ability to think like an adversary while maintaining strong ethical judgment...Suggested
$30 - $60 per hour
Portland Seed Fund is seeking part-time Red Teaming Experts in Seattle to support AI safety evaluation campaigns. Candidates will design and simulate AI conversations, identifying risks and evaluating performance. This role requires strong analytical skills and creative...SuggestedPart time- YO IT Consulting is looking for a Red-Teaming Quality Assurance Lead to ensure quality and consistency across AI red-teaming projects. This remote position involves evaluating AI-generated evaluations and providing feedback to maintain quality standards. The ideal candidate...SuggestedRemote job
- ...mpathic is building the future of trustworthy AI. Grounded in behavioral science and human-... ...on our work in areas like RL gyms, red teaming, and benchmarking, we are creating... ...teams of experts, annotators, reviewers, red teamers, contractors, and quality assurance personnel...Contract workFor contractors
$139.5k - $258.1k
...and Services We are the Apple Services Engineering (ASE) Security Red Team. We focus on deep technical security review work of critical... ...review depth and quality. We are growing our team and looking an AI Security Engineer to lead deep reviews that identify meaningful...Relocation$230k - $280k
...Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world's largest community of... ...like bug bounty, vulnerability disclosure, agentic pentesting, AI red teaming, and code security, HackerOne delivers measurable,...ApprenticeshipWork at officeLocal areaRemote workFlexible hoursShift work1 day per week$208k - $251.03k
BBG Ventures, LLC is seeking a Staff AI Security Engineer in Seattle to define and evolve the AI security strategy protecting sensitive mental health data. This hybrid role requires being in the office 2-3 days a week. Ideal candidates will bring 10+ years of software...Work at office2 days per week3 days per week$30 per hour
...connectivity and cloud infrastructure is limited, as well as areas where data needs to be processed locally for real-time analytics and AI at the edge. We’re looking to bring on the most brilliant minds to help further our mission of bridging the digital divide with...InternshipWork at officeRemote work- ...exceptional and forward-thinking leader to drive the next generation of AI-powered Document Verification and Biometrics solutions. This role... ...attack detection for multimodal and agent-based systems Agentic red teaming , proactively identifying and mitigating vulnerabilities...
$123.5k - $185.3k
...where each individual can thrive. Role Overview F5 is expanding its AI Center of Excellence and is hiring a Specialist Solutions... ...testing results—particularly outcomes from proofs of concept (POCs), red-teaming exercises, and runtime guardrail evaluations. You will act...Local area- Role Overview The Security Product Lead - Product Security & AI Security is responsible for defining the strategic direction, roadmap... ...required to deliver roadmap objectives (e.g., implementing an AI Red Team program, rolling out a new static analysis tool). Define milestones...Remote work
$185k - $220k
...Principal Data Scientist job at Curative AI. Bellevue, WA. About Curative AI, Inc. Curative AI, Inc. is an ambitious innovative early‑stage startup revolutionizing the healthcare industry through cutting‑edge AI‑powered SaaS solutions. We are currently delivering...Full timeH1b$200k - $245k
...future. Unleash the power of you! What You'll Be Doing: We are seeking an analytical and innovative Data Scientist to join our Data & AI team. You will play a key role in developing and deploying advanced machine learning models to solve real-world business challenges....Worldwide- ...ll work with leading companies across industries, helping them shape their hybrid cloud and AI journeys. With support from our strategic partners, robust IBM technology, and Red Hat, you’ll have the tools to drive meaningful change and accelerate client impact. At IBM Consulting...Worldwide
- ...The AI/ML Solutions Architect will be instrumental in designing and implementing end-to-end artificial intelligence and machine learning solutions for a key Randstad client in the DC area. This role requires an expert-level blend of advanced AI/ML model development (including...Permanent employmentContract workLocal area
$156k - $312k
...systems involving machine learning, supervised and unsupervised and deep learning. Hands on experience in Computer Visions and NLP. Gen AI, Agentic AI Lead, mentor and?grow a team of data scientists, ML engineers and data analysts dedicated to Marketplace initiatives...Full timeTemporary workPart time$1,400 per month
Overview We are hiring an experienced AI Deep Learning Engineer to join our team in Washington. In this role you will design, develop, and implement deep learning algorithms for various projects, analyze and interpret complex data sets, and collaborate with software engineers...Full timeWork at officeLocal areaFree visa- ...Job Description Job Description Aircall is a unicorn, AI-powered customer communications platform used by 22,000+ companies worldwide to drive revenue, resolve issues faster, and scale customer-facing teams. We’re redefining customer communications by bringing voice...Worldwide
- ...off fires into systemic fixes. Drive design and implementation of AI (Agentic) workflows for data quality and analytics. Influence... ...prioritize inclusion, collaboration, and growth over hierarchy or red tape. Some of our benefits include: Flexible PTO - take time when...Live inWork at officeWork from homeWorldwideFlexible hours
$300k - $320k
...the role: We are seeking a Technical Program Manager to lead our AI model evaluation initiatives across multiple workstreams. This role... ...evaluations, including designing test suites, coordinating red team exercises, and analyzing results Create and manage dashboards...Work at officeHome officeVisa sponsorshipRelocation package$186.1k - $300.55k
...only detect incidents in real-time but to troubleshoot and resolve them autonomously. If you are passionate about applying complex AI architectures to massive datasets (billions of telemetry points) to solve real-world reliability challenges, this is the role for you...Work at officeLocal areaRemote work2 days per week$237.5k - $321.3k
...that deliver measurable business and customer impact. This highly visible role works directly with senior leaders to define and deliver AI‑native products that improve and reinvent customer experiences globally, making them more delightful and helpful for millions of...Flexible hoursDay shift- ...at the intersection of data engineering, system performance optimization, and client-facing technical operations, ensuring that our AI personalization solution runs reliably in production and delivers measurable value. You will collaborate closely with Product Managers...Fixed term contractWork at officeRemote workWork from home
- ...government/military DNEA, Exploitation Analyst (EA) operations, elite red teams, or nation‑state threat research. You have defined... ...analysis and large‑scale data processing systems. You might work with AI/ML workflows applied to adversary emulation and automation. You...Full timeWork at officeFlexible hours
$139.5k - $258.1k
Seattle, Washington, United States Machine Learning and AI The Apple Knowledge Quality Team is building the next-generation of machine learning solutions for Knowledge Q&A at Apple and help power features including Siri and Spotlight. The features we build are redefining...Relocation$68k - $102k
...to real‑time streaming technologies including Kafka, Azure Event Hub, Delta Live Tables, or Spark Structured Streaming. Exposure to AI/ML pipelines or building data products that support ML workloads. Familiarity with legacy data platforms such as Teradata, Oracle, or...Full timeTemporary workWork experience placementLocal areaFlexible hours- ...ambiguous situations. Nice to have Knowledge and experience in securing AI/ML based products. Experience with the risk management associated with financial technology companies. Experience with red teaming or penetration testing. Additional compensation in the form(s)...Work experience placementLocal area
- Red Cell Partners in McLean, VA is seeking a Principal AI Researcher to lead research in autonomous systems and AI infrastructures. This pivotal role focuses on defining long-term research directions and translating AI innovations into scalable systems. The candidate should...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Red Teamer, CBRNE. Be the first to apply!



