AI Red Teamer, CBRNE

Handshake

Seattle, United States of America

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Compensation

$ 329K

Job location

Seattle, United States of America

Tech stack

Artificial Intelligence

Software System Penetration Testing

Intelligence Analysis

Python

Open Source Technology

Software Safety

Scripting (Bash/Python/Go/Ruby)

Large Language Models

GPT

Job description

As a CBRNE Red Teamer, you will evaluate whether AI models appropriately handle queries related to chemical, biological, radiological, nuclear, and explosive threats. Your job is to probe models for dangerous knowledge gaps in their safety guardrails, testing whether they can be manipulated into providing meaningful uplift toward the creation, acquisition, or deployment of weapons or hazardous materials.

This work sits at the intersection of AI safety and national security. You will use your domain expertise to craft realistic, technically grounded adversarial scenarios that stress-test model defenses far beyond what a non-expert could attempt. The goal is not to generate harmful content, but to find and document the places where models fail to refuse, hedge, or redirect appropriately so that labs can fix them before those failures reach the real world.

This role requires deep subject matter expertise in at least one CBRNE domain, strong ethical judgment, and the ability to think like a sophisticated threat actor while operating within a structured evaluation framework.

Day-to-Day Responsibilities

Design technically grounded adversarial prompts that test whether models provide meaningful uplift toward CBRNE threats
Evaluate model outputs for technical accuracy, assessing whether responses contain genuinely dangerous information versus superficial or publicly available knowledge
Probe dual-use knowledge boundaries, testing how models handle queries that blend legitimate scientific, medical, or industrial use cases with potential weapons applications
Test multi-step and multi-turn attack chains that simulate how a motivated actor might extract dangerous information incrementally
Score model responses against structured harm taxonomies and severity rubrics calibrated to real-world risk
Document findings with clear technical reasoning, including what a response gets right, what it gets wrong, and why the failure matters
Identify and articulate the difference between information that is freely available in open literature and information that constitutes genuine uplift beyond baseline
Contribute to the development and refinement of CBRNE-specific evaluation frameworks and threat models
Collaborate with other red teamers, AI researchers, and policy teams to translate findings into actionable model improvements
Stay current on evolving model capabilities, jailbreak techniques, and relevant developments in your domain

Requirements

Do you have experience in Threat intelligence?, * Graduate-level education or equivalent professional experience in a relevant CBRNE field (chemistry, biochemistry, microbiology, virology, nuclear physics, radiochemistry, materials science, munitions/ordnance, chemical engineering, or closely related disciplines)

Ability to evaluate the technical accuracy and real-world consequence of model outputs in your domain
Understanding of dual-use research concerns and the distinction between open-source knowledge and operationally significant uplift
Strong hands-on experience using multiple LLMs (ChatGPT, Claude, Gemini, open-source models, etc.)
Creative, adversarial problem-solving skills
Clear and precise written communication, including the ability to explain technical risk to non-specialist audiences
Strong ethical judgment and the ability to separate adversarial thinking from personal values
Self-directed, collaborative, and comfortable in feedback-heavy environments

Nice to Have

Active or prior security clearance (Secret, Top Secret, or SCI)
Experience in threat assessment, WMD analysis, intelligence analysis, or arms control verification
Background in biosafety/biosecurity, chemical safety, nuclear nonproliferation, or explosive ordnance disposal
Familiarity with relevant regulatory frameworks (CWC, BWC, IAEA safeguards, ATF regulations, Export Administration Regulations)
Experience in red teaming, penetration testing, or structured adversarial evaluation in any context
Familiarity with Python or scripting languages, LLM APIs, or evaluation tooling
Published research or professional presentations in a relevant CBRNE domain
Prior work in trust and safety, content moderation, or AI evaluation

You Will Thrive Here If

You have spent years building deep expertise in a CBRNE-relevant field and want to apply that knowledge to AI safety
You can look at a model response about synthesis routes, enrichment processes, or dispersal mechanisms and immediately assess whether it crosses the line from textbook to actionable
You think in attack trees and threat models, not just individual prompts
You are comfortable working at the boundary between helpful scientific information and genuinely dangerous knowledge
You care about getting this right because you understand what the consequences of getting it wrong look like

Content Warning

This role involves regular and deliberate engagement with sensitive CBRNE-related content. You will craft and evaluate scenarios involving weapons of mass destruction, toxic industrial chemicals, biological agents, radiological and nuclear materials, and explosive devices. All work is conducted within a structured evaluation framework with strict ethical guidelines and operational security protocols. Candidates must be able to engage with this material professionally and sustainably.

About the company

Handshake was founded on a simple belief that everyone deserves a path to a great career, regardless of where they went to school or who they know. Today, we power 25 million job seekers, 1 million+ employers, and 1,600 educational institutions. In 2025, we started Handshake AI and built the fastest-growing AI data business in history. We work directly with frontier AI lab researchers to create evaluations, publish benchmarks, and push the boundary of data. We've grown from $0 to ~$1B run rate and pay ~$60M to over 30K individuals every month. Why join Handshake now: * Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel * Partner hand-in-hand with world-class AI labs, Fortune 500 partners and the world's top educational institutions * Work together with engineers, scientists, operators, and more from Palantir, Meta, Scale AI, and former YC founders * Build a massive, fast-growing business with billions in revenue About Handshake AI Human data is the core infrastructure to AI advancement. Frontier AI labs currently improve model capabilities with various data-intensive post-training techniques. We believe that data spend for AI training will increase by 3-5x in the next few years and continue for much longer as models take on new domains. Handshake AI supports all of the frontier AI labs, working on their most complex data at the largest scale.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all