Threat Intelligence Exchange

AI Vulnerability Registry

Curated, severity-scored intelligence on jailbreaks, prompt injections, DLP bypasses, and governance failures across enterprise AI providers. Updated continuously by the research community.

7Total Entries
2Critical
2No Mitigation
4With Policy Pack

Showing 7 of 7 entries

CRITICALPrompt InjectionSMP-2026-0341
2026-05-11GPT-4o

Multilingual Role-Switch Bypass via Bahasa Code-Switching

Adversary alternates between English and Bahasa Indonesia mid-sentence to defeat English-only moderation classifiers, enabling full system prompt extraction.

Detection:92%
Available Policy Packby NUS_SecLab
CRITICALJailbreakSMP-2026-0298
2026-05-10Claude 3.5

Base64 Double-Encode Token-Level PII Filter Bypass

Double-encoding PII payloads in Base64 circumvents token-level scan buffers. Model decodes internally and includes raw PII in response.

Detection:78%
Partial Policy Packby r_vasquez
HIGHGovernance FailureSMP-2026-0271
2026-05-09Gemini 1.5

System Prompt Extraction via Few-Shot Induction

Carefully crafted few-shot examples induce the model to pattern-match and reveal confidential system prompt fragments.

Detection:55%
Noneby iitb_ai
HIGHDLP BypassSMP-2026-0254
2026-05-08Copilot

Unicode Zero-Width Join Sequence Bypasses Content Filter

Inserting U+200D (ZWNJ) between sensitive tokens prevents string-matching content filters from flagging the payload.

Detection:88%
Available Policy Packby anon_r847
MEDIUMPII LeakageSMP-2026-0231
2026-05-07GPT-4o

Malaysian IC Regex Evasion via Transliteration

Transliterating digits in Malaysian IC numbers (NRIC) to look-alike Unicode characters bypasses regex-based PII detection.

Detection:94%
Available Policy Packby anon_r312
HIGHSemantic AttackSMP-2026-0218
2026-05-06Llama 3

Multi-Step Reasoning Jailbreak via Hypothetical Framing

Adversary chains 5+ hypothetical-framing prompts across a conversation context window to gradually erode safety constraints.

Detection:41%
Noneby kyoto_sec
MEDIUMPrompt InjectionSMP-2026-0199
2026-05-04GPT-4o

Indirect Injection via Calendar Event Description

Malicious instructions embedded in calendar event descriptions are executed by Copilot / ChatGPT when summarizing user's calendar.

Detection:67%
Partialby NUS_SecLab