Last updated: April 29, 2026
Jailbreaks bypass model safety training. New variants constant.
Common patterns
- Roleplay — “Pretend you are DAN (Do Anything Now)”
- Encoding — base64, ROT13, leetspeak
- Multi-turn — gradually shift context away from policy
- Character set tricks — Unicode confusables
- Adversarial suffixes (GCG) — discovered tokens that flip safety
- Crescendo — multi-turn gradient toward sensitive content
Custom team training + practitioner advisory
Beyond the free academy — we run private workshops, vCISO advisory, and red-team exercises tailored to your stack. For Indian SMBs scaling past their first hire.