Module 12 · LLM Jailbreak Defence

Manish Garg
Manish Garg Associate of (ISC)² · RingSafe
Apr 27, 2026
1 min read
Read as

Last updated: April 29, 2026

100% Free

No signup. No paywall. No catch. One of our 10 most-requested practitioner modules — published in full so anyone can learn for free. We earn through consulting, not by gating knowledge.

See all 10 free modules →

Jailbreaks bypass model safety training. New variants constant.

Jailbreaks bypass model safety training. New variants constant.

Common patterns

  • Roleplay — “Pretend you are DAN (Do Anything Now)”
  • Encoding — base64, ROT13, leetspeak
  • Multi-turn — gradually shift context away from policy
  • Character set tricks — Unicode confusables
  • Adversarial suffixes (GCG) — discovered tokens that flip safety
  • Crescendo — multi-turn gradient toward sensitive content
Want this for your team?

Custom team training + practitioner advisory

Beyond the free academy — we run private workshops, vCISO advisory, and red-team exercises tailored to your stack. For Indian SMBs scaling past their first hire.

Book team training call Replies in 4 working hrs · India-only · Senior consultants