| Check Type | What It Does |
|---|---|
| Prompt Injection Detection | Catches "ignore previous instructions" patterns |
| PII Scrubbing | Strips emails, SSNs, phone numbers from input |
| Topic/Intent Classification | Ensures query is within allowed scope |
| Token Length Validation | Prevents context overflow attacks |
| Jailbreak Pattern Matching | Regex/classifier on known bypass attempts |
Created
March 16, 2026 19:18
-
-
Save shubhamkakkar/9a6898f1c3f7165804585691715fe202 to your computer and use it in GitHub Desktop.
What Pre-Checks Cover
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment