GuardrailsAI/prompt-saturation-attack-detector Text Classification • 4.39M • Updated Nov 14, 2024 • 29.7k • • 2
qualifire/prompt-injection-jailbreak-sentinel-v2 Text Classification • 0.6B • Updated Sep 28 • 1.83k • 15