Detecting Language ModelDetecting Language ModelAttacksinputs.Using perplexity to fight harmfulComputation and LanguageSpotting Harmful Attacks on Language ModelsUsing perplexity to identify risky inputs in language models.2025-10-03T23:20:54+00:00 ― 5 min read