AVATAR's Tricks UnmaskedAVATAR's Tricks UnmaskedAVATAR's clever disguises.Language models face risks fromComputation and LanguageAVATAR: Mischief in Language ModelsDiscover how AVATAR cleverly disguises harmful intents in language models.2025-03-27T11:33:27+00:00 ― 6 min read