Savvas Zannettou

Automated tools often misjudge African American English, leading to unfair treatment online.

2025-09-14T22:35:48+00:00 ― 7 min read

A new framework assesses the effectiveness of image safety classifiers against harmful content.

2025-08-13T09:48:48+00:00 ― 10 min read

Examining the threats posed by autonomous language model agents and their weaknesses.

2025-07-04T23:55:12+00:00 ― 6 min read