CAST offers a precise approach to managing language model responses.
― 7 min read
Cutting edge science explained simply
CAST offers a precise approach to managing language model responses.
― 7 min read
Examining how flagging mechanisms influence user trust and perceptions of fairness.
― 6 min read
Using open-source models to improve harmful content detection efficiently and effectively.
― 6 min read
A new dataset aims to improve the classification of harmful content online.
― 7 min read
Examining how labeling strategies affect minority voices in sexism detection.
― 7 min read
New methods improve detection of offensive language using sentiment analysis.
― 6 min read
Exploring new methods for effective social media content moderation.
― 5 min read
Examining the support and harm of online spaces for eating disorders.
― 5 min read
The SAFE-MEME framework helps identify hate speech hidden in memes.
― 7 min read