Examining how evolving language affects models detecting online abuse.
― 8 min read
Cutting edge science explained simply
Examining how evolving language affects models detecting online abuse.
― 8 min read
A new framework enhances detection of harmful online language through continuous learning.
― 7 min read
An overview of user awareness and experiences with safety technologies on social media.
― 5 min read
AI-generated images fuel the rise of deceptive social media accounts.
― 4 min read
Phishing tactics are evolving with AI, posing new risks for organizations.
― 7 min read
Exploring the challenges and strategies for moderating hate speech online.
― 8 min read
A look at deception online and how to detect it effectively.
― 7 min read
Examining the challenges LGBTQ+ youth face on social media platforms.
― 4 min read
A new method enhances detection of hateful content in memes.
― 5 min read
A new framework enhances hate speech detection by generating realistic test cases.
― 5 min read
Examining content moderation's role in online mental health communities.
― 8 min read
Examining how users perceive fairness in moderation decisions on platforms.
― 6 min read
This article discusses how language models help identify hate speech.
― 5 min read
A study on the impact and identification of hateful memes in the Bengali language.
― 6 min read
This study tackles user-intended attacks on offensive language in Korean social media.
― 5 min read
New evaluation methods aim to improve detection of harmful content online.
― 7 min read
This article explores how users perceive adversarial phishing sites and ways to improve detection.
― 6 min read
Enhancing tools to detect harmful language in online spaces is crucial for safety.
― 6 min read
SGHateCheck focuses on local languages to tackle online hate speech effectively.
― 7 min read
A new framework that protects online identities while preserving message clarity.
― 4 min read
New research reveals gaps in detecting transient domains used for online abuse.
― 6 min read
Examining the impact and spread of malicious information on social platforms.
― 5 min read
Examining the impact of nudges on password change behavior after data breaches.
― 10 min read
Research reveals harmful content exposure for Amharic speakers on YouTube.
― 7 min read
A new approach using conversational agents for safer discussions on WhatsApp.
― 4 min read
A new framework improves detecting harmful language in online spaces.
― 4 min read
An analysis of how online services handle children's data privacy.
― 5 min read
LLMs assist human raters in effectively identifying harmful online content.
― 5 min read
LionGuard enhances content safety by focusing on Singapore's unique language context.
― 4 min read
Addressing the challenges of E2EE and account recovery methods.
― 6 min read
This paper introduces Demarcation to tackle online abusive speech effectively.
― 9 min read
A new dataset aims to improve hate speech detection in Indonesia.
― 8 min read
A study assessing the quality of datasets for identifying hate speech online.
― 7 min read
A new method improves identification of hate speech in Arabic social media.
― 5 min read
A new method promotes fairness in identifying targeted toxic language across demographic groups.
― 7 min read
A new model improves password guessing and strength assessment.
― 5 min read
Learn to detect and prevent brute-force attacks through practical training scenarios.
― 6 min read
A study on the effectiveness of offensive language detection systems and datasets.
― 6 min read
PhishLang offers improved detection for phishing websites using advanced analysis techniques.
― 5 min read
A new system targets hate speech in memes effectively.
― 6 min read