Latest Articles for Online Safety

Computation and Language The Challenge of Temporal Bias in Abusive Language Detection

Examining how evolving language affects models detecting online abuse.

2025-09-22T00:34:42+00:00 ― 8 min read

Computation and Language Improving Online Harmful Content Detection

A new framework enhances detection of harmful online language through continuous learning.

2025-09-20T19:36:42+00:00 ― 7 min read

Computers and Society Understanding Online Safety Tools and User Engagement

An overview of user awareness and experiences with safety technologies on social media.

2025-09-19T02:47:42+00:00 ― 5 min read

Computers and Society Fake Profiles on Twitter: The AI Threat

AI-generated images fuel the rise of deceptive social media accounts.

2025-09-19T00:49:12+00:00 ― 4 min read

Cryptography and Security Phishing Threats in the Age of AI

Phishing tactics are evolving with AI, posing new risks for organizations.

2025-09-16T00:40:00+00:00 ― 7 min read

Computation and Language Addressing Online Hate Speech in Digital Spaces

Exploring the challenges and strategies for moderating hate speech online.

2025-09-13T11:26:30+00:00 ― 8 min read

Computation and Language Understanding Deception in the Digital Age

A look at deception online and how to detect it effectively.

2025-09-12T15:41:30+00:00 ― 7 min read

Human-Computer Interaction Navigating Risks: LGBTQ+ Youth on Instagram

Examining the challenges LGBTQ+ youth face on social media platforms.

2025-09-08T01:45:00+00:00 ― 4 min read

Computation and Language Improving Hateful Meme Detection with Visual and Text Features

A new method enhances detection of hateful content in memes.

2025-09-07T15:36:42+00:00 ― 5 min read

Computation and Language Improving Hate Speech Detection with GPT-HateCheck

A new framework enhances hate speech detection by generating realistic test cases.

2025-09-04T17:18:06+00:00 ― 5 min read

Psychiatry and Clinical Psychology The Challenge of Moderation in Online Mental Health Spaces

Examining content moderation's role in online mental health communities.

2025-08-31T04:41:00+00:00 ― 8 min read

Human-Computer Interaction Fairness in Content Moderation on Social Media

Examining how users perceive fairness in moderation decisions on platforms.

2025-08-31T00:43:36+00:00 ― 6 min read

Computation and Language Using AI to Detect Hate Speech Online

This article discusses how language models help identify hate speech.

2025-08-30T05:53:54+00:00 ― 5 min read

Computation and Language Addressing Hateful Memes in Bengali

A study on the impact and identification of hateful memes in the Bengali language.

2025-08-28T19:08:18+00:00 ― 6 min read

Computation and Language Addressing Offensive Language Detection in Korean Online Spaces

This study tackles user-intended attacks on offensive language in Korean social media.

2025-08-27T17:04:06+00:00 ― 5 min read

Machine Learning Rethinking Malicious Content Detection Models

New evaluation methods aim to improve detection of harmful content online.

2025-08-23T05:06:06+00:00 ― 7 min read

Cryptography and Security Understanding Adversarial Phishing Webpages and User Perception

This article explores how users perceive adversarial phishing sites and ways to improve detection.

2025-08-22T22:23:12+00:00 ― 6 min read

Computation and Language Tackling Toxicity: Improving Online Language Detection

Enhancing tools to detect harmful language in online spaces is crucial for safety.

2025-08-21T05:02:36+00:00 ― 6 min read

Computation and Language A New Tool for Detecting Hate Speech in Singapore

SGHateCheck focuses on local languages to tackle online hate speech effectively.

2025-08-13T23:54:06+00:00 ― 7 min read

Computation and Language Enhancing Online Privacy with Automatic Text Rewriting

A new framework that protects online identities while preserving message clarity.

2025-08-10T15:35:06+00:00 ― 4 min read

Networking and Internet Architecture Addressing the Challenge of Transient Domains in Cybersecurity

New research reveals gaps in detecting transient domains used for online abuse.

2025-08-09T16:32:36+00:00 ― 6 min read

Computers and Society The Threat of Mal-Info on Social Media

Examining the impact and spread of malicious information on social platforms.

2025-08-07T20:41:54+00:00 ― 5 min read

Cryptography and Security Changing Passwords After Data Breaches: A Study

Examining the impact of nudges on password change behavior after data breaches.

2025-08-07T11:36:48+00:00 ― 10 min read

Human-Computer Interaction Amharic Speakers Face Challenges on YouTube

Research reveals harmful content exposure for Amharic speakers on YouTube.

2025-08-06T21:39:24+00:00 ― 7 min read

Human-Computer Interaction Addressing Harmful Content in WhatsApp Groups

A new approach using conversational agents for safer discussions on WhatsApp.

2025-08-04T17:38:54+00:00 ― 4 min read

Computation and Language Rethinking Toxic Language Detection Online

A new framework improves detecting harmful language in online spaces.

2025-08-02T19:18:06+00:00 ― 4 min read

Cryptography and Security Protecting Young Users: A Privacy Audit of Online Services

An analysis of how online services handle children's data privacy.

2025-07-31T01:59:42+00:00 ― 5 min read

Cryptography and Security Large Language Models Enhancing Content Moderation

LLMs assist human raters in effectively identifying harmful online content.

2025-07-27T07:00:48+00:00 ― 5 min read

Computation and Language Introducing LionGuard: A Localized Moderation Tool for Singapore

LionGuard enhances content safety by focusing on Singapore's unique language context.

2025-07-24T16:52:00+00:00 ― 4 min read

Cryptography and Security Secure Communication in a Digital Age

Addressing the challenges of E2EE and account recovery methods.

2025-07-23T21:22:48+00:00 ― 6 min read

Computation and Language Addressing Abusive Speech Online: A New Approach

This paper introduces Demarcation to tackle online abusive speech effectively.

2025-07-23T14:47:48+00:00 ― 9 min read

Computation and Language Addressing Hate Speech Online in Indonesia

A new dataset aims to improve hate speech detection in Indonesia.

2025-07-23T12:01:54+00:00 ― 8 min read

Computation and Language Evaluating Datasets for Hate Speech Detection

A study assessing the quality of datasets for identifying hate speech online.

2025-07-23T04:07:54+00:00 ― 7 min read

Computation and Language Detecting Hate Speech in Arabic Tweets

A new method improves identification of hate speech in Arabic social media.

2025-07-20T22:16:48+00:00 ― 5 min read

Machine Learning Ensuring Fairness in Toxic Language Detection

A new method promotes fairness in identifying targeted toxic language across demographic groups.

2025-07-12T15:12:00+00:00 ― 7 min read

Cryptography and Security Enhancing Password Security with PassTSL

A new model improves password guessing and strength assessment.

2025-07-10T04:04:54+00:00 ― 5 min read

Cryptography and Security Brute-Force Attacks: A Training Guide for Beginners

Learn to detect and prevent brute-force attacks through practical training scenarios.

2025-07-08T01:31:18+00:00 ― 6 min read

Computation and Language Addressing Offensive Language Detection Challenges

A study on the effectiveness of offensive language detection systems and datasets.

2025-07-06T11:04:30+00:00 ― 6 min read

Cryptography and Security PhishLang: A New Tool Against Phishing Scams

PhishLang offers improved detection for phishing websites using advanced analysis techniques.

2025-06-29T17:23:18+00:00 ― 5 min read

Artificial Intelligence Addressing Hate Speech in Memes with HateSieve

A new system targets hate speech in memes effectively.

2025-06-29T11:12:00+00:00 ― 6 min read