Hateful Content
Hateful content online, encompassing hate speech and offensive language, is a significant research area aiming to develop automated detection and mitigation strategies. Current research focuses on improving detection accuracy using various machine learning models, including transformer-based architectures (like BERT and its variants) and graph neural networks, often incorporating multimodal data (text, images, audio) and contextual information from user networks and online discussions. These advancements are crucial for creating safer online environments and informing effective content moderation strategies, while also highlighting the challenges of cross-lingual generalization, bias in datasets, and the need for explainable AI in this sensitive domain.
Papers
Analysis and Detection of Multilingual Hate Speech Using Transformer Based Deep Learning
Arijit Das, Somashree Nandy, Rupam Saha, Srijan Das, Diganta Saha
Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media
Dhanush Kikkisetti, Raza Ul Mustafa, Wendy Melillo, Roberto Corizzo, Zois Boukouvalas, Jeff Gill, Nathalie Japkowicz
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection
Atanu Mandal, Gargi Roy, Amit Barman, Indranil Dutta, Sudip Kumar Naskar