Hateful Content
Hateful content online, encompassing hate speech and offensive language, is a significant research area aiming to develop automated detection and mitigation strategies. Current research focuses on improving detection accuracy using various machine learning models, including transformer-based architectures (like BERT and its variants) and graph neural networks, often incorporating multimodal data (text, images, audio) and contextual information from user networks and online discussions. These advancements are crucial for creating safer online environments and informing effective content moderation strategies, while also highlighting the challenges of cross-lingual generalization, bias in datasets, and the need for explainable AI in this sensitive domain.
Papers
CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection
Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee
TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes
Sherzod Hakimov, Gullal S. Cheema, Ralph Ewerth