Hate Speech
Hate speech, encompassing discriminatory and derogatory language targeting individuals or groups, is a significant online problem. Current research focuses on improving automated hate speech detection, employing various deep learning models like BERT, LSTM, and transformer-based architectures, often incorporating multimodal data (text and images) and addressing challenges like implicit hate, code-mixing, and cross-cultural variations. These efforts aim to enhance the accuracy and fairness of hate speech detection systems, ultimately contributing to safer online environments and informing content moderation strategies. The field also explores methods for generating counterspeech and mitigating biases within detection models.
Papers
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel, Tomer Wullach, Amir Adler, Einat Minkov
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection
Sarah Masud, Mohammad Aflah Khan, Md. Shad Akhtar, Tanmoy Chakraborty