Hate Speech
Hate speech, encompassing discriminatory and derogatory language targeting individuals or groups, is a significant online problem. Current research focuses on improving automated hate speech detection, employing various deep learning models like BERT, LSTM, and transformer-based architectures, often incorporating multimodal data (text and images) and addressing challenges like implicit hate, code-mixing, and cross-cultural variations. These efforts aim to enhance the accuracy and fairness of hate speech detection systems, ultimately contributing to safer online environments and informing content moderation strategies. The field also explores methods for generating counterspeech and mitigating biases within detection models.
Papers
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula, Sara Tonelli
Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets
Tommaso Giorgi, Lorenzo Cima, Tiziano Fagni, Marco Avvenuti, Stefano Cresci
A Hate Speech Moderated Chat Application: Use Case for GDPR and DSA Compliance
Jan Fillies, Theodoros Mitsikas, Ralph Schäfermeier, Adrian Paschke