Hate Speech
Hate speech, encompassing discriminatory and derogatory language targeting individuals or groups, is a significant online problem. Current research focuses on improving automated hate speech detection, employing various deep learning models like BERT, LSTM, and transformer-based architectures, often incorporating multimodal data (text and images) and addressing challenges like implicit hate, code-mixing, and cross-cultural variations. These efforts aim to enhance the accuracy and fairness of hate speech detection systems, ultimately contributing to safer online environments and informing content moderation strategies. The field also explores methods for generating counterspeech and mitigating biases within detection models.
Papers
Untangling Hate Speech Definitions: A Semantic Componential Analysis Across Cultures and Domains
Katerina Korre, Arianna Muti, Federico Ruggeri, Alberto Barrón-Cedeño
A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information
Prashant Kapil, Asif Ekbal
1-800-SHARED-TASKS @ NLU of Devanagari Script Languages: Detection of Language, Hate Speech, and Targets using LLMs
Jebish Purbey, Siddartha Pullakhandam, Kanwal Mehreen, Muhammad Arham, Drishti Sharma, Ashay Srivastava, Ram Mohan Rao Kadiyala
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula, Sara Tonelli
Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets
Tommaso Giorgi, Lorenzo Cima, Tiziano Fagni, Marco Avvenuti, Stefano Cresci
A Hate Speech Moderated Chat Application: Use Case for GDPR and DSA Compliance
Jan Fillies, Theodoros Mitsikas, Ralph Schäfermeier, Adrian Paschke