Hate Speech

Hate speech, encompassing discriminatory and derogatory language targeting individuals or groups, is a significant online problem. Current research focuses on improving automated hate speech detection, employing various deep learning models like BERT, LSTM, and transformer-based architectures, often incorporating multimodal data (text and images) and addressing challenges like implicit hate, code-mixing, and cross-cultural variations. These efforts aim to enhance the accuracy and fairness of hate speech detection systems, ultimately contributing to safer online environments and informing content moderation strategies. The field also explores methods for generating counterspeech and mitigating biases within detection models.

Papers

September 20, 2024

Trustworthy Hate Speech Detection Through Visual Augmentation
Ziyuan Yang, Ming Yan, Yingyu Chen, Hui Wang, Zexin Lu, Yi Zhang
Hate Speech Hate Speech Detection Semantic Representation

September 19, 2024

Exploring the topics, sentiments and hate speech in the Spanish information environment
ALEJANDRO BUITRAGO LOPEZ, Javier Pastor-Galindo, José Antonio Ruipérez-Valiente
Hate Speech Implicit Sentiment Toxic Comment Significant Topic Public Discourse Clear Human Aware Sentiment Orientation Spanish Dictionary

September 8, 2024

August 12, 2024

An Investigation Into Explainable Audio Hate Speech Detection
Jinmyeong An, Wonjun Lee, Yejin Jeon, Jungseul Ok, Yunsu Kim, Gary Geunbae Lee
Hate Speech Hate Speech Detection

August 11, 2024

HateSieve: A Contrastive Learning Framework for Detecting and Segmenting Hateful Content in Multimodal Memes
Xuanyu Su, Yansong Li, Diana Inkpen, Nathalie Japkowicz
Data Detection Hate Speech Hate Speech Detection Hateful Content Contrastive Learning Framework Hateful Meme Multimodal Meme Meme Generation

August 7, 2024

Hate Speech Detection and Classification in Amharic Text with Deep Learning
Samuel Minale Gashe, Seid Muhie Yimam, Yaregal Assabie
Deep Learning Classification Code Hate Speech Hate Speech Detection Non Hate Speech Amharic Speech Emotion Dataset Amharic Scene Text

July 28, 2024

MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Han Wang, Tan Rui Yang, Usman Naseem, Roy Ka-Wei Lee
Hate Speech Hate Speech Detection Multilingual Benchmark Multi Head Offensive Language Online Hate

July 26, 2024

Towards Generalized Offensive Language Identification
Alphaeus Dmonte, Tejas Arya, Tharindu Ranasinghe, Marcos Zampieri
Natural Language Processing Hate Speech Offensive Content Offensive Language Detection

July 24, 2024

Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation
Huimin Lu, Masaru Isonuma, Junichiro Mori, Ichiro Sakata
Large Language Model Hate Speech Unlearning Framework Debiasing Method Empirical Evidence

July 1, 2024

Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media
Sidney G. -J. Wong
Social Medium Hate Speech Anti LGBTQ+

June 27, 2024

June 20, 2024

Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services
David Hartmann, Amin Oueslati, Dimitri Staufer
Hate Speech Hate Speech Detection Content Moderation Live Streaming Viewer Content Moderation Software

June 18, 2024

COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
Linhao Zhang, Li Jin, Guangluan Xu, Xiaoyu Li, Xian Sun
Optimal Transport Hate Speech Generative Approach Token Representation Counter Narrative BED Turnaround Time

June 17, 2024

Investigating Annotator Bias in Large Language Models for Hate Speech Detection
Amit Das, Zheng Zhang, Najib Hasan, Souvika Sarkar, Fatemeh Jamshidi, Tathagata Bhattacharya, Mostafa Rahgouy, Nilanjana Raychawdhary, Dongji Feng, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals
Hate Speech Hate Speech Detection Speech Data Annotated Dataset Data Annotation Annotation Bias

June 12, 2024

Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection
Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, Kyungsik Han
Contrastive Learning Hate Speech Hate Speech Detection Negative Sampling Momentum Contrast

June 7, 2024

HateDebias: On the Diversity and Variability of Hate Speech Debiasing
Nankai Lin, Hongyan Wu, Zhengming Chen, Zijian Li, Lianxi Wang, Shengyi Jiang, Dong Zhou, Aimin Yang
Hate Speech Diversity Awareness Hate Speech Detection Non Pathological Variability Bias Attribute

June 6, 2024

Hate Speech

Papers

Trustworthy Hate Speech Detection Through Visual Augmentation

Exploring the topics, sentiments and hate speech in the Spanish information environment

MHS-STMA: Multimodal Hate Speech Detection via Scalable Transformer-Based Multilevel Attention Framework

Hate Content Detection via Novel Pre-Processing Sequencing and Ensemble Methods

An Investigation Into Explainable Audio Hate Speech Detection

HateSieve: A Contrastive Learning Framework for Detecting and Segmenting Hateful Content in Multimodal Memes

Hate Speech Detection and Classification in Amharic Text with Deep Learning

MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili

Towards Generalized Offensive Language Identification

Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation

Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media

IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language

Empirical Evaluation of Public HateSpeech Datasets

Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services

COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport

Investigating Annotator Bias in Large Language Models for Hate Speech Detection

Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection

HateDebias: On the Diversity and Variability of Hate Speech Debiasing

Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster

Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech