Hate Speech Detection Method
Hate speech detection aims to automatically identify hateful content online, mitigating its harmful effects. Current research focuses on improving model robustness against adversarial attacks and biases, enhancing interpretability through techniques like rationale extraction using large language models, and addressing data scarcity, particularly in low-resource languages, often employing deep learning architectures such as transformers and LSTMs. These advancements are crucial for creating more accurate and fair hate speech detection systems, impacting online safety and fostering more constructive online discourse.
Papers
September 25, 2024
May 13, 2024
March 19, 2024
June 26, 2023
August 29, 2022