hARmful Meme
Harmful memes, combining text and images to spread harmful ideologies or attack individuals, are a growing concern requiring automated detection. Current research focuses on developing multimodal models, often leveraging large language models (LLMs) and incorporating techniques like image captioning and optical character recognition (OCR), to analyze both visual and textual content for nuanced understanding of toxicity. This work is crucial for improving online safety and informing the development of effective content moderation strategies, with a particular emphasis on explainability and addressing challenges posed by multilingual and low-resource contexts.
Papers
Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations
Rabindra Nath Nandi, Firoj Alam, Preslav Nakov
Detecting and Understanding Harmful Memes: A Survey
Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty