Multimodal Topic Modeling

Multimodal topic modeling aims to discover underlying themes in datasets containing multiple data types, such as text and images, by jointly analyzing their combined information. Current research focuses on developing neural network-based models, often incorporating large language models or graph-based representations to effectively integrate and interpret diverse data modalities, and improving evaluation metrics for assessing the coherence and diversity of discovered topics. This field is significant for applications like social media analysis (e.g., understanding meme trends) and relation extraction, where integrating visual and textual information enhances accuracy and provides richer insights.

Papers

October 11, 2024

More than Memes: A Multimodal Topic Modeling Approach to Conspiracy Theories on Telegram
Elisabeth Steffen
Theoretical Understanding BERT Based Internet Meme Multimodal Content Conspiracy Related Telegram Post Multimodal Topic Modeling

March 26, 2024

Neural Multimodal Topic Modeling: A Comprehensive Evaluation
Felipe González-Pizarro, Giuseppe Carenini
Multimodal Dataset Topic Modeling Comprehensive Evaluation Neural Topic Model Future Multimodal Multimodal Topic Modeling

December 11, 2023

PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models
Nirmalendu Prakash, Han Wang, Nguyen Khoi Hoang, Ming Shan Hee, Roy Ka-Wei Lee
Social Medium Prompt Learning Internet Meme Meme Datasets Meme Analysis Multimodal Topic Modeling

May 19, 2023

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, Tat-Seng Chua
Early Screening Fine Grained Semantic Feature Denoising Multimodal Topic Modeling

Multimodal Topic Modeling

Papers

More than Memes: A Multimodal Topic Modeling Approach to Conspiracy Theories on Telegram

Neural Multimodal Topic Modeling: A Comprehensive Evaluation

PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling