Document Categorization
Document categorization aims to automatically assign documents to predefined categories or clusters, facilitating efficient information retrieval and knowledge discovery. Current research emphasizes improving the accuracy and interpretability of categorization, focusing on techniques like graph neural networks for leveraging sentence-level context and advanced clustering methods that analyze document parts to identify nuanced patterns, such as influence campaigns. These advancements address challenges like handling multi-page documents, noisy data, and imbalanced class distributions, ultimately improving the effectiveness of information organization and analysis across diverse applications.
Papers
February 27, 2024
February 20, 2024
August 24, 2023
August 2, 2022
June 6, 2022
March 15, 2022
December 29, 2021
December 13, 2021