Text Clustering
Text clustering aims to automatically group similar text documents based on their content, facilitating efficient organization and analysis of large datasets where manual labeling is impractical. Current research emphasizes leveraging large language models (LLMs) for improved embedding generation and cluster interpretation, exploring both unsupervised and supervised approaches, and incorporating techniques like contrastive learning and attention mechanisms to enhance performance. These advancements are improving the accuracy and efficiency of text clustering, with applications ranging from data augmentation in legal contexts to improved information retrieval and resource recommendation in digital libraries.
Papers
January 3, 2023
December 19, 2022
October 31, 2022
August 2, 2022
August 1, 2022
January 8, 2022
December 16, 2021
December 15, 2021