Data Summarization

Data summarization aims to create concise, representative subsets from large datasets, improving interpretability and facilitating analysis. Current research focuses on developing efficient algorithms, including those leveraging large language models for diverse data types (text, tabular, time series) and submodular optimization for non-monotone functions, to generate diverse and informative summaries. These advancements are impacting various fields, enabling more effective data exploration and analysis for both technical and non-technical users in applications ranging from fraud detection to time series forecasting. The development of guided exploration methods further enhances the utility of data summarization by allowing for iterative refinement of summaries based on user feedback.

Papers