Paper ID: 2311.17771

Supervising the Centroid Baseline for Extractive Multi-Document Summarization

Simão Gonçalves, Gonçalo Correia, Diogo Pernes, Afonso Mendes

The centroid method is a simple approach for extractive multi-document summarization and many improvements to its pipeline have been proposed. We further refine it by adding a beam search process to the sentence selection and also a centroid estimation attention model that leads to improved results. We demonstrate this in several multi-document summarization datasets, including in a multilingual scenario.

Submitted: Nov 29, 2023