Paper ID: 2210.14174

Towards Interpretable Summary Evaluation via Allocation of Contextual Embeddings to Reference Text Topics

Ben Schaper, Christopher Lohse, Marcell Streile, Andrea Giovannini, Richard Osuala

Despite extensive recent advances in summary generation models, evaluation of auto-generated summaries still widely relies on single-score systems insufficient for transparent assessment and in-depth qualitative analysis. Towards bridging this gap, we propose the multifaceted interpretable summary evaluation method (MISEM), which is based on allocation of a summary's contextual token embeddings to semantic topics identified in the reference text. We further contribute an interpretability toolbox for automated summary evaluation and interactive visual analysis of summary scoring, topic identification, and token-topic allocation. MISEM achieves a promising .404 Pearson correlation with human judgment on the TAC'08 dataset.

Submitted: Oct 25, 2022

Topics

Evaluation Metric
Higher Quality Reference
Contextual Embeddings
Summarization Quality
Capital Allocation
Generated Summary
Interpretability Tool

Links

arXiv PDF