Summarization Datasets
Summarization datasets are crucial for training and evaluating text summarization models, aiming to create resources that accurately reflect the complexities of human summarization. Current research focuses on improving dataset quality through expert curation, addressing issues like factual consistency and hallucination in large language models (LLMs), and exploring diverse data types including multilingual and multimodal resources. These efforts are vital for advancing summarization techniques across various domains and languages, ultimately leading to more accurate, reliable, and efficient summarization systems with practical applications in information retrieval and knowledge synthesis.
Papers
October 24, 2024
March 8, 2024
February 7, 2024
November 15, 2023
May 23, 2023
May 11, 2023
April 5, 2023
February 14, 2023
January 26, 2023
November 9, 2022
October 31, 2022
October 24, 2022
October 23, 2022
October 22, 2022
September 26, 2022
August 1, 2022
May 31, 2022
May 25, 2022
May 4, 2022