News Dataset
News datasets are collections of news articles used to train and evaluate machine learning models for various tasks, including recommendation, summarization, and misinformation detection. Current research focuses on improving model performance through ensemble methods, addressing issues like news avoidance in recommender systems, and enhancing dataset quality via techniques like LLM-based data cleansing and careful label definition. These advancements are crucial for improving the accuracy and reliability of news-related applications, impacting fields ranging from social science research to financial market forecasting and risk management.
Papers
March 5, 2023
February 25, 2023
December 22, 2022
MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification
Alina Petukhova, Nuno Fachada
Multilingual News Location Detection using an Entity-Based Siamese Network with Semi-Supervised Contrastive Learning and Knowledge Base
Víctor Suárez-Paniagua, Steven Derby, Tri Kurniawan Wijaya
November 13, 2022
October 19, 2022
October 17, 2022
September 23, 2022
September 17, 2022
August 11, 2022
March 10, 2022
January 7, 2022
November 22, 2021