Open Corpus
Open corpora, large collections of publicly available text and other data, are increasingly crucial for advancing various fields of research. Current research focuses on developing and improving these corpora, including creating benchmarks for evaluating multi-object tracking and building models to extract information like character and emotion from narratives or mathematical concepts from scientific texts. This work facilitates advancements in natural language processing, knowledge graph construction, and other areas by providing researchers with standardized, accessible datasets for training and evaluating algorithms, ultimately leading to more robust and reliable models.
Papers
July 22, 2024
July 19, 2024
March 29, 2024
March 21, 2024
January 31, 2024
December 19, 2023
October 30, 2023
August 8, 2023
May 18, 2023
August 29, 2022
June 15, 2022
June 8, 2022