Metadata Extraction
Metadata extraction aims to automatically identify and extract key information from diverse data sources, improving searchability and analysis. Current research focuses on developing robust methods using machine learning, including deep learning models like transformers and techniques leveraging layout analysis of documents (e.g., PDFs) and multimodal data integration (combining text and image information). These advancements are crucial for managing the ever-increasing volume of digital data in various fields, from cultural heritage preservation to scientific research, enabling more efficient data discovery and knowledge synthesis.
Papers
November 8, 2024
October 13, 2024
July 9, 2024
November 28, 2023
November 8, 2023
December 5, 2022
September 20, 2022
March 9, 2022
December 23, 2021