Rich Document
Rich document understanding (RDU) focuses on automatically extracting information from complex documents containing text, images, tables, and layouts. Current research emphasizes developing robust models, often incorporating graph neural networks or attention mechanisms, to handle diverse document structures and multimodal features, and addressing the challenges of limited labeled data through techniques like synthetic data generation and active learning. This field is crucial for automating information extraction from various sources, impacting diverse applications such as business process automation, digital humanities research, and knowledge-based question answering systems.
Papers
November 8, 2024
October 16, 2024
October 2, 2024
August 28, 2024
July 9, 2024
May 8, 2024
May 2, 2024
February 20, 2024
February 16, 2024
January 23, 2024
December 15, 2023
December 12, 2023
September 24, 2023
March 1, 2023
November 15, 2022
October 28, 2022
July 14, 2022