Rich Document

Rich document understanding (RDU) focuses on automatically extracting information from complex documents containing text, images, tables, and layouts. Current research emphasizes developing robust models, often incorporating graph neural networks or attention mechanisms, to handle diverse document structures and multimodal features, and addressing the challenges of limited labeled data through techniques like synthetic data generation and active learning. This field is crucial for automating information extraction from various sources, impacting diverse applications such as business process automation, digital humanities research, and knowledge-based question answering systems.

Papers