Document Layout Analysis
Document layout analysis (DLA) aims to automatically understand the structure of documents by identifying and classifying different regions (e.g., text, images, tables) and their relationships. Current research emphasizes improving model accuracy and robustness using various architectures, including transformers, graph neural networks, and object detection models like Mask R-CNN and YOLOv5, often incorporating techniques like knowledge distillation and self-supervised learning to address data scarcity. Advances in DLA are crucial for enabling efficient information extraction, document understanding, and accessibility, impacting fields ranging from digital humanities to automated document processing in various industries.
Papers
December 17, 2024
October 16, 2024
June 12, 2024
June 10, 2024
May 20, 2024
April 27, 2024
April 15, 2024
March 21, 2024
February 8, 2024
January 30, 2024
January 22, 2024
January 16, 2024
December 15, 2023
December 14, 2023
October 2, 2023
September 29, 2023
August 31, 2023
August 29, 2023