Document Image

Document image analysis focuses on automatically understanding and extracting information from scanned or photographed documents, aiming to bridge the gap between physical and digital formats. Current research emphasizes developing efficient and accurate models for tasks like layout analysis (using transformer networks and hybrid approaches), optical character recognition (OCR), and information extraction (leveraging multimodal learning and techniques like masked autoencoders). These advancements are crucial for improving accessibility to digitized archives, automating document processing in various industries (e.g., finance, healthcare), and enabling new applications in areas such as historical document analysis and forensic document examination.

Papers