Full Length Document
Research on full-length document understanding focuses on efficiently extracting information and answering questions from complex, often visually rich, documents. Current efforts involve developing multimodal models that integrate text, layout, and image information, employing techniques like large language models (LLMs), attention mechanisms (e.g., shifted window attention), and graph-based representations to capture relationships between entities and temporal information. These advancements aim to improve information retrieval, question answering, and document summarization, impacting fields like scientific literature analysis, business intelligence, and digital archiving.
Papers
November 5, 2024
November 1, 2024
October 30, 2024
October 28, 2024
October 23, 2024
October 4, 2024
July 31, 2024
March 30, 2024
March 15, 2024
March 7, 2024
March 1, 2024
December 1, 2023
November 8, 2023
September 27, 2023
July 18, 2023
May 20, 2023
May 15, 2023
October 8, 2022
September 12, 2022