Document Understanding
Document understanding aims to enable computers to comprehend the content and structure of documents, including text, images, and layouts, to extract key information and answer questions. Current research focuses on improving the efficiency and accuracy of multimodal large language models (MLLMs) for this task, often employing techniques like knowledge distillation, synthetic data generation, and efficient visual processing to handle high-resolution and long-context documents. These advancements are significant because they improve information retrieval, automate document processing tasks, and address privacy concerns through techniques like machine unlearning, ultimately impacting various fields from healthcare to finance.
Papers
November 12, 2024
November 2, 2024
October 25, 2024
October 8, 2024
October 4, 2024
September 17, 2024
August 27, 2024
August 8, 2024
July 19, 2024
July 17, 2024
July 9, 2024
July 2, 2024
July 1, 2024
June 27, 2024
June 17, 2024
June 14, 2024
June 12, 2024
May 28, 2024
May 23, 2024