Visually Rich Document
Visually rich documents (VRDs), containing diverse elements like text, images, tables, and charts, present a significant challenge for automated information extraction. Current research focuses on developing robust multimodal models, often leveraging transformer architectures and graph neural networks, to effectively integrate visual and textual information, addressing issues like layout understanding and reading order prediction to improve information extraction accuracy and efficiency. This field is crucial for advancing document understanding across various domains, impacting applications ranging from scientific literature analysis to business process automation.
Papers
April 28, 2023
April 24, 2023
March 1, 2023
February 6, 2023
December 20, 2022
December 19, 2022
November 15, 2022
October 28, 2022
October 12, 2022
September 18, 2022
July 14, 2022
June 27, 2022
May 23, 2022
May 5, 2022
March 14, 2022
February 3, 2022