OCR Model
Optical Character Recognition (OCR) models aim to automatically extract text from images, a crucial task with broad applications. Current research emphasizes developing more robust and versatile OCR models, including those that integrate vision and language models within a unified framework (e.g., transformer-based architectures) or operate without relying on separate OCR engines. These advancements focus on improving accuracy, efficiency, and adaptability across diverse languages, document types, and image qualities, ultimately enhancing accessibility to information in various digital archives and applications.
Papers
October 17, 2024
October 8, 2024
July 17, 2024
December 5, 2023
August 29, 2023
May 11, 2023
April 5, 2023
November 25, 2022
May 5, 2022