Text Detection
Text detection, the task of automatically locating textual regions within images and videos, aims to improve machine understanding of visual data containing text. Current research focuses on enhancing robustness to diverse backgrounds, artistic styles, and varying text granularities, employing deep learning architectures like transformers and convolutional neural networks, often combined with techniques like attention mechanisms and feature fusion. These advancements are crucial for applications ranging from automated document processing and scene understanding to combating misinformation spread through AI-generated text detection, impacting fields like computer vision, natural language processing, and information security.
Papers
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia, Ajoy Mondal, C V Jawahar
GPT-generated Text Detection: Benchmark Dataset and Tensor-based Detection Method
Zubair Qazi, William Shiao, Evangelos E. Papalexakis
MENTOR: Multilingual tExt detectioN TOward leaRning by analogy
Hsin-Ju Lin, Tsu-Chun Chung, Ching-Chun Hsiao, Pin-Yu Chen, Wei-Chen Chiu, Ching-Chun Huang