Recognition Rate
Recognition rate, the accuracy of correctly identifying objects or patterns, is a central theme across diverse fields, from biometric security to image analysis. Current research focuses on improving recognition rates through advanced deep learning architectures like Convolutional Neural Networks (CNNs), Vision Transformers (ViTs), and recurrent models, often incorporating techniques like transfer learning, multi-modal fusion, and generative models to enhance performance, particularly in challenging scenarios such as low-resolution images or noisy data. These advancements have significant implications for various applications, including automated surveillance, medical diagnosis, and human-computer interaction, by enabling more reliable and efficient systems.
Papers
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Yicheng Du, Aditya Arie Nugraha, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and Recognition
Jingjing Wu, Pengyuan Lyu, Guangming Lu, Chengquan Zhang, Wenjie Pei