Recognition Rate
Recognition rate, the accuracy of correctly identifying objects or patterns, is a central theme across diverse fields, from biometric security to image analysis. Current research focuses on improving recognition rates through advanced deep learning architectures like Convolutional Neural Networks (CNNs), Vision Transformers (ViTs), and recurrent models, often incorporating techniques like transfer learning, multi-modal fusion, and generative models to enhance performance, particularly in challenging scenarios such as low-resolution images or noisy data. These advancements have significant implications for various applications, including automated surveillance, medical diagnosis, and human-computer interaction, by enabling more reliable and efficient systems.
Papers
Design of Recognition and Evaluation System for Table Tennis Players' Motor Skills Based on Artificial Intelligence
Zhuo-yong Shi, Ye-tao Jia, Ke-xin Zhang, Ding-han Wang, Long-meng Ji, Yong Wu
Recognition of Heat-Induced Food State Changes by Time-Series Use of Vision-Language Model for Cooking Robot
Naoaki Kanazawa, Kento Kawaharazuka, Yoshiki Obinata, Kei Okada, Masayuki Inaba
LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices
Joerg Schmalenstroeer, Tobias Gburrek, Reinhold Haeb-Umbach
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Yochai Blau, Rohan Agrawal, Lior Madmony, Gary Wang, Andrew Rosenberg, Zhehuai Chen, Zorik Gekhman, Genady Beryozkin, Parisa Haghani, Bhuvana Ramabhadran
PGT-Net: Progressive Guided Multi-task Neural Network for Small-area Wet Fingerprint Denoising and Recognition
Yu-Ting Li, Ching-Te Chiu, An-Ting Hsieh, Mao-Hsiu Hsu, Long Wenyong, Jui-Min Hsu