Keyword Spotting
Keyword spotting (KWS) focuses on efficiently and accurately detecting predefined words within continuous audio streams, a crucial component in voice-activated devices and other applications. Current research emphasizes improving KWS robustness in noisy environments and resource-constrained settings, exploring techniques like contrastive learning, multi-task learning, and novel architectures such as Transformers and Spiking Neural Networks, often incorporating attention mechanisms and efficient feature extraction methods. These advancements aim to enhance accuracy, reduce latency and energy consumption, and enable personalized and multilingual KWS capabilities, impacting fields ranging from voice assistants to aviation safety.
Papers
Multitaper mel-spectrograms for keyword spotting
Douglas Baptista de Souza, Khaled Jamal Bakri, Fernanda Ferreira, Juliana Inacio
Written Term Detection Improves Spoken Term Detection
Bolaji Yusuf, Murat SaraƧlar
Few-Shot Keyword Spotting from Mixed Speech
Junming Yuan, Ying Shi, LanTian Li, Dong Wang, Askar Hamdulla