Google Speech Command

Google Speech Commands research focuses on efficiently and accurately identifying keywords in audio, crucial for applications like voice assistants. Current efforts concentrate on improving model robustness (e.g., against noise and adversarial attacks) and reducing resource requirements (computation, memory, energy) through techniques like contrastive learning, synthesized data augmentation, and efficient architectures (e.g., transformers, spiking neural networks, and compact CNNs). These advancements are significant for deploying accurate and low-power keyword spotting systems on resource-constrained devices, impacting both the efficiency of machine learning models and the user experience of voice-activated technologies.

Papers