End to End Spoken Language
End-to-end spoken language understanding (SLU) aims to directly translate spoken audio into semantic representations, bypassing the traditional pipeline approach of separate speech recognition and natural language understanding. Current research emphasizes improving model robustness to noisy audio and ASR errors, often employing transformer-based architectures and techniques like knowledge distillation and multi-task learning to enhance accuracy and efficiency, particularly in low-resource settings. This field is crucial for advancing human-computer interaction, enabling more natural and effective voice-controlled interfaces for applications ranging from virtual assistants to smart home devices.
Papers
January 13, 2025
June 12, 2024
March 22, 2024
October 9, 2023
July 22, 2023
May 29, 2023
May 23, 2023
May 2, 2023
October 29, 2022
October 27, 2022
October 11, 2022
July 17, 2022
July 15, 2022
July 14, 2022
July 1, 2022
June 29, 2022
April 7, 2022
December 13, 2021