Spoken Language Understanding

Spoken Language Understanding (SLU) focuses on enabling computers to comprehend human speech, aiming to extract meaning and intent from spoken dialogue. Current research emphasizes improving the robustness and accuracy of SLU systems, particularly in handling noisy speech, low-resource languages, and out-of-distribution data, often employing large language models (LLMs) and contrastive learning techniques within various architectures like end-to-end models and hybrid approaches combining speech encoders with LLMs. Advances in SLU are crucial for enhancing human-computer interaction in applications such as virtual assistants, improving accessibility for diverse languages, and advancing the broader field of artificial intelligence.

Papers

June 1, 2024

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning
Keqi Deng, Guangzhi Sun, Philip C. Woodland
Large Language Model Zero Shot Automatic Speech Recognition Spoken Language Understanding Better Zero Prompt Generation

May 31, 2024

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Xuxin Cheng, Wanshi Xu, Zhihong Zhu, Hongxiang Li, Yuexian Zou
Contrastive Learning Spoken Language Understanding Task Oriented Dialogue System Multi Intent Spoken Language Understanding

May 23, 2024

Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
Suyoung Kim, Jiyeon Hwang, Ho-Young Jung
Automatic Speech Recognition Speech Recognition Spoken Language Understanding Natural Language Understanding Consistency Learning Noisy Channel

May 19, 2024

MSNER: A Multilingual Speech Dataset for Named Entity Recognition
Quentin Meeus, Marie-Francine Moens, Hugo Van hamme
Entity Recognition Named Entity Recognition Spoken Language Understanding Multilingual Speech

May 10, 2024

HC$^2$L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding
Bowen Xing, Ivor W. Tsang
Contrastive Learning Spoken Language Understanding Semantic Alignment Zero Shot Cross Lingual

April 27, 2024

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
Tiantian Feng, Xuan Shi, Rahul Gupta, Shrikanth S. Narayanan
Training Data Speech Data Spoken Language Understanding Imputation Task Missingness Resilient

April 16, 2024

Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Pavel Denisov, Ngoc Thang Vu
Spoken Language Understanding Multilingual Large Language Model Multilingual Speech Multilingual Encoders Multilingual Speech Representation Training Point

April 3, 2024

Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages
Jakub Hoscilowicz, Pawel Pawlowski, Marcin Skorupa, Marcin Sowański, Artur Janicki
Spoken Language Understanding Prompt Expansion New Language LLM Based Machine Translation

April 1, 2024

Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism
Xiangming Xi, Feng Gao, Jun Xu, Fangtai Guo, Tianlei Jin
Multi Task Learning Spoken Language Understanding Feedback Mechanism Task Relatedness

March 28, 2024

New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset
Spoken Language Understanding Intent Classification Semantic Task French Census

March 22, 2024

Privacy-Preserving End-to-End Spoken Language Understanding
Yinggui Wang, Wei Huang, Le Yang
Speech Recognition Privacy Preserving Spoken Language Understanding Human Speech End to End Spoken Language

February 28, 2024

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Hongshen Xu, Ruisheng Cao, Su Zhu, Sheng Jiang, Hanchong Zhang, Lu Chen, Kai Yu
Spoken Language Understanding Intent Detection User Utterance Multi Intent Spoken Language Understanding

February 16, 2024

Evaluating and Improving Continual Learning in Spoken Language Understanding
Muqiao Yang, Xiang Li, Umberto Cappellazzo, Shinji Watanabe, Bhiksha Raj
Knowledge Distillation Continual LEArning Continual Learning Spoken Language Understanding Continual Learning Algorithm

February 12, 2024

The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese
Ajinkya Kulkarni, Anna Tokareva, Rameez Qureshi, Miguel Couceiro
Automatic Speech Recognition Spoken Language Understanding Automatic Speech Recognition System Balancing Strategy Brazilian Portuguese

February 8, 2024

Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model
Hung-Chieh Fang, Nai-Xuan Ye, Yi-Jen Shih, Puyuan Peng, Hsuan-Fu Wang, Layne Berry, Hung-yi Lee, David Harwath
Spoken Language Understanding Speech Model Self Supervised Speech Model First Integral Visually Grounded Target Word Speech Text Data Pseudo Target

February 6, 2024

Pro-HAN: A Heterogeneous Graph Attention Network for Profile-Based Spoken Language Understanding
Dechuan Teng, Chunlin Lu, Xiao Xu, Wanxiang Che, Libo Qin
Knowledge Graph MAESTRO Dataset Graph Attention Network Spoken Language Understanding Multi Source

January 5, 2024

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Kevin Everson, Yile Gu, Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas Stolcke
Large Language Model Context Learning Speech Recognition Spoken Language Understanding Natural Language Understanding

December 25, 2023

Compositional Generalization in Spoken Language Understanding
Avik Ray, Yilin Shen, Hongxia Jin
Compositional Generalization Spoken Language Understanding Compositional Problem

December 11, 2023

Creating Spoken Dialog Systems in Ultra-Low Resourced Settings
Moayad Elamin, Muhammad Omer, Yonas Chanie, Henslaac Ndlovu
Automatic Speech Recognition Dialogue System Spoken Language Understanding Automatic Speech Recognition System Intent Classification

November 30, 2023

Speech Understanding on Tiny Devices with A Learning Cache
Afsara Benazir, Zhiming Xu, Felix Xiaozhu Lin
Spoken Language Understanding Embedded System Speech Input Speech Benchmark

Spoken Language Understanding

Papers

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding

MSNER: A Multilingual Speech Dataset for Named Entity Recognition

HC$^2$L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training

Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages

Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism

New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark

Privacy-Preserving End-to-End Spoken Language Understanding

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Evaluating and Improving Continual Learning in Spoken Language Understanding

The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese

Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model

Pro-HAN: A Heterogeneous Graph Attention Network for Profile-Based Spoken Language Understanding

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

Compositional Generalization in Spoken Language Understanding

Creating Spoken Dialog Systems in Ultra-Low Resourced Settings

Speech Understanding on Tiny Devices with A Learning Cache