Spoken Language Understanding

Spoken Language Understanding (SLU) focuses on enabling computers to comprehend human speech, aiming to extract meaning and intent from spoken dialogue. Current research emphasizes improving the robustness and accuracy of SLU systems, particularly in handling noisy speech, low-resource languages, and out-of-distribution data, often employing large language models (LLMs) and contrastive learning techniques within various architectures like end-to-end models and hybrid approaches combining speech encoders with LLMs. Advances in SLU are crucial for enhancing human-computer interaction in applications such as virtual assistants, improving accessibility for diverse languages, and advancing the broader field of artificial intelligence.

Papers

May 27, 2023

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma
Speech Representation Spoken Language Understanding Text Representation Bridging Text Speech to Text Continual Pre Training Pre Training Paradigm Language Model Distillation

May 23, 2023

Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding
Umberto Cappellazzo, Muqiao Yang, Daniele Falavigna, Alessio Brutti
Knowledge Distillation Continual Learning Sequence to Sequence Spoken Language Understanding End to End Spoken Language Sequence Level Knowledge Distillation Entity Prediction

May 22, 2023

May 20, 2023

Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding
Yi Xuan Tan, Navonil Majumder, Soujanya Poria
Knowledge Distillation Pre Trained Language Understanding Spoken Language Understanding

May 17, 2023

OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding
Libo Qin, Qiguang Chen, Xiao Xu, Yunlong Feng, Wanxiang Che
Spoken Language Understanding Unified Alignment Task Oriented Dialogue System Multi Intent

May 16, 2023

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He, Philip N. Garner
Language Model Machine Translation Language Understanding Speech Translation Spoken Language Understanding Speech Model Multiple Meaning Professional Sign Language Interpreter

May 9, 2023

Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Jingbei Li, Sipan Li, Ping Chen, Luwen Zhang, Yi Meng, Zhiyong Wu, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang
Style Transfer Spoken Language Understanding Automatic Dubbing Bidirectional Attention

May 4, 2023

End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders
Jixuan Wang, Martin Radfar, Kai Wei, Clement Chung
Automatic Speech Recognition Language Understanding Spoken Language Understanding Text Embeddings Connectionist Temporal Classification

May 2, 2023

May 1, 2023

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History
Siddhant Arora, Hayato Futami, Emiru Tsunoo, Brian Yan, Shinji Watanabe
Spoken Language Understanding Dialogue Context Joint Modeling Spoken Utterance Speaker Characteristic Spoken Conversation Dialog History

April 21, 2023

Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding
Mohan Li, Rama Doddipatla
Spoken Language Understanding Connectionist Temporal Classification Bidirectional Encoder Representation Automatic Speech Recognition Hypothesis Speech Recognition Accuracy Joint Audio Non Autoregressive End to End

March 2, 2023

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Yingting Li, Ambuj Mehrish, Shuai Zhao, Rishabh Bhardwaj, Amir Zadeh, Navonil Majumder, Rada Mihalcea, Soujanya Poria
Transfer Learning Spoken Language Understanding Large Pre Trained Model Parameter Efficient Parameter Efficient Transfer Learning

February 28, 2023

Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context
Alexandre Pasquiou, Yair Lakretz, Bertrand Thirion, Christophe Pallier
Context Information Spoken Language Understanding Neural Language Model Semantics Surfaced Code Syntax Syntactic Information Word Level Brain Region Lexical Processing Semantic Processing

January 25, 2023

Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Tanvi Dinkar, Chloé Clavel, Ioana Vasilescu
Automatic Speech Recognition Speech Analysis Spoken Language Understanding Computational Approach Psycholinguistic Research Speech Recording Speech Disfluency Filler Word

January 5, 2023

HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language Understanding
Bo Zheng, Zhouyang Li, Fuxuan Wei, Qiguang Chen, Libo Qin, Wanxiang Che
Language Understanding Spoken Language Understanding Intent Detection Consistency Regularization

December 21, 2022

Spoken Language Understanding for Conversational AI: Recent Advances and Future Direction
Soyeon Caren Han, Siqu Long, Henry Weld, Josiah Poon
Natural Language Language Understanding Future Direction Recent Advance Spoken Language Understanding Intent Detection Conversational AI Intent Classification

December 20, 2022

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan Sharma, Wei-Lun Wu, Hung-Yi Lee, Karen Livescu, Shinji Watanabe
Spoken Language Understanding Benchmark Suite Speech Task

Spoken Language Understanding

Papers

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding

Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding

Extrapolating Multilingual Understanding Models as Multilingual Generators

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization

Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding

OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation

Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing

End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History

Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context

Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives

HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language Understanding

Spoken Language Understanding for Conversational AI: Recent Advances and Future Direction

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks