Speech Emotion Recognition

Speech emotion recognition (SER) aims to automatically identify human emotions from speech, primarily focusing on improving accuracy and robustness across diverse languages and contexts. Current research emphasizes leveraging self-supervised learning models, particularly transformer-based architectures, and exploring techniques like cross-lingual adaptation, multi-modal fusion (combining speech with text or visual data), and efficient model compression for resource-constrained environments. Advances in SER have significant implications for various applications, including mental health monitoring, human-computer interaction, and personalized healthcare, by enabling more natural and empathetic interactions between humans and machines.

Papers

September 15, 2023

Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting
Tiantian Feng, Shrikanth Narayanan
Speech Emotion Recognition Speech Data Annotation Rather Emotional Speech Unlabeled Speech Coherent Voice Transcription

September 9, 2023

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations
Debaditya Shome, Ali Etemad
Speech Emotion Recognition Prosodic Feature Target Emotion Cross Modal Knowledge Distillation

September 7, 2023

LanSER: Language-Model Supported Speech Emotion Recognition
Taesik Gong, Josh Belanich, Krishna Somandepalli, Arsha Nagrani, Brian Eoff, Brendan Jou
Speech Emotion Recognition Weak Supervision Weakly Supervised Learning Emotion Label Large Scale Speech

September 5, 2023

Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Minh Tran, Yufeng Yin, Mohammad Soleymani
Pre Trained Emotion Recognition Speech Emotion Recognition Speaker Embeddings Robust Speaker Representation Personalized Adaptation

September 3, 2023

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Yu-Wen Chen, Julia Hirschberg, Yu Tsao
Speech Enhancement Industrial Disturbing Noise Speech Emotion Recognition Speech Signal Signal to Noise Ratio

August 31, 2023

Supervised Contrastive Learning with Nearest Neighbor Search for Speech Emotion Recognition
Xuechen Wang, Shiwan Zhao, Yong Qin
Contrastive Learning Loss Function Speech Emotion Recognition Cross Entropy Loss Nearest Neighbor Supervised Contrastive Loss

August 28, 2023

August 17, 2023

Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition
Anant Singh, Akshat Gupta
Speech Emotion Recognition Speech Representation Speech Model Cross Linguistic Emotion Understanding

August 8, 2023

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
Yu Pan, Yuguang Yang, Yuheng Huang, Jixun Yao, Jingjing Yin, Yanni Hu, Heng Lu, Lei Ma, Jianjun Zhao
Speech Emotion Recognition Speech Emotion Fine Grained Emotion Emotion Attribute

August 6, 2023

"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations
N V S Abhishek, Pushpak Bhattacharyya
Emotion Recognition Speech Emotion Recognition Customer Service Emotional Speech Speech Emotion

August 4, 2023

Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Samiul Islam, Md. Maksudul Haque, Abu Jobayer Md. Sadat
Deep Learning Emotion Recognition Multimodal Data Speech Emotion Recognition Long Range Context

July 21, 2023

A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion
Zeinab Sadat Taghavi, Ali Satvaty, Hossein Sameti
Emotion Recognition Speech Emotion Recognition Speech to Text Pre Change Information Human Heart Modality Conversion

July 20, 2023

July 12, 2023

Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
Siddique Latif, Muhammad Usama, Mohammad Ibrahim Malik, Björn W. Schuller
Speech Emotion Recognition Speech Data New Frontier Emotional Speech

July 6, 2023

Evaluating raw waveforms with deep learning frameworks for speech emotion recognition
Zeynep Hilal Kilimci, Ulku Bayraktar, Ayhan Kucukmanisa
New Framework Speech Emotion Recognition Speech Processing Speech Signal Audio Classification Waveform Domain Mel Frequency Cepstral Coefficient

June 30, 2023

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain
Emotion Recognition Speech Emotion Recognition Human Relationship Empirical Analysis Speech Emotion Acoustic Context

June 26, 2023

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Samuel Cahyawijaya, Holy Lovenia, Willy Chung, Rita Frieske, Zihan Liu, Pascale Fung
Speech Emotion Recognition Cross Lingual Speech Emotion Cross Lingual Transferability

June 23, 2023

Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Syed Aun Muhammad Zaidi, Siddique Latif, Junaid Qadir
Emotion Recognition Speech Emotion Recognition Cross Modality Dual Attention

Speech Emotion Recognition

Papers

Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations

LanSER: Language-Model Supported Speech Emotion Recognition

Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Supervised Contrastive Learning with Nearest Neighbor Search for Speech Emotion Recognition

Multiscale Contextual Learning for Speech Emotion Recognition in Emergency Call Center Conversations

Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition

Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations

Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques

A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion

Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

Evaluating raw waveforms with deep learning frameworks for speech emotion recognition

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition

Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers