Speech Emotion

Speech emotion recognition (SER) aims to automatically identify the emotional content of spoken language, focusing on both categorical (e.g., happy, sad) and dimensional (e.g., valence, arousal) aspects. Current research emphasizes improving model robustness and generalization across languages and demographics, employing techniques like self-supervised learning, large language models (LLMs), and various deep learning architectures (e.g., CNNs, Transformers). Advances in SER have significant implications for improving human-computer interaction, particularly in applications requiring emotional intelligence, such as customer service, mental health monitoring, and personalized education.

Papers

August 8, 2023

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
Yu Pan, Yuguang Yang, Yuheng Huang, Jixun Yao, Jingjing Yin, Yanni Hu, Heng Lu, Lei Ma, Jianjun Zhao
Speech Emotion Recognition Speech Emotion Fine Grained Emotion Emotion Attribute

August 6, 2023

"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations
N V S Abhishek, Pushpak Bhattacharyya
Emotion Recognition Speech Emotion Recognition Customer Service Emotional Speech Speech Emotion

June 30, 2023

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain
Emotion Recognition Speech Emotion Recognition Human Relationship Empirical Analysis Speech Emotion Acoustic Context

June 26, 2023

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Samuel Cahyawijaya, Holy Lovenia, Willy Chung, Rita Frieske, Zihan Liu, Pascale Fung
Speech Emotion Recognition Cross Lingual Speech Emotion Cross Lingual Transferability

June 22, 2023

Speech Emotion Diarization: Which Emotion Appears When?
Yingzhi Wang, Mirco Ravanelli, Alya Yacoubi
Speech Emotion Recognition Underlying Emotion Emotional Speech Speech Emotion Emotion Datasets Speech Emotion Diarization

March 15, 2023

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents
Suhee Jo, Younggun Lee, Yookyung Shin, Yeongtae Hwang, Taesu Kim
Emotional Speech Speech Emotion Emotion Intensity Speaking Style

March 3, 2023

Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis
Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano
Native Robustness Speech Recognition Industrial Disturbing Noise Speech Model Pre Trained Representation Acoustic Model Speech Emotion Acoustic Representation

November 7, 2022

Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words
Taesu Kim, SeungHeon Doh, Gyunpyo Lee, Hyungseok Jeon, Juhan Nam, Hyeon-Jeong Suk
Underlying Emotion Speech Recognition System Speech Emotion Wake Word Koopman Autoencoders

October 28, 2022

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Jia-Xin Ye, Xin-Cheng Wen, Xuan-Ze Wang, Yong Xu, Yan Luo, Chang-Li Wu, Li-Yan Chen, Kun-Hong Liu
Speech Emotion Recognition Speech Emotion Causal Emotion Entailment

July 2, 2022

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi
Knowledge Distillation Multi Task Learning Pre Trained Speech Model Speech Emotion Valence Prediction Model Representation Dimensional Emotion Recognition Emotion Estimation

June 27, 2022

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning
Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao
Emotion Recognition Speech Emotion Recognition Multitask Learning Speech Emotion Speech Datasets

April 28, 2022

Emotion Recognition In Persian Speech Using Deep Neural Networks
Ali Yazdani, Hossein Simchi, Yasser Shekofteh
Deep Neural Network Emotion Recognition Speech Emotion Recognition Speech Emotion Persian Dataset Arabic Sentiment Analysis Persian Text

April 5, 2022

Learning Speech Emotion Representations in the Quaternion Domain
Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello
Speech Emotion Recognition Speech Emotion Quaternion Domain Quaternion Valued Convolutional Neural Network Quaternion Signal

March 28, 2022

Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representations
Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen
Loss Function Semi Supervised Variational Autoencoder Speech Emotion Recognition Cross Lingual Emotion Classification Speech Emotion Denoising Autoencoder

March 3, 2022

Attention-based Region of Interest (ROI) Detection for Speech Emotion Recognition
Jay Desai, Houwei Cao, Ravi Shah
Data Detection Emotion Recognition Speech Emotion Recognition Emotional Expression Speech Emotion Region of Interest Automatic Emotion Recognition Emotion Dynamic Region Attention

February 2, 2022

Speaker Normalization for Self-supervised Speech Emotion Recognition
Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory
Speech Emotion Recognition Speech Emotion Gradient Based Adversarial Speaker Normalization

January 9, 2022

Emotional Speaker Identification using a Novel Capsule Nets Model
Ali Bou Nassif, Ismail Shahin, Ashraf Elnagar, Divya Velayudhan, Adi Alhudhaif, Kemal Polat
Speaker Verification Capsule Network Speaker Identification Speech Emotion Speaker Recognition System Speaker Recognition Model

January 7, 2022

A New Amharic Speech Emotion Dataset and Classification Benchmark
Ephrem A. Retta, Eiad Almekhlafi, Richard Sutcliffe, Mustafa Mhamed, Haider Ali, Jun Feng
Speech Emotion Recognition Mel Spectrogram Speech Emotion Image Classification Benchmark Mel Frequency Cepstral Coefficient Amharic Speech Emotion Dataset

November 15, 2021

Biologically inspired speech emotion recognition
Reza Lotfidereshgi, Philippe Gournay
Speech Emotion Recognition Speech Processing Speech Signal Emotional Speech Speech Emotion Speech Production

November 13, 2021

Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method
Fatemeh Daneshfar, Seyed Jahanshah Kabudian
Feature Selection Speech Emotion Recognition Speech Signal Dimension Reduction Extreme Learning Machine Speech Emotion Spectro Temporal Glottal Source

Speech Emotion

Papers

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition

Speech Emotion Diarization: Which Emotion Appears When?

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Emotion Recognition In Persian Speech Using Deep Neural Networks

Learning Speech Emotion Representations in the Quaternion Domain

Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representations

Attention-based Region of Interest (ROI) Detection for Speech Emotion Recognition

Speaker Normalization for Self-supervised Speech Emotion Recognition

Emotional Speaker Identification using a Novel Capsule Nets Model

A New Amharic Speech Emotion Dataset and Classification Benchmark

Biologically inspired speech emotion recognition

Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method