A Comparison of Time-based Models for Multimodal Emotion Recognition [2306.13076]