Challenge Task
Challenge tasks in computer vision, audio processing, and natural language processing drive advancements by focusing research efforts on specific, well-defined problems. Current research emphasizes developing robust and efficient models, often employing deep learning architectures like transformers, convolutional neural networks, and variational autoencoders, to improve performance metrics such as accuracy, efficiency, and generalization across diverse datasets and conditions. These challenges yield valuable benchmark datasets and innovative solutions with significant implications for various applications, including medical imaging, video enhancement, speech technology, and AI safety.
Papers
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment
Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello
The PCG-AIID System for L3DAS22 Challenge: MIMO and MISO convolutional recurrent Network for Multi Channel Speech Enhancement and Speech Recognition
Jingdong Li, Yuanyuan Zhu, Dawei Luo, Yun Liu, Guohui Cui, Zhaoxia Li