Discrete Speech Representation
Discrete speech representation focuses on encoding speech signals into discrete units, aiming to improve efficiency and performance in various speech processing tasks. Current research emphasizes the development and application of these representations within transformer-based models, including autoregressive and non-autoregressive architectures, often leveraging self-supervised learning techniques and incorporating techniques like quantization and hierarchical structures. This approach shows promise for enhancing automatic speech recognition, speech synthesis, voice conversion, and cross-lingual speech processing, particularly in low-resource scenarios and for improving robustness to noise and variations in speech.
Papers
September 13, 2024
September 12, 2024
July 16, 2024
July 3, 2024
June 3, 2024
August 21, 2023
July 31, 2023
July 28, 2023
February 7, 2023
October 31, 2022
October 29, 2022
September 30, 2022
June 25, 2022
March 29, 2022
February 5, 2022
January 14, 2022