Vec Tok Codec
VecTok codecs represent a class of neural audio and visual codecs designed to improve efficiency and quality in various applications, primarily focusing on speech and image processing for large language models (LLMs). Current research emphasizes developing codecs that enhance semantic preservation during compression and decompression, often employing techniques like vector quantization, residual vector quantization, and diffusion models within architectures such as VQ-VAEs and transformer networks. These advancements aim to improve the speed and quality of tasks such as speech synthesis, voice conversion, and image generation for LLMs, impacting fields ranging from communication technology to multimedia analysis.
Papers
October 29, 2024
October 19, 2024
October 14, 2024
October 6, 2024
October 2, 2024
September 24, 2024
September 18, 2024
August 30, 2024
August 16, 2024
August 15, 2024
June 18, 2024
June 14, 2024
March 5, 2024
February 2, 2024
November 29, 2023
October 2, 2023
March 27, 2023
July 18, 2022