Discrete Token
Discrete tokens represent a crucial advancement in various machine learning domains, aiming to improve efficiency and performance by converting continuous data (like images, speech, or trajectories) into discrete symbolic representations. Current research focuses on applying discrete tokenization within self-supervised learning frameworks, particularly using autoregressive models and contrastive learning, to enhance tasks such as speech recognition, image generation, and multimodal understanding. This approach offers benefits including faster processing, reduced data storage needs, and improved model generalization, impacting fields ranging from natural language processing and computer vision to robotics and autonomous driving.
Papers
November 29, 2024
November 21, 2024
November 13, 2024
November 7, 2024
October 31, 2024
September 13, 2024
September 6, 2024
July 12, 2024
June 19, 2024
May 27, 2024
January 30, 2024
December 26, 2023
December 7, 2023
December 1, 2023
November 28, 2023
September 14, 2023
March 27, 2023
March 8, 2023
March 4, 2023