Compound Token
Compound tokens represent a novel approach in various machine learning domains, aiming to improve efficiency and performance by grouping related sub-tokens into single units. Current research focuses on optimizing their use within transformer architectures, exploring methods like autoregressive decoding and dynamic compute allocation to enhance model capabilities while mitigating computational costs. This approach shows promise in improving the efficiency and performance of large language models, vision-language models, and other sequence-based tasks, leading to advancements in areas such as text generation, image synthesis, and human pose estimation.
Papers
October 16, 2024
August 2, 2024
June 12, 2024
May 20, 2024
April 30, 2024
April 2, 2024
March 17, 2024
December 6, 2023
October 25, 2023
October 15, 2023
June 26, 2023
May 25, 2023
March 21, 2023
March 1, 2023
December 2, 2022
July 14, 2022