Sequence Compression

Sequence compression aims to reduce the computational cost and memory requirements of processing long sequences of data, such as those found in speech, video, and reinforcement learning. Current research focuses on developing efficient compression techniques, including methods inspired by large language model tokenization (like byte pair encoding) and those that leverage latent representations of continuous-time processes to achieve variable or adaptive compression rates. These advancements are significant because they enable faster and more efficient processing of large datasets, improving the scalability and applicability of various machine learning models and algorithms across diverse domains.

Papers

May 6, 2024

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh, Kenny Young, Louis Kirsch, Jürgen Schmidhuber
Reinforcement Learning Bias Variance Credit Assignment TD Learning Temporal Credit Assignment Sequence Compression

February 16, 2024

PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng, Ching-An Cheng, Hal Daumé, Furong Huang, Andrey Kolobov
External Control Shot Imitation Novel Encoding Action Abstraction Sequence Compression Temporal Action Abstraction

February 15, 2024

Multi-word Tokenization for Sequence Compression
Leonidas Gee, Leonardo Rigutini, Marco Ernandes, Andrea Zugarini
Subword Tokenization Multiword Expression Efficient Tokenization Sequence Compression Sequence Length Reduction

December 28, 2022

Latent Discretization for Continuous-time Sequence Compression
Ricky T. Q. Chen, Matthew Le, Matthew Muckley, Maximilian Nickel, Karen Ullrich
Neural Compression Lossless Compression Deep Sequence Time Discretization Sequence Compression

November 4, 2022

Once-for-All Sequence Compression for Self-Supervised Speech Models
Hsuan-Jui Chen, Yen Meng, Hung-yi Lee
Speech Processing Self Supervised Speech Model Variable Rate Compression Sequence Compression

October 13, 2022

On Compressing Sequences for Self-Supervised Speech Models
Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe, Paola Garcia, Hung-yi Lee, Hao Tang
Self Supervised Learning Self Supervised Model Self Supervised Speech Model Subsampling Method Sequence Compression