Many Sparse
Many Sparse research focuses on developing efficient methods for handling sparse data and models, primarily aiming to reduce computational costs and memory consumption while maintaining or improving performance. Current efforts concentrate on sparse neural network architectures (including Mixture-of-Experts models and various pruning techniques), sparse attention mechanisms in transformers, and sparse representations for various data types (e.g., point clouds, images). This work is significant for advancing machine learning applications in resource-constrained environments and enabling the scaling of large models to previously intractable sizes and complexities.
Papers
September 16, 2023
September 15, 2023
September 8, 2023
September 6, 2023
September 5, 2023
August 27, 2023
August 13, 2023
August 7, 2023
August 3, 2023
August 2, 2023
July 27, 2023
July 25, 2023
July 22, 2023
July 21, 2023
July 19, 2023
July 18, 2023