Key Value Memory
Key-value memory (KVM) is a rapidly developing area focusing on efficient data storage and retrieval within neural networks, particularly transformers. Current research emphasizes optimizing KVM architectures for reduced memory footprint and improved inference speed, exploring techniques like L2 norm-based compression and dynamic memory allocation, often within the context of large language models. These advancements are crucial for deploying large models on resource-constrained devices and improving the efficiency of various NLP and computer vision tasks, addressing the limitations of traditional memory-intensive approaches.
Papers
September 24, 2024
June 17, 2024
June 4, 2024
March 14, 2024
February 19, 2024
January 10, 2024
November 28, 2023
October 24, 2023
August 17, 2023
February 13, 2023
January 3, 2023
October 30, 2022
July 31, 2022
July 22, 2022
March 23, 2022