Unified Memory
Unified memory research explores efficient and scalable methods for integrating and utilizing memory within various computational systems, primarily focusing on improving the performance and flexibility of deep learning models. Current efforts concentrate on developing parameter-efficient fine-tuning techniques (like LoRA), memory-augmented architectures that leverage retrieval mechanisms and efficient compression strategies, and novel memory management systems for handling large-scale datasets and diverse model architectures. These advancements are crucial for enabling more efficient and effective AI systems, particularly in resource-constrained environments and for applications requiring continual learning or adaptation to new tasks.
Papers
November 12, 2024
November 11, 2024
October 25, 2024
August 23, 2024
March 18, 2024
January 10, 2024
November 6, 2023
August 28, 2023
May 22, 2023
April 24, 2023
September 1, 2022
April 10, 2022