Catastrophic Forgetting
Catastrophic forgetting describes the phenomenon where artificial neural networks, upon learning new tasks, lose previously acquired knowledge. Current research focuses on mitigating this issue through various strategies, including parameter-efficient fine-tuning methods (like LoRA), generative model-based data replay, and novel optimization algorithms that constrain gradient updates or leverage hierarchical task structures. Addressing catastrophic forgetting is crucial for developing robust and adaptable AI systems capable of continuous learning in real-world applications, particularly in domains like medical imaging, robotics, and natural language processing where data streams are constantly evolving.
Papers
Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Yide Ran, Zhaozhuo Xu, Yuhang Yao, Zijian Hu, Shanshan Han, Han Jin, Alay Dilipbhai Shah, Jipeng Zhang, Dimitris Stripelis, Tong Zhang, Salman Avestimehr, Chaoyang He
Gradient Localization Improves Lifelong Pretraining of Language Models
Jared Fernandez, Yonatan Bisk, Emma Strubell
Reducing catastrophic forgetting of incremental learning in the absence of rehearsal memory with task-specific token
Young Jo Choi, Min Kyoon Yoo, Yu Rang Park
Exploring the Stability Gap in Continual Learning: The Role of the Classification Head
Wojciech Łapacz, Daniel Marczak, Filip Szatkowski, Tomasz Trzciński
SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models
Linglan Zhao, Xuerui Zhang, Ke Yan, Shouhong Ding, Weiran Huang
Masked Autoencoders are Parameter-Efficient Federated Continual Learners
Yuchen He, Xiangfeng Wang
FPPL: An Efficient and Non-IID Robust Federated Continual Learning Framework
Yuchen He, Chuyun Shen, Xiangfeng Wang, Bo Jin
UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function
Zhichao Wang, Bin Bi, Zixu Zhu, Xiangbo Mao, Jun Wang, Shiyu Wang
Task Confusion and Catastrophic Forgetting in Class-Incremental Learning: A Mathematical Framework for Discriminative and Generative Modelings
Milad Khademi Nori, Il-Min Kim