Anti Forgetting
"Anti-forgetting" in machine learning focuses on mitigating the tendency of models, particularly deep neural networks and large language models, to lose previously learned information when acquiring new knowledge. Current research emphasizes techniques like experience replay, regularization methods, and novel optimization algorithms (e.g., momentum-filtered optimizers) to improve knowledge retention across various tasks and datasets, often within continual learning or machine unlearning frameworks. This field is crucial for developing more robust and adaptable AI systems, impacting areas like robotics, personalized medicine, and natural language processing by enabling lifelong learning and efficient knowledge management.
Papers
LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging
Ke Wang, Nikolaos Dimitriadis, Alessandro Favero, Guillermo Ortiz-Jimenez, Francois Fleuret, Pascal Frossard
Exploring Forgetting in Large Language Model Pre-Training
Chonghua Liao, Ruobing Xie, Xingwu Sun, Haowen Sun, Zhanhui Kang