Unlearning Framework
Machine unlearning aims to remove the influence of specific data points from trained machine learning models, addressing privacy concerns and promoting data control. Current research focuses on developing efficient unlearning algorithms for various model architectures, including deep neural networks, large language models, and graph neural networks, often employing techniques like gradient manipulation, parameter editing, and contrastive learning. This field is crucial for ensuring compliance with data privacy regulations and enhancing the trustworthiness and safety of deployed AI systems across diverse applications, from medical diagnosis to personalized recommendations. Challenges remain in balancing effective unlearning with the preservation of model utility and robustness against adversarial attacks.
Papers
LLM Unlearning via Loss Adjustment with Only Forget Data
Yaxuan Wang, Jiaheng Wei, Chris Yuhao Liu, Jinlong Pang, Quan Liu, Ankit Parag Shah, Yujia Bao, Yang Liu, Wei Wei
Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices
Xiaoyu Xia, Ziqi Wang, Ruoxi Sun, Bowen Liu, Ibrahim Khalil, Minhui Xue
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning
Chongyu Fan, Jiancheng Liu, Licong Lin, Jinghan Jia, Ruiqi Zhang, Song Mei, Sijia Liu
Dissecting Fine-Tuning Unlearning in Large Language Models
Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang
Machine Unlearning in Forgettability Sequence
Junjie Chen, Qian Chen, Jian Lou, Xiaoyu Zhang, Kai Wu, Zilong Wang