Concept Erasure
Concept erasure focuses on removing specific information (concepts) from machine learning models, particularly large language models and diffusion models for image generation, while preserving overall model functionality. Current research emphasizes developing efficient and robust algorithms, such as those based on low-rank updates, weight pruning, or closed-form solutions, to achieve complete concept removal without significantly impairing model performance. This field is crucial for addressing ethical concerns like bias mitigation, privacy protection (GDPR compliance), and the prevention of harmful content generation, impacting both the responsible development of AI and its practical applications.
Papers
October 11, 2024
October 10, 2024
October 3, 2024
September 22, 2024
September 1, 2024
July 17, 2024
May 29, 2024
May 24, 2024
May 12, 2024
April 4, 2024
March 10, 2024
February 3, 2024
January 24, 2024
December 20, 2023
November 30, 2023
November 29, 2023
October 23, 2023
June 6, 2023