Concept Erasure
Concept erasure focuses on removing specific information (concepts) from machine learning models, particularly large language models and diffusion models for image generation, while preserving overall model functionality. Current research emphasizes developing efficient and robust algorithms, such as those based on low-rank updates, weight pruning, or closed-form solutions, to achieve complete concept removal without significantly impairing model performance. This field is crucial for addressing ethical concerns like bias mitigation, privacy protection (GDPR compliance), and the prevention of harmful content generation, impacting both the responsible development of AI and its practical applications.
Papers
January 3, 2025
January 2, 2025
December 29, 2024
December 10, 2024
December 9, 2024
November 28, 2024
October 11, 2024
October 10, 2024
October 3, 2024
September 22, 2024
September 1, 2024
July 17, 2024
May 29, 2024
May 24, 2024
May 12, 2024
April 4, 2024
March 10, 2024
February 3, 2024