Neuron Attribution
Neuron attribution aims to understand which neurons within a neural network (biological or artificial) are most responsible for a given output or behavior. Current research focuses on developing methods to identify these influential neurons, exploring their roles across different tasks and languages (especially in large language models), and designing novel neuron architectures and training algorithms to improve model efficiency and interpretability. This work is crucial for enhancing the explainability of complex neural systems, improving model design, and potentially leading to advancements in fields like neuroscience, AI, and secure communication.
Papers
September 8, 2024
June 13, 2024
May 3, 2024
April 29, 2024
January 3, 2024
December 21, 2023
October 11, 2023
January 24, 2023
October 25, 2022
October 24, 2022
February 24, 2022