Linear Probing
Linear probing is a technique used to analyze and understand the internal representations of complex machine learning models, primarily focusing on identifying what information the model has learned and how it's encoded. Current research explores linear probing's application in diverse areas, including assessing copyright infringement in large language models, improving transfer learning via enhanced probing layers (e.g., Kolmogorov-Arnold Networks), and detecting adversarial examples and biases. This methodology offers valuable insights into model interpretability, facilitating the development of more robust, reliable, and ethically sound AI systems across various domains, from natural language processing to medical image analysis.
Papers
October 26, 2023
October 24, 2023
October 5, 2023
June 14, 2023
May 29, 2023
May 2, 2023
April 27, 2023
April 24, 2023
April 20, 2023
March 27, 2023
March 2, 2023
January 27, 2023
December 21, 2022
December 11, 2022
November 17, 2022
October 28, 2022
October 22, 2022
October 21, 2022