Model Representation
Model representation research focuses on understanding and improving how machine learning models encode information, aiming to enhance efficiency, interpretability, and robustness. Current efforts concentrate on analyzing the geometric properties of model representations learned through next-token prediction, mitigating catastrophic forgetting during fine-tuning, and developing methods to efficiently transfer representations between models and datasets. These advancements are crucial for improving the performance, scalability, and trustworthiness of various machine learning applications, particularly in large language models and computer vision.
Papers
October 31, 2024
October 6, 2024
August 27, 2024
June 18, 2024
May 23, 2024
February 6, 2024
August 16, 2023
August 4, 2023
February 16, 2023
December 22, 2022
December 18, 2022
November 2, 2022
July 2, 2022
December 9, 2021