Leverage Score

Leverage scores are a crucial concept in various fields, including machine learning and statistics, used to identify influential data points or features within a dataset. Current research focuses on improving the efficiency of leverage score computation, particularly for large datasets and streaming data, employing techniques like approximate Newton methods and randomized linear algebra. These advancements are significant for enhancing the accuracy and speed of algorithms in regression analysis, matrix computations, and other applications, ultimately impacting the scalability and performance of machine learning models. Furthermore, research explores the use of leverage scores in understanding model behavior, such as analyzing the impact of visual information on language models and improving the faithfulness of long-context models.

Papers