High Quality Subset
High-quality subset selection focuses on identifying optimal or near-optimal subsets from larger datasets, aiming to maximize efficiency and performance while minimizing computational cost. Current research explores diverse approaches, including Bayesian optimization for graph-structured data, optimal transport methods for handling noisy and imbalanced datasets, and the use of large language models for efficient data filtering in specific domains like legal text analysis. These advancements have significant implications for various fields, improving the efficiency and robustness of machine learning models, enhancing data analysis in complex domains, and enabling more effective resource allocation in computationally intensive tasks.
Papers
November 15, 2024
October 26, 2024
October 19, 2024
May 28, 2024
May 24, 2024
April 10, 2024
March 6, 2024
March 4, 2024
February 26, 2024
September 28, 2023
September 15, 2023
September 6, 2023
August 7, 2023
July 24, 2023
March 16, 2023
January 17, 2023
June 25, 2022
June 14, 2022
May 30, 2022