Pool Based Active Learning
Pool-based active learning aims to efficiently train machine learning models by strategically selecting a subset of unlabeled data for labeling, maximizing model performance with minimal annotation effort. Current research focuses on improving the scalability and efficiency of algorithms, particularly for large, imbalanced datasets and multi-class problems, often employing techniques like generative flow networks and novel uncertainty sampling methods. This approach is crucial for applications where labeled data is scarce or expensive, impacting fields ranging from scientific computing and medical diagnosis to natural language processing and entity matching by reducing annotation costs and improving model accuracy.
Papers
January 2, 2025
December 31, 2024
September 11, 2024
April 18, 2024
April 8, 2024
December 23, 2023
October 2, 2023
September 11, 2023
July 6, 2023
June 26, 2023
May 23, 2023
November 1, 2022
June 25, 2022
February 11, 2022
February 8, 2022
January 29, 2022