Active Learning
Active learning is a machine learning paradigm focused on optimizing data labeling efficiency by strategically selecting the most informative samples for annotation from a larger unlabeled pool. Current research emphasizes developing novel acquisition functions and data pruning strategies to reduce computational costs associated with large datasets, exploring the integration of active learning with various model architectures (including deep neural networks, Gaussian processes, and language models), and addressing challenges like privacy preservation and handling open-set noise. This approach holds significant promise for reducing the substantial cost and effort of data labeling in diverse fields, ranging from image classification and natural language processing to materials science and healthcare.
Papers
Exploiting Diversity of Unlabeled Data for Label-Efficient Semi-Supervised Active Learning
Felix Buchert, Nassir Navab, Seong Tae Kim
Active Learning Strategies for Weakly-supervised Object Detection
Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce
Efficient Classification with Counterfactual Reasoning and Active Learning
Azhar Mohammed, Dang Nguyen, Bao Duong, Thin Nguyen
Improving decision-making via risk-based active learning: Probabilistic discriminative classifiers
Aidan J. Hughes, Paul Gardner, Lawrence A. Bull, Nikolaos Dervilis, Keith Worden
Patient Aware Active Learning for Fine-Grained OCT Classification
Yash-yee Logan, Ryan Benkert, Ahmad Mustafa, Gukyeong Kwon, Ghassan AlRegib