Active Learning
Active learning is a machine learning paradigm focused on optimizing data labeling efficiency by strategically selecting the most informative samples for annotation from a larger unlabeled pool. Current research emphasizes developing novel acquisition functions and data pruning strategies to reduce computational costs associated with large datasets, exploring the integration of active learning with various model architectures (including deep neural networks, Gaussian processes, and language models), and addressing challenges like privacy preservation and handling open-set noise. This approach holds significant promise for reducing the substantial cost and effort of data labeling in diverse fields, ranging from image classification and natural language processing to materials science and healthcare.
Papers
Frugal Algorithm Selection
Erdem Kuş, Özgür Akgün, Nguyen Dang, Ian Miguel
ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios
Markus Bayer, Christian Reuter
Future Aware Safe Active Learning of Time Varying Systems using Gaussian Processes
Markus Lange-Hegermann, Christoph Zimmer
Agnostic Active Learning of Single Index Models with Linear Sample Complexity
Aarshvi Gajjar, Wai Ming Tai, Xingyu Xu, Chinmay Hegde, Yi Li, Christopher Musco
Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving
Ross Greer, Mohan Trivedi
Active Learning with Simple Questions
Vasilis Kontonis, Mingchen Ma, Christos Tzamos
Maximizing Information Gain in Privacy-Aware Active Learning of Email Anomalies
Mu-Huan Miles Chung, Sharon Li, Jaturong Kongmanee, Lu Wang, Yuhong Yang, Calvin Giang, Khilan Jerath, Abhay Raman, David Lie, Mark Chignell
Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)
Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria
Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models
Lun Ai, Stephen H. Muggleton, Shi-Shun Liang, Geoff S. Baldwin