Label Proportion
Learning from Label Proportions (LLP) is a weakly supervised learning paradigm where models are trained using only the aggregate class proportions within groups of instances (bags), rather than individual instance labels. Current research focuses on improving the accuracy and efficiency of LLP methods, particularly for large bags and tabular data, employing techniques like contrastive learning, debiasing methods, and belief propagation. LLP's significance lies in its ability to address privacy concerns and data scarcity in various applications, including user modeling, medical image analysis, and remote sensing, by enabling model training with aggregated data.
Papers
PAC Learning Linear Thresholds from Label Proportions
Anand Brahmbhatt, Rishi Saket, Aravindan Raghuveer
LLP-Bench: A Large Scale Tabular Benchmark for Learning from Label Proportions
Anand Brahmbhatt, Mohith Pokala, Rishi Saket, Aravindan Raghuveer
Label Differential Privacy via Aggregation
Anand Brahmbhatt, Rishi Saket, Shreyas Havaldar, Anshul Nasery, Aravindan Raghuveer