Class Imbalance
Class imbalance, the uneven distribution of classes in a dataset, hinders the performance of machine learning models by biasing them towards the majority class. Current research focuses on mitigating this imbalance through various techniques, including data resampling (oversampling minority classes, undersampling majority classes), cost-sensitive learning (assigning different misclassification costs), and algorithmic modifications (e.g., adapting loss functions, employing novel regularization methods within models like GBDTs, and using contrastive learning). Addressing class imbalance is crucial for improving the fairness, robustness, and accuracy of machine learning models across diverse applications, from medical diagnosis and financial risk assessment to environmental monitoring and traffic sign recognition.
Papers
An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
Hao Chen, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Marios Savvides, Bhiksha Raj
An interpretable imbalanced semi-supervised deep learning framework for improving differential diagnosis of skin diseases
Futian Weng, Yuanting Ma, Jinghan Sun, Shijun Shan, Qiyuan Li, Jianping Zhu, Yang Wang, Yan Xu
IMB-NAS: Neural Architecture Search for Imbalanced Datasets
Rahul Duggal, Shengyun Peng, Hao Zhou, Duen Horng Chau
Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction
Jianyi Zhang, Ang Li, Minxue Tang, Jingwei Sun, Xiang Chen, Fan Zhang, Changyou Chen, Yiran Chen, Hai Li