Class Imbalance
Class imbalance, the uneven distribution of classes in a dataset, hinders the performance of machine learning models by biasing them towards the majority class. Current research focuses on mitigating this imbalance through various techniques, including data resampling (oversampling minority classes, undersampling majority classes), cost-sensitive learning (assigning different misclassification costs), and algorithmic modifications (e.g., adapting loss functions, employing novel regularization methods within models like GBDTs, and using contrastive learning). Addressing class imbalance is crucial for improving the fairness, robustness, and accuracy of machine learning models across diverse applications, from medical diagnosis and financial risk assessment to environmental monitoring and traffic sign recognition.
Papers
Enhancing Performance for Highly Imbalanced Medical Data via Data Regularization in a Federated Learning Setting
Georgios Tsoumplekas, Ilias Siniosoglou, Vasileios Argyriou, Ioannis D. Moscholios, Panagiotis Sarigiannidis
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain