Class Distribution

Class distribution, the frequency of different categories within a dataset, is a critical factor influencing the performance and fairness of machine learning models. Current research focuses on mitigating the negative impacts of imbalanced class distributions, particularly in semi-supervised learning and federated learning settings, employing techniques like dual-branch training, class-wise averaging, and adaptive distribution fusion. These advancements aim to improve model accuracy and robustness across all classes, addressing biases towards majority categories and enhancing generalization to unseen data, with significant implications for various applications including medical image analysis, facial expression recognition, and object detection.

Papers