Label Noise
Label noise, the presence of incorrect labels in training datasets, significantly hinders the performance and robustness of machine learning models. Current research focuses on developing methods to mitigate this issue, exploring techniques like loss function modifications, sample selection strategies (e.g., identifying and removing or down-weighting noisy samples), and the use of robust algorithms such as those based on nearest neighbors or contrastive learning, often applied within deep neural networks or gradient boosted decision trees. Addressing label noise is crucial for improving the reliability and generalizability of machine learning models across various applications, from medical image analysis to natural language processing, and is driving the development of new benchmark datasets and evaluation metrics.
Papers
Identity Overlap Between Face Recognition Train/Test Data: Causing Optimistic Bias in Accuracy Measurement
Haiyu Wu, Sicong Tian, Jacob Gutierrez, Aman Bhatta, Kağan Öztürk, Kevin W. Bowyer
Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui Liao