Real World Noisy Datasets
Real-world datasets are frequently contaminated with noisy labels, hindering the performance of machine learning models. Current research focuses on developing robust training methods that mitigate the impact of this noise, employing techniques like sample selection (identifying and removing or correcting noisy samples), noise-robust loss functions, and the integration of external knowledge sources (e.g., large language models). These advancements are crucial for improving the reliability and generalizability of models trained on real-world data, impacting diverse fields from medical image analysis to autonomous driving where perfectly labeled data is often unavailable or prohibitively expensive to obtain.
Papers
January 14, 2025
December 16, 2024
December 9, 2024
December 3, 2024
November 26, 2024
September 10, 2024
June 22, 2024
April 7, 2024
March 23, 2024
March 2, 2024
February 3, 2024
December 23, 2023
November 30, 2023
November 2, 2023
October 24, 2023
October 5, 2023
September 21, 2023
August 26, 2023
July 11, 2023