Dataset Refinement
Dataset refinement focuses on improving the quality and utility of datasets used to train machine learning models, aiming to enhance model performance, robustness, and fairness. Current research emphasizes automated methods for identifying and correcting errors, such as noisy labels or biased samples, often employing techniques like Shapley value analysis, generative models (e.g., diffusion models), and human-in-the-loop approaches for iterative refinement. These advancements are crucial for accelerating the development of reliable and effective AI systems across diverse applications, from robotics and medical image analysis to natural language processing.
Papers
September 30, 2024
June 12, 2024
April 23, 2024
March 25, 2024
February 8, 2024
November 1, 2023
November 1, 2022
October 21, 2022
March 10, 2022