Training Sample
Training sample selection and manipulation are crucial for optimizing machine learning model performance and addressing various challenges, including data scarcity, bias, and generalization. Current research focuses on developing strategies for selecting informative samples, generating synthetic data to augment existing datasets, and mitigating the negative impacts of memorization and distribution shifts. These efforts aim to improve model accuracy, efficiency, and fairness across diverse applications, from natural language processing and speech recognition to medical image analysis and audio processing. The ultimate goal is to develop more robust and reliable models that generalize well to unseen data and avoid undesirable biases.
Papers
November 17, 2024
November 5, 2024
August 18, 2024
July 18, 2024
July 5, 2024
June 22, 2024
June 5, 2024
June 3, 2024
May 24, 2024
April 8, 2024
March 12, 2024
February 19, 2024
February 12, 2024
November 25, 2023
November 21, 2023
June 30, 2023
June 19, 2023
June 12, 2023