Automatic Curation
Automatic curation focuses on using computational methods to efficiently organize, label, and enhance datasets, addressing the limitations of manual curation in terms of cost, time, and scalability. Current research emphasizes developing algorithms and models, including those based on transformers, diffusion models, and clustering techniques, to automate tasks such as data cleaning, annotation, and selection for various data types (text, images, videos). This automated approach is crucial for advancing machine learning across diverse fields, from biomedical research and scientific publishing to autonomous driving and public art curation, by providing high-quality, readily accessible datasets for training and evaluation.
Papers
February 14, 2024
February 13, 2024
January 13, 2024
December 27, 2023
December 20, 2023
October 31, 2023
July 26, 2023
June 13, 2023
June 6, 2023
May 8, 2023
April 26, 2023
January 5, 2023
December 28, 2022
October 23, 2022
August 6, 2022
July 28, 2022
July 13, 2022
February 25, 2022
January 28, 2022