Synthetic Data Generation
Synthetic data generation aims to create artificial datasets that mimic the statistical properties of real data, addressing limitations in data availability, privacy concerns, and the high cost of data annotation. Current research focuses on developing advanced generative models, including diffusion models, generative adversarial networks, and methods leveraging large language models, to produce high-fidelity synthetic data across diverse data types (tabular, image, text, and even 3D models). This field is crucial for advancing machine learning in various domains, enabling the training of robust models in situations where real data is scarce, expensive, or sensitive, and improving the reliability and fairness of AI systems.
Papers
July 12, 2022
July 7, 2022
June 20, 2022
May 31, 2022
May 29, 2022
May 28, 2022
May 19, 2022
May 18, 2022
May 13, 2022
April 27, 2022
March 11, 2022
March 3, 2022
January 19, 2022
December 17, 2021
December 6, 2021
December 3, 2021
November 25, 2021
November 15, 2021