Synthetic Data Generation
Synthetic data generation aims to create artificial datasets that mimic the statistical properties of real data, addressing limitations in data availability, privacy concerns, and the high cost of data annotation. Current research focuses on developing advanced generative models, including diffusion models, generative adversarial networks, and methods leveraging large language models, to produce high-fidelity synthetic data across diverse data types (tabular, image, text, and even 3D models). This field is crucial for advancing machine learning in various domains, enabling the training of robust models in situations where real data is scarce, expensive, or sensitive, and improving the reliability and fairness of AI systems.
Papers
September 13, 2023
September 12, 2023
September 10, 2023
August 28, 2023
August 15, 2023
August 7, 2023
July 29, 2023
July 28, 2023
July 14, 2023
July 10, 2023
June 30, 2023
June 29, 2023
June 27, 2023
June 26, 2023
June 22, 2023
June 19, 2023
June 12, 2023
June 2, 2023
May 30, 2023