High Quality
High-quality data is paramount for the success of machine learning models, driving research into efficient and reliable methods for data creation, curation, and evaluation. Current efforts focus on developing novel algorithms and model architectures, such as diffusion models, generative adversarial networks (GANs), and large language models (LLMs), to improve data quality across diverse domains, including image generation, speech processing, and natural language processing. These advancements are crucial for enhancing the performance and reliability of machine learning systems and enabling new applications in various fields, from medical imaging to robotics. The development of robust evaluation metrics and automated quality control methods is also a key area of focus.
Papers
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems
Bryan Wilie, Yan Xu, Willy Chung, Samuel Cahyawijaya, Holy Lovenia, Pascale Fung
QASnowball: An Iterative Bootstrapping Framework for High-Quality Question-Answering Data Generation
Kunlun Zhu, Shihao Liang, Xu Han, Zhi Zheng, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
Xuefei Ning, Zinan Lin, Zixuan Zhou, Zifu Wang, Huazhong Yang, Yu Wang
Recovering high-quality FODs from a reduced number of diffusion-weighted images using a model-driven deep learning architecture
J Bartlett, C E Davey, L A Johnston, J Duan