Data Augmentation
Data augmentation is a technique used to artificially expand datasets by creating modified versions of existing data, primarily to improve the performance and robustness of machine learning models, especially when training data is scarce. Current research focuses on developing more sophisticated augmentation methods, including those leveraging generative models like GANs and diffusion models, and integrating augmentation with other techniques such as contrastive learning and transfer learning, often applied within architectures like transformers and convolutional neural networks. This work is significant because it addresses the limitations of limited datasets across various domains, from image classification and object detection to natural language processing and time series forecasting, leading to more accurate and generalizable models for diverse applications.
Papers
Augment on Manifold: Mixup Regularization with UMAP
Yousef El-Laham, Elizabeth Fons, Dillon Daudert, Svitlana Vyetrenko
Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method
Rahul Vishwakarma, Subhankar Mishra
Domain Similarity-Perceived Label Assignment for Domain Generalized Underwater Object Detection
Xisheng Li, Wei Li, Pinhao Song, Mingjun Zhang, Jie Zhou
AdvST: Revisiting Data Augmentations for Single Domain Generalization
Guangtao Zheng, Mengdi Huai, Aidong Zhang
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in ultra low-data regimes
Nabeel Seedat, Nicolas Huynh, Boris van Breugel, Mihaela van der Schaar
Classification of complex local environments in systems of particle shapes through shape-symmetry encoded data augmentation
Shih-Kuang, Lee, Sun-Ting Tsai, Sharon Glotzer
TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training
Chaoya Jiang, Wei ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang
Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference
Dat Thanh Nguyen