Data Augmentation
Data augmentation is a technique used to artificially expand datasets by creating modified versions of existing data, primarily to improve the performance and robustness of machine learning models, especially when training data is scarce. Current research focuses on developing more sophisticated augmentation methods, including those leveraging generative models like GANs and diffusion models, and integrating augmentation with other techniques such as contrastive learning and transfer learning, often applied within architectures like transformers and convolutional neural networks. This work is significant because it addresses the limitations of limited datasets across various domains, from image classification and object detection to natural language processing and time series forecasting, leading to more accurate and generalizable models for diverse applications.
Papers
Meta-learning and Data Augmentation for Stress Testing Forecasting Models
Ricardo Inácio, Vitor Cerqueira, Marília Barandas, Carlos Soares
Task Oriented In-Domain Data Augmentation
Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao
Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
Jiyue Jiang, Liheng Chen, Sheng Wang, Lingpeng Kong, Yu Li, Chuan Wu
Improving robustness to corruptions with multiplicative weight perturbations
Trung Trinh, Markus Heinonen, Luigi Acerbi, Samuel Kaski
Class-specific Data Augmentation for Plant Stress Classification
Nasla Saleem, Aditya Balu, Talukder Zaki Jubery, Arti Singh, Asheesh K. Singh, Soumik Sarkar, Baskar Ganapathysubramanian
Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition
Xingming Liao, Nankai Lin, Haowen Li, Lianglun Cheng, Zhuowei Wang, Chong Chen
Self-Supervised Time-Series Anomaly Detection Using Learnable Data Augmentation
Kukjin Choi, Jihun Yi, Jisoo Mok, Sungroh Yoon
Data Augmentation by Fuzzing for Neural Test Generation
Yifeng He, Jicheng Wang, Yuyang Rong, Hao Chen
Enhancing Activity Recognition After Stroke: Generative Adversarial Networks for Kinematic Data Augmentation
Aaron J. Hadley, Christopher L. Pulliam
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn, Christian Rupprecht