Data Augmentation
Data augmentation is a technique used to artificially expand datasets by creating modified versions of existing data, primarily to improve the performance and robustness of machine learning models, especially when training data is scarce. Current research focuses on developing more sophisticated augmentation methods, including those leveraging generative models like GANs and diffusion models, and integrating augmentation with other techniques such as contrastive learning and transfer learning, often applied within architectures like transformers and convolutional neural networks. This work is significant because it addresses the limitations of limited datasets across various domains, from image classification and object detection to natural language processing and time series forecasting, leading to more accurate and generalizable models for diverse applications.
Papers
Data Augmentation for Multivariate Time Series Classification: An Experimental Study
Romain Ilbert, Thai V. Hoang, Zonghua Zhang
Data Augmentation in Earth Observation: A Diffusion Model Approach
Tiago Sousa, Benoît Ries, Nicolas Guelfi
Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos
Deterministic Reversible Data Augmentation for Neural Machine Translation
Jiashu Yao, Heyan Huang, Zeming Liu, Yuhang Guo
Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion
Colin Hansen, Simas Glinskis, Ashwin Raju, Micha Kornreich, JinHyeong Park, Jayashri Pawar, Richard Herzog, Li Zhang, Benjamin Odry
EMOE: Expansive Matching of Experts for Robust Uncertainty Based Rejection
Yunni Qu, James Wellnitz, Alexander Tropsha, Junier Oliva
TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting
Andrei Margeloiu, Adrián Bazaga, Nikola Simidjievski, Pietro Liò, Mateja Jamnik
Robust Classification by Coupling Data Mollification with Label Smoothing
Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone
DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Yuning Zhou, Henry Badgery, Matthew Read, James Bailey, Catherine E. Davey
FaceMixup: Enhancing Facial Expression Recognition through Mixed Face Regularization
Fabio A. Faria, Mateus M. Souza, Raoni F. da S. Teixeira, Mauricio P. Segundo
PGA-SciRE: Harnessing LLM on Data Augmentation for Enhancing Scientific Relation Extraction
Yang Zhou, Shimin Shan, Hongkui Wei, Zhehuan Zhao, Wenshuo Feng