Data Augmentation
Data augmentation is a technique used to artificially expand datasets by creating modified versions of existing data, primarily to improve the performance and robustness of machine learning models, especially when training data is scarce. Current research focuses on developing more sophisticated augmentation methods, including those leveraging generative models like GANs and diffusion models, and integrating augmentation with other techniques such as contrastive learning and transfer learning, often applied within architectures like transformers and convolutional neural networks. This work is significant because it addresses the limitations of limited datasets across various domains, from image classification and object detection to natural language processing and time series forecasting, leading to more accurate and generalizable models for diverse applications.
Papers
Data Augmentation: a Combined Inductive-Deductive Approach featuring Answer Set Programming
Pierangela Bruno, Francesco Calimeri, Cinzia Marte, Simona Perri
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji
Filling the Missing: Exploring Generative AI for Enhanced Federated Learning over Heterogeneous Mobile Edge Devices
Peichun Li, Hanwen Zhang, Yuan Wu, Liping Qian, Rong Yu, Dusit Niyato, Xuemin Shen
Toward Generative Data Augmentation for Traffic Classification
Chao Wang, Alessandro Finamore, Pietro Michiardi, Massimo Gallo, Dario Rossi
Gaussian processes based data augmentation and expected signature for time series classification
Marco Romito, Francesco Triggiano
BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bengali
Saumajit Saha, Albert Nanda
Contextual Data Augmentation for Task-Oriented Dialog Systems
Dustin Axman, Avik Ray, Shubham Garg, Jing Huang
Data Augmentation for Time-Series Classification: An Extensive Empirical Study and Comprehensive Survey
Zijun Gao, Haibao Liu, Lingbo Li
Class-Specific Data Augmentation: Bridging the Imbalance in Multiclass Breast Cancer Classification
Kanan Mahammadli, Abdullah Burkan Bereketoglu, Ayse Gul Kabakci
AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Yang Yu, Qi Liu, Kai Zhang, Yuren Zhang, Chao Song, Min Hou, Yuqing Yuan, Zhihao Ye, Zaixi Zhang, Sanshi Lei Yu
Revisiting Data Augmentation for Rotational Invariance in Convolutional Neural Networks
Facundo Manuel Quiroga, Franco Ronchetti, Laura Lanzarini, Aurelio Fernandez-Bariviera
DualAug: Exploiting Additional Heavy Augmentation with OOD Data Rejection
Zehao Wang, Yiwen Guo, Qizhang Li, Guanglei Yang, Wangmeng Zuo