Heterogeneous Data
Heterogeneous data, encompassing datasets with varying distributions, formats, and qualities across sources, presents a significant challenge in machine learning. Current research focuses on developing robust algorithms and model architectures, such as federated learning with adaptive aggregation and personalized models, to effectively handle this heterogeneity in diverse applications like medical imaging and industrial settings. These efforts aim to improve model accuracy, fairness, and robustness while addressing privacy concerns inherent in decentralized data collection. The successful management of heterogeneous data is crucial for advancing machine learning's applicability to real-world problems where data is inherently diverse and distributed.
Papers
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data
Michael Crawshaw, Yajie Bao, Mingrui Liu
Heterogeneous Anomaly Detection for Software Systems via Semi-supervised Cross-modal Attention
Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Yongqiang Yang, Michael R. Lyu