Heterogeneous Data
Heterogeneous data, encompassing datasets with varying distributions, formats, and qualities across sources, presents a significant challenge in machine learning. Current research focuses on developing robust algorithms and model architectures, such as federated learning with adaptive aggregation and personalized models, to effectively handle this heterogeneity in diverse applications like medical imaging and industrial settings. These efforts aim to improve model accuracy, fairness, and robustness while addressing privacy concerns inherent in decentralized data collection. The successful management of heterogeneous data is crucial for advancing machine learning's applicability to real-world problems where data is inherently diverse and distributed.
Papers
Nonconvex Federated Learning on Compact Smooth Submanifolds With Heterogeneous Data
Jiaojiao Zhang, Jiang Hu, Anthony Man-Cho So, Mikael Johansson
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Samuele Cornell, Janek Ebbers, Constance Douwes, Irene Martín-Morató, Manu Harju, Annamaria Mesaros, Romain Serizel