Heterogeneous Data
Heterogeneous data, encompassing datasets with varying distributions, formats, and qualities across sources, presents a significant challenge in machine learning. Current research focuses on developing robust algorithms and model architectures, such as federated learning with adaptive aggregation and personalized models, to effectively handle this heterogeneity in diverse applications like medical imaging and industrial settings. These efforts aim to improve model accuracy, fairness, and robustness while addressing privacy concerns inherent in decentralized data collection. The successful management of heterogeneous data is crucial for advancing machine learning's applicability to real-world problems where data is inherently diverse and distributed.
Papers
Nonconvex Federated Learning on Compact Smooth Submanifolds With Heterogeneous Data
Jiaojiao Zhang, Jiang Hu, Anthony Man-Cho So, Mikael Johansson
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Samuele Cornell, Janek Ebbers, Constance Douwes, Irene Martín-Morató, Manu Harju, Annamaria Mesaros, Romain Serizel
Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data
Biplob Biswas, Rajiv Ramnath
Stochastic Inference of Plate Bending from Heterogeneous Data: Physics-informed Gaussian Processes via Kirchhoff-Love Theory
Igor Kavrakov, Gledson Rodrigo Tondo, Guido Morgenthal