Paper ID: 2411.01580
Federated Learning Clients Clustering with Adaptation to Data Drifts
Minghao Li (1), Dmitrii Avdiukhin (2), Rana Shahout (1), Nikita Ivkin (3), Vladimir Braverman (4), Minlan Yu (1) ((1) Harvard University, (2) Northwestern University, (3) Amazon, (4) Rice University)
Federated Learning (FL) enables deep learning model training across edge devices and protects user privacy by retaining raw data locally. Data heterogeneity in client distributions slows model convergence and leads to plateauing with reduced precision. Clustered FL solutions address this by grouping clients with statistically similar data and training models for each cluster. However, maintaining consistent client similarity within each group becomes challenging when data drifts occur, significantly impacting model accuracy. In this paper, we introduce Fielding, a clustered FL framework that handles data drifts promptly with low overheads. Fielding detects drifts on all clients and performs selective label distribution-based re-clustering to balance cluster optimality and model performance, remaining robust to malicious clients and varied heterogeneity degrees. Our evaluations show that Fielding improves model final accuracy by 1.9%-5.9% and reaches target accuracies 1.16x-2.61x faster.
Submitted: Nov 3, 2024