Federated Learning With Adaptive Aggregation Weights for Non-IID Data in Edge Networks

Xiaodong Li, Yulong Gao*, Yansha Deng, Xinzhuo Jiang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Federated learning (FL) enables edge nodes to collaboratively train a global model under the coordination of a server without sharing local private data. However, data heterogeneity across nodes leads to serious performance degradation or even non-convergence of the learned model. To tackle this challenge, most existing methods typically involve either local model regularization or global model adjustments. Nevertheless, these methods primarily perform model aggregation based on the dataset size proportion, while the exploration of the impact of non-independent and identically distributed (non-IID) data on aggregation weights remains insufficient. To this end, we theoretically derive an analytical expression for the aggregation weights by minimizing the convergence upper bound of standard FL on non-IID data across nodes, achieving a tighter bound and superior convergence performance. Accordingly, we propose an adaptive aggregation weight strategy, called FedAAW. It can be easily incorporated into other FL methods to improve their convergence performance with negligible additional communication overhead. Extensive experiments on four common datasets show that FedAAW can effectively mitigate the performance degradation caused by data heterogeneity in various cases. Simply applying FedAAW to other methods can significantly improve their performance, achieving a maximum improvement of 37.32% in test accuracy and outperforming other state-of-the-art aggregation weight strategies.

Original languageEnglish
JournalIEEE Transactions on Cognitive Communications and Networking
DOIs
Publication statusAccepted/In press - 2025

Keywords

  • adaptive aggregation weight
  • edge networks
  • Federated learning
  • non-IID data

Fingerprint

Dive into the research topics of 'Federated Learning With Adaptive Aggregation Weights for Non-IID Data in Edge Networks'. Together they form a unique fingerprint.

Cite this