FedAdaVR: 有限客户端参与下自适应方差缩减的鲁棒联邦学习

📄 中文摘要

联邦学习在异构环境下,面临梯度噪声、客户端漂移和部分客户端参与错误等严峻挑战,其中部分客户端参与错误最为普遍但现有文献关注不足。FedAdaVR是一种新颖的联邦学习算法,旨在通过引入自适应优化器和方差缩减机制,解决由零星客户端参与引起的异构性问题。该算法的核心在于动态调整学习率和方差缩减强度,以适应不同轮次中客户端参与率的变化。当参与客户端数量较少时,FedAdaVR能够更有效地抑制梯度方差的增加,防止模型训练过程中的震荡和发散。通过结合客户端本地数据分布的统计信息,FedAdaVR能够智能地为每个客户端分配合适的权重,并优化全局模型的聚合策略。

📄 English Summary

FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client Participation

Federated learning (FL) encounters substantial challenges due to heterogeneity, leading to gradient noise, client drift, and partial client participation errors. The latter is particularly pervasive but remains insufficiently addressed in current literature. FedAdaVR proposes a novel FL algorithm specifically designed to mitigate heterogeneity issues arising from sporadic client participation by integrating an adaptive optimizer with a variance reduction mechanism. The core innovation of FedAdaVR lies in its ability to dynamically adjust learning rates and variance reduction strengths, adapting to varying client participation rates across different communication rounds. When the number of participating clients is small, FedAdaVR effectively suppresses the increase in gradient variance, preventing oscillations and divergence during model training. By incorporating statistical information about local data distributions, FedAdaVR intelligently assigns appropriate weights to each client and optimizes the global model aggregation strategy. Specifically, FedAdaVR introduces an adaptive variance estimation method based on historical gradient information, enabling accurate estimation and compensation of gradient variance even in complex scenarios involving frequent client churn and non-IID data distributions. Experimental evaluations demonstrate that FedAdaVR significantly accelerates convergence and achieves higher model accuracy compared to existing state-of-the-art FL algorithms across various heterogeneous datasets and diverse client participation patterns. Its robustness and performance advantages are particularly pronounced under extremely low client participation rates. FedAdaVR offers an efficient and practical solution to the persistent problem of uncertain client participation in federated learning, thereby facilitating its deployment in real-world applications.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等