论文标题
迈向有效的聚类联合学习:具有自适应邻居匹配的点对点框架
Towards Effective Clustered Federated Learning: A Peer-to-peer Framework with Adaptive Neighbor Matching
论文作者
论文摘要
在联合学习(FL)中,客户可能具有多种目标,将所有客户的知识合并为一个全球模型将导致负面转移到本地绩效。因此,提出了聚类的FL将相似客户分组为群集并维护多个全局模型。在文献中,集中式的FL算法需要假设集群数量,因此不足以探索客户之间的潜在关系。在本文中,我们在不假设簇数的情况下提出了一个名为Panm的点对点(P2P)FL算法。在PANM中,客户与同行沟通以适应性地形成有效的聚类拓扑。具体而言,我们提出了两个新型指标,用于测量客户相似性和一个基于算法的蒙特卡洛方法和在高斯混合模型假设下的两阶段邻居匹配的指标和期望最大化。我们已经对PANM进行了有关邻居估计的概率以及群集最佳的误差差距的理论分析。我们还在合成和现实世界聚类异质性下实施了广泛的实验。理论分析和经验实验表明,所提出的算法优于P2P FL对应物,并且比集中式群集FL方法更好。 Panm即使在沟通预算极低的情况下也有效。
In federated learning (FL), clients may have diverse objectives, and merging all clients' knowledge into one global model will cause negative transfer to local performance. Thus, clustered FL is proposed to group similar clients into clusters and maintain several global models. In the literature, centralized clustered FL algorithms require the assumption of the number of clusters and hence are not effective enough to explore the latent relationships among clients. In this paper, without assuming the number of clusters, we propose a peer-to-peer (P2P) FL algorithm named PANM. In PANM, clients communicate with peers to adaptively form an effective clustered topology. Specifically, we present two novel metrics for measuring client similarity and a two-stage neighbor matching algorithm based Monte Carlo method and Expectation Maximization under the Gaussian Mixture Model assumption. We have conducted theoretical analyses of PANM on the probability of neighbor estimation and the error gap to the clustered optimum. We have also implemented extensive experiments under both synthetic and real-world clustered heterogeneity. Theoretical analysis and empirical experiments show that the proposed algorithm is superior to the P2P FL counterparts, and it achieves better performance than the centralized cluster FL method. PANM is effective even under extremely low communication budgets.