联合学习应用的基于群体的合奏学习证明

论文标题

联合学习应用的基于群体的合奏学习证明

Proof of Swarm Based Ensemble Learning for Federated Learning Applications

论文作者

Raza, Ali, Tran, Kim Phuc, Koehl, Ludovic, Li, Shujun

论文摘要

合奏学习结合了多个机器学习模型的结果，以便提供一个更好，优化的预测模型，并减少偏差，差异和改进的预测。但是，在联邦学习中，由于隐私问题，直接应用集中式合奏学习是不可行的。因此，需要一种机制来结合本地模型的结果以产生全局模型。大多数分布式共识算法，例如拜占庭式容错（BFT），通常在此类应用中表现不佳。这是因为，在这样的方法中，对某些同龄人的预测被忽略了，因此大多数同龄人甚至可以在不考虑其他同行的决定的情况下获胜。此外，通常不考虑每个同伴结果的置信度评分，尽管这是集合学习的重要特征。此外，诸如BFT之类的方法通常会使领带事件的问题未解决。为了填补这些研究空白，我们提出了POSW（群的证明），这是一种新型的分布式共识算法，用于在联合环境中进行集合学习，该算法的灵感来自基于粒子群的算法来解决优化问题。从理论上讲，所提出的算法始终以相对较少的步骤收敛，并具有解决绑带事件的机制，同时试图实现次级最低的解决方案。我们通过实验性地验证了使用ECG分类作为医疗保健中的示例应用程序的拟议算法的性能，这表明合奏学习模型的表现优于所有本地模型甚至基于FL的全局模型。据我们所知，提出的算法是对使用联合学习训练的分布式模型的输出结果达成共识的首次尝试。

Ensemble learning combines results from multiple machine learning models in order to provide a better and optimised predictive model with reduced bias, variance and improved predictions. However, in federated learning it is not feasible to apply centralised ensemble learning directly due to privacy concerns. Hence, a mechanism is required to combine results of local models to produce a global model. Most distributed consensus algorithms, such as Byzantine fault tolerance (BFT), do not normally perform well in such applications. This is because, in such methods predictions of some of the peers are disregarded, so a majority of peers can win without even considering other peers' decisions. Additionally, the confidence score of the result of each peer is not normally taken into account, although it is an important feature to consider for ensemble learning. Moreover, the problem of a tie event is often left un-addressed by methods such as BFT. To fill these research gaps, we propose PoSw (Proof of Swarm), a novel distributed consensus algorithm for ensemble learning in a federated setting, which was inspired by particle swarm based algorithms for solving optimisation problems. The proposed algorithm is theoretically proved to always converge in a relatively small number of steps and has mechanisms to resolve tie events while trying to achieve sub-optimum solutions. We experimentally validated the performance of the proposed algorithm using ECG classification as an example application in healthcare, showing that the ensemble learning model outperformed all local models and even the FL-based global model. To the best of our knowledge, the proposed algorithm is the first attempt to make consensus over the output results of distributed models trained using federated learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题