贝叶斯矩阵分解的高性能实施，沟通有限

论文标题

贝叶斯矩阵分解的高性能实施，沟通有限

A High-Performance Implementation of Bayesian Matrix Factorization with Limited Communication

论文作者

Aa, Tom Vander, Qin, Xiangju, Blomstedt, Paul, Wuyts, Roel, Verachtert, Wilfried, Kaski, Samuel

论文摘要

矩阵分解是建议系统中非常常见的机器学习技术。贝叶斯基质分解（BMF）算法将很有吸引力，因为它们能够量化预测的不确定性并避免过度拟合，并结合高预测准确性。但是，由于它们的计算成本过高，因此尚未在大规模数据上广泛使用。在最近的工作中，已经通过提高BMF算法的可扩展性及其实施来降低成本的努力，但至今主要是分开的。在本文中，我们表明，可以将两种可扩展性方法的最新方法结合在一起。我们将最近高度易于估计的后验传播算法与BMF合作，该算法将矩阵块的计算与分布式BMF实现相结合，该算法与每个块中的分布式BMF实现相结合。我们表明，这两种方法的组合在缩短壁式锁定时间时，在Web尺度数据集上可扩展BMF的可伸缩性。

Matrix factorization is a very common machine learning technique in recommender systems. Bayesian Matrix Factorization (BMF) algorithms would be attractive because of their ability to quantify uncertainty in their predictions and avoid over-fitting, combined with high prediction accuracy. However, they have not been widely used on large-scale data because of their prohibitive computational cost. In recent work, efforts have been made to reduce the cost, both by improving the scalability of the BMF algorithm as well as its implementation, but so far mainly separately. In this paper we show that the state-of-the-art of both approaches to scalability can be combined. We combine the recent highly-scalable Posterior Propagation algorithm for BMF, which parallelizes computation of blocks of the matrix, with a distributed BMF implementation that users asynchronous communication within each block. We show that the combination of the two methods gives substantial improvements in the scalability of BMF on web-scale datasets, when the goal is to reduce the wall-clock time.

下载PDF全文

下载文献需遵守相关版权规定

论文标题