论文标题
用于聚类的二维半非矩阵分解
Two-Dimensional Semi-Nonnegative Matrix Factorization for Clustering
论文作者
论文摘要
在本文中,我们为二维(2D)数据提出了一种新的半非矩阵分解方法,称为TS-NMF。它克服了现有方法的缺点,这些方法通过在预处理步骤中将2D数据转换为向量,从而严重损害了数据的空间信息。特别是,在构建新的数据表示形式的指导下寻求投影矩阵,以便保留空间信息,并通过聚类的目标增强了预测,这有助于构建最佳投影方向。此外,为了利用数据的非线性结构,在预计的子空间中构建了歧管,该子空间会根据预测进行自适应更新,并且较少受到数据的噪声和离群值的困扰,因此在预计的空间中更具代表性。因此,寻求预测,构建新的数据表示和学习歧管在单个模型中无缝集成,从而相互增强其他模型并导致强大的数据表示。与几种最先进的算法相比,全面的实验结果验证了TS-NMF的有效性,这表明所提出的现实世界应用方法的潜力很高。
In this paper, we propose a new Semi-Nonnegative Matrix Factorization method for 2-dimensional (2D) data, named TS-NMF. It overcomes the drawback of existing methods that seriously damage the spatial information of the data by converting 2D data to vectors in a preprocessing step. In particular, projection matrices are sought under the guidance of building new data representations, such that the spatial information is retained and projections are enhanced by the goal of clustering, which helps construct optimal projection directions. Moreover, to exploit nonlinear structures of the data, manifold is constructed in the projected subspace, which is adaptively updated according to the projections and less afflicted with noise and outliers of the data and thus more representative in the projected space. Hence, seeking projections, building new data representations, and learning manifold are seamlessly integrated in a single model, which mutually enhance other and lead to a powerful data representation. Comprehensive experimental results verify the effectiveness of TS-NMF in comparison with several state-of-the-art algorithms, which suggests high potential of the proposed method for real world applications.