论文标题
与标签信息的歧管对齐
Manifold Alignment with Label Information
论文作者
论文摘要
多域数据变得越来越普遍,并提出了数据科学界的挑战和机遇。不同数据视图的集成可以用于探索性数据分析,并使下游分析受益,包括机器学习相关任务。考虑到这一点,我们提出了一种称为Mali的新型歧管对准方法(带有标签信息的多种对齐方式),该方法学习了两个不同领域之间的对应关系。马里可以被认为是属于更常见的半监督歧管比对问题之间的中间立场,这两个域之间的某些已知对应关系和纯粹无监督的情况,没有提供已知的对应关系。为此,马里通过扩散过程学习了两个域中的多种结构,然后利用离散的类标签来指导对齐。通过对齐两个不同的域,马里恢复了一个配对和一个共同表示,揭示了两个域中的相关样本。此外,马里可用于转移学习问题,称为域适应。我们表明,马里的表现优于多个数据集的当前最新歧管对齐方法。
Multi-domain data is becoming increasingly common and presents both challenges and opportunities in the data science community. The integration of distinct data-views can be used for exploratory data analysis, and benefit downstream analysis including machine learning related tasks. With this in mind, we present a novel manifold alignment method called MALI (Manifold alignment with label information) that learns a correspondence between two distinct domains. MALI can be considered as belonging to a middle ground between the more commonly addressed semi-supervised manifold alignment problem with some known correspondences between the two domains, and the purely unsupervised case, where no known correspondences are provided. To do this, MALI learns the manifold structure in both domains via a diffusion process and then leverages discrete class labels to guide the alignment. By aligning two distinct domains, MALI recovers a pairing and a common representation that reveals related samples in both domains. Additionally, MALI can be used for the transfer learning problem known as domain adaptation. We show that MALI outperforms the current state-of-the-art manifold alignment methods across multiple datasets.