论文标题
作者名称在异构信息网络上具有对抗表示学习的歧义歧义
Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning
论文作者
论文摘要
作者命名歧义会导致学术信息检索的不足和不便,这增加了作者名称消除歧义的必要性(and)。现有和方法可以分为两个类别:重点关注内容信息的模型,以区分同一作者是否编写了两个论文,这些模型着重于关系信息以表示网络上的边缘信息并量化论文之间的相似性。但是,前者需要足够的标签样本和信息性的负样本,并且在衡量论文之间的高阶连接方面也无效,而后者则需要复杂的功能工程或监督来构建网络。我们提出了一个新颖的生成对抗框架,以将两类模型种植在一起:(i)区分模块区分了两篇论文是否与同一位作者区分开来,(ii)生成模块选择可能直接从异质信息网络中选择可能均匀的论文,从而消除了复杂的特征工程。通过这种方式,判别模块指导生成模块选择均匀的论文,而生成模块生成了高质量的负样本来训练判别模块,以使其意识到论文之间的高阶连接。此外,判别模块和随机步行生成算法的自我训练策略旨在使训练稳定和高效。对两个现实世界和基准测试的广泛实验表明,我们的模型对最先进的方法提供了显着的性能改善。
Author name ambiguity causes inadequacy and inconvenience in academic information retrieval, which raises the necessity of author name disambiguation (AND). Existing AND methods can be divided into two categories: the models focusing on content information to distinguish whether two papers are written by the same author, the models focusing on relation information to represent information as edges on the network and to quantify the similarity among papers. However, the former requires adequate labeled samples and informative negative samples, and are also ineffective in measuring the high-order connections among papers, while the latter needs complicated feature engineering or supervision to construct the network. We propose a novel generative adversarial framework to grow the two categories of models together: (i) the discriminative module distinguishes whether two papers are from the same author, and (ii) the generative module selects possibly homogeneous papers directly from the heterogeneous information network, which eliminates the complicated feature engineering. In such a way, the discriminative module guides the generative module to select homogeneous papers, and the generative module generates high-quality negative samples to train the discriminative module to make it aware of high-order connections among papers. Furthermore, a self-training strategy for the discriminative module and a random walk based generating algorithm are designed to make the training stable and efficient. Extensive experiments on two real-world AND benchmarks demonstrate that our model provides significant performance improvement over the state-of-the-art methods.