GraphMae：自我监督的蒙版图形自动编码器

论文标题

GraphMae：自我监督的蒙版图形自动编码器

GraphMAE: Self-Supervised Masked Graph Autoencoders

论文作者

Hou, Zhenyu, Liu, Xiao, Cen, Yukuo, Dong, Yuxiao, Yang, Hongxia, Wang, Chunjie, Tang, Jie

论文摘要

近年来，自我监督的学习（SSL）经过广泛的探索。特别是，生成的SSL在自然语言处理和其他AI领域（例如BERT和GPT的广泛采用）中获得了新的成功。尽管如此，对比度学习 - 在很大程度上依赖于结构数据增强和复杂的培训策略，这是图SSL的主要方法，而迄今为止，迄今为止，在图形上，尤其是图形自动编码器（GAE）的生成SSL的进展尚未达到其他领域的潜在潜力。在本文中，我们确定并检查对GAE的发展产生负面影响的问题，包括其重建目标，训练鲁棒性和错误指标。我们提出了一个蒙版的图形自动编码器Graphmae，该图可以减轻这些问题，以预处理生成性自我监督图。我们建议使用掩盖策略和缩放余弦误差，而不是重建图形结构，而是将重点放在特征重建上，从而使GraphMae的强大训练受益。我们在21个公共数据集上进行了大量实验，以实现三个不同的图形学习任务。结果表明，Graphmae-A简单的图形自动编码器具有仔细的设计can始终在对比度和生成性最新基线上都产生优于性的表现。这项研究提供了对图自动编码器的理解，并证明了在图上的生成自我监督预训练的潜力。

Self-supervised learning (SSL) has been extensively explored in recent years. Particularly, generative SSL has seen emerging success in natural language processing and other AI fields, such as the wide adoption of BERT and GPT. Despite this, contrastive learning-which heavily relies on structural data augmentation and complicated training strategies-has been the dominant approach in graph SSL, while the progress of generative SSL on graphs, especially graph autoencoders (GAEs), has thus far not reached the potential as promised in other fields. In this paper, we identify and examine the issues that negatively impact the development of GAEs, including their reconstruction objective, training robustness, and error metric. We present a masked graph autoencoder GraphMAE that mitigates these issues for generative self-supervised graph pretraining. Instead of reconstructing graph structures, we propose to focus on feature reconstruction with both a masking strategy and scaled cosine error that benefit the robust training of GraphMAE. We conduct extensive experiments on 21 public datasets for three different graph learning tasks. The results manifest that GraphMAE-a simple graph autoencoder with careful designs-can consistently generate outperformance over both contrastive and generative state-of-the-art baselines. This study provides an understanding of graph autoencoders and demonstrates the potential of generative self-supervised pre-training on graphs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题