论文标题
与局部邻里公平的图形学习
Graph Learning with Localized Neighborhood Fairness
论文作者
论文摘要
为下游应用程序学习公平的图表表示越来越重要,但是现有工作主要集中在通过修改图形结构或目标函数的情况下,而无需考虑节点的本地社区,以提高全球级别的公平性。在这项工作中,我们正式介绍了邻里公平的概念,并开发了一个计算框架来学习此类当地公平的嵌入。我们认为,由于基于GNN的模型在节点的当地社区层面上运行,因此邻里公平的概念更为合适。我们的邻居公平框架具有两个主要组成部分,可以灵活地从任意数据中学习公正的图表表示:第一个旨在为图中的任何任意节点构建公平的邻居,第二个旨在使这些公平的社区适应这些公平的社区,以更好地捕获某些依赖于数据的限制或与数据相关的约束,例如使社区更加偏向于某些属性或在图中进行依据。首先要研究公平链接分类的图表表示学习任务。我们演示了提议的邻里公平框架对各种图形机器学习任务的有效性,包括公平的链接预测,链接分类和学习公平的图形嵌入。值得注意的是,我们的方法不仅取得了更好的公平性,而且还提高了大多数案例中各种各样的图表,问题设置和指标的准确性。
Learning fair graph representations for downstream applications is becoming increasingly important, but existing work has mostly focused on improving fairness at the global level by either modifying the graph structure or objective function without taking into account the local neighborhood of a node. In this work, we formally introduce the notion of neighborhood fairness and develop a computational framework for learning such locally fair embeddings. We argue that the notion of neighborhood fairness is more appropriate since GNN-based models operate at the local neighborhood level of a node. Our neighborhood fairness framework has two main components that are flexible for learning fair graph representations from arbitrary data: the first aims to construct fair neighborhoods for any arbitrary node in a graph and the second enables adaption of these fair neighborhoods to better capture certain application or data-dependent constraints, such as allowing neighborhoods to be more biased towards certain attributes or neighbors in the graph.Furthermore, while link prediction has been extensively studied, we are the first to investigate the graph representation learning task of fair link classification. We demonstrate the effectiveness of the proposed neighborhood fairness framework for a variety of graph machine learning tasks including fair link prediction, link classification, and learning fair graph embeddings. Notably, our approach achieves not only better fairness but also increases the accuracy in the majority of cases across a wide variety of graphs, problem settings, and metrics.