论文标题
PGE:可靠的产品图嵌入错误检测的学习
PGE: Robust Product Graph Embedding Learning for Error Detection
论文作者
论文摘要
尽管近年来产品图(PGS)在产品搜索和建议中的成功应用中引起了人们的注意,但PGS的广泛功能可能会受到各种错误的不可避免的参与而受到限制。因此,至关重要的是,验证PG中三倍的正确性以提高其可靠性。知识图(kg)嵌入方法具有强大的错误检测能力。但是,由于其独特的特征,现有的kg嵌入方法可能不直接适用于PG:(1)PG包含丰富的文本信号,这需要对文本信息和图形结构进行联合探索; (2)PG包含大量属性三元组,其中属性值由自由文本表示。由于自由文本太灵活,无法在kgs中定义实体,因此使用IDS将实体映射到其嵌入的传统方式不再适合属性值表示; (3)PG中的嘈杂三元组误导了嵌入学习,并严重损害了错误检测的性能。为了应对上述挑战,我们提出了一个端到端的嵌入学习框架,PGE,以共同利用PG中的文本信息和图形结构,以学习嵌入嵌入以进行错误检测。现实世界中产品图的实验结果证明了与最新方法相比,提出的框架的有效性。
Although product graphs (PGs) have gained increasing attentions in recent years for their successful applications in product search and recommendations, the extensive power of PGs can be limited by the inevitable involvement of various kinds of errors. Thus, it is critical to validate the correctness of triples in PGs to improve their reliability. Knowledge graph (KG) embedding methods have strong error detection abilities. Yet, existing KG embedding methods may not be directly applicable to a PG due to its distinct characteristics: (1) PG contains rich textual signals, which necessitates a joint exploration of both text information and graph structure; (2) PG contains a large number of attribute triples, in which attribute values are represented by free texts. Since free texts are too flexible to define entities in KGs, traditional way to map entities to their embeddings using ids is no longer appropriate for attribute value representation; (3) Noisy triples in a PG mislead the embedding learning and significantly hurt the performance of error detection. To address the aforementioned challenges, we propose an end-to-end noise-tolerant embedding learning framework, PGE, to jointly leverage both text information and graph structure in PG to learn embeddings for error detection. Experimental results on real-world product graph demonstrate the effectiveness of the proposed framework comparing with the state-of-the-art approaches.