论文标题
三角图兴趣网络用于点击率预测
Triangle Graph Interest Network for Click-through Rate Prediction
论文作者
论文摘要
点击率预测是在线广告中的关键任务。当前,许多现有方法试图从历史点击行为序列中提取用户潜在兴趣。但是,很难处理稀疏的用户行为或扩大兴趣探索。最近,一些研究人员将项目项目共发生图作为辅助图。由于用户兴趣的令人震惊,这些作品仍然无法确定用户点击行为的真正动机。此外,这些作品更偏向于流行或类似商品。他们缺乏打破多样性限制的有效机制。在本文中,我们指出了针对推荐系统的项目 - 项目图中三角形的两个特殊特性:三角形同质和三角间异质。基于此,我们提出了一个名为Triangle Graph兴趣网络(TGIN)的新颖有效的框架。对于用户行为序列中的每个点击项目,我们将三角形在其项目 - 项目图的附近介绍作为补充。 Tgin将这些三角形视为用户兴趣的基本单位,该单位提供了捕获用户单击项目的真实动机的线索。我们通过汇总几个兴趣单位的信息来减轻难以捉摸的动机问题来表征每个点击行为。注意机制决定了用户对不同兴趣单位的偏爱。通过选择各种和相对三角形,Tgin带来了新颖和偶然的项目,以扩大用户兴趣的探索机会。然后,我们汇总了历史行为序列的多层次兴趣,以改善CTR预测。对公共和工业数据集进行的广泛实验清楚地验证了我们框架的有效性。
Click-through rate prediction is a critical task in online advertising. Currently, many existing methods attempt to extract user potential interests from historical click behavior sequences. However, it is difficult to handle sparse user behaviors or broaden interest exploration. Recently, some researchers incorporate the item-item co-occurrence graph as an auxiliary. Due to the elusiveness of user interests, those works still fail to determine the real motivation of user click behaviors. Besides, those works are more biased towards popular or similar commodities. They lack an effective mechanism to break the diversity restrictions. In this paper, we point out two special properties of triangles in the item-item graphs for recommendation systems: Intra-triangle homophily and Inter-triangle heterophiy. Based on this, we propose a novel and effective framework named Triangle Graph Interest Network (TGIN). For each clicked item in user behavior sequences, we introduce the triangles in its neighborhood of the item-item graphs as a supplement. TGIN regards these triangles as the basic units of user interests, which provide the clues to capture the real motivation for a user clicking an item. We characterize every click behavior by aggregating the information of several interest units to alleviate the elusive motivation problem. The attention mechanism determines users' preference for different interest units. By selecting diverse and relative triangles, TGIN brings in novel and serendipitous items to expand exploration opportunities of user interests. Then, we aggregate the multi-level interests of historical behavior sequences to improve CTR prediction. Extensive experiments on both public and industrial datasets clearly verify the effectiveness of our framework.