重新加权型型摩擦式匹配的负面样品

论文标题

重新加权型型摩擦式匹配的负面样品

Re-weighting Negative Samples for Model-Agnostic Matching

论文作者

Lou, Jiazhen, Wen, Hong, Lv, Fuyu, Zhang, Jing, Yuan, Tengfei, Li, Zhao

论文摘要

推荐系统（RS）是一种有效的工具，可以从一个非常大的语料库中发现用户感兴趣的物品，吸引了学术界和行业的关注。作为RS的初始阶段，大规模匹配是基本但充满挑战的。一个典型的食谱是通过两个较高的体系结构学习用户和项目表示，然后计算两个表示向量之间的相似性得分，但是在如何正确处理负面样本方面仍然挣扎。在本文中，我们发现，从整个空间中随机采样负样本并同样对其进行处理的共同做法不是最佳选择，因为不同阶段的不同子空间的负样本对匹配模型具有不同的重要性。为了解决这个问题，我们提出了一种名为“无偏模型匹配方法”（UMA $^2 $）的新颖方法。它由两个基本模块组成，包括1）通用匹配模型（GMM），该模型是模型不合时宜的，可以作为任何基于嵌入式的两个较高型模型实现； 2）负样本DEBIAS网络（NSDN），该网络通过借用反向倾向加权（IPW）的概念来区分负样本，并重新赢得GMM中的损失。 UMA $^2 $无缝将这两个模块集成到端到端的多任务学习框架中。对现实世界脱机数据集和在线A/B测试的广泛实验证明了其优于最先进的方法。

Recommender Systems (RS), as an efficient tool to discover users' interested items from a very large corpus, has attracted more and more attention from academia and industry. As the initial stage of RS, large-scale matching is fundamental yet challenging. A typical recipe is to learn user and item representations with a two-tower architecture and then calculate the similarity score between both representation vectors, which however still struggles in how to properly deal with negative samples. In this paper, we find that the common practice that randomly sampling negative samples from the entire space and treating them equally is not an optimal choice, since the negative samples from different sub-spaces at different stages have different importance to a matching model. To address this issue, we propose a novel method named Unbiased Model-Agnostic Matching Approach (UMA$^2$). It consists of two basic modules including 1) General Matching Model (GMM), which is model-agnostic and can be implemented as any embedding-based two-tower models; and 2) Negative Samples Debias Network (NSDN), which discriminates negative samples by borrowing the idea of Inverse Propensity Weighting (IPW) and re-weighs the loss in GMM. UMA$^2$ seamlessly integrates these two modules in an end-to-end multi-task learning framework. Extensive experiments on both real-world offline dataset and online A/B test demonstrate its superiority over state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题