论文标题
CTR预测的持续学习:一种混合方法
Continual Learning for CTR Prediction: A Hybrid Approach
论文作者
论文摘要
点击率(CTR)预测是每次点击成本(CPC)广告系统的核心任务,并且已经由机器学习从业人员进行了广泛的研究。尽管许多现有的方法已成功地在实践中部署,但其中大多数是基于I.I.D.(独立且分布相同的)假设的,忽略了用于培训和推理的点击数据是通过时间收集的,并且本质上是非平稳的和漂流的。这种不匹配将不可避免地导致次优性能。为了解决这个问题,我们将CTR预测作为一项持续的学习任务,并提出COLF,这是CTR预测的混合持续学习框架,该框架具有基于内存的模块化体系结构,旨在适应,当面对非平稳的漂移点击数据流时,旨在适应,持续学习和提供预测。 COLF已与记忆种群方法明确控制记忆和目标数据之间的差异,能够从其历史经验中获得积极的知识,并改善CTR预测。从中国一个主要的购物应用程序收集的点击日志上的经验评估证明了我们的方法优于现有方法。此外,我们已经在网上部署了我们的方法,并观察到了CTR和收入的显着改善,这进一步证明了我们的方法的功效。
Click-through rate(CTR) prediction is a core task in cost-per-click(CPC) advertising systems and has been studied extensively by machine learning practitioners. While many existing methods have been successfully deployed in practice, most of them are built upon i.i.d.(independent and identically distributed) assumption, ignoring that the click data used for training and inference is collected through time and is intrinsically non-stationary and drifting. This mismatch will inevitably lead to sub-optimal performance. To address this problem, we formulate CTR prediction as a continual learning task and propose COLF, a hybrid COntinual Learning Framework for CTR prediction, which has a memory-based modular architecture that is designed to adapt, learn and give predictions continuously when faced with non-stationary drifting click data streams. Married with a memory population method that explicitly controls the discrepancy between memory and target data, COLF is able to gain positive knowledge from its historical experience and makes improved CTR predictions. Empirical evaluations on click log collected from a major shopping app in China demonstrate our method's superiority over existing methods. Additionally, we have deployed our method online and observed significant CTR and revenue improvement, which further demonstrates our method's efficacy.