电子商务搜索中指定实体识别的端到端解决方案

论文标题

电子商务搜索中指定实体识别的端到端解决方案

An End-to-End Solution for Named Entity Recognition in eCommerce Search

论文作者

Cheng, Xiang, Bowden, Mitchell, Bhange, Bhushan Ramesh, Goyal, Priyanka, Packer, Thomas, Javed, Faizan

论文摘要

命名实体识别（NER）是现代搜索查询理解中的关键步骤。在电子商务领域，确定关键实体（例如品牌和产品类型）可以帮助搜索引擎检索相关产品，从而提供引人入胜的购物体验。最近的研究表明，使用深度学习方法对共享基准NER任务的有希望的结果，但是行业中关于领域知识，培训数据和模型生产的独特挑战。本文展示了解决这些挑战的端到端解决方案。解决方案的核心是一个新颖的模型培训框架“ Triplealearn”，它从三个独立的训练数据集中迭代学习，而不是传统上进行的一个训练集。使用这种方法，最佳模型将Holdout测试数据中的F1分数从69.5提高到93.3。在我们的离线实验中，与使用一组培训数据的传统培训方法相比，Triplelearn提高了模型性能。此外，在在线A/B测试中，我们看到了用户参与度和收入转换的显着改善。该模型已在Homedepot.com上直播了9个月以上，从而增加了搜索转化和收入。除了我们的应用外，这个三重核心框架以及端到端的流程是独立于模型的和与问题无关的，因此可以将其推广到更多的工业应用程序，尤其是对于具有类似数据基础和问题的电子商务行业。

Named entity recognition (NER) is a critical step in modern search query understanding. In the domain of eCommerce, identifying the key entities, such as brand and product type, can help a search engine retrieve relevant products and therefore offer an engaging shopping experience. Recent research shows promising results on shared benchmark NER tasks using deep learning methods, but there are still unique challenges in the industry regarding domain knowledge, training data, and model production. This paper demonstrates an end-to-end solution to address these challenges. The core of our solution is a novel model training framework "TripleLearn" which iteratively learns from three separate training datasets, instead of one training set as is traditionally done. Using this approach, the best model lifts the F1 score from 69.5 to 93.3 on the holdout test data. In our offline experiments, TripleLearn improved the model performance compared to traditional training approaches which use a single set of training data. Moreover, in the online A/B test, we see significant improvements in user engagement and revenue conversion. The model has been live on homedepot.com for more than 9 months, boosting search conversions and revenue. Beyond our application, this TripleLearn framework, as well as the end-to-end process, is model-independent and problem-independent, so it can be generalized to more industrial applications, especially to the eCommerce industry which has similar data foundations and problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题