论文标题
通过自定义预训练的多转响应选择的图形推理网络
A Graph Reasoning Network for Multi-turn Response Selection via Customized Pre-training
论文作者
论文摘要
我们研究基于检索的聊天机器人中多转交谈的响应选择。现有研究通过基于学习的功能来计算匹配分数,更多地关注话语和响应之间的匹配,从而导致模型推理能力不足。在本文中,我们提出了一个图形结构网络(GRN)来解决该问题。 GRN首先使用专门为响应选择而设计的下一个话语预测和话语顺序预测任务,基于Albert进行预训练。这两个定制的预训练任务可以使我们的模型具有捕获语音之间的语义和按时间顺序依赖性的能力。然后,我们在具有序列推理和图形推理结构的集成网络上微调模型。序列推理模块从全球角度的话语 - 响应对的高度汇总上下文向量进行推理。图形推理模块从局部角度进行了引言级图神经网络的推理。两个对话推理数据集的实验表明,我们的模型可以极大地超过强大的基线方法,并且可以实现与人级接近的性能。
We investigate response selection for multi-turn conversation in retrieval-based chatbots. Existing studies pay more attention to the matching between utterances and responses by calculating the matching score based on learned features, leading to insufficient model reasoning ability. In this paper, we propose a graph-reasoning network (GRN) to address the problem. GRN first conducts pre-training based on ALBERT using next utterance prediction and utterance order prediction tasks specifically devised for response selection. These two customized pre-training tasks can endow our model with the ability of capturing semantical and chronological dependency between utterances. We then fine-tune the model on an integrated network with sequence reasoning and graph reasoning structures. The sequence reasoning module conducts inference based on the highly summarized context vector of utterance-response pairs from the global perspective. The graph reasoning module conducts the reasoning on the utterance-level graph neural network from the local perspective. Experiments on two conversational reasoning datasets show that our model can dramatically outperform the strong baseline methods and can achieve performance which is close to human-level.