论文标题
通过问答和回答答案检索的问题和回答互轴增强双重编码器
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval
论文作者
论文摘要
双重编码器是回答回答答案(QA)系统的有前途的机制。目前,大多数传统的双重编码器仅通过匹配分数来学习问题和答案的语义表示。研究人员建议在评分函数中介绍质量检查功能,但以推理阶段效率低的代价。为了在推理阶段进行独立编码问题和答案,进一步介绍了各种自动编码器,以重建问题(答案)嵌入的答案(问题)作为辅助任务,以增强训练阶段中表示中的质量检查。但是,文本生成和答案检索的需求不同,这导致了训练的硬度。在这项工作中,我们提出了一个框架来增强双重编码器模型,并通过问题答案交叉插件和一种新颖的几何形状对准机制(GAM)使双重编码器的嵌入几何形状与跨编码器的几何形状相结合。广泛的实验结果表明,我们的框架显着改善了双编码器模型,并且在多个答案检索数据集上胜过最新方法。
Dual-Encoders is a promising mechanism for answer retrieval in question answering (QA) systems. Currently most conventional Dual-Encoders learn the semantic representations of questions and answers merely through matching score. Researchers proposed to introduce the QA interaction features in scoring function but at the cost of low efficiency in inference stage. To keep independent encoding of questions and answers during inference stage, variational auto-encoder is further introduced to reconstruct answers (questions) from question (answer) embeddings as an auxiliary task to enhance QA interaction in representation learning in training stage. However, the needs of text generation and answer retrieval are different, which leads to hardness in training. In this work, we propose a framework to enhance the Dual-Encoders model with question answer cross-embeddings and a novel Geometry Alignment Mechanism (GAM) to align the geometry of embeddings from Dual-Encoders with that from Cross-Encoders. Extensive experimental results show that our framework significantly improves Dual-Encoders model and outperforms the state-of-the-art method on multiple answer retrieval datasets.