论文标题

学会解释:用于识别多ihop问题避开中有效推理链的数据集和模型

Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering

论文作者

Jhamtani, Harsh, Clark, Peter

论文摘要

尽管MultiHop提问(QA)取得了迅速的进展,但模型仍然很难解释为什么答案是正确的,并且有限的解释培训数据可以从中学习。为了解决这个问题,我们介绍了三个解释数据集,其中用语料库事实形成的解释被注释。我们的第一个数据集EQASC包含多台面问题答案数据集QASC的9​​8K解释注释,并且是第一个注释每个答案的多个候选解释的注释。第二个数据集EQASC受扰动是通过众包扰动(同时保留其有效性)来构建的,以测试解释预测模型的一致性和概括性。第三个数据集EOBQA是通过向OBQA数据集添加说明注释来测试在EQASC训练的模型的概括来构建的。我们表明,这些数据可用于使用基于BERT的分类器来显着提高解释质量(在强率基线上+14%的绝对F1),但仍在上限后面,为将来的研究带来了新的挑战。我们还探索了一种避开链表示,其中重复的名词短语被变量替换,从而将它们变成了广义的推理链(例如:“ x is a y”和“ y has hm ham has z”暗示“ x具有z”)。我们发现,广义链保持性能,同时对某些扰动更加强大。

Despite the rapid progress in multihop question-answering (QA), models still have trouble explaining why an answer is correct, with limited explanation training data available to learn from. To address this, we introduce three explanation datasets in which explanations formed from corpus facts are annotated. Our first dataset, eQASC, contains over 98K explanation annotations for the multihop question answering dataset QASC, and is the first that annotates multiple candidate explanations for each answer. The second dataset eQASC-perturbed is constructed by crowd-sourcing perturbations (while preserving their validity) of a subset of explanations in QASC, to test consistency and generalization of explanation prediction models. The third dataset eOBQA is constructed by adding explanation annotations to the OBQA dataset to test generalization of models trained on eQASC. We show that this data can be used to significantly improve explanation quality (+14% absolute F1 over a strong retrieval baseline) using a BERT-based classifier, but still behind the upper bound, offering a new challenge for future research. We also explore a delexicalized chain representation in which repeated noun phrases are replaced by variables, thus turning them into generalized reasoning chains (for example: "X is a Y" AND "Y has Z" IMPLIES "X has Z"). We find that generalized chains maintain performance while also being more robust to certain perturbations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源