捍卫开放域问题中的虚假信息攻击回答

论文标题

捍卫开放域问题中的虚假信息攻击回答

Defending Against Disinformation Attacks in Open-Domain Question Answering

论文作者

Weller, Orion, Khan, Aleem, Weir, Nathaniel, Lawrie, Dawn, Van Durme, Benjamin

论文摘要

开放域问答（ODQA）的最新工作表明，搜索收集的对抗中毒可能会导致生产系统的准确性大量下降。但是，几乎没有工作提出了防御这些攻击的方法。为此，我们依靠直觉，即大型语料库中经常存在冗余信息。为了找到它，我们介绍了一种使用查询增强的方法来搜索可以回答原始问题但不太可能被中毒的各种段落。我们通过设计一种新颖的置信度方法将这些新段落整合到模型中，将预测的答案与其在检索到的上下文中的外观进行比较（我们称之为答案冗余的信心，即汽车）。这些方法共同提供了一种简单但有效的方法，以防御中毒攻击，从而在不同程度的数据中毒/知识冲突中提供了近20％的确切匹配。

Recent work in open-domain question answering (ODQA) has shown that adversarial poisoning of the search collection can cause large drops in accuracy for production systems. However, little to no work has proposed methods to defend against these attacks. To do so, we rely on the intuition that redundant information often exists in large corpora. To find it, we introduce a method that uses query augmentation to search for a diverse set of passages that could answer the original question but are less likely to have been poisoned. We integrate these new passages into the model through the design of a novel confidence method, comparing the predicted answer to its appearance in the retrieved contexts (what we call Confidence from Answer Redundancy, i.e. CAR). Together these methods allow for a simple but effective way to defend against poisoning attacks that provides gains of nearly 20% exact match across varying levels of data poisoning/knowledge conflicts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题