评估野外可靠性评估的深泰勒分解

论文标题

评估野外可靠性评估的深泰勒分解

Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild

论文作者

Brandl, Stephanie, Hershcovich, Daniel, Søgaard, Anders

论文摘要

我们认为，我们需要评估“野外”模型的可解释性方法，即在专业人士做出关键决策的情况下，模型可以潜在地帮助他们。我们提出了基于深层泰勒分解的代币归因的野外评估，专业记者进行可靠性评估。我们发现，在八卦语料库上进行了微调，将这种方法与罗伯塔·洛尔格（Roberta-Large）结合使用，导致了更快，更好的人类决策，以及对记者中新闻来源的更为批判性态度。我们介绍了人类和模型理由的比较，以及对记者在循环决策中的经历的定性分析。

We argue that we need to evaluate model interpretability methods 'in the wild', i.e., in situations where professionals make critical decisions, and models can potentially assist them. We present an in-the-wild evaluation of token attribution based on Deep Taylor Decomposition, with professional journalists performing reliability assessments. We find that using this method in conjunction with RoBERTa-Large, fine-tuned on the Gossip Corpus, led to faster and better human decision-making, as well as a more critical attitude toward news sources among the journalists. We present a comparison of human and model rationales, as well as a qualitative analysis of the journalists' experiences with machine-in-the-loop decision making.

下载PDF全文

下载文献需遵守相关版权规定

论文标题