论文标题

评估野外可靠性评估的深泰勒分解

Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild

论文作者

Brandl, Stephanie, Hershcovich, Daniel, Søgaard, Anders

论文摘要

我们认为,我们需要评估“野外”模型的可解释性方法,即在专业人士做出关键决策的情况下,模型可以潜在地帮助他们。我们提出了基于深层泰勒分解的代币归因的野外评估,专业记者进行可靠性评估。我们发现,在八卦语料库上进行了微调,将这种方法与罗伯塔·洛尔格(Roberta-Large)结合使用,导致了更快,更好的人类决策,以及对记者中新闻来源的更为批判性态度。我们介绍了人类和模型理由的比较,以及对记者在循环决策中的经历的定性分析。

We argue that we need to evaluate model interpretability methods 'in the wild', i.e., in situations where professionals make critical decisions, and models can potentially assist them. We present an in-the-wild evaluation of token attribution based on Deep Taylor Decomposition, with professional journalists performing reliability assessments. We find that using this method in conjunction with RoBERTa-Large, fine-tuned on the Gossip Corpus, led to faster and better human decision-making, as well as a more critical attitude toward news sources among the journalists. We present a comparison of human and model rationales, as well as a qualitative analysis of the journalists' experiences with machine-in-the-loop decision making.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源