论文标题
一系列不幸的反事件事件:时间在反事实解释中的作用
A Series of Unfortunate Counterfactual Events: the Role of Time in Counterfactual Explanations
论文作者
论文摘要
反事实解释是可解释的人工智能研究领域中事后解释性方法的重要例子。他们为个人提供替代方案和一系列建议,以实现受欢迎的机器学习模型结果。最近,文献已经确定了反事实解释的逃亡者,例如可行性,可行性和稀疏性,这些解释应支持其在现实世界中的适用性。但是,我们表明文献已经忽略了反事实解释的时间依赖性问题。我们认为,由于他们的时间依赖性和由于提供了建议,甚至可行,可行和稀疏的反事实解释可能在实际应用中可能不合适。这是由于我们所谓的“不幸的反事件事件”的可能出现。这些事件可能是由于机器学习模型的重新培训而发生的,必须通过反事实解释来解释其结果。一系列不幸的反事实事件使那些成功实施反事实解释建议的人的努力感到沮丧。这对人们对机构能够始终如一地提供机器学习支持的决策的能力的信任产生了负面影响。我们介绍了一种方法来解决不幸的反事件事件的出现问题,该事件利用了反事实解释的历史。在本文的最后一部分中,我们提出了两种不同的策略的道德分析,以应对不幸的反事件事件的挑战。我们表明,他们应对保留信贷贷款组织的可信赖性,他们采用的决策模式以及信贷贷款的社会经济职能的符合道德责任的当务之急。
Counterfactual explanations are a prominent example of post-hoc interpretability methods in the explainable Artificial Intelligence research domain. They provide individuals with alternative scenarios and a set of recommendations to achieve a sought-after machine learning model outcome. Recently, the literature has identified desiderata of counterfactual explanations, such as feasibility, actionability and sparsity that should support their applicability in real-world contexts. However, we show that the literature has neglected the problem of the time dependency of counterfactual explanations. We argue that, due to their time dependency and because of the provision of recommendations, even feasible, actionable and sparse counterfactual explanations may not be appropriate in real-world applications. This is due to the possible emergence of what we call "unfortunate counterfactual events." These events may occur due to the retraining of machine learning models whose outcomes have to be explained via counterfactual explanation. Series of unfortunate counterfactual events frustrate the efforts of those individuals who successfully implemented the recommendations of counterfactual explanations. This negatively affects people's trust in the ability of institutions to provide machine learning-supported decisions consistently. We introduce an approach to address the problem of the emergence of unfortunate counterfactual events that makes use of histories of counterfactual explanations. In the final part of the paper we propose an ethical analysis of two distinct strategies to cope with the challenge of unfortunate counterfactual events. We show that they respond to an ethically responsible imperative to preserve the trustworthiness of credit lending organizations, the decision models they employ, and the social-economic function of credit lending.