调查可解释的人工智能方法用于地球科学中卷积神经网络应用的忠诚度

论文标题

调查可解释的人工智能方法用于地球科学中卷积神经网络应用的忠诚度

Investigating the fidelity of explainable artificial intelligence methods for applications of convolutional neural networks in geoscience

论文作者

Mamalakis, Antonios, Barnes, Elizabeth A., Ebert-Uphoff, Imme

论文摘要

卷积神经网络（CNN）最近由于捕获非线性系统行为并提取预测性时空模式而引起了地球科学的极大关注。但是，鉴于其黑盒的性质以及预测性的重要性，可解释的人工智能方法（XAI）正在流行，作为解释CNN决策策略的一种手段。在这里，我们建立了一些最受欢迎的XAI方法的比较，并研究了它们在解释CNN的地球科学应用决策方面的保真度。我们的目标是提高人们对这些方法的理论局限性的认识，并深入了解相对优势和缺点，以帮助指导最佳实践。所考虑的XAI方法首先应用于理想化的归因基准，在该基准中，该网络解释的基础真实是先验，以帮助客观地评估其性能。其次，我们将XAI应用于与气候相关的预测设置，即解释CNN，该CNN经过训练，可以预测气候模拟的每日快照中的大气河流数量。我们的结果突出了XAI方法的几个重要问题（例如，梯度粉碎，无法区分归因的迹象，对零输入的无知），这些迹象以前在我们的领域被忽略了，如果不谨慎地认为，可能会导致CNN决策策略的扭曲。我们设想，我们的分析将激发对XAI保真度的进一步调查，并有助于在地球科学中谨慎地实施XAI，这可能导致进一步剥削CNN，并深入学习预测问题。

Convolutional neural networks (CNNs) have recently attracted great attention in geoscience due to their ability to capture non-linear system behavior and extract predictive spatiotemporal patterns. Given their black-box nature however, and the importance of prediction explainability, methods of explainable artificial intelligence (XAI) are gaining popularity as a means to explain the CNN decision-making strategy. Here, we establish an intercomparison of some of the most popular XAI methods and investigate their fidelity in explaining CNN decisions for geoscientific applications. Our goal is to raise awareness of the theoretical limitations of these methods and gain insight into the relative strengths and weaknesses to help guide best practices. The considered XAI methods are first applied to an idealized attribution benchmark, where the ground truth of explanation of the network is known a priori, to help objectively assess their performance. Secondly, we apply XAI to a climate-related prediction setting, namely to explain a CNN that is trained to predict the number of atmospheric rivers in daily snapshots of climate simulations. Our results highlight several important issues of XAI methods (e.g., gradient shattering, inability to distinguish the sign of attribution, ignorance to zero input) that have previously been overlooked in our field and, if not considered cautiously, may lead to a distorted picture of the CNN decision-making strategy. We envision that our analysis will motivate further investigation into XAI fidelity and will help towards a cautious implementation of XAI in geoscience, which can lead to further exploitation of CNNs and deep learning for prediction problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题