论文标题
Xinsight:通过因果关系的可解释数据分析
XInsight: eXplainable Data Analysis Through The Lens of Causality
论文作者
论文摘要
鉴于探索性数据分析的日益普及(EDA),了解EDA获得的知识的根本原因至关重要。但是,它的研究仍然不足。这项研究促进了对数据分析的透明且可解释的观点,称为可解释的数据分析(XDA)。因此,我们提出了XInsight,这是XDA的一般框架。 Xinsight提供了有关因果和非因果语义的定性和定量解释的数据分析。这样,它将显着提高人类对数据分析结果的理解和信心,从而促进现实世界中准确的数据解释和决策。 Xinsight是一种三模块的端到端管道,旨在提取因果图,将因果原语转化为XDA语义,并量化每个解释对数据事实的定量贡献。 Xinsight使用一组设计概念和优化来解决与因果关系纳入XDA相关的固有困难。关于合成和现实世界数据集以及用户研究的实验证明了Xinsight的高度有希望的功能。
In light of the growing popularity of Exploratory Data Analysis (EDA), understanding the underlying causes of the knowledge acquired by EDA is crucial. However, it remains under-researched. This study promotes a transparent and explicable perspective on data analysis, called eXplainable Data Analysis (XDA). For this reason, we present XInsight, a general framework for XDA. XInsight provides data analysis with qualitative and quantitative explanations of causal and non-causal semantics. This way, it will significantly improve human understanding and confidence in the outcomes of data analysis, facilitating accurate data interpretation and decision making in the real world. XInsight is a three-module, end-to-end pipeline designed to extract causal graphs, translate causal primitives into XDA semantics, and quantify the quantitative contribution of each explanation to a data fact. XInsight uses a set of design concepts and optimizations to address the inherent difficulties associated with integrating causality into XDA. Experiments on synthetic and real-world datasets as well as a user study demonstrate the highly promising capabilities of XInsight.