学习多个时间序列中的潜在因果关系

论文标题

学习多个时间序列中的潜在因果关系

Learning latent causal relationships in multiple time series

论文作者

Dmochowski, Jacek P.

论文摘要

确定具有多个动态元素的系统的因果结构对于几个科学学科至关重要。常规的方法是在被选为先验选择的信号之间进行因果关系的统计检验，例如与Granger因果关系进行。在这里，人们认为，在许多系统中，因果关系被嵌入在观察到的数据中作为线性混合物中表达的潜在空间。提出了一种盲目识别潜在来源的技术：将观察结果投射到成对的组件（驾驶和驱动）成对，以最大程度地提高对之间的因果关系。这导致了具有目标函数和梯度的封闭形式表达式的优化问题，可以通过现成的技术解决。在证明了具有已知潜在结构的合成数据的概念概念后，该技术将应用于人类脑和历史加密货币价格的记录。在这两种情况下，该方法都恢复了多种强大的因果关系，这些因果关系在观察到的数据中尚不明显。所提出的技术是无监督的，并且可以很容易地应用于任何多个时间序列，以阐明数据的因果关系。

Identifying the causal structure of systems with multiple dynamic elements is critical to several scientific disciplines. The conventional approach is to conduct statistical tests of causality, for example with Granger Causality, between observed signals that are selected a priori. Here it is posited that, in many systems, the causal relations are embedded in a latent space that is expressed in the observed data as a linear mixture. A technique for blindly identifying the latent sources is presented: the observations are projected into pairs of components -- driving and driven -- to maximize the strength of causality between the pairs. This leads to an optimization problem with closed form expressions for the objective function and gradient that can be solved with off-the-shelf techniques. After demonstrating proof-of-concept on synthetic data with known latent structure, the technique is applied to recordings from the human brain and historical cryptocurrency prices. In both cases, the approach recovers multiple strong causal relationships that are not evident in the observed data. The proposed technique is unsupervised and can be readily applied to any multiple time series to shed light on the causal relationships underlying the data.

下载PDF全文

下载文献需遵守相关版权规定

论文标题