论文标题
tsexplain:通过浮出水面的贡献者来解释汇总时间序列
TSEXPLAIN: Explaining Aggregated Time Series by Surfacing Evolving Contributors
论文作者
论文摘要
汇总时间序列毫不费力地生成,例如,“自2019年以来的共确认Covid-19案件和“随时间销售的总销售”。了解“为什么”和“为什么”这些关键绩效指标(KPI)随着时间的推移而发展对于做出数据信息决策至关重要。现有的解释引擎专注于解释一个汇总价值或两个关系之间的差异。但是,这没有解释KPI的持续变化。在此激励的情况下,我们提出了Tsexplain,该系统通过浮出水面的顶级贡献者来解释汇总时间序列。在引擎盖下,我们利用了关于两种关系差异作为构建块的先前工作,并制定了K分段问题,以分割时间序列,以使分段之后的每个细分都共享一致的解释,即贡献者。为了量化每个细分市场的一致性,我们提出了一种新颖的段方差设计,这是解释感知的;为了得出最佳的K分割方案,我们开发了一种有效的动态编程算法。关于合成和现实世界数据集的实验表明,我们的解释意识分割可以有效地识别汇总时间序列和跑赢大于解释 - 不合Snostic分段的演变解释。此外,我们提出了K的最佳选择策略和几种优化,以加快交互式用户体验的速度,从而提高了13倍的效率。
Aggregated time series are generated effortlessly everywhere, e.g., "total confirmed covid-19 cases since 2019" and "total liquor sales over time." Understanding "how" and "why" these key performance indicators (KPI) evolve over time is critical to making data-informed decisions. Existing explanation engines focus on explaining one aggregated value or the difference between two relations. However, this falls short of explaining KPIs' continuous changes over time. Motivated by this, we propose TSEXPLAIN, a system that explains aggregated time series by surfacing the underlying evolving top contributors. Under the hood, we leverage prior works on two-relations diff as a building block and formulate a K-Segmentation problem to segment the time series such that each segment after segmentation shares consistent explanations, i.e., contributors. To quantify consistency in each segment, we propose a novel within-segment variance design that is explanation-aware; to derive the optimal K-Segmentation scheme, we develop an efficient dynamic programming algorithm. Experiments on synthetic and real-world datasets show that our explanation-aware segmentation can effectively identify evolving explanations for aggregated time series and outperform explanation-agnostic segmentation. Further, we proposed an optimal selection strategy of K and several optimizations to speed up TSEXPLAIN for interactive user experience, achieving up to 13X efficiency improvement.