论文标题
标量数据的局部拓扑简化
Localized Topological Simplification of Scalar Data
论文作者
论文摘要
本文介绍了标量数据拓扑简化的局部算法,这是拓扑数据分析(TDA)的基本预处理步骤。鉴于标量场F和一系列可保留的极端局部,提出的局部拓扑简化(LTS)得出了接近F的函数G,仅显示了选定的极值集。具体而言,首先将与不希望的超值相关的子和超级套件组件首先在局部扁平化,然后正确地嵌入全局标量场中,从而确保这些区域可以保证 - 从组合角度来看 - 不再包含任何不希望的极点。与以前的全局方法相反,仅LTS仅和独立处理实际需要简化的域的区域,这已经导致了明显的加速。此外,由于算法的局部性质,LTS可以利用共享的记忆并行性以高平行效率同时简化区域(70%)。因此,LTS显着提高了探索简化参数及其对随后拓扑分析的影响。对于此类勘探任务,LTS带来了大量TDA管道的总体执行时间,从几分钟下降到几秒钟,平均观察到的速度比最新技术的速度高达X36。此外,在根据拓扑持久性选择保留的极端情况的特殊情况下,LTS的改编版本部分计算持久图,并同时简化了在预定义的持久性阈值下方的特征。 LTS的有效性,其并行效率及其对TDA的效果在来自不同应用领域的几个模拟和获取的数据集上,包括物理,化学和生物医学成像。
This paper describes a localized algorithm for the topological simplification of scalar data, an essential pre-processing step of topological data analysis (TDA). Given a scalar field f and a selection of extrema to preserve, the proposed localized topological simplification (LTS) derives a function g that is close to f and only exhibits the selected set of extrema. Specifically, sub- and superlevel set components associated with undesired extrema are first locally flattened and then correctly embedded into the global scalar field, such that these regions are guaranteed -- from a combinatorial perspective -- to no longer contain any undesired extrema. In contrast to previous global approaches, LTS only and independently processes regions of the domain that actually need to be simplified, which already results in a noticeable speedup. Moreover, due to the localized nature of the algorithm, LTS can utilize shared-memory parallelism to simplify regions simultaneously with a high parallel efficiency (70%). Hence, LTS significantly improves interactivity for the exploration of simplification parameters and their effect on subsequent topological analysis. For such exploration tasks, LTS brings the overall execution time of a plethora of TDA pipelines from minutes down to seconds, with an average observed speedup over state-of-the-art techniques of up to x36. Furthermore, in the special case where preserved extrema are selected based on topological persistence, an adapted version of LTS partially computes the persistence diagram and simultaneously simplifies features below a predefined persistence threshold. The effectiveness of LTS, its parallel efficiency, and its resulting benefits for TDA are demonstrated on several simulated and acquired datasets from different application domains, including physics, chemistry, and biomedical imaging.