论文标题
拓扑数据分析中对矢量化方法的调查
A Survey of Vectorization Methods in Topological Data Analysis
论文作者
论文摘要
尝试将拓扑信息纳入监督学习任务中的尝试导致创建了几种矢量化持续同源性条形码的技术。在本文中,我们研究了13种此类方法。除了描述这些方法的组织框架外,我们还全面地对付了三个众所周知的分类任务。令人惊讶的是,我们发现表现最佳的方法是一种简单的矢量化,仅由一些基本摘要统计数据组成。最后,我们提供了一个方便的Web应用程序,旨在通过各种矢量化方法来促进探索和实验。
Attempts to incorporate topological information in supervised learning tasks have resulted in the creation of several techniques for vectorizing persistent homology barcodes. In this paper, we study thirteen such methods. Besides describing an organizational framework for these methods, we comprehensively benchmark them against three well-known classification tasks. Surprisingly, we discover that the best-performing method is a simple vectorization, which consists only of a few elementary summary statistics. Finally, we provide a convenient web application which has been designed to facilitate exploration and experimentation with various vectorization methods.