接受拓扑特征训练的神经网络是否学习不同的内部表示？

论文标题

接受拓扑特征训练的神经网络是否学习不同的内部表示？

Do Neural Networks Trained with Topological Features Learn Different Internal Representations?

论文作者

McGuire, Sarah, Jackson, Shane, Emerson, Tegan, Kvinge, Henry

论文摘要

越来越多的工作利用通过拓扑数据分析提取的特征来训练机器学习模型。尽管该领域有时被称为拓扑机学习（TML），但已经获得了一些显着的成功，但了解从拓扑特征学习过程与从原始数据中学习过程有何不同。在这项工作中，我们开始通过询问接受拓扑特征的模型学习数据的内部表示，这些模型与经过原始原始数据训练的模型所学的数据有根本不同的数据来解决这个较大问题的一个组成部分。为了量化``不同''，我们利用了两个流行的指标，这些指标可用于衡量神经网络，神经缝制和中心内核对齐中数据隐藏表示的相似性。从这些内容中，我们得出了一系列结论，内容涉及拓扑特征的训练方式，并且不会改变模型所学的表示。也许毫不奇怪，我们发现，与对相应的原始数据训练和评估的模型相比，受过拓扑特征训练和评估的模型的隐藏表示形式有很大差异。另一方面，我们的实验表明，在某些情况下，可以使用简单的仿射转换来对帐（至少对于解决相应任务所需的程度）。我们猜想这意味着对原始数据训练的神经网络可能会在做出预测的过程中提取一些有限的拓扑特征。

There is a growing body of work that leverages features extracted via topological data analysis to train machine learning models. While this field, sometimes known as topological machine learning (TML), has seen some notable successes, an understanding of how the process of learning from topological features differs from the process of learning from raw data is still limited. In this work, we begin to address one component of this larger issue by asking whether a model trained with topological features learns internal representations of data that are fundamentally different than those learned by a model trained with the original raw data. To quantify ``different'', we exploit two popular metrics that can be used to measure the similarity of the hidden representations of data within neural networks, neural stitching and centered kernel alignment. From these we draw a range of conclusions about how training with topological features does and does not change the representations that a model learns. Perhaps unsurprisingly, we find that structurally, the hidden representations of models trained and evaluated on topological features differ substantially compared to those trained and evaluated on the corresponding raw data. On the other hand, our experiments show that in some cases, these representations can be reconciled (at least to the degree required to solve the corresponding task) using a simple affine transformation. We conjecture that this means that neural networks trained on raw data may extract some limited topological features in the process of making predictions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题