在图像上重新访问切成薄片的Wasserstein：从矢量化到卷积

论文标题

在图像上重新访问切成薄片的Wasserstein：从矢量化到卷积

Revisiting Sliced Wasserstein on Images: From Vectorization to Convolution

论文作者

Nguyen, Khai, Ho, Nhat

论文摘要

传统的切成薄片的瓦斯汀定义在具有实现为矢量的两个概率度量之间。在比较图像的两个概率度量时，从业人员首先需要使用样品矩阵和投影矩阵之间的矩阵乘法来矢量化图像，然后将它们投影到一维空间。之后，通过平均两个相应的一维投影概率度量来评估切片的瓦斯汀。但是，这种方法有两个局限性。第一个限制是，图像的空间结构不是通过矢量化步骤有效捕获的。因此，后来的切片过程变得越来越难以收集差异信息。第二个限制是内存效率低下，因为每个切片方向都是具有与图像相同的尺寸的向量。为了解决这些局限性，我们提出了针对基于卷积运算符的图像的概率度量，用于切成薄片的新型切片方法。我们通过将步幅，扩张和非线性激活函数纳入卷积算子来得出卷积切成薄片的Wasserstein（CSW）及其变体。我们研究了CSW的指标及其样品复杂性，其计算复杂性以及与常规切片的Wasserstein距离的联系。最后，我们证明了CSW在比较图像和训练图像上的深层生成模型中的概率度量方面的良好性能比传统的切片瓦斯坦（Wasserstein）相比。

The conventional sliced Wasserstein is defined between two probability measures that have realizations as vectors. When comparing two probability measures over images, practitioners first need to vectorize images and then project them to one-dimensional space by using matrix multiplication between the sample matrix and the projection matrix. After that, the sliced Wasserstein is evaluated by averaging the two corresponding one-dimensional projected probability measures. However, this approach has two limitations. The first limitation is that the spatial structure of images is not captured efficiently by the vectorization step; therefore, the later slicing process becomes harder to gather the discrepancy information. The second limitation is memory inefficiency since each slicing direction is a vector that has the same dimension as the images. To address these limitations, we propose novel slicing methods for sliced Wasserstein between probability measures over images that are based on the convolution operators. We derive convolution sliced Wasserstein (CSW) and its variants via incorporating stride, dilation, and non-linear activation function into the convolution operators. We investigate the metricity of CSW as well as its sample complexity, its computational complexity, and its connection to conventional sliced Wasserstein distances. Finally, we demonstrate the favorable performance of CSW over the conventional sliced Wasserstein in comparing probability measures over images and in training deep generative modeling on images.

下载PDF全文

下载文献需遵守相关版权规定

论文标题