论文标题
跨维网络的希尔伯特蒸馏
Hilbert Distillation for Cross-Dimensionality Networks
论文作者
论文摘要
3D卷积神经网络在处理体积数据(例如视频和医学成像)方面揭示了卓越的性能。但是,利用3D网络的竞争性能会导致巨大的计算成本,远远超出了2D网络的竞争成本。在本文中,我们提出了一种基于希尔伯特曲线的新型跨差异性蒸馏方法,该方法促进了3D网络的知识,以提高2D网络的性能。提出的希尔伯特蒸馏方法(HD)方法通过希尔伯特曲线保留了结构信息,该曲线将高维(> = 2)表示为一维连续填充空间填充曲线。由于蒸馏2D网络由从尺寸异质3D特征转换的曲线监督,因此,在学习嵌入在训练有素的高维表示中的学习结构信息方面,2D网络具有丰富的观点。我们进一步提出了一种可变长度的希尔伯特蒸馏(VHD)方法,以动态缩短激活特征区域中希尔伯特曲线的步行步伐,并在上下文特征区域内延长步幅,迫使2D网络更多地关注从激活特征中学习。所提出的算法优于当前最新的蒸馏技术,适用于两个分类任务上的交叉维度蒸馏。此外,通过建议的方法蒸馏的2D网络通过原始的3D网络实现竞争性能,表明在现实世界中,轻质蒸馏的2D网络可能是替代笨拙的3D网络。
3D convolutional neural networks have revealed superior performance in processing volumetric data such as video and medical imaging. However, the competitive performance by leveraging 3D networks results in huge computational costs, which are far beyond that of 2D networks. In this paper, we propose a novel Hilbert curve-based cross-dimensionality distillation approach that facilitates the knowledge of 3D networks to improve the performance of 2D networks. The proposed Hilbert Distillation (HD) method preserves the structural information via the Hilbert curve, which maps high-dimensional (>=2) representations to one-dimensional continuous space-filling curves. Since the distilled 2D networks are supervised by the curves converted from dimensionally heterogeneous 3D features, the 2D networks are given an informative view in terms of learning structural information embedded in well-trained high-dimensional representations. We further propose a Variable-length Hilbert Distillation (VHD) method to dynamically shorten the walking stride of the Hilbert curve in activation feature areas and lengthen the stride in context feature areas, forcing the 2D networks to pay more attention to learning from activation features. The proposed algorithm outperforms the current state-of-the-art distillation techniques adapted to cross-dimensionality distillation on two classification tasks. Moreover, the distilled 2D networks by the proposed method achieve competitive performance with the original 3D networks, indicating the lightweight distilled 2D networks could potentially be the substitution of cumbersome 3D networks in the real-world scenario.