自适应信号差异：CNN通过现代体系结构初始化

论文标题

自适应信号差异：CNN通过现代体系结构初始化

Adaptive Signal Variances: CNN Initialization Through Modern Architectures

论文作者

Henmi, Takahiko, Zara, Esmeraldo Ronnie Rey, Hirohashi, Yoshihiro, Kato, Tsuyoshi

论文摘要

深度卷积神经网络（CNN）已经确立了对图像处理任务的表现的坚定信心。 CNN体系结构构成了各种不同类型的层，包括卷积层和最大层层。 CNN从业者广泛理解学习的稳定性取决于如何初始化每一层模型参数的初始化。如今，没有人怀疑初始化的事实上的标准方案是He等人开发的所谓凯明初始化。 Kaiming方案源自自Kaiming方案出现以来的当前使用的CNN结构要简单得多。 Kaiming模型仅由卷积和完全连接的层组成，忽略了最大池层和全球平均合并层。在这项研究中，我们再次得出了初始化方案，不是从简化的凯明（Kaiming）模型中得出的，而仅来自现代CNN体系结构，并经验研究了与当今广泛使用的事实标准标准相比，新的初始化方法的性能。

Deep convolutional neural networks (CNN) have achieved the unwavering confidence in its performance on image processing tasks. The CNN architecture constitutes a variety of different types of layers including the convolution layer and the max-pooling layer. CNN practitioners widely understand the fact that the stability of learning depends on how to initialize the model parameters in each layer. Nowadays, no one doubts that the de facto standard scheme for initialization is the so-called Kaiming initialization that has been developed by He et al. The Kaiming scheme was derived from a much simpler model than the currently used CNN structure having evolved since the emergence of the Kaiming scheme. The Kaiming model consists only of the convolution and fully connected layers, ignoring the max-pooling layer and the global average pooling layer. In this study, we derived the initialization scheme again not from the simplified Kaiming model, but precisely from the modern CNN architectures, and empirically investigated how the new initialization method performs compared to the de facto standard ones that are widely used today.

下载PDF全文

下载文献需遵守相关版权规定

论文标题