通过网络感知的适应，神经进化转移学习深度复发的神经网络

论文标题

通过网络感知的适应，神经进化转移学习深度复发的神经网络

Neuroevolutionary Transfer Learning of Deep Recurrent Neural Networks through Network-Aware Adaptation

论文作者

ElSaid, AbdElRahman, Karns, Joshua, Ororbia II, Alexander, Krutz, Daniel, Lyu, Zimeng, Desell, Travis

论文摘要

转移学习需要进行人工神经网络（ANN），该神经网络（ANN）在源数据集上进行了培训并将其调整为新的目标数据集。尽管已显示出非常强大的功能，但其使用通常受到建筑限制的限制。以前，为了重复使用和调整ANN的内部权重和结构，在附加新的输出层时，在任务中转移的ANN的基本拓扑必须保持相同，从而丢弃了旧的输出层的权重。这项工作介绍了网络感知的自适应结构转移学习（N-ASTL），这是对消除此限制的先前努力的进步。 N-ASTL利用与源网络的拓扑和权重分布相关的统计信息，以告知如何将新的输入和输出神经元集成到现有结构中。结果表明，对先前最新的最新情况有所改善，包括以前无法进行的挑战现实数据集转移的能力，并改善了对未经转移的RNN的概括。

Transfer learning entails taking an artificial neural network (ANN) that is trained on a source dataset and adapting it to a new target dataset. While this has been shown to be quite powerful, its use has generally been restricted by architectural constraints. Previously, in order to reuse and adapt an ANN's internal weights and structure, the underlying topology of the ANN being transferred across tasks must remain mostly the same while a new output layer is attached, discarding the old output layer's weights. This work introduces network-aware adaptive structure transfer learning (N-ASTL), an advancement over prior efforts to remove this restriction. N-ASTL utilizes statistical information related to the source network's topology and weight distribution in order to inform how new input and output neurons are to be integrated into the existing structure. Results show improvements over prior state-of-the-art, including the ability to transfer in challenging real-world datasets not previously possible and improved generalization over RNNs trained without transfer.

下载PDF全文

下载文献需遵守相关版权规定

论文标题