论文标题
一种培训深信仰网络的发展方法
A developmental approach for training deep belief networks
论文作者
论文摘要
深信仰网络(DBN)是随机神经网络,可以从感官数据中提取丰富的环境内部表示。 DBN在触发深度学习革命方面具有催化作用,这是第一次证明在具有许多隐藏神经元层的网络中无监督学习的可行性。这些分层体系结构融合了合理的生物学和认知特性,使其成为人类感知和认知的计算模型特别有吸引力。但是,DBN中的学习通常是以贪婪的,层次的方式进行的,这不允许模拟皮质回路的整体成熟,并防止对认知发展进行建模。在这里,我们提出IDBN,这是DBN的一种迭代学习算法,允许共同更新模型所有层的连接权重。我们在两组不同的视觉刺激上评估了提出的迭代算法,从而测量了学习模型的生成能力及其支持监督下游任务的潜力。我们还根据图理论属性跟踪网络开发,并研究IDBN对持续学习方案的潜在扩展。使用我们的迭代方法训练的DBNS实现了与贪婪同行相当的最终性能,同时允许准确地分析深网络中内部表示的逐步发展,并逐步改善任务绩效。我们的工作为使用IDBN进行建模神经认知发展铺平了道路。
Deep belief networks (DBNs) are stochastic neural networks that can extract rich internal representations of the environment from the sensory data. DBNs had a catalytic effect in triggering the deep learning revolution, demonstrating for the very first time the feasibility of unsupervised learning in networks with many layers of hidden neurons. These hierarchical architectures incorporate plausible biological and cognitive properties, making them particularly appealing as computational models of human perception and cognition. However, learning in DBNs is usually carried out in a greedy, layer-wise fashion, which does not allow to simulate the holistic maturation of cortical circuits and prevents from modeling cognitive development. Here we present iDBN, an iterative learning algorithm for DBNs that allows to jointly update the connection weights across all layers of the model. We evaluate the proposed iterative algorithm on two different sets of visual stimuli, measuring the generative capabilities of the learned model and its potential to support supervised downstream tasks. We also track network development in terms of graph theoretical properties and investigate the potential extension of iDBN to continual learning scenarios. DBNs trained using our iterative approach achieve a final performance comparable to that of the greedy counterparts, at the same time allowing to accurately analyze the gradual development of internal representations in the deep network and the progressive improvement in task performance. Our work paves the way to the use of iDBN for modeling neurocognitive development.