依赖性保留超纸网的持续学习

论文标题

依赖性保留超纸网的持续学习

Continual Learning with Dependency Preserving Hypernetworks

论文作者

Chandra, Dupati Srikar, Varshney, Sakshi, Srijith, P. K., Gupta, Sunil

论文摘要

人类在整个生命周期中不断学习，通过积累多样化的知识并为将来的任务进行微调。当出现类似目标时，神经网络会遭受灾难性忘记，在学习过程中，跨顺序任务的数据分布是否不固定。解决这种持续学习（CL）问题的有效方法是使用超网络为目标网络生成任务依赖权重的超网络。但是，现有基于超网的方法的持续学习性能受到整个层之间权重的独立性的假设，以维持参数效率。为了解决这一限制，我们提出了一种新颖的方法，该方法使用依赖关系保留超网络来为目标网络生成权重，同时还保持参数效率。我们建议使用基于复发的神经网络（RNN）的超网络，可以有效地生成层权重，同时允许在它们的依赖关系中产生依赖性。此外，我们为基于RNN的超网络提出了新颖的正则化和网络增长技术，以进一步提高持续的学习绩效。为了证明所提出的方法的有效性，我们对几个图像分类持续学习任务和设置进行了实验。我们发现，基于RNN HyperNetworks的提出方法在所有这些CL设置和任务中都优于基准。

Humans learn continually throughout their lifespan by accumulating diverse knowledge and fine-tuning it for future tasks. When presented with a similar goal, neural networks suffer from catastrophic forgetting if data distributions across sequential tasks are not stationary over the course of learning. An effective approach to address such continual learning (CL) problems is to use hypernetworks which generate task dependent weights for a target network. However, the continual learning performance of existing hypernetwork based approaches are affected by the assumption of independence of the weights across the layers in order to maintain parameter efficiency. To address this limitation, we propose a novel approach that uses a dependency preserving hypernetwork to generate weights for the target network while also maintaining the parameter efficiency. We propose to use recurrent neural network (RNN) based hypernetwork that can generate layer weights efficiently while allowing for dependencies across them. In addition, we propose novel regularisation and network growth techniques for the RNN based hypernetwork to further improve the continual learning performance. To demonstrate the effectiveness of the proposed methods, we conducted experiments on several image classification continual learning tasks and settings. We found that the proposed methods based on the RNN hypernetworks outperformed the baselines in all these CL settings and tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题