一项针对视觉任务的课堂增量学习算法的全面研究

论文标题

一项针对视觉任务的课堂增量学习算法的全面研究

A Comprehensive Study of Class Incremental Learning Algorithms for Visual Tasks

论文作者

Belouadah, Eden, Popescu, Adrian, Kanellos, Ioannis

论文摘要

在面对新数据时，人工代理人递增其能力的能力是人工智能的一个开放挑战。在这种情况下，面临的主要挑战是灾难性遗忘，即神经网络摄入新数据时，神经网络的趋势不足以使过去的数据不足。第一组方法通过提高深层模型能力来适应新知识而忘记了。第二种方法固定了深层模型的大小，并引入了一种机制，该机制是确保模型的稳定性和可塑性之间的良好妥协。虽然对第一种类型的算法进行了彻底比较，但利用固定尺寸模型的方法并非如此。在这里，我们专注于后者，将它们放在一个共同的概念和实验框架中，并提出以下贡献：（1）定义六个理想的学习算法的理想属性，并根据这些属性进行分析，（2）引入统一的正式形式化，对集体学习问题的统一性框架的范围更为彻底的记录，（3）在现有的范围中，（3）尺寸的尺寸尺寸的数量，数量的数量，数量的数字，（4）研究放牧对过去的示例选择的有用性，（5）提供了实验证据，表明可以在不使用知识蒸馏的情况下获得竞争性能，以解决灾难性的遗忘，（6）（6）通过将所有测试的方法集成在常见的开放式回源供应中。主要的实验发现是，在所有评估的设置中，现有算法都没有取得最佳结果。是否允许对过去类的有界记忆，会显着出现重要差异。

The ability of artificial agents to increment their capabilities when confronted with new data is an open challenge in artificial intelligence. The main challenge faced in such cases is catastrophic forgetting, i.e., the tendency of neural networks to underfit past data when new ones are ingested. A first group of approaches tackles forgetting by increasing deep model capacity to accommodate new knowledge. A second type of approaches fix the deep model size and introduce a mechanism whose objective is to ensure a good compromise between stability and plasticity of the model. While the first type of algorithms were compared thoroughly, this is not the case for methods which exploit a fixed size model. Here, we focus on the latter, place them in a common conceptual and experimental framework and propose the following contributions: (1) define six desirable properties of incremental learning algorithms and analyze them according to these properties, (2) introduce a unified formalization of the class-incremental learning problem, (3) propose a common evaluation framework which is more thorough than existing ones in terms of number of datasets, size of datasets, size of bounded memory and number of incremental states, (4) investigate the usefulness of herding for past exemplars selection, (5) provide experimental evidence that it is possible to obtain competitive performance without the use of knowledge distillation to tackle catastrophic forgetting and (6) facilitate reproducibility by integrating all tested methods in a common open-source repository. The main experimental finding is that none of the existing algorithms achieves the best results in all evaluated settings. Important differences arise notably if a bounded memory of past classes is allowed or not.

下载PDF全文

下载文献需遵守相关版权规定

论文标题