Alife：自适应logit正常化程序和功能重播，用于增量语义分段

论文标题

Alife：自适应logit正常化程序和功能重播，用于增量语义分段

ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation

论文作者

Oh, Youngmin, Baek, Donghyeon, Ham, Bumsub

论文摘要

我们解决了不断识别新颖对象/内容类别的增量语义细分（ISS）的问题，而又不会忘记以前的学识。 ISS中的灾难性遗忘问题在ISS中特别严重，因为像素级地面真相标签仅适用于训练时的新型类别。为了解决该问题，基于正则化的方法利用概率校准技术从未标记的像素中学习语义信息。尽管这种技术有效，但仍然缺乏对它们的理论理解。基于重播的方法建议记住以前类别的一小部分图像。他们以大型内存足迹为代价实现最先进的性能。我们在本文中提出了一种称为Alife的新型ISS方法，该方法在准确性和效率之间提供了更好的妥协。为此，我们首先对校准技术进行了深入的分析，以更好地了解ISS的影响。基于此，我们引入了一个自适应logit正常化程序（ALI），使我们的模型能够更好地学习新类别，同时保留以前的知识。我们还提出了一个功能重播方案，该方案记住功能，而不是直接图像，以大大减少内存需求。由于功能提取器不断更改，因此还应在每个增量阶段更新记忆的功能。为了处理此问题，我们介绍了特定于类别的旋转矩阵，分别更新每个类别的功能。我们通过对标准ISS基准进行广泛的实验来证明我们的方法的有效性，并表明我们的方法在准确性和效率方面取得了更好的权衡。

We address the problem of incremental semantic segmentation (ISS) recognizing novel object/stuff categories continually without forgetting previous ones that have been learned. The catastrophic forgetting problem is particularly severe in ISS, since pixel-level ground-truth labels are available only for the novel categories at training time. To address the problem, regularization-based methods exploit probability calibration techniques to learn semantic information from unlabeled pixels. While such techniques are effective, there is still a lack of theoretical understanding of them. Replay-based methods propose to memorize a small set of images for previous categories. They achieve state-of-the-art performance at the cost of large memory footprint. We propose in this paper a novel ISS method, dubbed ALIFE, that provides a better compromise between accuracy and efficiency. To this end, we first show an in-depth analysis on the calibration techniques to better understand the effects on ISS. Based on this, we then introduce an adaptive logit regularizer (ALI) that enables our model to better learn new categories, while retaining knowledge for previous ones. We also present a feature replay scheme that memorizes features, instead of images directly, in order to reduce memory requirements significantly. Since a feature extractor is changed continually, memorized features should also be updated at every incremental stage. To handle this, we introduce category-specific rotation matrices updating the features for each category separately. We demonstrate the effectiveness of our approach with extensive experiments on standard ISS benchmarks, and show that our method achieves a better trade-off in terms of accuracy and efficiency.

下载PDF全文

下载文献需遵守相关版权规定

论文标题