论文标题
SMMIX:视觉变压器的自我激励图像混合
SMMix: Self-Motivated Image Mixing for Vision Transformers
论文作者
论文摘要
CutMix是一种重要的增强策略,它决定视力变压器(VIT)的性能和概括能力。但是,混合图像和相应标签之间的不一致会损害其功效。现有的cutmix变体通过产生更一致的混合图像或更精确的混合标签来解决此问题,但不可避免地会引入重型培训开销或需要额外的信息,从而破坏了易用性。为此,我们提出了一种新颖有效的自我激励图像混合方法(SMMIX),该方法激发了训练本身下的模型的图像和标签增强。具体而言,我们提出了一种最大的注意区域混合方法,该方法丰富了混合图像中以注意力为中心的对象。然后,我们引入了一种细粒的标签分配技术,该技术将混合图像的输出令牌与细粒度的监督共同传输。此外,我们设计了一种新颖的特征一致性约束,以对混合图像和未混合图像的对齐特征。由于自我激励范式的微妙设计,我们的SMMIX在较小的训练开销中具有重要意义,并且比其他CutMix变体更为重要。特别是,SMMIX提高了DEIT-T/S/B,CAIT-XXS-24/36和PVT-T/S/M/L的准确性超过1%以上。我们的方法的概括能力也可以在下游任务和分布数据集中证明。我们的项目可匿名在https://github.com/chenmnz/smmix上找到。
CutMix is a vital augmentation strategy that determines the performance and generalization ability of vision transformers (ViTs). However, the inconsistency between the mixed images and the corresponding labels harms its efficacy. Existing CutMix variants tackle this problem by generating more consistent mixed images or more precise mixed labels, but inevitably introduce heavy training overhead or require extra information, undermining ease of use. To this end, we propose an novel and effective Self-Motivated image Mixing method (SMMix), which motivates both image and label enhancement by the model under training itself. Specifically, we propose a max-min attention region mixing approach that enriches the attention-focused objects in the mixed images. Then, we introduce a fine-grained label assignment technique that co-trains the output tokens of mixed images with fine-grained supervision. Moreover, we devise a novel feature consistency constraint to align features from mixed and unmixed images. Due to the subtle designs of the self-motivated paradigm, our SMMix is significant in its smaller training overhead and better performance than other CutMix variants. In particular, SMMix improves the accuracy of DeiT-T/S/B, CaiT-XXS-24/36, and PVT-T/S/M/L by more than +1% on ImageNet-1k. The generalization capability of our method is also demonstrated on downstream tasks and out-of-distribution datasets. Our project is anonymously available at https://github.com/ChenMnZ/SMMix.