论文标题
通过具有人类先验知识的组成部分来识别对象,可以增强深层神经网络的对抗性鲁棒性
Recognizing Object by Components with Human Prior Knowledge Enhances Adversarial Robustness of Deep Neural Networks
论文作者
论文摘要
对抗性攻击可以轻松地基于深度神经网络(DNN)欺骗对象识别系统。尽管近年来已经提出了许多防御方法,但其中大多数仍然可以自适应地逃避。较弱的对抗性鲁棒性的原因之一可能是DNN仅由类别标签监督,并且没有像人类的识别过程那样基于部分的归纳偏见。受认知心理学众所周知的理论的启发 - 逐一识别,我们提出了一种新颖的对象识别模型摇滚(通过具有人类先验知识的组成部分来识别对象)。它首先是从图像中片段的部分部分,然后将部分分割结果与预定义的人类的先验知识进行分割结果,最后根据分数输出预测。岩石的第一阶段对应于将物体分解为人类视觉中的一部分的过程。第二阶段对应于人脑的决策过程。岩石比各种攻击环境中的经典识别模型表现出更好的鲁棒性。这些结果鼓励研究人员重新考虑当前基于DNN的对象识别模型的合理性,并探索基于零件的模型的潜力,曾经很重要但最近被忽略,以改善鲁棒性。
Adversarial attacks can easily fool object recognition systems based on deep neural networks (DNNs). Although many defense methods have been proposed in recent years, most of them can still be adaptively evaded. One reason for the weak adversarial robustness may be that DNNs are only supervised by category labels and do not have part-based inductive bias like the recognition process of humans. Inspired by a well-known theory in cognitive psychology -- recognition-by-components, we propose a novel object recognition model ROCK (Recognizing Object by Components with human prior Knowledge). It first segments parts of objects from images, then scores part segmentation results with predefined human prior knowledge, and finally outputs prediction based on the scores. The first stage of ROCK corresponds to the process of decomposing objects into parts in human vision. The second stage corresponds to the decision process of the human brain. ROCK shows better robustness than classical recognition models across various attack settings. These results encourage researchers to rethink the rationality of currently widely-used DNN-based object recognition models and explore the potential of part-based models, once important but recently ignored, for improving robustness.