论文标题

解毒器:对神经网络的毒药攻击的运行时间检测和纠正

AntidoteRT: Run-time Detection and Correction of Poison Attacks on Neural Networks

论文作者

Usman, Muhammad, Sun, Youcheng, Gopinath, Divya, Pasareanu, Corina S.

论文摘要

我们研究了针对图像分类网络的后门中毒攻击,从而以攻击者将触发器插入训练数据的一个子集中,以至于在测试时,这会导致分类器预测某些目标类别。 %文献中提出了几种旨在检测攻击的技术,但只有少数人也建议对其进行防御,通常涉及重新训练网络,而在实践中并不总是可能。我们提出了针对中毒攻击的轻质自动检测和校正技术,这些检测和校正技术基于使用带有已知标签的一小部分清洁和中毒的测试样品从网络中挖掘出来的神经元模式。基于错误分类的样本构建的图案用于对新中毒输入的运行时间检测。为了进行校正,我们提出了一种输入校正技术,该技术使用差分分析来识别检测到的有毒图像中的触发因素,然后将其重置为中性颜色。我们的检测和校正是在运行时和输入级别进行的,这与大多数专注于离线模型级防御的工作相反。我们证明,我们的技术在流行的基准上(例如MNIST,CIFAR-10和GTSRB)对流行的Badnets攻击和更复杂的DFST攻击等流行的基准(例如MNIST,CIFAR-10和GTSRB)上的现有防御能力优于现有防御。

We study backdoor poisoning attacks against image classification networks, whereby an attacker inserts a trigger into a subset of the training data, in such a way that at test time, this trigger causes the classifier to predict some target class. %There are several techniques proposed in the literature that aim to detect the attack but only a few also propose to defend against it, and they typically involve retraining the network which is not always possible in practice. We propose lightweight automated detection and correction techniques against poisoning attacks, which are based on neuron patterns mined from the network using a small set of clean and poisoned test samples with known labels. The patterns built based on the mis-classified samples are used for run-time detection of new poisoned inputs. For correction, we propose an input correction technique that uses a differential analysis to identify the trigger in the detected poisoned images, which is then reset to a neutral color. Our detection and correction are performed at run-time and input level, which is in contrast to most existing work that is focused on offline model-level defenses. We demonstrate that our technique outperforms existing defenses such as NeuralCleanse and STRIP on popular benchmarks such as MNIST, CIFAR-10, and GTSRB against the popular BadNets attack and the more complex DFST attack.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源