论文标题
nnoculation:在野外捕获adnet
NNoculation: Catching BadNets in the Wild
论文作者
论文摘要
本文提出了针对背部神经网络(BADNET)的新型两阶段防御(NNOCULUTION),该防御措施响应于该领域遇到的后置测试输入,以修复Badnet前部署和在线。在预部阶段,Nnoculation用随机的清洁验证输入扰动BadNet恢复了BadNet,以部分减少后门的对抗性影响。通过记录原始和预部填充的修补网络之间的分歧,在开发后,NNOCULUTION检测和隔离后式测试输入。然后,对Cyclegan进行了训练,以学习清洁验证和隔离输入之间的转变;即,它学会了添加触发器以清洁验证图像。后门验证图像及其正确的标签用于进一步重新审查预部部门修补的网络,从而产生我们的最终防御。对后门攻击的综合套件的经验评估表明,Nnoculation的表现优于所有最先进的防御能力,这些防御能够做出限制性假设,仅在特定的后门攻击中起作用,或者在适应性攻击方面失败。相反,Nnoculation是最小的假设,并提供了有效的防御,即使在现有防御措施无效的情况下,由于攻击者规避了其限制性假设。
This paper proposes a novel two-stage defense (NNoculation) against backdoored neural networks (BadNets) that, repairs a BadNet both pre-deployment and online in response to backdoored test inputs encountered in the field. In the pre-deployment stage, NNoculation retrains the BadNet with random perturbations of clean validation inputs to partially reduce the adversarial impact of a backdoor. Post-deployment, NNoculation detects and quarantines backdoored test inputs by recording disagreements between the original and pre-deployment patched networks. A CycleGAN is then trained to learn transformations between clean validation and quarantined inputs; i.e., it learns to add triggers to clean validation images. Backdoored validation images along with their correct labels are used to further retrain the pre-deployment patched network, yielding our final defense. Empirical evaluation on a comprehensive suite of backdoor attacks show that NNoculation outperforms all state-of-the-art defenses that make restrictive assumptions and only work on specific backdoor attacks, or fail on adaptive attacks. In contrast, NNoculation makes minimal assumptions and provides an effective defense, even under settings where existing defenses are ineffective due to attackers circumventing their restrictive assumptions.