WaveFuzz：一种清洁标签的中毒攻击，以保护您的声音

论文标题

WaveFuzz：一种清洁标签的中毒攻击，以保护您的声音

WaveFuzz: A Clean-Label Poisoning Attack to Protect Your Voice

论文作者

Ge, Yunjie, Wang, Qian, Zhang, Jingfeng, Zhou, Juntao, Zhang, Yunzhu, Shen, Chao

论文摘要

人们并不总是接受收集和滥用的语音数据。训练音频智能系统需要这些数据来构建有用的功能，但是获得权限或购买数据的成本很高，这不可避免地鼓励黑客收集这些语音数据而没有人们的意识。为了阻止黑客主动收集人们的语音数据，我们是第一个提出清洁标签中毒攻击的人，称为WaveFuzz，这可以防止情报音频模型从受保护的（中毒）语音数据中构建有用的功能，但仍然将语义信息保留给人类。具体而言，WaveFuzz会导致语音数据引起MEL频率曲线系数（MFCC）（音频信号的典型表示），以生成中毒的频率特征。然后将这些中毒功能馈送到音频预测模型，从而降低了音频智能系统的性能。从经验上讲，我们通过攻击两种代表性的智能音频系统，即说话者识别系统（SR）和语音命令识别系统（SCR）来显示WaveFuzz的功效。例如，当只有$ 10 \％$中毒的语音数据的$ 10 \％$是微调型号时，模型的准确性被降低了$ 19.78 \％$，而当训练语音数据的$ 10 \％$ $ 10 \％$时，模型的准确性下降了$ 6.07 \％$。因此，WaveFuzz是一种有效的技术，它使人们能够反击以保护自己的语音数据，这为改善隐私问题提供了新的启示。

People are not always receptive to their voice data being collected and misused. Training the audio intelligence systems needs these data to build useful features, but the cost for getting permissions or purchasing data is very high, which inevitably encourages hackers to collect these voice data without people's awareness. To discourage the hackers from proactively collecting people's voice data, we are the first to propose a clean-label poisoning attack, called WaveFuzz, which can prevent intelligence audio models from building useful features from protected (poisoned) voice data but still preserve the semantic information to the humans. Specifically, WaveFuzz perturbs the voice data to cause Mel Frequency Cepstral Coefficients (MFCC) (typical representations of audio signals) to generate the poisoned frequency features. These poisoned features are then fed to audio prediction models, which degrades the performance of audio intelligence systems. Empirically, we show the efficacy of WaveFuzz by attacking two representative types of intelligent audio systems, i.e., speaker recognition system (SR) and speech command recognition system (SCR). For example, the accuracies of models are declined by $19.78\%$ when only $10\%$ of the poisoned voice data is to fine-tune models, and the accuracies of models declined by $6.07\%$ when only $10\%$ of the training voice data is poisoned. Consequently, WaveFuzz is an effective technique that enables people to fight back to protect their own voice data, which sheds new light on ameliorating privacy issues.

下载PDF全文

下载文献需遵守相关版权规定

论文标题