语音欺骗对策：分类法，最先进

论文标题

语音欺骗对策：分类法，最先进

Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward

论文作者

Khan, Awais, Malik, Khalid Mahmood, Ryan, James, Saravanan, Mikul

论文摘要

恶意演员可能会寻求使用不同的语音攻击来欺骗ASV系统，甚至使用它们来传播错误信息。已经提出了各种对策来检测这些欺骗攻击。由于在过去6 - 7年中对自动说话者验证（ASV）系统进行欺骗检测的大量工作，因此有必要对研究进行分类并对先进的对策进行定性和定量的比较。此外，没有现有的调查文件审查了对语音欺骗评估和扬声器验证的集成解决方案，对欺骗对策以及ASV本身或统一解决方案以使用单个模型来检测多个攻击的对抗/反质量分解攻击。此外，尚未进行任何工作以对已发表的对策进行苹果对苹果的比较，以通过在跨语料库中评估它们的普遍性。在这项工作中，我们对使用手工制作的特征，深度学习，端到端和通用欺骗解决方案进行欺骗检测的文献进行了综述，以检测语音合成（SS），语音转换（VC）和重播攻击。此外，我们还审查了综合解决方案，以欺骗评估和说话者验证，对语音对策和ASV的对抗和反法脱毛攻击。还提出了现有欺骗对策的局限性和挑战。我们在几个数据集上报告了这些对策的性能，并在各个语料库中对其进行了评估。对于实验，我们使用ASVSPOOF2019和VSDC数据集以及GMM，SVM，CNN和CNN-GRU分类器。（对于结果的再现性，可以在我们的GitHub存储库中找到测试床的代码。

Malicious actors may seek to use different voice-spoofing attacks to fool ASV systems and even use them for spreading misinformation. Various countermeasures have been proposed to detect these spoofing attacks. Due to the extensive work done on spoofing detection in automated speaker verification (ASV) systems in the last 6-7 years, there is a need to classify the research and perform qualitative and quantitative comparisons on state-of-the-art countermeasures. Additionally, no existing survey paper has reviewed integrated solutions to voice spoofing evaluation and speaker verification, adversarial/antiforensics attacks on spoofing countermeasures, and ASV itself, or unified solutions to detect multiple attacks using a single model. Further, no work has been done to provide an apples-to-apples comparison of published countermeasures in order to assess their generalizability by evaluating them across corpora. In this work, we conduct a review of the literature on spoofing detection using hand-crafted features, deep learning, end-to-end, and universal spoofing countermeasure solutions to detect speech synthesis (SS), voice conversion (VC), and replay attacks. Additionally, we also review integrated solutions to voice spoofing evaluation and speaker verification, adversarial and anti-forensics attacks on voice countermeasures, and ASV. The limitations and challenges of the existing spoofing countermeasures are also presented. We report the performance of these countermeasures on several datasets and evaluate them across corpora. For the experiments, we employ the ASVspoof2019 and VSDC datasets along with GMM, SVM, CNN, and CNN-GRU classifiers. (For reproduceability of the results, the code of the test bed can be found in our GitHub Repository.

下载PDF全文

下载文献需遵守相关版权规定

论文标题