论文标题
强大的自然语言处理:最近的进步,挑战和未来的方向
Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
论文作者
论文摘要
最近的自然语言处理(NLP)技术已经在基准数据集上实现了高性能,这主要是由于深度学习的性能显着改善。研究界的进步导致了针对NLP任务(例如虚拟助手,语音识别和情感分析)的最先进生产系统的大大增强。但是,在接受对抗攻击测试时,这种NLP系统通常仍然失败。在当前模型的语言理解能力中,最初缺乏鲁棒性揭示了令人困扰的差距,从而在现实生活中部署了NLP系统时会产生问题。在本文中,我们通过以各个方面的系统方式总结文献来概述NLP鲁棒性研究的结构化概述。然后,我们深入研究了跨越技术,指标,嵌入和基准的鲁棒性的各个方面。最后,我们认为鲁棒性应该是多维的,对当前的研究提供见解,确定文献中的差距,以提出值得追求解决这些差距的方向。
Recent natural language processing (NLP) techniques have accomplished high performance on benchmark datasets, primarily due to the significant improvement in the performance of deep learning. The advances in the research community have led to great enhancements in state-of-the-art production systems for NLP tasks, such as virtual assistants, speech recognition, and sentiment analysis. However, such NLP systems still often fail when tested with adversarial attacks. The initial lack of robustness exposed troubling gaps in current models' language understanding capabilities, creating problems when NLP systems are deployed in real life. In this paper, we present a structured overview of NLP robustness research by summarizing the literature in a systemic way across various dimensions. We then take a deep-dive into the various dimensions of robustness, across techniques, metrics, embeddings, and benchmarks. Finally, we argue that robustness should be multi-dimensional, provide insights into current research, identify gaps in the literature to suggest directions worth pursuing to address these gaps.