论文标题
从道德上收集与具有认知障碍的人的多模式自发对话
Ethically Collecting Multi-Modal Spontaneous Conversations with People that have Cognitive Impairments
论文作者
论文摘要
为了使口语对话系统(例如Amazon Alexa或Google Assistant)更容易访问,并且对于认知障碍的人来说是自然的交互式,必须获得适当的数据。但是,与弱势用户组的多模式自发对话的记录很少,但是这些有价值的数据很具有挑战性。呼吁这些数据的研究人员在与弱势参与者合作的道德和法律问题中通常没有经验。此外,标准记录设备是不安全的,不应用于捕获敏感数据。我们花了一年的时间咨询专家,介绍如何与弱势用户群体进行道德捕获和共享多模式自发对话的录音。在本文中,我们提供了从这些专家组成的指导,以道德收集此类数据,并提出一个新系统“ Cusco” - 以牢固地捕获,运输和交换敏感数据。该框架旨在轻松遵循和实施,以鼓励进一步的类似语料库出版。使用本指南和安全的记录系统,研究人员可以审查和完善其道德措施。
In order to make spoken dialogue systems (such as Amazon Alexa or Google Assistant) more accessible and naturally interactive for people with cognitive impairments, appropriate data must be obtainable. Recordings of multi-modal spontaneous conversations with vulnerable user groups are scarce however and this valuable data is challenging to collect. Researchers that call for this data are commonly inexperienced in ethical and legal issues around working with vulnerable participants. Additionally, standard recording equipment is insecure and should not be used to capture sensitive data. We spent a year consulting experts on how to ethically capture and share recordings of multi-modal spontaneous conversations with vulnerable user groups. In this paper we provide guidance, collated from these experts, on how to ethically collect such data and we present a new system - "CUSCO" - to capture, transport and exchange sensitive data securely. This framework is intended to be easily followed and implemented to encourage further publications of similar corpora. Using this guide and secure recording system, researchers can review and refine their ethical measures.