论文标题

语音恢复的无声语音界面:评论

Silent Speech Interfaces for Speech Restoration: A Review

论文作者

Gonzalez-Lopez, Jose A., Gomez-Alanis, Alejandro, Martín-Doñas, Juan M., Pérez-Córdoba, José L., Gomez, Angel M.

论文摘要

这篇评论总结了无声语音界面(SSI)研究的状态。 SSI依赖于人体在语音生产过程中产生的非声学生物信号,以在无法正常的言语交流或不希望的情况下实现交流。在这篇评论中,我们专注于第一种情况和最新的SSI研究,旨在为患有严重言语疾病的人提供新的替代性和增强性交流方法。 SSIS可以采用各种生物信号来实现无声交流,例如神经活动的电生理记录,肌电图(EMG)的声带运动记录或使用成像技术直接跟踪关节仪运动。根据疾病的不同,某些传感技术可能比其他传感技术更适合捕获与语音有关的信息。例如,EMG和成像技术非常适合喉切除患者,他们的声带几乎保持完整,但在删除声带后无法说话,但由于严重瘫痪的个体而失败了。 SSIS从生物信号中,使用自动语音识别或语音合成算法来解码预期的消息。尽管近年来取得了长足的进步,但当今的SSI仅在实验室环境中为健康用户提供了验证。因此,正如本文所讨论的那样,在将SSIS推广到现实世界应用之前,将来的研究中仍有许多挑战要解决。如果可以成功解决这些问题,未来的SSI将通过恢复沟通能力来改善严重言语障碍的人的生活。

This review summarises the status of silent speech interface (SSI) research. SSIs rely on non-acoustic biosignals generated by the human body during speech production to enable communication whenever normal verbal communication is not possible or not desirable. In this review, we focus on the first case and present latest SSI research aimed at providing new alternative and augmentative communication methods for persons with severe speech disorders. SSIs can employ a variety of biosignals to enable silent communication, such as electrophysiological recordings of neural activity, electromyographic (EMG) recordings of vocal tract movements or the direct tracking of articulator movements using imaging techniques. Depending on the disorder, some sensing techniques may be better suited than others to capture speech-related information. For instance, EMG and imaging techniques are well suited for laryngectomised patients, whose vocal tract remains almost intact but are unable to speak after the removal of the vocal folds, but fail for severely paralysed individuals. From the biosignals, SSIs decode the intended message, using automatic speech recognition or speech synthesis algorithms. Despite considerable advances in recent years, most present-day SSIs have only been validated in laboratory settings for healthy users. Thus, as discussed in this paper, a number of challenges remain to be addressed in future research before SSIs can be promoted to real-world applications. If these issues can be addressed successfully, future SSIs will improve the lives of persons with severe speech impairments by restoring their communication capabilities.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源