论文标题
Geno:用于在现有Web应用程序上创建多模式交互的开发工具
Geno: A Developer Tool for Authoring Multimodal Interaction on Existing Web Applications
论文作者
论文摘要
应用程序中的支持语音命令给用户带来了重大好处。但是,如我们在形成性的研究中所示,在基于GUI的Web应用程序中添加这样的支持是在努力耗费的,这是由于缺乏创建多模式界面的统一支持,因此在我们的形成性研究中所示。我们提出了Geno ---用于将语音输入模式添加到现有Web应用程序的开发工具,而无需大量的NLP专业知识。 Geno为开发人员提供了高级工作流程,以指定要由语音(意图)支持的功能,创建语言模型,以检测用户话语中的意图和相关信息(参数),并通过编程方式调用相应的功能或在Web应用程序上重复GUI操作来实现意图。 Geno进一步支持语音命令中对GUI上下文的多模式引用(例如,“将[事件]移至下周”,同时指向与光标的事件”。在一项研究中,NLP专业知识的开发人员能够使用Geno为两个现有的Web应用程序添加多模式语音命令支持。
Supporting voice commands in applications presents significant benefits to users. However, adding such support to existing GUI-based web apps is effort-consuming with a high learning barrier, as shown in our formative study, due to the lack of unified support for creating multimodal interfaces. We present Geno---a developer tool for adding the voice input modality to existing web apps without requiring significant NLP expertise. Geno provides a high-level workflow for developers to specify functionalities to be supported by voice (intents), create language models for detecting intents and the relevant information (parameters) from user utterances, and fulfill the intents by either programmatically invoking the corresponding functions or replaying GUI actions on the web app. Geno further supports multimodal references to GUI context in voice commands (e.g. "move this [event] to next week" while pointing at an event with the cursor). In a study, developers with little NLP expertise were able to add multimodal voice command support for two existing web apps using Geno.