论文标题
部署用于自适应原位音景增强的物联网系统
Deployment of an IoT System for Adaptive In-Situ Soundscape Augmentation
论文作者
论文摘要
音景增强是一种新兴的降解方法,它通过引入称为“遮罩器”的其他声音来增加声学舒适感。传统上,掩蔽者的选择通常是基于专家指导或事后分析的基础,这可能是耗时的,有时甚至是任意的。此外,这通常会导致一组静态的掩蔽器,这些掩蔽器对现实世界声学环境的动态性质不灵活。克服传统音景增强的僵化性是双重的。首先,鉴于声景的快照,系统必须能够在没有人类监督的情况下选择最佳掩蔽器。其次,该系统还必须能够对声学环境的变化做出反应,并具有近乎实时的延迟。在这项工作中,我们利用云计算和物联网(IoT)的组合能力允许使用微控制器的原位聆听和播放,同时将计算昂贵的推理任务委派给云。特别是,使用无服务器云体系结构进行推理,确保无需提供计算资源而无需提供实时延迟和可扩展性。该系统的工作原型目前正在部署在经历高交通噪音的公共区域,并进行了公众评估以进行未来的改进。
Soundscape augmentation is an emerging approach for noise mitigation by introducing additional sounds known as "maskers" to increase acoustic comfort. Traditionally, the choice of maskers is often predicated on expert guidance or post-hoc analysis which can be time-consuming and sometimes arbitrary. Moreover, this often results in a static set of maskers that are inflexible to the dynamic nature of real-world acoustic environments. Overcoming the inflexibility of traditional soundscape augmentation is twofold. First, given a snapshot of a soundscape, the system must be able to select an optimal masker without human supervision. Second, the system must also be able to react to changes in the acoustic environment with near real-time latency. In this work, we harness the combined prowess of cloud computing and the Internet of Things (IoT) to allow in-situ listening and playback using microcontrollers while delegating computationally expensive inference tasks to the cloud. In particular, a serverless cloud architecture was used for inference, ensuring near real-time latency and scalability without the need to provision computing resources. A working prototype of the system is currently being deployed in a public area experiencing high traffic noise, as well as undergoing public evaluation for future improvements.