论文标题
RFMASK:带无线电信号的人类轮廓分割的简单基线
RFMask: A Simple Baseline for Human Silhouette Segmentation with Radio Signals
论文作者
论文摘要
最初在计算机视觉中定义的人类轮廓细分已获得理解人类活动的有希望的结果。但是,物理限制使现有的系统基于光相机,在低照明,烟雾和/或不透明的阻塞条件下遭受了严重的性能降解。为了克服这种局限性,在本文中,我们建议利用无线电信号,该信号可以遍历障碍物,并且不受照明条件的影响以实现轮廓分割。提出的RFMASK框架由三个模块组成。它首先将毫米波雷达在两个平面上捕获的RF信号转换为空间域,并抑制对信号处理模块的干扰。然后,它将人类对RF框架的反射定位,并通过人类检测模块从周围信号中提取特征。最后,从RF框架中提取的特征通过基于注意力的掩码生成模块聚合。为了验证我们提出的框架,我们收集了一个包含804,760台无线电框架和402,380个相机框架的数据集,并在各种场景下进行人体活动。实验结果表明,即使在具有挑战性的场景(例如,基于光学相机的方法)失败的情况下,提出的框架也可以实现令人印象深刻的人类轮廓分割。据我们所知,这是基于毫米波信号分割人类轮廓的首次研究。我们希望我们的工作可以作为基准,并激发通过无线电信号执行视觉任务的进一步研究。数据集和代码将在公开场合进行。
Human silhouette segmentation, which is originally defined in computer vision, has achieved promising results for understanding human activities. However, the physical limitation makes existing systems based on optical cameras suffer from severe performance degradation under low illumination, smoke, and/or opaque obstruction conditions. To overcome such limitations, in this paper, we propose to utilize the radio signals, which can traverse obstacles and are unaffected by the lighting conditions to achieve silhouette segmentation. The proposed RFMask framework is composed of three modules. It first transforms RF signals captured by millimeter wave radar on two planes into spatial domain and suppress interference with the signal processing module. Then, it locates human reflections on RF frames and extract features from surrounding signals with human detection module. Finally, the extracted features from RF frames are aggregated with an attention based mask generation module. To verify our proposed framework, we collect a dataset containing 804,760 radio frames and 402,380 camera frames with human activities under various scenes. Experimental results show that the proposed framework can achieve impressive human silhouette segmentation even under the challenging scenarios(such as low light and occlusion scenarios) where traditional optical-camera-based methods fail. To the best of our knowledge, this is the first investigation towards segmenting human silhouette based on millimeter wave signals. We hope that our work can serve as a baseline and inspire further research that perform vision tasks with radio signals. The dataset and codes will be made in public.