论文标题

COLAR:通过咨询典范的有效有效的在线操作检测

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

论文作者

Yang, Le, Han, Junwei, Zhang, Dingwen

论文摘要

近年来,在线行动检测吸引了增加的研究兴趣。当前的作品模拟了历史依赖性,并预期未来可以感知视频段内的动作演变并提高检测准确性。但是,现有的范式忽略了类别级建模,并且对效率没有足够的关注。考虑到一个类别,其代表性框架具有各种特征。因此,类别级建模可以为时间依赖性建模提供免费指导。本文开发了一种有效的示例性辅助机制,该机制首先测量框架和示例框架之间的相似性,然后根据相似性权重汇总示例性特征。这也是一种有效的机制,因为相似性测量和特征聚集都需要有限的计算。基于示例性辅助机制,可以通过历史框架作为示例来捕获长期依赖性,而类别级建模可以通过从类别中作为示例的代表性框架来实现。由于类别级建模的互补性,我们的方法采用了轻量级的体系结构,但在三个基准上实现了新的高性能。此外,使用时空网络来解决视频帧,我们的方法在有效性和效率之间取决了良好的权衡。代码可在https://github.com/vividle/online-action-detection上找到。

Online action detection has attracted increasing research interests in recent years. Current works model historical dependencies and anticipate the future to perceive the action evolution within a video segment and improve the detection accuracy. However, the existing paradigm ignores category-level modeling and does not pay sufficient attention to efficiency. Considering a category, its representative frames exhibit various characteristics. Thus, the category-level modeling can provide complimentary guidance to the temporal dependencies modeling. This paper develops an effective exemplar-consultation mechanism that first measures the similarity between a frame and exemplary frames, and then aggregates exemplary features based on the similarity weights. This is also an efficient mechanism, as both similarity measurement and feature aggregation require limited computations. Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars. Due to the complementarity from the category-level modeling, our method employs a lightweight architecture but achieves new high performance on three benchmarks. In addition, using a spatio-temporal network to tackle video frames, our method makes a good trade-off between effectiveness and efficiency. Code is available at https://github.com/VividLe/Online-Action-Detection.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源