论文标题
学会在暹罗跟踪器中融合不对称特征地图
Learning to Fuse Asymmetric Feature Maps in Siamese Trackers
论文作者
论文摘要
最近,基于暹罗的跟踪器在视觉跟踪中取得了有希望的表现。最新的基于暹罗的跟踪器通常采用深度的互相关(DW-XCORR)来从两个特征图(目标和搜索区域)中获取多通道相关信息。但是,DW-XCORR在基于暹罗的跟踪中有几个局限性:它很容易被干扰器愚弄,激活的频道更少,并且对对象边界的歧视较弱。此外,DW-XCORR是一种无参数的模块,无法完全受益于大规模数据的离线学习。我们提出了一个可学习的模块,称为非对称卷积(ACM),该模块学会了在大规模数据的离线培训中更好地捕获语义相关信息。与DW-XCORR及其前身(XCORR)不同,将单个特征映射视为卷积内核,我们的ACM将串联特征映射上的卷积操作分解为两个数学上等效的操作,从而避免了在串联过程中需要具有相同大小和高度的特征图的需求。我们的ACM可以将有用的先前信息(例如边界框大小)与标准视觉功能结合在一起。此外,基于DW-Xcorror Xcorr,可以轻松地将ACM集成到现有的暹罗跟踪器中。为了证明其概括能力,我们将ACM整合到三个代表性跟踪器中:siAMFC,siamrpn ++和Siamban。我们的实验揭示了拟议的ACM的好处,该ACM优于六个跟踪基准的现有方法。在LASOT测试集中,我们基于ACM的跟踪器在成功方面(AUC)在基线上获得了5.8%的显着提高。
Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, has fewer activated channels, and provides weak discrimination of object boundaries. Further, DW-XCorr is a handcrafted parameter-free module and cannot fully benefit from offline learning on large-scale data. We propose a learnable module, called the asymmetric convolution (ACM), which learns to better capture the semantic correlation information in offline training on large-scale data. Different from DW-XCorr and its predecessor(XCorr), which regard a single feature map as the convolution kernel, our ACM decomposes the convolution operation on a concatenated feature map into two mathematically equivalent operations, thereby avoiding the need for the feature maps to be of the same size (width and height)during concatenation. Our ACM can incorporate useful prior information, such as bounding-box size, with standard visual features. Furthermore, ACM can easily be integrated into existing Siamese trackers based on DW-XCorror XCorr. To demonstrate its generalization ability, we integrate ACM into three representative trackers: SiamFC, SiamRPN++, and SiamBAN. Our experiments reveal the benefits of the proposed ACM, which outperforms existing methods on six tracking benchmarks. On the LaSOT test set, our ACM-based tracker obtains a significant improvement of 5.8% in terms of success (AUC), over the baseline.