长尾相机陷阱识别的平衡域专家

论文标题

长尾相机陷阱识别的平衡域专家

Balancing Domain Experts for Long-Tailed Camera-Trap Recognition

论文作者

Park, Byeongjun, Kim, Jeongsoo, Cho, Seungju, Kim, Heeseon, Kim, Changick

论文摘要

相机陷阱图像中的标签分布是高度不平衡且长尾巴的，导致神经网络倾向于偏向经常出现的头层。尽管已经探索了长尾学习来解决数据失衡，但很少有研究考虑摄像机陷阱特征，例如多域和多帧设置。在这里，我们提出了一个统一的框架，并介绍了两个数据集以进行长尾相机陷阱识别。我们首先设计域专家，每个专家都学会平衡由数据失衡引起的不完善的决策边界，并相互补充以产生域平衡的决策界限。另外，我们提出了集中在移动对象上的流量一致性损失，期望多框架的类激活图与输入图像的光流图相匹配。此外，引入了两个长尾相机陷阱数据集WCS-LT和DMZ-LT，以验证我们的方法。实验结果显示了我们框架的有效性，并且提出的方法的表现优于先前的隐性域样本方法。

Label distributions in camera-trap images are highly imbalanced and long-tailed, resulting in neural networks tending to be biased towards head-classes that appear frequently. Although long-tail learning has been extremely explored to address data imbalances, few studies have been conducted to consider camera-trap characteristics, such as multi-domain and multi-frame setup. Here, we propose a unified framework and introduce two datasets for long-tailed camera-trap recognition. We first design domain experts, where each expert learns to balance imperfect decision boundaries caused by data imbalances and complement each other to generate domain-balanced decision boundaries. Also, we propose a flow consistency loss to focus on moving objects, expecting class activation maps of multi-frame matches the flow with optical flow maps for input images. Moreover, two long-tailed camera-trap datasets, WCS-LT and DMZ-LT, are introduced to validate our methods. Experimental results show the effectiveness of our framework, and proposed methods outperform previous methods on recessive domain samples.

下载PDF全文

下载文献需遵守相关版权规定

论文标题