论文标题
重新思考零售商店中的对象检测
Rethinking Object Detection in Retail Stores
论文作者
论文摘要
对象检测的约定标准使用一个边界框来表示每个单独的对象实例。但是,由于同一类别群体之间的严重阻塞,在仓库中与行业相关的应用中,它在仓库的情况下是不实际的。在本文中,我们提出了一项新任务,即同时反对本地化和计数,缩写为Locount,该算法需要算法将感兴趣的对象组定位为实例的数量。但是,不存在为此类任务设计的数据集或基准。为此,我们收集了一个大规模的对象本地化和计数数据集,并在零售商店中具有丰富的注释,该数据集由50,394张图像组成,其中140个类别的对象实例超过190万。与此数据集一起,我们提供了一个新的评估协议,并将培训和测试子集划分,以公平地评估算法的算法,从而为置换任务开发了新的基准测试。此外,我们将级联的本地化和计数网络视为一个强大的基线,该基线逐渐将对象的边界框进行分类和回归,并用封闭在边界框中的实例数量数量,以端到端的方式进行了训练。在拟议的数据集上进行了广泛的实验,以证明其重要性,并提供了有关故障案例的分析讨论以指示未来的方向。数据集可在https://isrc.iscas.ac.cn/gitlab/research/locount-dataset上找到。
The convention standard for object detection uses a bounding box to represent each individual object instance. However, it is not practical in the industry-relevant applications in the context of warehouses due to severe occlusions among groups of instances of the same categories. In this paper, we propose a new task, ie, simultaneously object localization and counting, abbreviated as Locount, which requires algorithms to localize groups of objects of interest with the number of instances. However, there does not exist a dataset or benchmark designed for such a task. To this end, we collect a large-scale object localization and counting dataset with rich annotations in retail stores, which consists of 50,394 images with more than 1.9 million object instances in 140 categories. Together with this dataset, we provide a new evaluation protocol and divide the training and testing subsets to fairly evaluate the performance of algorithms for Locount, developing a new benchmark for the Locount task. Moreover, we present a cascaded localization and counting network as a strong baseline, which gradually classifies and regresses the bounding boxes of objects with the predicted numbers of instances enclosed in the bounding boxes, trained in an end-to-end manner. Extensive experiments are conducted on the proposed dataset to demonstrate its significance and the analysis discussions on failure cases are provided to indicate future directions. Dataset is available at https://isrc.iscas.ac.cn/gitlab/research/locount-dataset.