OpenImage2019的第一名解决方案 - 对象检测和实例分段

论文标题

OpenImage2019的第一名解决方案 - 对象检测和实例分段

1st Place Solutions for OpenImage2019 -- Object Detection and Instance Segmentation

论文作者

Liu, Yu, Song, Guanglu, Zang, Yuhang, Gao, Yan, Xie, Enze, Yan, Junjie, Loy, Chen Change, Wang, Xiaogang

论文摘要

本文介绍了两个冠军团队的解决方案，“ MMFRUIT”用于检测轨道和“ MMFRUITSEG”的分段曲目，在OpenImage Challenge 2019中。众所周知，对于对象探测器而言，共享功能通常不适合分类和回归级别的单个阶段detction detctor cormession the Backbone，这两种功能都不适用。 \ cite {ren2015faster}基于检测器。在这场比赛中，我们观察到，即使有了共同的功能，一个对象中的不同位置对于这两个任务的表现完全不一致。 \ textit {例如。显着位置的特征通常非常适合分类，而对象边缘周围的特征则适合回归。}受到此启发，我们建议通过自我学习的最佳特征提取来解散对象分类和回归，从而导致对象分类和回归。此外，我们将Soft-NMS算法调整为ADJ-NMS以获得稳定的性能改进。最后，提出了一个精心设计的合奏策略，通过投票框架的位置和信心。我们还将介绍几种培训/推理策略和一袋技巧，这些技巧可以略有改进。鉴于这些细节的大量，我们培训和汇总了28个全球模型，具有各种骨架，头部和3+2个专家模型，并在公共和私人铅板上都获得了2019年OpenImage 2019对象检测挑战的第一名。给定这样一个良好的实例边界框，我们进一步设计了一个简单的实例级语义分割管道，并在分割挑战中获得第一名。

This article introduces the solutions of the two champion teams, `MMfruit' for the detection track and `MMfruitSeg' for the segmentation track, in OpenImage Challenge 2019. It is commonly known that for an object detector, the shared feature at the end of the backbone is not appropriate for both classification and regression, which greatly limits the performance of both single stage detector and Faster RCNN \cite{ren2015faster} based detector. In this competition, we observe that even with a shared feature, different locations in one object has completely inconsistent performances for the two tasks. \textit{E.g. the features of salient locations are usually good for classification, while those around the object edge are good for regression.} Inspired by this, we propose the Decoupling Head (DH) to disentangle the object classification and regression via the self-learned optimal feature extraction, which leads to a great improvement. Furthermore, we adjust the soft-NMS algorithm to adj-NMS to obtain stable performance improvement. Finally, a well-designed ensemble strategy via voting the bounding box location and confidence is proposed. We will also introduce several training/inferencing strategies and a bag of tricks that give minor improvement. Given those masses of details, we train and aggregate 28 global models with various backbones, heads and 3+2 expert models, and achieves the 1st place on the OpenImage 2019 Object Detection Challenge on the both public and private leadboards. Given such good instance bounding box, we further design a simple instance-level semantic segmentation pipeline and achieve the 1st place on the segmentation challenge.

下载PDF全文

下载文献需遵守相关版权规定

论文标题