课程学习具有多样性，用于监督计算机视觉任务

论文标题

课程学习具有多样性，用于监督计算机视觉任务

Curriculum Learning with Diversity for Supervised Computer Vision Tasks

论文作者

Soviany, Petru

论文摘要

课程学习技术是通过易于匹配的策略替换传统的随机培训来提高自动模型准确性的可行解决方案。但是，标准课程方法不会自动提供改进的结果，但是它受到多个元素（例如数据分布或提出的模型）的约束。在本文中，我们引入了一种新颖的课程抽样策略，该策略考虑了培训数据的多样性以及输入的难度。我们根据解决视觉搜索任务所需的人类时间来确定使用最先进的估计器的难度。我们认为这种困难指标更适合解决一般问题，因为它不是基于某些任务依赖性元素，而是基于每个图像的上下文。我们确保培训期间的多样性，对访问较少的课程的要素更加优先。我们对Pascal VOC 2007和CityScapes数据集进行了对象检测和实例分割实验，超过了随机训练的基线和标准课程方法。我们证明，当其他基于课程的策略失败时，我们的策略对于不平衡的数据集非常有效，从而导致更快的收敛和更准确的结果。

Curriculum learning techniques are a viable solution for improving the accuracy of automatic models, by replacing the traditional random training with an easy-to-hard strategy. However, the standard curriculum methodology does not automatically provide improved results, but it is constrained by multiple elements like the data distribution or the proposed model. In this paper, we introduce a novel curriculum sampling strategy which takes into consideration the diversity of the training data together with the difficulty of the inputs. We determine the difficulty using a state-of-the-art estimator based on the human time required for solving a visual search task. We consider this kind of difficulty metric to be better suited for solving general problems, as it is not based on certain task-dependent elements, but more on the context of each image. We ensure the diversity during training, giving higher priority to elements from less visited classes. We conduct object detection and instance segmentation experiments on Pascal VOC 2007 and Cityscapes data sets, surpassing both the randomly-trained baseline and the standard curriculum approach. We prove that our strategy is very efficient for unbalanced data sets, leading to faster convergence and more accurate results, when other curriculum-based strategies fail.

下载PDF全文

下载文献需遵守相关版权规定

论文标题