论文标题
积极学习不平衡的民用基础设施数据
Active Learning for Imbalanced Civil Infrastructure Data
论文作者
论文摘要
工程师对民用基础设施的老化密切监测,以确保损害和严重缺陷。由于对如此大型结构的手动检查是昂贵且耗时的,因此我们正在努力充分自动化视觉检查,以支持维护活动的优先级。为此,我们结合了无人机技术的最新进展和深度学习。不幸的是,注释成本非常高,因为我们专有的土木工程数据集必须由训练有素的工程师注释。因此,主动学习是优化模型性能和注释成本之间的权衡取舍的宝贵工具。我们的用例与经典的主动学习环境有所不同,因为我们的数据集患有沉重的类不平衡,并且与其他活跃的学习研究相比,已经标记的数据库更大。我们提出了一种新颖的方法,能够通过辅助二元歧视器代替传统的主动学习获取功能,以在这种挑战性的环境中运行。我们通过实验表明,我们的新方法在CIFAR-10和我们的专有数据集上的表现分别优于表现最佳的传统活性学习方法(BALD)5%和38%的精度。
Aging civil infrastructures are closely monitored by engineers for damage and critical defects. As the manual inspection of such large structures is costly and time-consuming, we are working towards fully automating the visual inspections to support the prioritization of maintenance activities. To that end we combine recent advances in drone technology and deep learning. Unfortunately, annotation costs are incredibly high as our proprietary civil engineering dataset must be annotated by highly trained engineers. Active learning is, therefore, a valuable tool to optimize the trade-off between model performance and annotation costs. Our use-case differs from the classical active learning setting as our dataset suffers from heavy class imbalance and consists of a much larger already labeled data pool than other active learning research. We present a novel method capable of operating in this challenging setting by replacing the traditional active learning acquisition function with an auxiliary binary discriminator. We experimentally show that our novel method outperforms the best-performing traditional active learning method (BALD) by 5% and 38% accuracy on CIFAR-10 and our proprietary dataset respectively.