论文标题
深度增强特征金字塔网络,用于从倾斜图像对建筑物的遮挡感知验证
Depth-Enhanced Feature Pyramid Network for Occlusion-Aware Verification of Buildings from Oblique Images
论文作者
论文摘要
检测城市环境中建筑物的变化至关重要。仅使用Nadir图像的现有方法遭受了建筑物和其他地区之间模棱两可的特征和遮挡的严重问题。此外,城市环境中的建筑物的规模差异很大,这在使用单一尺度功能时会导致性能问题。为了解决这些问题,本文提出了一个融合的特征金字塔网络,该网络利用颜色和深度数据来对现有建筑物的3D验证2D脚印,从斜图像进行2D脚印。首先,倾斜图像的颜色数据充满了从3D网格模型呈现的深度信息。其次,在功能金字塔网络中融合了多尺度功能,以卷入颜色和深度数据。最后,来自Nadir和斜图像的多视图信息都在强大的投票程序中使用,以标记现有建筑物的变化。使用ISPRS基准数据集和深圳数据集的实验评估表明,就召回率和精确度而言,所提出的方法分别优于Resnet和有效网络的表现分别优于5 \%和2 \%。我们证明了所提出的方法可以成功检测所有已更改的建筑物。因此,只有在管道更新过程中需要手动检查标记为更改的人;这大大减少了手动质量控制要求。此外,消融研究表明,使用深度数据,特征金字塔模块和多视图投票策略可以导致清晰而渐进的改进。
Detecting the changes of buildings in urban environments is essential. Existing methods that use only nadir images suffer from severe problems of ambiguous features and occlusions between buildings and other regions. Furthermore, buildings in urban environments vary significantly in scale, which leads to performance issues when using single-scale features. To solve these issues, this paper proposes a fused feature pyramid network, which utilizes both color and depth data for the 3D verification of existing buildings 2D footprints from oblique images. First, the color data of oblique images are enriched with the depth information rendered from 3D mesh models. Second, multiscale features are fused in the feature pyramid network to convolve both the color and depth data. Finally, multi-view information from both the nadir and oblique images is used in a robust voting procedure to label changes in existing buildings. Experimental evaluations using both the ISPRS benchmark datasets and Shenzhen datasets reveal that the proposed method outperforms the ResNet and EfficientNet networks by 5\% and 2\%, respectively, in terms of recall rate and precision. We demonstrate that the proposed method can successfully detect all changed buildings; therefore, only those marked as changed need to be manually checked during the pipeline updating procedure; this significantly reduces the manual quality control requirements. Moreover, ablation studies indicate that using depth data, feature pyramid modules, and multi-view voting strategies can lead to clear and progressive improvements.