论文标题
在差异条件下,卷积神经网络的统件估计
Homography Estimation with Convolutional Neural Networks Under Conditions of Variance
论文作者
论文摘要
平面同型估计是许多计算机视觉问题的基础,例如同时定位和映射(SLAM)和增强现实(AR)。但是,即使是最新的算法,高方差的条件也混淆了。在本报告中,我们使用卷积神经网络(CNN)分析了两种最近发表的方法的性能,这些方法旨在替代更传统的基于功能匹配的方法来估计同型同构的估计。我们对基于CNN的方法的评估尤其着重于在明显的噪声,照明转移和遮挡条件下测量性能。我们还以不同程度的噪声来衡量训练CNN的好处。此外,我们比较使用颜色图像代替灰度图像的效果,以输入到CNN。最后,我们将结果与使用SIFT,SURF和ORB的基线功能匹配的同型估计方法进行比较。我们发现,可以对CNN进行训练,以防止噪音更强大,但是在噪音的情况下,准确性的成本很小。此外,在极端差异条件下,CNN的性能要比其基于功能匹配的对应物要好得多。关于颜色输入,我们得出结论,如果CNN体系结构没有更改以利用颜色平面中的其他信息,则使用颜色输入或灰度输入的性能差异可以忽略不计。关于接受噪声浪费的输入训练的CNN,我们表明,训练CNN对特定噪声的特定噪声导致了有关CNN表现最佳的噪声水平的“ Goldilocks区”。
Planar homography estimation is foundational to many computer vision problems, such as Simultaneous Localization and Mapping (SLAM) and Augmented Reality (AR). However, conditions of high variance confound even the state-of-the-art algorithms. In this report, we analyze the performance of two recently published methods using Convolutional Neural Networks (CNNs) that are meant to replace the more traditional feature-matching based approaches to the estimation of homography. Our evaluation of the CNN based methods focuses particularly on measuring the performance under conditions of significant noise, illumination shift, and occlusion. We also measure the benefits of training CNNs to varying degrees of noise. Additionally, we compare the effect of using color images instead of grayscale images for inputs to CNNs. Finally, we compare the results against baseline feature-matching based homography estimation methods using SIFT, SURF, and ORB. We find that CNNs can be trained to be more robust against noise, but at a small cost to accuracy in the noiseless case. Additionally, CNNs perform significantly better in conditions of extreme variance than their feature-matching based counterparts. With regard to color inputs, we conclude that with no change in the CNN architecture to take advantage of the additional information in the color planes, the difference in performance using color inputs or grayscale inputs is negligible. About the CNNs trained with noise-corrupted inputs, we show that training a CNN to a specific magnitude of noise leads to a "Goldilocks Zone" with regard to the noise levels where that CNN performs best.