使用激光镜头和摄像头的视觉地图本地化调查

论文标题

使用激光镜头和摄像头的视觉地图本地化调查

A Survey on Visual Map Localization Using LiDARs and Cameras

论文作者

Mahdi, Elhousni, Xinming, Huang

论文摘要

随着自动驾驶行业正在缓慢成熟，视觉地图本地化正在迅速成为尽可能准确定位汽车的标准方法。由于相机或激光镜等视觉传感器返回的丰富数据，研究人员能够构建具有各种细节的不同类型的地图，并使用它们来实现高水平的车辆定位精度和在城市环境中的稳定性。与流行的SLAM方法相反，视觉图的定位依赖于预构建的地图，并且仅通过避免误差积累或漂移来提高定位准确性。我们将视觉图定位定义为两个阶段的过程。在位置识别的阶段，通过将视觉传感器输出与一组地理标记的地图区域进行比较，可以确定车辆在地图中的初始位置。随后，在MAP指标定位的阶段，通过将视觉传感器的输出与正在穿越的地图的当前区域保持一致，在整个地图上移动时跟踪车辆。在本文中，我们针对这两个阶段进行了调查，讨论和比较基于激光雷达的，基于相机和跨模式的视觉图本地化的最新方法，以突出每种方法的优势。

As the autonomous driving industry is slowly maturing, visual map localization is quickly becoming the standard approach to localize cars as accurately as possible. Owing to the rich data returned by visual sensors such as cameras or LiDARs, researchers are able to build different types of maps with various levels of details, and use them to achieve high levels of vehicle localization accuracy and stability in urban environments. Contrary to the popular SLAM approaches, visual map localization relies on pre-built maps, and is focused solely on improving the localization accuracy by avoiding error accumulation or drift. We define visual map localization as a two-stage process. At the stage of place recognition, the initial position of the vehicle in the map is determined by comparing the visual sensor output with a set of geo-tagged map regions of interest. Subsequently, at the stage of map metric localization, the vehicle is tracked while it moves across the map by continuously aligning the visual sensors' output with the current area of the map that is being traversed. In this paper, we survey, discuss and compare the latest methods for LiDAR based, camera based and cross-modal visual map localization for both stages, in an effort to highlight the strength and weakness of each approach.

下载PDF全文

下载文献需遵守相关版权规定

论文标题