学习综合到实际转移以进行本地化和导航任务

论文标题

学习综合到实际转移以进行本地化和导航任务

Learning Synthetic to Real Transfer for Localization and Navigational Tasks

论文作者

Pietrantoni, Maxime, Chidlovskii, Boris, Silander, Tomi

论文摘要

自主导航包括能够在没有人为干预或监督的情况下导航的代理，它会影响高级计划和低水平控制。导航位于多个学科的十字路口，它结合了计算机视觉，机器人技术和控制的概念。这项工作旨在在模拟中创建导航管道，其转移到现实世界中可以尽可能少地进行。考虑到有限的时间和要解决的各种问题，绝对的导航表演并不是主要目标。重点是研究SIM2REAL差距，这是现代机器人技术和自动导航的主要瓶颈之一。为了设计导航管道，出现了四个主要挑战；环境，本地化，导航和计划。 Igibson模拟器的照片真实纹理和物理引擎被选中。在公制方法上选择了一种解决空间表示的拓扑方法，因为它们可以更好地推广到新环境，并且对变化的变化不太敏感。导航管道被分解为本地化模块，计划模块和本地导航模块。这些模块利用三个不同的网络：图像表示提取器，通道检测器和本地策略。通过针对这些特定任务创建的一些相关数据集对专门量身定制的任务进行了培训。本地化是代理在特定空间表示方面进行本地定位的能力。对于多种转化，它必须可靠，可重复和健壮。使用在辅助任务上训练的深神经网络作为特征描述符提取器，将本地化作为图像检索任务解决。本地政策接受了与ROS Navigation堆栈聚集的专家轨迹的行为克隆培训。

Autonomous navigation consists in an agent being able to navigate without human intervention or supervision, it affects both high level planning and low level control. Navigation is at the crossroad of multiple disciplines, it combines notions of computer vision, robotics and control. This work aimed at creating, in a simulation, a navigation pipeline whose transfer to the real world could be done with as few efforts as possible. Given the limited time and the wide range of problematic to be tackled, absolute navigation performances while important was not the main objective. The emphasis was rather put on studying the sim2real gap which is one the major bottlenecks of modern robotics and autonomous navigation. To design the navigation pipeline four main challenges arise; environment, localization, navigation and planning. The iGibson simulator is picked for its photo-realistic textures and physics engine. A topological approach to tackle space representation was picked over metric approaches because they generalize better to new environments and are less sensitive to change of conditions. The navigation pipeline is decomposed as a localization module, a planning module and a local navigation module. These modules utilize three different networks, an image representation extractor, a passage detector and a local policy. The laters are trained on specifically tailored tasks with some associated datasets created for those specific tasks. Localization is the ability for the agent to localize itself against a specific space representation. It must be reliable, repeatable and robust to a wide variety of transformations. Localization is tackled as an image retrieval task using a deep neural network trained on an auxiliary task as a feature descriptor extractor. The local policy is trained with behavioral cloning from expert trajectories gathered with ROS navigation stack.

下载PDF全文

下载文献需遵守相关版权规定

论文标题