论文标题
使用基于边界的自动差异信息增益度量对未知2D环境的机器人探索
Robotic Exploration of Unknown 2D Environment Using a Frontier-based Automatic-Differentiable Information Gain Measure
论文作者
论文摘要
自主机器人探索的路径规划方法的核心是一种启发式,它鼓励探索未知环境区域。这种启发式方法通常是使用基于边界或信息理论方法计算的。基于边界的方法将探索路径的信息获益定义为从路径可见的边界单元或边界的数量。但是,这种信息增益量度的离散性和非差异性质使使用基于梯度的方法很难优化。相反,信息理论方法将信息增益定义为传感器测量和探索图之间的相互信息。但是,相互信息梯度的计算涉及有限的差异,因此在计算上很昂贵。这项工作提出了一个探索计划框架,该框架结合了基于抽样的路径计划和基于梯度的路径优化。该框架的主要贡献是对信息增益作为可区分功能的新型重新重新制定。这使我们可以通过其他可区分质量度量(例如平滑度)同时优化信息增益。在使用Turtlebot3汉堡机器人的模拟和硬件实验中验证了拟议的规划框架的有效性。
At the heart of path-planning methods for autonomous robotic exploration is a heuristic which encourages exploring unknown regions of the environment. Such heuristics are typically computed using frontier-based or information-theoretic methods. Frontier-based methods define the information gain of an exploration path as the number of boundary cells, or frontiers, which are visible from the path. However, the discrete and non-differentiable nature of this measure of information gain makes it difficult to optimize using gradient-based methods. In contrast, information-theoretic methods define information gain as the mutual information between the sensor's measurements and the explored map. However, computation of the gradient of mutual information involves finite differencing and is thus computationally expensive. This work proposes an exploration planning framework that combines sampling-based path planning and gradient-based path optimization. The main contribution of this framework is a novel reformulation of information gain as a differentiable function. This allows us to simultaneously optimize information gain with other differentiable quality measures, such as smoothness. The proposed planning framework's effectiveness is verified both in simulation and in hardware experiments using a Turtlebot3 Burger robot.