论文标题
从可见性约束的演示中推断出障碍和路径有效性
Inferring Obstacles and Path Validity from Visibility-Constrained Demonstrations
论文作者
论文摘要
从演示中学习的许多方法都认为示威者对完整的环境有了解。但是,在许多情况下,示威者只看到了一部分环境,并且在收集信息时不断地重新译来。要计划新的路径或重建环境,我们必须考虑示威者的可见性约束和重新介绍过程,据我们所知,这在以前的工作中尚未完成。我们考虑了在2D环境中推断障碍物配置的问题,该问题从能够朝任何方向看到但不能通过障碍物看见的点机器人的路径。给定一组\ textit {调查点},描述了演示者在何处获得新信息和候选路径,我们在环境的单元格分解上构建了约束满意度问题(CSP)。我们将与CSP分配的一组障碍物参数化,并从集合中进行样本以找到有效的环境。我们表明,存在一种概率完整的,但不是完全可牵引的算法,可以保证空间中的新路径是不安全或可能安全的。我们还提出了一种不完整但经验成功的启发式引导算法,我们将实验应用于1)计划新路径和2)恢复对环境的概率表示。
Many methods in learning from demonstration assume that the demonstrator has knowledge of the full environment. However, in many scenarios, a demonstrator only sees part of the environment and they continuously replan as they gather information. To plan new paths or to reconstruct the environment, we must consider the visibility constraints and replanning process of the demonstrator, which, to our knowledge, has not been done in previous work. We consider the problem of inferring obstacle configurations in a 2D environment from demonstrated paths for a point robot that is capable of seeing in any direction but not through obstacles. Given a set of \textit{survey points}, which describe where the demonstrator obtains new information, and a candidate path, we construct a Constraint Satisfaction Problem (CSP) on a cell decomposition of the environment. We parameterize a set of obstacles corresponding to an assignment from the CSP and sample from the set to find valid environments. We show that there is a probabilistically-complete, yet not entirely tractable, algorithm that can guarantee novel paths in the space are unsafe or possibly safe. We also present an incomplete, but empirically-successful, heuristic-guided algorithm that we apply in our experiments to 1) planning novel paths and 2) recovering a probabilistic representation of the environment.