深度负担的远见：计划将来可以做什么

论文标题

深度负担的远见：计划将来可以做什么

Deep Affordance Foresight: Planning Through What Can Be Done in the Future

论文作者

Xu, Danfei, Mandlekar, Ajay, Martín-Martín, Roberto, Zhu, Yuke, Savarese, Silvio, Fei-Fei, Li

论文摘要

在现实的环境中进行计划需要在大型计划空间中进行搜索。负担是一个简化此搜索的强大概念，因为它们在给定情况下建模了哪些行动可以成功。但是，经典的负担概念不适合长时间的计划，因为它只会告知机器人行动的直接结果，而不是最适合实现长期目标的行动。在本文中，我们介绍了一种新的负担能力表示，使机器人能够通过建模将来的行动来理解行动对行动的长期影响，从而告知机器人在实现任务目标中采取的最佳动作。根据新的表示，我们开发了一种学习对计划的方法，即深度负担的预见（DAF），该方法通过试用和错误学习了部分环境模型的参数化运动技能。我们在两个具有挑战性的操纵域上评估DAF，并表明它可以有效地学习执行多步骤任务，在不同任务之间共享学习的负担能力表示，并学会使用高维图像输入进行计划。可从https://sites.google.com/stanford.edu/daf获得其他材料

Planning in realistic environments requires searching in large planning spaces. Affordances are a powerful concept to simplify this search, because they model what actions can be successful in a given situation. However, the classical notion of affordance is not suitable for long horizon planning because it only informs the robot about the immediate outcome of actions instead of what actions are best for achieving a long-term goal. In this paper, we introduce a new affordance representation that enables the robot to reason about the long-term effects of actions through modeling what actions are afforded in the future, thereby informing the robot the best actions to take next to achieve a task goal. Based on the new representation, we develop a learning-to-plan method, Deep Affordance Foresight (DAF), that learns partial environment models of affordances of parameterized motor skills through trial-and-error. We evaluate DAF on two challenging manipulation domains and show that it can effectively learn to carry out multi-step tasks, share learned affordance representations among different tasks, and learn to plan with high-dimensional image inputs. Additional material is available at https://sites.google.com/stanford.edu/daf

下载PDF全文

下载文献需遵守相关版权规定

论文标题