论文标题
利用终身多代理探索经验
Leveraging Experience in Lifelong Multi-Agent Pathfinding
论文作者
论文摘要
在终生的多代理路径查找(L-MAPF)中,一组代理执行的任务流由代理商在共享图上访问的多个位置组成,同时避免彼此碰撞。 L-MAPF通常通过将其划分为多个连续的方法来解决,从而像滚动 - 莫利兹碰撞分辨率(RHCR)算法一样,类似的“一击” MAPF查询。因此,对一个查询的解决方案会告知下一个查询,这导致与代理人的开始和目标位置相似,以及如何从一个查询到下一个查询。因此,解决一个MAPF查询的经验可能会用于加速解决下一个问题。尽管有这种直觉,但当前的L-MAPF计划者从头开始求解连续的MAPF查询。在本文中,我们引入了一种新的RHCR启发方法,称为EXRHCR,该方法利用其组成的MAPF查询中的经验。特别是,EXRHCR采用了基于优先级的搜索(PBS)的扩展,这是最先进的MAPF求解器。我们称之为EXPB的扩展程序允许在上一个MAPF实例中使用PBS使用的代理之间的优先级进行搜索。我们从经验上证明,EXRHCR解决了比RHCR快39%的L-MAPF实例,并且通过增加计划者可以在给定时间预算中应对的代理人的数量来增加给定的任务流的系统吞吐量。
In Lifelong Multi-Agent Path Finding (L-MAPF) a team of agents performs a stream of tasks consisting of multiple locations to be visited by the agents on a shared graph while avoiding collisions with one another. L-MAPF is typically tackled by partitioning it into multiple consecutive, and hence similar, "one-shot" MAPF queries, as in the Rolling-Horizon Collision Resolution (RHCR) algorithm. Therefore, a solution to one query informs the next query, which leads to similarity with respect to the agents' start and goal positions, and how collisions need to be resolved from one query to the next. Thus, experience from solving one MAPF query can potentially be used to speedup solving the next one. Despite this intuition, current L-MAPF planners solve consecutive MAPF queries from scratch. In this paper, we introduce a new RHCR-inspired approach called exRHCR, which exploits experience in its constituent MAPF queries. In particular, exRHCR employs an extension of Priority-Based Search (PBS), a state-of-the-art MAPF solver. The extension, which we call exPBS, allows to warm-start the search with the priorities between agents used by PBS in the previous MAPF instances. We demonstrate empirically that exRHCR solves L-MAPF instances up to 39% faster than RHCR, and has the potential to increase system throughput for given task streams by increasing the number of agents a planner can cope with for a given time budget.