论文标题
通过人造货币重复游戏中紧迫性意识的最佳路由
Urgency-aware Optimal Routing in Repeated Games through Artificial Currencies
论文作者
论文摘要
当人们选择最小化个人延迟的路线时,与中央凸起的路线相比,总的拥塞可能要高得多。然而,集中式路由与自身利益药物的存在不相容。我们如何调和两者?在本文中,我们在重复的游戏框架中解决了这个问题,并提出了一种基于人工货币的公平激励机制,该机制以系统最佳的方式将自私的代理路由,同时考虑了他们的时间偏好。我们在平行网络中实例化框架,从而反复(例如,每天)从通用启动节点到末端节点进行反复通勤。此后,我们专注于特定的两项案例,该案例基于人造货币,在第一个,快速弧旅行时会收取代理,而在第二次旅行时,它们会得到奖励。我们假设代理商是合理的,并通过一款游戏对他们的选择进行建模,在这种游戏中,每个代理商的目的是最大程度地减少当今不适的结合,并由他们的紧迫性加权,以及在此期间剩下的时间里遇到的平均不适感(例如,一周)。我们表明,如果明智地选择了人工货币的价格,则路由模式会融合到系统最佳解决方案,同时适应代理商的紧迫性。我们通过数值模拟来补充我们的研究。我们的结果表明,与集中式最佳但紧迫的统一政策相比,可以实现系统最佳解决方案,同时将代理人的感知不适减少14-20%。
When people choose routes minimizing their individual delay, the aggregate congestion can be much higher compared to that experienced by a centrally-imposed routing. Yet centralized routing is incompatible with the presence of self-interested agents. How can we reconcile the two? In this paper we address this question within a repeated game framework and propose a fair incentive mechanism based on artificial currencies that routes selfish agents in a system-optimal fashion, while accounting for their temporal preferences. We instantiate the framework in a parallel-network whereby agents commute repeatedly (e.g., daily) from a common start node to the end node. Thereafter, we focus on the specific two-arcs case whereby, based on an artificial currency, the agents are charged when traveling on the first, fast arc, whilst they are rewarded when traveling on the second, slower arc. We assume the agents to be rational and model their choices through a game where each agent aims at minimizing a combination of today's discomfort, weighted by their urgency, and the average discomfort encountered for the rest of the period (e.g., a week). We show that, if prices of artificial currencies are judiciously chosen, the routing pattern converges to a system-optimal solution, while accommodating the agents' urgency. We complement our study through numerical simulations. Our results show that it is possible to achieve a system-optimal solution whilst reducing the agents' perceived discomfort by 14-20% when compared to a centralized optimal but urgency-unaware policy.