论文标题
自主驾驶的类似人类的决策:一种非合作游戏理论方法
Human-Like Decision Making for Autonomous Driving: A Noncooperative Game Theoretic Approach
论文作者
论文摘要
考虑到以后的人类驱动的车辆和自动驾驶汽车(AV)将来会在很长一段时间内在道路上共存,如何将AV合并到人类驾驶员交通生态学中,并将AVS及其与人驾驶员的不正确效果最小化,这是值得考虑的问题。此外,不同的乘客对AV有不同的需求,因此,如何为不同乘客提供个性化选择是AVS的另一个问题。因此,本文为AVS设计了类似人类的决策框架。为AVS制定了不同的驾驶风格和社交互动特征,以驾驶安全性,骑行舒适性和旅行效率,这在决策的建模过程中被考虑。然后,将NASH平衡和Stackelberg游戏理论应用于非合作决策。此外,将潜在的现场方法和模型预测控制(MPC)结合在一起,以处理AVS的运动预测和计划,该预测为决策模块提供了预测的运动信息。最后,进行了两种典型的测试场景,即合并和超车,以评估考虑到不同类似人类行为的不同人类的决策框架的可行性和有效性。测试结果表明,两种游戏理论方法都可以为AV提供合理的类似人类的决策。与NASH平衡方法相比,在正常的驾驶方式下,使用Stackelberg游戏理论方法做出决策的成本价值降低了20%以上。
Considering that human-driven vehicles and autonomous vehicles (AVs) will coexist on roads in the future for a long time, how to merge AVs into human drivers traffic ecology and minimize the effect of AVs and their misfit with human drivers, are issues worthy of consideration. Moreover, different passengers have different needs for AVs, thus, how to provide personalized choices for different passengers is another issue for AVs. Therefore, a human-like decision making framework is designed for AVs in this paper. Different driving styles and social interaction characteristics are formulated for AVs regarding driving safety, ride comfort and travel efficiency, which are considered in the modeling process of decision making. Then, Nash equilibrium and Stackelberg game theory are applied to the noncooperative decision making. In addition, potential field method and model predictive control (MPC) are combined to deal with the motion prediction and planning for AVs, which provides predicted motion information for the decision-making module. Finally, two typical testing scenarios of lane change, i.e., merging and overtaking, are carried out to evaluate the feasibility and effectiveness of the proposed decision-making framework considering different human-like behaviors. Testing results indicate that both the two game theoretic approaches can provide reasonable human-like decision making for AVs. Compared with the Nash equilibrium approach, under the normal driving style, the cost value of decision making using the Stackelberg game theoretic approach is reduced by over 20%.