论文标题
使用深钢筋学习的车辆自主排的强大纵向控制
Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning
论文作者
论文摘要
在过去的几年中,研究人员在车辆排的背景下应用了机器学习策略,以提高合作运输的安全性和效率。加强学习方法已用于对合作自适应巡航控制系统的纵向间距控制中,但是迄今为止,在这种情况下,没有一个研究解决了干扰拒绝的问题。诸如模型中不确定参数和外部干扰的特征可能会阻止代理以巡航速度行驶时达到空间距的错误。另一方面,复杂的通信拓扑导致特定的培训过程,这些过程无法推广到其他情况下,要求每次配置更改时重新训练。因此,在本文中,我们提出了一种概括车辆排的训练过程的方法,使每个代理的加速命令都独立于网络拓扑。此外,我们已经将加速度输入建模为具有整体作用的术语,因此,当国家受到未知效应打扰的状态时,人工神经网络能够学习纠正措施。我们通过使用不同的网络拓扑,不确定参数和外部力的实验来说明提案的有效性。就稳态误差和过冲响应而言,进行了比较分析,是针对最新文献进行的。这些发现提供了有关在自主排的控制中使用强化学习的概括和鲁棒性的新见解。
In the last few years, researchers have applied machine learning strategies in the context of vehicular platoons to increase the safety and efficiency of cooperative transportation. Reinforcement Learning methods have been employed in the longitudinal spacing control of Cooperative Adaptive Cruise Control systems, but to date, none of those studies have addressed problems of disturbance rejection in such scenarios. Characteristics such as uncertain parameters in the model and external interferences may prevent agents from reaching null-spacing errors when traveling at cruising speed. On the other hand, complex communication topologies lead to specific training processes that can not be generalized to other contexts, demanding re-training every time the configuration changes. Therefore, in this paper, we propose an approach to generalize the training process of a vehicular platoon, such that the acceleration command of each agent becomes independent of the network topology. Also, we have modeled the acceleration input as a term with integral action, such that the Artificial Neural Network is capable of learning corrective actions when the states are disturbed by unknown effects. We illustrate the effectiveness of our proposal with experiments using different network topologies, uncertain parameters, and external forces. Comparative analyses, in terms of the steady-state error and overshoot response, were conducted against the state-of-the-art literature. The findings offer new insights concerning generalization and robustness of using Reinforcement Learning in the control of autonomous platoons.