使用深入的强化学习，基于端到端的基于视觉的自适应巡航控制（ACC）

论文标题

使用深入的强化学习，基于端到端的基于视觉的自适应巡航控制（ACC）

End-to-End Vision-Based Adaptive Cruise Control (ACC) Using Deep Reinforcement Learning

论文作者

Wei, Zhensong, Jiang, Yu, Liao, Xishun, Qi, Xuewei, Wang, Ziran, Wu, Guoyuan, Hao, Peng, Barth, Matthew

论文摘要

本文提出了一种名为Double Deep Q-Networks的深钢筋学习方法，以设计基于端到端视觉的自适应巡航控制（ACC）系统。建立了Unity的高速公路场景的模拟环境，这是一种游戏引擎，既可以提供车辆的物理型号又提供用于训练和测试的功能数据。在内燃机（ICE）车辆和电动汽车（EV）的强化学习模型中，实施了与以下距离和油门/制动力相关的精心设计的奖励功能，以执行自适应巡航控制。评估了不同车辆类型的间隙统计数据和总能耗，以探索奖励功能与动力总成特性之间的关系。与传统的基于雷达的ACC系统或循环模拟相比，根据预设奖励功能，提出的基于视觉的ACC系统可以生成更好的间隙调节轨迹或更平滑的速度轨迹。提出的系统可以很好地适应前面车辆的不同速度轨迹并实时操作。

This paper presented a deep reinforcement learning method named Double Deep Q-networks to design an end-to-end vision-based adaptive cruise control (ACC) system. A simulation environment of a highway scene was set up in Unity, which is a game engine that provided both physical models of vehicles and feature data for training and testing. Well-designed reward functions associated with the following distance and throttle/brake force were implemented in the reinforcement learning model for both internal combustion engine (ICE) vehicles and electric vehicles (EV) to perform adaptive cruise control. The gap statistics and total energy consumption are evaluated for different vehicle types to explore the relationship between reward functions and powertrain characteristics. Compared with the traditional radar-based ACC systems or human-in-the-loop simulation, the proposed vision-based ACC system can generate either a better gap regulated trajectory or a smoother speed trajectory depending on the preset reward function. The proposed system can be well adaptive to different speed trajectories of the preceding vehicle and operated in real-time.

下载PDF全文

下载文献需遵守相关版权规定

论文标题