具有可学习的成本功能的可靠集成运动预测和计划

论文标题

具有可学习的成本功能的可靠集成运动预测和计划

Differentiable Integrated Motion Prediction and Planning with Learnable Cost Function for Autonomous Driving

论文作者

Huang, Zhiyu, Liu, Haochen, Wu, Jingda, Lv, Chen

论文摘要

相应地预测周围交通参与者的未来状态，并计划安全，平稳且符合社会的轨迹对于自动驾驶汽车至关重要。当前的自主驾驶系统有两个主要问题：预测模块通常与计划模块分开，并且计划的成本功能很难指定和调整。为了解决这些问题，我们提出了一个可区分的综合预测计划框架（DIPP），该框架也可以从数据中学习成本函数。具体而言，我们的框架使用可区分的非线性优化器作为运动计划者，该框架将神经网络给出的周围剂的预测轨迹作为输入，并优化了自动驾驶汽车的轨迹，使所有操作都能可区分，包括成本功能权重。提出的框架经过大规模的现实驾驶数据集进行了训练，以模仿整个驾驶场景中的人类驾驶轨迹，并在开环和闭环的方式中进行了验证。开环测试结果表明，所提出的方法的表现优于各种指标的基线方法，并提供以计划为中心的预测结果，从而使计划模块能够输出轨迹接近人类驱动因素的轨迹。在闭环测试中，提出的方法的表现优于各种基线方法，表明能够处理复杂的城市驾驶场景和鲁棒性，以防止分配转移。重要的是，我们发现计划和预测模块的联合培训比在开环和闭环测试中使用单独的训练有素的预测模块进行计划要比计划更好。此外，消融研究表明，框架中的可学习组件对于确保计划稳定性和性能至关重要。

Predicting the future states of surrounding traffic participants and planning a safe, smooth, and socially compliant trajectory accordingly is crucial for autonomous vehicles. There are two major issues with the current autonomous driving system: the prediction module is often separated from the planning module and the cost function for planning is hard to specify and tune. To tackle these issues, we propose a differentiable integrated prediction-planning framework (DIPP) that can also learn the cost function from data. Specifically, our framework uses a differentiable nonlinear optimizer as the motion planner, which takes as input the predicted trajectories of surrounding agents given by the neural network and optimizes the trajectory for the autonomous vehicle, enabling all operations to be differentiable, including the cost function weights. The proposed framework is trained on a large-scale real-world driving dataset to imitate human driving trajectories in the entire driving scene and validated in both open-loop and closed-loop manners. The open-loop testing results reveal that the proposed method outperforms the baseline methods across a variety of metrics and delivers planning-centric prediction results, allowing the planning module to output trajectories close to those of human drivers. In closed-loop testing, the proposed method outperforms various baseline methods, showing the ability to handle complex urban driving scenarios and robustness against the distributional shift. Importantly, we find that joint training of planning and prediction modules achieves better performance than planning with a separate trained prediction module in both open-loop and closed-loop tests. Moreover, the ablation study indicates that the learnable components in the framework are essential to ensure planning stability and performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题