论文标题
社交二元:基于社交互动模式的多模式轨迹预测
Social-DualCVAE: Multimodal Trajectory Forecasting Based on Social Interactions Pattern Aware and Dual Conditional Variational Auto-Encoder
论文作者
论文摘要
行人轨迹预测是多个公用事业领域的基本任务,例如自动驾驶,自动机器人和监视系统。未来的轨迹预测是多模式的,受到与场景环境的物理互动以及行人之间复杂的社交互动的影响。现有的主要文献通过深度学习网络学习了社交互动的表示,而没有利用明确的互动模式。不同的相互作用模式,例如避免或避免碰撞,将产生下一步运动的不同趋势,因此,对社会互动模式的认识对于轨迹预测很重要。此外,社会互动模式是有关隐私或缺乏标签的。为了共同解决上述问题,我们为多模式轨迹预测提供了社会双重的条件变异自动编码器(社交二元),该预测基于一种生成模型,不仅基于过去的轨迹,而且还基于不经验的交互模式分类。在生成了未标记的社会互动模式的类别分布之后,根据过去的轨迹和社会相互作用模式进行的DUALCVAE是通过估计潜在变量预测的多模式轨迹预测的。在训练过程中,差异结合是最小化目标。对广泛使用的轨迹基准进行评估所提出的模型,并优于先前的最新方法。
Pedestrian trajectory forecasting is a fundamental task in multiple utility areas, such as self-driving, autonomous robots, and surveillance systems. The future trajectory forecasting is multi-modal, influenced by physical interaction with scene contexts and intricate social interactions among pedestrians. The mainly existing literature learns representations of social interactions by deep learning networks, while the explicit interaction patterns are not utilized. Different interaction patterns, such as following or collision avoiding, will generate different trends of next movement, thus, the awareness of social interaction patterns is important for trajectory forecasting. Moreover, the social interaction patterns are privacy concerned or lack of labels. To jointly address the above issues, we present a social-dual conditional variational auto-encoder (Social-DualCVAE) for multi-modal trajectory forecasting, which is based on a generative model conditioned not only on the past trajectories but also the unsupervised classification of interaction patterns. After generating the category distribution of the unlabeled social interaction patterns, DualCVAE, conditioned on the past trajectories and social interaction pattern, is proposed for multi-modal trajectory prediction by latent variables estimating. A variational bound is derived as the minimization objective during training. The proposed model is evaluated on widely used trajectory benchmarks and outperforms the prior state-of-the-art methods.