Chinese Journal of Intelligent Science and Technology ›› 2023, Vol. 5 ›› Issue (2): 180-199.doi: 10.11959/j.issn.2096-6652.202317

• Surveys and Prospectives • Previous Articles     Next Articles

Deep learning-based multimodal trajectory prediction methods for autonomous driving: state of the art and perspectives

Jun HUANG1,2,3, Yonglin TIAN1,4, Xingyuan DAI1,5, Xiao WANG3,6,7,8, Zhixing PING4,5,9   

  1. 1 The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
    2 Macau University of Science and Technology, Macau 999780, China
    3 Qingdao Academy of Intelligent Industries, Qingdao 266000, China
    4 Beijing Huairou Academy of Parallel Sensing, Beijing 101400, China
    5 Beijing Engineering Research Center of Intelligent Systems and Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
    6 School of Artificial Intelligence, Anhui University, Hefei 230601, China
    7 Engineering Research Center of Autonomous Unmanned System Technology, Ministry of Education, Hefei 230601, China
    8 Anhui Provincial Engineering Research Center for Unmanned System and Intelligent Technology, Hefei 230601, China
    9 North Automatic Control Technology Institute, Taiyuan 030006, China
  • Revised:2023-05-26 Online:2023-06-15 Published:2023-06-10
  • Supported by:
    The National Natural Science Foundation of China(62173329)

Abstract:

Although deep learning methods have achieved better results than traditional trajectory prediction algorithms, there are still problems such as information loss, interaction and uncertainty difficulties in modelling, and lack of interpretability of predictions when implementing multimodal high-precision prediction for autonomous vehicles in heterogeneous, highly dynamic and complex changing environments.The newly developed Transformer's long-range modelling capability and parallel computing ability make it a great success not only in the field of natural language processing, but also in solving the above problems when extended to the task of multimodal trajectory prediction for autonomous driving.Based on this, the aim of this paper is to provide a comprehensive summary and review of past deep neural network-based approaches, in particular the Transformer-based approach.The advantages of Transformer over traditional sequential network, graphical neural network and generative model were also analyzed and classified in relation to existing challenges, simultaneously.Transformer models can be better applied to multimodal trajectory prediction tasks, and that such models have better generalisation and interpretability.Finally, the future directions of multimodal trajectory prediction were presented.

Key words: Transformer, sequential network, graph neural network, generative model, trajectory prediction, multimodal

CLC Number: 

No Suggested Reading articles found!