Trajectory prediction plays a crucial role in autonomous driving tasks, as accurately and rapidly predicting the future trajectories of traffic participants can significantly enhance the safety and robustness of autonomous driving systems. This paper presents a novel trajectory prediction model that follows the encoder–decoder paradigm, achieving precise and rapid predictions of future vehicle trajectories by efficiently aggregating the spatiotemporal and interaction information of agents in traffic scenarios. We propose an agent–agent interaction information extraction module based on a sparse graph attention mechanism, which enables the efficient aggregation of interaction information between agents. Additionally, we introduce a non-autoregressive query generation method that accelerates the model inference speed by generating the decoding queries in parallel. Comparative experiments with the existing advanced algorithms show that our method improves the multimodal trajectory prediction metrics for the Minimum Average Displacement Error (minADE), the Minimum Final Displacement Error (minFDE), and the Miss Rate (MR) by an average of 9.1%, 11.8%, and 14.6%, respectively, while the inference time is only 33.7% of the average time taken by the other algorithms. Finally, we demonstrate the effectiveness of the various modules proposed in this paper through ablation studies.
Read full abstract