Two-Stage Merging Network for Describing Traffic Scenes in Intelligent Vehicle Driving System

Heng Song,Yi Jiang,Junwu Zhu

doi:10.1109/tits.2021.3083656

Abstract

Intelligent vehicle driving systems aim to control the driving behavior of a vehicle in real time without human intervention by perceiving and monitoring the surrounding environment. Describing images of traffic scenes automatically, which is one of the key problems of intelligent vehicle driving technology, has drawn attention since its inception. In recent years, a variety of automatic image description technologies have been proposed, among which the attention-based encoder-decoder framework achieved good results. In this paper we will discuss the fusing of a variety of information from multiple aspects of the images of traffic scenes. First, we will introduce visual attention, text attention and image topics attention which generates the weighted visual features, the attentive text information and the global image topics information respectively. We will then propose an adaptive two-stage merging network based on an encoder-decoder framework, which can fully integrate the three kinds of information in two stages, while automatically calculating the proportions of the information at each time step. Numerous experiments conducted on COCO2014 and Flickr30K datasets have demonstrated the effectiveness and advantages of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Stage Merging Network for Describing Traffic Scenes in Intelligent Vehicle Driving System

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems

Lead the way for us

Journal: IEEE Transactions on Intelligent Transportation Systems	Publication Date: Dec 1, 2022
Citations: 11

Similar Papers

Recognition of traffic lights in urban traffic scenes using color space model
Chihang Zhao ... Jie He
-
Chihang Zhao, et. al.Chihang Zhao ... Jie He
29 Oct 2018
29 Oct 2018

Vision-Based Forward-Looking Traffic Scene Analysis Scheme
Jyh-Yeong Chang ... Chien-Wen Cho
-
Jyh-Yeong Chang, et. al.Jyh-Yeong Chang ... Chien-Wen Cho
01 Jun 2007
01 Jun 2007

Res2Net-based multi-scale and multi-attention model for traffic scene image classification.
Guanghui Gao ... Gang Shi
PloS one | VOL. 19
Guanghui Gao, et. al.Guanghui Gao ... Gang Shi
20 May 2024
PloS one | VOL. 19

Neural network based smart vision system for driver assistance in extracting traffic signposts
Meeta Kumar ... V Y Kulkarni
-
Meeta Kumar, et. al.Meeta Kumar ... V Y Kulkarni
03 Sep 2012
03 Sep 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Stage Merging Network for Describing Traffic Scenes in Intelligent Vehicle Driving System

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems