Driving Behavior-Aware Network for 3D Object Tracking in Complex Traffic Scenes

Qingnan Li,Ruimin Hu,Zhongyuan Wang,Zhi Ding

doi:10.1109/access.2021.3068899

Qingnan Li, Ruimin Hu + Show 2 more

Open Access

PDF Available

https://doi.org/10.1109/access.2021.3068899

Copy DOI

Export

Save

Cite

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: Wuhan University of Technology, Wuhan University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Recently a large number of 3D object tracking methods have been extensively investigated and applied in a variety of applications using convolutional neural networks. Although most of them have made great progress in partial occlusion, the intricate interweaving of moving agents (e.g. pedestrians and vehicles) may lead to inferior performance of 3D object tracking in complex traffic scenes. To boost the performance of 3D object tracking in cases of severe occlusions, we present an end-to-end deep learning framework with a driving behavior-aware model that takes full advantage of spatial-temporal details in consecutive frames and learns the driving behavior from object variations in 2D center point, depth, rotation and translation in parallel. In contrast to prior work, our novelty formulates driving behavior that reasons about the possible motion trajectories of the investigated target for autonomous systems. We show in experiments that our method outperforms state-of-the-art approaches on 3D object tracking in the challenging nuScenes dataset.

Highlights

Multi-object tracking (MOT), called multi-target tracking (MTT), is an essential component technology in many computer vision applications such as autonomous driving [1]–[3] and robot collision prediction [4], [5]
Compared with the state-ofthe-art CenterTrack framework that is based solely on object 2D displacement supervised feature representations, our driving behavior-aware hierarchical architecture encodes object motion components and object variations in consecutive frames, producing a sufficiently better high-level knowledgebased 2D displacement offset for 3D object tracking in complex traffic scenes
By exploring the object variations in motion components that consist of 2D center offset, depth offset, rotation and translation offset in consecutive frames, our framework in contrast to prior work [3] that aims to formulate driving behavior for efficient 3D object tracking with a finer 2D displacement

Summary

INTRODUCTION

Multi-object tracking (MOT), called multi-target tracking (MTT), is an essential component technology in many computer vision applications such as autonomous driving [1]–[3] and robot collision prediction [4], [5]. Inspired by the prior works [28]–[30], we consider a natural formulation that the movements of road agents with different poses and scales are determined by human driving behavior Based on this natural formulation, instead of encoding object center offsets on 2D plane for 3D tracking [3], we take full advantage of spatial-temporal details across consecutive frames and propose an end-to-end deep learning framework to learn the driving behavior from variations in 2D center point, depth, rotation and translation in the magnitude and direction of hidden-state vectors.

PRELIMINARIES

ARCHITECTURE OVERVIEW

EXPERIMENTS

METRICS

IMPLEMENTATION DETAILS

EVALUATION ON nuScenes DATASET

Findings

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Driving Behavior-Aware Network for 3D Object Tracking in Complex Traffic Scenes

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Boosting the Performance of Model-based 3D Tracking by Employing Low Level Motion Cues
Ammar Qammaz ... Nikolaos Kyriazis
-
Ammar Qammaz, et. al.Ammar Qammaz ... Nikolaos Kyriazis
01 Jan 2015
01 Jan 2015

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition
Swetha V C ... Deepak Mishra
-
Swetha V C, et. al.Swetha V C ... Deepak Mishra
18 Dec 2018
18 Dec 2018

Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field
Xu-Ran Zhao ... Qi-Chao Chen
Journal of Computer Science and Technology | VOL. 32
Xu-Ran Zhao, et. al.Xu-Ran Zhao ... Qi-Chao Chen
01 May 2017
Journal of Computer Science and Technology | VOL. 32

How do deep convolutional features affect tracking performance: an experimental study
Hao Guan ... Baozhong Cheng
The Visual Computer | VOL. 34
Hao Guan, et. al.Hao Guan ... Baozhong Cheng
16 Oct 2017
The Visual Computer | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Driving Behavior-Aware Network for 3D Object Tracking in Complex Traffic Scenes

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access