Advances in Deep Learning Methods for Visual Tracking: Literature Review and Fundamentals

Xiao-Qin Zhang,Tian-Yu Tong,Chen-Xiang Fan,Tao Wang,Run-Hua Jiang,Peng-Cheng Huang

doi:10.1007/s11633-020-1274-8

Xiao-Qin Zhang, Tian-Yu Tong + Show 4 more

Open Access

https://doi.org/10.1007/s11633-020-1274-8

Copy DOI

Abstract

Recently, deep learning has achieved great success in visual tracking tasks, particularly in single-object tracking. This paper provides a comprehensive review of state-of-the-art single-object tracking algorithms based on deep learning. First, we introduce basic knowledge of deep visual tracking, including fundamental concepts, existing algorithms, and previous reviews. Second, we briefly review existing deep learning methods by categorizing them into data-invariant and data-adaptive methods based on whether they can dynamically change their model parameters or architectures. Then, we conclude with the general components of deep trackers. In this way, we systematically analyze the novelties of several recently proposed deep trackers. Thereafter, popular datasets such as Object Tracking Benchmark (OTB) and Visual Object Tracking (VOT) are discussed, along with the performances of several deep trackers. Finally, based on observations and experimental results, we discuss three different characteristics of deep trackers, i.e., the relationships between their general components, exploration of more effective tracking frameworks, and interpretability of their motion estimation components.

Highlights

Single object tracking is a fundamental and critical task in the fields of computer vision and video processing
To facilitate the development of single object tracking algorithms based on deep learning, in this work, we conclude with the general components of existing deep-learning-based tracking algorithms and present the popular components of deep neural networks, which are proposed for improving the representative ability of the features in Papers [29] [30] [31] [32] [33]
We find that since different components in the deep trackers have their special characteristics, improving only a single component sometimes cannot facilitate the tracking process

Summary

Introduction

Single object tracking is a fundamental and critical task in the fields of computer vision and video processing. Benchmark 2013 (OTB-2013)[27] and Visual Object Tracking 2013 (VOT-2013)[28], have been proposed to evaluate the performance of these tracking algorithms With these developments, several papers reviewed the advancements and challenges in deep-learning-based tracking algorithms. To facilitate the development of single object tracking algorithms based on deep learning, in this work, we conclude with the general components of existing deep-learning-based tracking algorithms and present the popular components of deep neural networks, which are proposed for improving the representative ability of the features in. We present popular metrics used for evaluating the tracking performance on popular tracking datasets

Deep learning models

Data-invariant methods

Data-adaptive methods

Deep tracker components

Feature extraction module

Motion estimation module

Regression module

Loss function

Visual tracking datasets

Object tracking benchmark datasets

Visual object tracking datasets

Large-scale single object tracking dataset

Evaluation metrics

Performance evaluation

Robustness

Quantitative results

Discussions

Relationship among different components

Exploration of more effective frameworks

Interpretability of motion estimation module

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Automation and Computing	Publication Date: Mar 4, 2021
Citations: 15	License type: open-access

R Discovery Prime

R Discovery Prime

Advances in Deep Learning Methods for Visual Tracking: Literature Review and Fundamentals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Automation and Computing

Lead the way for us

Similar Papers

Person Re-identification and Tracking in Video Surveillance

-

16 Jun 2020
16 Jun 2020

Visual tracking with multilevel feature, similarity attention, color constraint, and global redetection
Song Guiling ... Lu Ru
International Journal of Advanced Robotic Systems | VOL. 18
Song Guiling, et. al.Song Guiling ... Lu Ru
01 Sep 2021
International Journal of Advanced Robotic Systems | VOL. 18

LSTM guided ensemble correlation filter tracking with appearance model pool
Monika Jain ... Clinton Fookes
Computer Vision and Image Understanding | VOL. 195
Monika Jain, et. al.Monika Jain ... Clinton Fookes
25 Feb 2020
Computer Vision and Image Understanding | VOL. 195

MFCFSiam: A Correlation-Filter-Guided Siamese Network with Multifeature for Visual Tracking
Chenpu Li ... Ke Zang
Wireless Communications and Mobile Computing | VOL. 2020
Chenpu Li, et. al.Chenpu Li ... Ke Zang
23 Dec 2020
Wireless Communications and Mobile Computing | VOL. 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Advances in Deep Learning Methods for Visual Tracking: Literature Review and Fundamentals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Automation and Computing