Self-Supervised Deep Correlation Tracking.

Di Yuan,Po-Yao Huang,Qiao Liu,Xiaojun Chang,Zhenyu He

doi:10.1109/tip.2020.3037518

Abstract

The training of a feature extraction network typically requires abundant manually annotated training samples, making this a time-consuming and costly process. Accordingly, we propose an effective self-supervised learning-based tracker in a deep correlation framework (named: self-SDCT). Motivated by the forward-backward tracking consistency of a robust tracker, we propose a multi-cycle consistency loss as self-supervised information for learning feature extraction network from adjacent video frames. At the training stage, we generate pseudo-labels of consecutive video frames by forward-backward prediction under a Siamese correlation tracking framework and utilize the proposed multi-cycle consistency loss to learn a feature extraction network. Furthermore, we propose a similarity dropout strategy to enable some low-quality training sample pairs to be dropped and also adopt a cycle trajectory consistency loss in each sample pair to improve the training loss function. At the tracking stage, we employ the pre-trained feature extraction network to extract features and utilize a Siamese correlation tracking framework to locate the target using forward tracking alone. Extensive experimental results indicate that the proposed self-supervised deep correlation tracker (self-SDCT) achieves competitive tracking performance contrasted to state-of-the-art supervised and unsupervised tracking methods on standard evaluation benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Supervised Deep Correlation Tracking.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Dec 9, 2020
Citations: 282

Similar Papers

Video object detection based on the spatial-temporal convolution feature memory model
Wenjun Dai ... Libin Guo
-
Wenjun Dai, et. al.Wenjun Dai ... Libin Guo
01 Jul 2020
01 Jul 2020

Tampering detection and localization in digital video using temporal difference between adjacent frames of actual and reconstructed video clip
Vaishali Joshi ... Sanjay Jain
International Journal of Information Technology | VOL. 12
Vaishali Joshi, et. al.Vaishali Joshi ... Sanjay Jain
01 Jan 2019
International Journal of Information Technology | VOL. 12

A Novel Hierarchical Model-Based Frame Rate Up-Conversion via Spatio-temporal Conditional Random Fields
M.J Shafiee ... P Fieguth
-
M.J Shafiee, et. al.M.J Shafiee ... P Fieguth
01 Dec 2011
01 Dec 2011

Image-registration-based local noise reduction for noisy video sequences
Nan Jiang ... Jennie Si
-
Nan Jiang, et. al.Nan Jiang ... Jennie Si
05 May 2006
05 May 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Supervised Deep Correlation Tracking.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing