Online Siamese Network for Visual Object Tracking.

Shuo Chang,Zhiyong Feng,Wei Li,Yifan Zhang

doi:10.3390/s19081858

Shuo Chang, Zhiyong Feng + Show 2 more

Open Access

https://doi.org/10.3390/s19081858

Copy DOI

Abstract

Offline-trained Siamese networks are not robust to the environmental complication in visual object tracking. Without online learning, the Siamese network cannot learn from instance domain knowledge and adapt to appearance changes of targets. In this paper, a new lightweight Siamese network is proposed for feature extraction. To cope with the dynamics of targets and backgrounds, the weight in the proposed Siamese network is updated in an online manner during the tracking process. In order to enhance the discrimination capability, the cross-entropy loss is integrated into the contrastive loss. Inspired by the face verification algorithm DeepID2, the Bayesian verification model is applied for candidate selection. In general, visual object tracking can benefit from face verification algorithms. Numerical results suggest that the newly developed algorithm achieves comparable performance in public benchmarks.

Highlights

IntroductionAs a fundamental and challenging task, visual object tracking has a variety of applications, such as smart video surveillance, autopilot, human–computer interaction and video communication [1,2,3]
As a fundamental and challenging task, visual object tracking has a variety of applications, such as smart video surveillance, autopilot, human–computer interaction and video communication [1,2,3].In general, the goal of visual object tracking is to estimate the position and scale variation of targets in the video sequence, where its initial state is given in the first frame
To illustrate the characteristics of our proposed algorithm, we compare the Our proposed improved contrastive loss (OSNV) algorithm with nine state-of-the-art tracking methods. According to their working principles, these algorithms could be classified into four classes: (i) Siamese-like tracking algorithms, including SiamFC_3s [7], and SINT_noflow [9]. Both of them train an offline Siamese network to extract feature vectors. (ii) algorithms based on convolutional neural networks (CNNs): MDNet [10], SANet [11]; (iii) algorithms based on correlation filter e.g., ECO [12], KCF [32], MCPF [13]; (iv) algorithms based on hand-crafted features e.g., MEEM [33], TGPR [34]

Summary

Introduction

As a fundamental and challenging task, visual object tracking has a variety of applications, such as smart video surveillance, autopilot, human–computer interaction and video communication [1,2,3]. To exploit the representation capabilities of CNNs, Tao et al [9] proposed a matching function with the Siamese network to extract feature vectors, which was named the Siamese Instance Search for Tracking (SINT) This new method was trained using the contrastive loss. Different from SiamFC and SINT, the algorithms MDNet and SANet trained an offline model and updated part of it in the inference phase, and these two algorithms were supervised by logistic loss These two algorithms have made superior performance in online tracking benchmark (OTB). It can learn from the domain knowledge of target and adapt to appearance changes of target; An improved contrastive loss integrated with cross-entropy loss is introduced to update the Siamese network; The Bayesian verification model is transferred for candidate selection.

Siamese Network for Visual Object Tracking

Online Algorithms for Visual Object Tracking

Loss Function for CNNs in Visual Tracking

Bayesian Verification Model

Proposed Algorithm

Siamese Network

Cross-Entropy Loss

Contrastive Loss

Improved Contrastive Loss

Implementation of the Bayesian Verification Model

Implementation Details

Experimental Validations

Ablation Study

Evaluation on OTB-2013

Evaluation on OTB-2015

Evaluation on OTB-50

Evaluation on VOT-2016

Evaluation on TempleColor

Qualitative Evaluation

Failure Case

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Apr 18, 2019
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Online Siamese Network for Visual Object Tracking.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Attention shake siamese network with auxiliary relocation branch for visual object tracking
Jun Wang ... Shunli Zhang
Neurocomputing | VOL. 400
Jun Wang, et. al.Jun Wang ... Shunli Zhang
12 Mar 2020
Neurocomputing | VOL. 400

Person Re-identification and Tracking in Video Surveillance

-

16 Jun 2020
16 Jun 2020

Visual object tracking based on adaptive Siamese and motion estimation network
Hossein Kashiani ... Shahriar B Shokouhi
Image and Vision Computing | VOL. 83-84
Hossein Kashiani, et. al.Hossein Kashiani ... Shahriar B Shokouhi
21 Feb 2019
Image and Vision Computing | VOL. 83-84

Channel and spatial attention-based Siamese network for visual object tracking
Shishun Tian ... Xia Li
Journal of Electronic Imaging | VOL. 30
Shishun Tian, et. al.Shishun Tian ... Xia Li
21 May 2021
Journal of Electronic Imaging | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Siamese Network for Visual Object Tracking.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors