Abstract

Recently, Siamese network based trackers have been greatly developed and achieved state-of-the-art performance on multiple benchmarks. However, the decision-making mechanism needs to be studied more deeply in order to obtain higher accuracy. In this paper, we propose a novel Siamese network based visual tracking method, which enhances decision-making ability by Spatially Constrained Correlation Filter (SCCF) and Saliency Prior Context (SPC) model. We use the deep features extracted from Siamese networks to train the SCCF via the efficient Alternating Direction Method of Multipliers (ADMM), and our SCCF applies a penalizing matrix to suppress the boundary effect well. Meanwhile, we regard the end-to-end output of Siamese networks as a priori probability and utilize the spatio-temporal relationship to establish the SPC model. The SPC model can handle the various cases of feature distributions generated from different targets and their contexts. Further, we also take measures to solve some challenging problems in visual tracking, such as target scale change and target occlusion. We conduct extensive experiments to demonstrate the effectiveness of the proposed method, which obtains currently the best results on three large tracking benchmarks, including OTB-2013, OTB-2015, and VOT-2016.

Highlights

  • Visual tracking has a large range of applications in multimedia processing, e.g. automatic driving, robotics, and human computer interaction

  • In DeepTrack [5], a candidate pool of multiple Convolutional Neural Network (CNN) is employed as a data-driven model of different instances of the target object, and the tracking task is regarded as a classification problem

  • We propose a novel Siamese Visual Tracking Network (SVTN) with Spatially Constrained Correlation Filter (SCCF) and Saliency Prior Context (SPC) model

Read more

Summary

INTRODUCTION

Visual tracking has a large range of applications in multimedia processing, e.g. automatic driving, robotics, and human computer interaction. We propose a novel Siamese Visual Tracking Network (SVTN) with Spatially Constrained Correlation Filter (SCCF) and Saliency Prior Context (SPC) model. Inspired by the state-of-the-art method CFNet [7], the Siamese networks perform the Correlation Filters (CFs) as the decision-making mechanism and are pre-trained offline with large-scale image pairs in an endto-end manner. CFnet, we extract the deep features from Siamese networks to train another correlation filter and apply a spatial constraint on the filter to alleviate the boundary effects. This spatial constrained function is a penalizing matrix that assigns higher bias weights on the background region.

RELATED WORKS
F Tak and ς k
ONLINE TRACKING
OCCLUSION-AWARE UPDATE
VISUAL COMPARISONS
Background
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call