Abstract

Convolutional neural networks (CNNs) have been applied to visual tracking with demonstrated success in recent years. However, the performance of CNN-based trackers can be further improved, because the predicted upright bounding box cannot tightly enclose the target due to factors such as deformations and rotations. Besides, many existing CNN-based trackers neglect to distinguish the occluded state of the target from non-occluded states, which causes the samples collected during occlusions wrongly update the tracker to focus on other objects. To address these problems, we propose to adaptively utilize the level set segmentation and bounding box regression techniques to obtain a tight enclosing box, and design a CNN to recognize whether the target is occluded. Extensive experimental results on a large benchmark dataset demonstrate the effectiveness of the proposed method compared to several state-of-the-art tracking algorithms.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call