Abstract

Most existing trackers perform well in the occurrence of short-term occlusions and appearance and illumination variations, but struggle with the challenges of long-term tracking which include heavy or long-term occlusions, and out-of-view objects. We propose a spatial-temporal aware long-term object tracking method in this paper, to manage the challenges of long term tracking. Firstly, we present a spatial-temporal aware correlation filter, which jointly models the spatial and temporal information of the target within the correlation filter framework to locate a visible target from frame to frame. Then, we design a tracking uncertainty detection mechanism to activate the re-detector in case of track failure. The mechanism relies on the variations of correlation response measured spatially, and the appearance similarity estimated temporally. Finally, we propose a spatial-temporal aware long-term re-detector to recover the target when it becomes visible. In the re-detection process, a large number of candidates are sampled, evaluated, and refined spatially to obtain accurate locations. Additionally, reliable memory retained through conservative template updating enables the recovery of the target by similarity matching. The spatial-temporal information is explicitly encoded in each component, which operates collaboratively to boost the overall performance. Extensive experimental results conducted on publicly available benchmark datasets demonstrate that the proposed method performs favorably when compared to other state-of-the-art trackers.

Highlights

  • Visual object tracking is a fundamental task in the field of computer vision which is used in a variety of applications including video surveillance, robot navigation, autonomous driving, and human-computer interaction

  • This study differs from existing studies [22]–[24], [48]–[51] in several aspects: a) Full advantage is taken of spatial and temporal cues, and the cues in each component of the method are encoded to achieve performance improvement; b) A designed failure detection mechanism based on the spatial correlation response variation and temporal appearance similarity change is used to determine current object state and activate the re-detector when necessary, rather than using an overlap rate or maximal response score; c) An effective re-detector is adopted as a long-term component, which works together with the reliable memories retained through conservative template updating to recover the object from track failure

  • TColor-128 dataset: We evaluate the proposed method against eight state-of-the-art trackers including kernelized correlation filter (KCF) [6], LCT [22], Staple [9], MEEM [55], spatially regularized discriminative correlation filter (SRDCF) [7], DeepSRDCF [11], spatial-temporal regularized CF (STRCF) [19], and multi-task correlation particle filter (MCPF) [14]

Read more

Summary

INTRODUCTION

Visual object tracking is a fundamental task in the field of computer vision which is used in a variety of applications including video surveillance, robot navigation, autonomous driving, and human-computer interaction. We present a spatial-temporal aware long-term object tracking method in this paper. Our method includes three components: a short-term tracker, tracking uncertainty detection, and a long-term re-detector. When the short-term tracker is detected as unreliable, a spatial-temporal aware long-term re-detector is activated to recover the target from track failure. The experimental results demonstrate that the STACF and tracking uncertainty detection mechanism work well in conjunction with the long-term re-detector to recover the tracked target when it becomes visible. The STACF creates a robust and efficient appearance model It acts as a short-term component to locate a visible target and helps evaluate the sampled particles in the long-term component. (2) A spatial-temporal aware long-term re-detector is proposed, which is activated by a designed tracking uncertainty detection mechanism to recover the target from track failure.

RELATED WORK
CORRELATION FILTERS
SHORT-TERM COMPONENT
TRACKING UNCERTAINTY DETECTION
SPATIAL-TEMPORAL AWARE LONG-TERM RE-DETECTOR
EXPERIMENTS
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call