Abstract Correlation filter based trackers have achieved appealing performance and high efficiency in recent years. However, for long-term tracking where target objects undergo dramatic appearance variation due to heavy occlusion or out-of-view, conventional correlation filter based tracking algorithms would be distracted by irrelevant objects. Once the trained tracker loses its way, it is impossible to recover the information for the following frames as the model has drifted. In this paper, we decompose the long-term tracking task into tracking and detection. Tracker learns separate correlation filters for explicit translation and scale estimation. Specifically, in order to improve tracking accuracy, the convolutional features for translation filter are extracted, and the scale filter is learned using the target appearance sampled at different scales. Detector trains an online long-term filter and applies it to the entire frame to generate detection proposals. By exploiting these detection proposals, it helps the tracker to recover from problems such as temporary or persistent occlusions. In this way, the proposed approach could handle the model drifting problem effectively for long-term tracking with more accurate estimation of object scale and location. Extensive experimental results on large-scale benchmark sequences have shown the robustness of the proposed method.
Read full abstract