Abstract
Object tracking is one of the most fundamental and important fields in computer vision with a wide range of applications. Although great progress has been made in object tracking combined with detection, there is still enormous challenges in real-time applications and for the computer cannot effectively capture the temporal correlations of targets and background clutter. In order to improve the performance of tracking algorithms under complex unconstrained conditions, we propose a novel tracking framework based on adaptive detection, called adaptive detection tracking (ADT). First, we exploit the temporal correlation of the recurrent neural network to predict the target’s motion direction and efficiently update the region of interest (RoI) in the narrow range of the next frame. Then, the algorithm utilizes the correlation filter to initialize the defined region of interest based on the threshold. If the Interaction of Union (IoU) of the predicted bounding box and the groundtruth bounding box is greater than the set threshold, the predicted bounding box will be directly output as the tracking results, whereas the detection is adaptively carried out in the determined RoI. Finally, the predicted bounding box refines the direction model as the input of the next frame to complete the whole tracking flow. Our proposed adaptive detection tracking mechanism can efficiently realize non-frame-by-frame adaptive detection with excellent tracking accuracy and is more robust in the unconstrained scenes, especially for occlusion. Comprehensive experiments demonstrate that our approach consistently achieves state-of-the-art results and runs in real-time on six large tracking benchmarks, including OTB100, VOT2016, VOT2017, TC128, UAV123 and LaSOT datasets.
Highlights
Object tracking utilizes the bounding box of the target to predict the target’s position and the whole trajectory in the subsequent frames, which has been widely used in various aspects of human daily life and military security [1], [2]
2) Occluded robustness: our proposed tracker combined with direction prediction model and correlation filter, which can make full use of temporal reliability and spatial effectiveness to highlight the importance of the motion state overtime, especially for heavy occlusion
RELATED WORK Since the key point of this paper is tracking based on adaptive detection, we provide a brief review on two aspects, which can be roughly categorized as object detection and object tracking
Summary
Object tracking utilizes the bounding box of the target to predict the target’s position and the whole trajectory in the subsequent frames, which has been widely used in various aspects of human daily life and military security [1], [2]. The adaptive tracker developed from correlation filter and the direction prediction model is proposed, which is initialized by the predicted RoI. Our proposed algorithm effectively combines the adaptive detection module, which can adaptively realize object detection based on the variances of object and scene, compared with our previous frame-by-frame detection tracking mechanism. The main contributions in this paper are summarized as follows: 1) Adaptive detection: we propose a novel adaptive detection mechanism to realize non-frame-by-frame detection, which can further update and localize an adaptive bounding box in real-time tracking as the VOLUME 8, 2020 object changes shape and size in complex unconstrained conditions. 2) Occluded robustness: our proposed tracker combined with direction prediction model and correlation filter, which can make full use of temporal reliability and spatial effectiveness to highlight the importance of the motion state overtime, especially for heavy occlusion.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.