Abstract

Object tracking based on visible images may fail when the visible images are unreliable, for example when the illumination condition is poor. Infrared images reveal thermal radiation of objects and are insensitive to these factors. Due to the complementary features of visible and infrared images, RGB-infrared fusion tracking has attracted widespread attention recently. In this paper, an RGB-infrared fusion tracking method based on the fully convolutional Siamese Networks, termed as SiamFT, is proposed. Visible and infrared images are firstly processed by two Siamese Networks, namely visible network and infrared network, respectively. Then, convolutional features of visible and infrared template images extracted from two Siamese Networks are concatenated to form fused template image. Convolutional features of visible and infrared search image are fused through the proposed feature fusion network adaptively. In particular, a modality weight computation method based on the response value of Siamese network is proposed to predict the reliability of different images. Cross-relation is then applied to the fused template feature and the fused search feature to produce the final response map, based on which the tracking results can be obtained. Extensive experiments indicate that the proposed SiamFT shows better performance than the-state-of-art fusion tracking algorithms at real-time speed.

Highlights

  • Object tracking has received increasing attention in recent years due to its wide applications in many areas, such as robotics, surveillance, and human-machine interface

  • The proposed SiamFT achieves the best results in terms of success rate (SR) and the second best results in terms of precision rate (PR) among all compared trackers on the nineteen RGB-infrared videos

  • SiamFT stays in the rank of top 3 in 16 videos in terms of SR and in 13 videos in terms of PR

Read more

Summary

Introduction

Object tracking has received increasing attention in recent years due to its wide applications in many areas, such as robotics, surveillance, and human-machine interface. A lot of algorithms have been proposed to perform object tracking, among which the most popular ones are based on deep learning and correlation filter (CF). Most video tracking algorithms are developed for visible images (RGB images) [1]. Despite remarkable progress, tracking algorithms based on visible images may fail as they may be unreliable in certain circumstance. Infrared images reveal thermal information of objects and are insensitive to these factors.

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.